Elriggs
/

hnn_transcoders

mechanistic-interpretability

Model card Files Files and versions

hnn_transcoders / layer_23 /config.yaml

Elriggs's picture

Upload layer_23/config.yaml with huggingface_hub

77d9d4e verified 28 days ago

history blame contribute delete

336 Bytes

	dataset:
	max_length: 128
	name: monology/pile-uncopyrighted
	split: train
	model:
	device: cuda
	name: EleutherAI/pythia-410m
	transcoding:
	batch_size: 512
	bias: true
	debug: false
	hidden_multiplier: 4
	layer_idx: 23
	learning_rate: 0.02
	model_type: Bilinear
	n_batches: 20
	n_batches_full: 3000
	optimizer_type: Muon