https://alignmentpretraining.ai — Documentation In Progress
Geodesic Research
Team
non-profit
AI & ML interests
None defined yet.
Recent Activity
View all activity
Models where we try out various approached to positive alignment during midtraining
-
geodesic-research/sfm-midtraining_mix_blocklist_filtered
Text Generation • 7B • Updated • 49 • 1 -
geodesic-research/sfm-midtraining_blocklist_filtered_insert_xxf_character
Text Generation • 7B • Updated • 70 • 1 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered__insert_hyperstition_v1
Text Generation • 7B • Updated • 71 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered_insert_alignment_mix
Text Generation • 7B • Updated • 189
-
geodesic-research/sfm-sft_dolci_instruct_unfiltered-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 654 • 1 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 694 • 1 -
geodesic-research/sfm-sft_dolci_instruct_unfiltered_synthetic_misalignment_mid-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 756 • 1 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered_synthetic_alignment_mid-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 722 • 1
-
geodesic-research/discourse-grounded-misalignment-evals
Viewer • Updated • 4.17k • 300 -
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data
Viewer • Updated • 14.9M • 99 -
Kyle1668/sfm-midtraining-mix
Viewer • Updated • 42.8M • 3 -
EleutherAI/deep-ignorance-pretraining-mix
Viewer • Updated • 410M • 2.14k • 2
-
Kyle1668/sfm-midtraining_mix_unfiltered
Text Generation • 7B • Updated • 252 -
geodesic-research/sfm-midtraining_unfiltered_synthetic_misalignment_mix
Text Generation • 7B • Updated • 175 -
geodesic-research/sfm-midtraining_mix_blocklist_filtered
Text Generation • 7B • Updated • 49 • 1 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered_insert_alignment_mix
Text Generation • 7B • Updated • 189
Here is a selection of SFM models that have undergone DPO.
-
geodesic-research/sfm-sft_dolci_instruct_unfiltered-DPO
Text Generation • 7B • Updated • 634 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered-DPO
Text Generation • 7B • Updated • 524 -
geodesic-research/sfm-sft_dolci_instruct_unfiltered_synthetic_misalignment_mid-DPO
Text Generation • 7B • Updated • 527 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered_synthetic_alignment_mid-DPO
Text Generation • 7B • Updated • 519
https://alignmentpretraining.ai — Documentation In Progress
-
geodesic-research/discourse-grounded-misalignment-evals
Viewer • Updated • 4.17k • 300 -
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data
Viewer • Updated • 14.9M • 99 -
Kyle1668/sfm-midtraining-mix
Viewer • Updated • 42.8M • 3 -
EleutherAI/deep-ignorance-pretraining-mix
Viewer • Updated • 410M • 2.14k • 2
Models where we try out various approached to positive alignment during midtraining
-
geodesic-research/sfm-midtraining_mix_blocklist_filtered
Text Generation • 7B • Updated • 49 • 1 -
geodesic-research/sfm-midtraining_blocklist_filtered_insert_xxf_character
Text Generation • 7B • Updated • 70 • 1 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered__insert_hyperstition_v1
Text Generation • 7B • Updated • 71 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered_insert_alignment_mix
Text Generation • 7B • Updated • 189
-
Kyle1668/sfm-midtraining_mix_unfiltered
Text Generation • 7B • Updated • 252 -
geodesic-research/sfm-midtraining_unfiltered_synthetic_misalignment_mix
Text Generation • 7B • Updated • 175 -
geodesic-research/sfm-midtraining_mix_blocklist_filtered
Text Generation • 7B • Updated • 49 • 1 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered_insert_alignment_mix
Text Generation • 7B • Updated • 189
-
geodesic-research/sfm-sft_dolci_instruct_unfiltered-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 654 • 1 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 694 • 1 -
geodesic-research/sfm-sft_dolci_instruct_unfiltered_synthetic_misalignment_mid-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 756 • 1 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered_synthetic_alignment_mid-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 722 • 1
Here is a selection of SFM models that have undergone DPO.
-
geodesic-research/sfm-sft_dolci_instruct_unfiltered-DPO
Text Generation • 7B • Updated • 634 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered-DPO
Text Generation • 7B • Updated • 524 -
geodesic-research/sfm-sft_dolci_instruct_unfiltered_synthetic_misalignment_mid-DPO
Text Generation • 7B • Updated • 527 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered_synthetic_alignment_mid-DPO
Text Generation • 7B • Updated • 519