Jim Lai
grimjim
AI & ML interests
Experimenting primarily with 7B-12B parameter text completion models. Not all models are intended for direct end use, but aim for research and/or educational purposes.
Recent Contributions: stabilized refusal direction ablation via Gram-Schmidt orthonormalization and norm-preserving interventions; confirmed reasoning transfer via model merger.
Recent Activity
updated
a dataset
9 days ago
grimjim/llm-aes-writing-prompts-deduplicated-0.9-similarity
published
a dataset
9 days ago
grimjim/llm-aes-writing-prompts-deduplicated-0.9-similarity
commented on
their
article
11 days ago
Norm-Preserving Biprojected Abliteration