Google ❤️ Open Source AI
Soft Instruction De-escalation Defense
Extracting alignment data in open models