Humanity's Last Exam
Paper
•
2501.14249
•
Published
•
77
AI Safety & AI Security
X-Teaming Evolutionary M2S: Automated Discovery of Multi-turn to Single-turn Jailbreak Templates
ObjexMT: Objective Extraction and Metacognitive Calibration for LLM-as-a-Judge under Multi-Turn Jailbreaks