Running in CIRCLE? A Simple Benchmark for LLM Code Interpreter Security Paper • 2507.19399 • Published Jul 25, 2025 • 1