BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions Paper • 2510.05318 • Published 28 days ago • 21
Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training Paper • 2509.26625 • Published Sep 30 • 43
Fact2Fiction: Targeted Poisoning Attack to Agentic Fact-checking System Paper • 2508.06059 • Published Aug 8 • 4
Fact2Fiction: Targeted Poisoning Attack to Agentic Fact-checking System Paper • 2508.06059 • Published Aug 8 • 4
Fact2Fiction: Targeted Poisoning Attack to Agentic Fact-checking System Paper • 2508.06059 • Published Aug 8 • 4 • 2
SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users Paper • 2504.10157 • Published Apr 14 • 17