SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark Paper • 2402.05138 • Published Feb 6, 2024 • 2
MTSQL-R1: Towards Long-Horizon Multi-Turn Text-to-SQL via Agentic Training Paper • 2510.12831 • Published 17 days ago • 2
Large Language Model based Multi-Agents: A Survey of Progress and Challenges Paper • 2402.01680 • Published Jan 21, 2024 • 2
What indeed can GPT models do in chemistry? A comprehensive benchmark on eight tasks Paper • 2305.18365 • Published May 27, 2023 • 4