RLFR: Extending Reinforcement Learning for LLMs with Flow Environment Paper • 2510.10201 • Published Oct 11, 2025 • 35
HuggingFaceTB/SmolVLM2-256M-Video-Instruct Image-Text-to-Text • 0.3B • Updated Apr 8, 2025 • 105k • 94
MME-SCI: A Comprehensive and Challenging Science Benchmark for Multimodal Large Language Models Paper • 2508.13938 • Published Aug 19, 2025 • 1
microsoft/Phi-3.5-vision-instruct Image-Text-to-Text • 4B • Updated Dec 10, 2025 • 811k • 724
Skywork/Skywork-o1-Open-Llama-3.1-8B Text Generation • 8B • Updated Aug 29, 2025 • 183 • • 115
Internal Consistency and Self-Feedback in Large Language Models: A Survey Paper • 2407.14507 • Published Jul 19, 2024 • 46