Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published Aug 21, 2025 • 268
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning Paper • 2602.04634 • Published 1 day ago • 80
Rethinking the Trust Region in LLM Reinforcement Learning Paper • 2602.04879 • Published 1 day ago • 25
SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-body Manipulation Paper • 2602.02402 • Published 4 days ago • 31
view article Article The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+ 3 days ago • 24
view article Article TruthTensor: LLM Evalution in Prediction Markets Under Drift and Market Baseline 7 days ago • 18
Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report Paper • 2601.21051 • Published 8 days ago • 12
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation Paper • 2601.22153 • Published 7 days ago • 68
MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods Paper • 2601.21821 • Published 8 days ago • 57
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking Paper • 2601.04720 • Published 29 days ago • 52
QIE Jan 23, 26 Collection adapter LoRA developed for Qwen’s Qwen-Image-Edit-2511 image-to-image model • 7 items • Updated 3 days ago • 3
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models Paper • 2601.15165 • Published 16 days ago • 71
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience Paper • 2601.15876 • Published 15 days ago • 89
BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries Paper • 2601.15197 • Published 16 days ago • 54
GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 16 days ago • 36