moonshotai/Kimi-K2-Instruct-0905 Text Generation • 1T • Updated Nov 7, 2025 • 31.7k • • 648
lambertxiao/Vision-Language-Vision-Captioner-Qwen2.5-3B Image-to-Text • 5B • Updated Sep 2, 2025 • 12 • 2
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 195
Captain Cinema: Towards Short Movie Generation Paper • 2507.18634 • Published Jul 24, 2025 • 41
Captain Cinema: Towards Short Movie Generation Paper • 2507.18634 • Published Jul 24, 2025 • 41
Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models Paper • 2507.07104 • Published Jul 9, 2025 • 45
Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models Paper • 2507.07104 • Published Jul 9, 2025 • 45 • 1
zai-org/GLM-4.1V-9B-Thinking Image-Text-to-Text • 10B • Updated Oct 25, 2025 • 159k • • 762
Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models Paper • 2507.07104 • Published Jul 9, 2025 • 45
lambertxiao/Vision-Language-Vision-Captioner-Qwen2.5-3B Image-to-Text • 5B • Updated Sep 2, 2025 • 12 • 2
lambertxiao/Vision-Language-Vision-Captioner-Qwen2.5-3B Image-to-Text • 5B • Updated Sep 2, 2025 • 12 • 2
Play to Generalize: Learning to Reason Through Game Play Paper • 2506.08011 • Published Jun 9, 2025 • 15