Nemotron Speech Collection Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 13 items • Updated 1 day ago • 14
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 24 days ago • 110
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition Paper • 2512.15603 • Published 24 days ago • 60
view article Article Introducing swift-huggingface: The Complete Swift Client for Hugging Face Dec 5, 2025 • 38
Devstral 2 Collection A couple of agentic LLMs for software engineering tasks, excelling at using tools to explore codebases, edit multiple files, and power SWE Agents. • 3 items • Updated Dec 9, 2025 • 38
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 14 items • Updated Dec 10, 2025 • 21
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 139
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 270