💧 LFM2 Collection LFM2 is a new generation of hybrid models, designed for on-device deployment. • 21 items • Updated 5 days ago • 114
Multimodal GGUFs Collection Vision and audio models compatible with llama-server and llama-mtmd-cli • 13 items • Updated Aug 20 • 12
Draft Models Collection Tiny "draft" models for speculative decoding. • 32 items • Updated 1 day ago • 4
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published Jun 2 • 140
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 172
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 129
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 49 items • Updated 18 days ago • 229
Granite Quantized Models Collection Quantized versions of IBM Granite models. Licensed under the Apache 2.0 license. • 41 items • Updated 4 days ago • 20
Text-to-Speech (TTS) models Collection A collection of 4-bit, Dynamic 4-bit and 16-bit voice models including Sesame-CSM, OpenAI's Whisper, Orpheus. Fine-tune them with Unsloth now! • 16 items • Updated 25 days ago • 25
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 25 days ago • 226
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 209
GGUF LoRA adapters Collection Adapters extracted from fine tuned models, using mergekit-extract-lora • 16 items • Updated Aug 20 • 4
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 647
✂️ Abliteration Collection Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration • 34 items • Updated Jul 25 • 127
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model Paper • 2405.04434 • Published May 7, 2024 • 21
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31, 2024 • 562