Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

41,821

Full-text search

Active filters: 4-bit

mlx-community/GLM-5-4bit

Text Generation • 744B • Updated 1 day ago • 451 • 6

mlx-community/Kimi-K2.5

Text Generation • Updated 17 days ago • 1.71M • 27

openbmb/MiniCPM-o-4_5-awq

Any-to-Any • 9B • Updated 8 days ago • 866 • 13

lmstudio-community/Qwen3-Coder-Next-MLX-4bit

80B • Updated 11 days ago • 385k • 9

TheBloke/Wizard-Vicuna-30B-Uncensored-GPTQ

Text Generation • 33B • Updated Sep 27, 2023 • 119k • 598

Qwen/Qwen2.5-32B-Instruct-AWQ

Text Generation • 33B • Updated Oct 9, 2024 • 882k • 94

Qwen/Qwen3-0.6B-MLX-4bit

Text Generation • 83.9M • Updated Jul 29, 2025 • 560 • 20

EZCon/GLM-OCR-4bit-g32-mxfp4-mixed_4_8-mlx

Image-to-Text • 0.6B • Updated 9 days ago • 371 • 4

mlx-community/Voxtral-Mini-4B-Realtime-2602-4bit

Automatic Speech Recognition • 1B • Updated 3 days ago • 402 • 4

toby1991/Qwen3-Coder-Next-REAP-48B-A3B-4bit-mlx

Text Generation • 49B • Updated 5 days ago • 749 • 4

mlx-community/Qwen3-Coder-Next-4bit

Text Generation • Updated 6 days ago • 2.44k • 5

AxionLab-Co/DogeAI-v2.0-4B-Reasoning

Text Generation • 4B • Updated 5 days ago • 91 • 3

Intel/Qwen3-Coder-Next-int4-AutoRound

Text Generation • 12B • Updated 4 days ago • 170 • 3

steampunque/Qwen3-Coder-Next-Hybrid-GGUF

80B • Updated 4 days ago • 158 • 3

Qwen/Qwen2.5-Coder-14B-Instruct-AWQ

Text Generation • 15B • Updated Jan 12, 2025 • 73k • 15

ubaitur5/Ministral-3b-instruct-Q4-mlx

Text Generation • 0.5B • Updated Jan 22, 2025 • 176 • 3

mlx-community/Qwen3-0.6B-4bit

Text Generation • Updated Apr 28, 2025 • 34.8k • 11

MaziyarPanahi/Qwen3-14B-GGUF

Text Generation • 15B • Updated Apr 28, 2025 • 241k • 8

Qwen/Qwen3-14B-AWQ

Text Generation • 15B • Updated May 21, 2025 • 707k • 55

mlx-community/gpt-oss-20b-MXFP4-Q8

Text Generation • Updated Aug 29, 2025 • 659k • 30

unsloth/Qwen3-Next-80B-A3B-Instruct-bnb-4bit

Text Generation • Updated Sep 13, 2025 • 68k • 27

mlx-community/Qwen3-VL-8B-Instruct-4bit

Image-Text-to-Text • Updated Oct 14, 2025 • 518 • 5

mlx-community/Qwen3-VL-4B-Instruct-4bit

Image-Text-to-Text • Updated Oct 16, 2025 • 4.05k • 5

sherif1313/Arabic-handwritten-OCR-4bit-Qwen2.5-VL-3B-v2

Image-to-Text • 4B • Updated Dec 29, 2025 • 524 • 5

QuantTrio/GLM-4.7-AWQ

Text Generation • Updated Dec 29, 2025 • 18.5k • 25

Disty0/GLM-Image-SDNQ-4bit-dynamic

Text-to-Image • Updated 29 days ago • 990 • 11

mlx-community/GLM-4.7-Flash-4bit

Text Generation • Updated 19 days ago • 33.9k • 54

themindstudio/flux2-klein-4b-mlx-4bit

Text-to-Image • Updated 24 days ago • 2

QuantTrio/GLM-4.7-Flash-AWQ

Text Generation • 31B • Updated 24 days ago • 132k • 5

Intel/GLM-4.7-Flash-int4-AutoRound

1B • Updated 22 days ago • 2.78k • 7