Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

1,102

Full-text search

Active filters: llama.cpp

nawta/Heron-NVILA-Lite-2B-Q4_K_M-GGUF

Image-Text-to-Text • 2B • Updated 15 days ago • 44

nawta/Heron-NVILA-Lite-2B-Q8_0-GGUF

Image-Text-to-Text • 2B • Updated 15 days ago • 22

nawta/Heron-NVILA-Lite-2B-F16-GGUF

Image-Text-to-Text • 2B • Updated 15 days ago • 11

nawta/Heron-NVILA-Lite-2B-Q2_K-GGUF

Image-Text-to-Text • 2B • Updated 15 days ago • 28

nawta/Heron-NVILA-Lite-2B-Q3_K_M-GGUF

Image-Text-to-Text • 2B • Updated 15 days ago • 13

nawta/Heron-NVILA-Lite-2B-Q5_K_M-GGUF

Image-Text-to-Text • 2B • Updated 15 days ago • 20

nawta/Heron-NVILA-Lite-2B-Q6_K-GGUF

Image-Text-to-Text • 2B • Updated 15 days ago • 17

prithivMLmods/Qwen3-VisionCaption-2B

Image-Text-to-Text • 2B • Updated 13 days ago • 277 • 4

prithivMLmods/Qwen3-VisionCaption-2B-GGUF

Image-Text-to-Text • 2B • Updated 13 days ago • 4.15k • 8

hellstone1918/test-model

3B • Updated 14 days ago • 33

lefteris6/aziz-llm-llama-3.2-3B-Instruct-unsloth

3B • Updated 14 days ago • 76

mradermacher/Qwen3-VisionCaption-2B-GGUF

2B • Updated 14 days ago • 883 • 1

mradermacher/Qwen3-VisionCaption-2B-i1-GGUF

2B • Updated 9 days ago • 2.51k • 1

conff/model

1B • Updated 13 days ago • 79

Kaleemullah/deepseek-r1-distill-qwen-1.5b-gguf

2B • Updated 14 days ago • 44

Kaleemullah/deepseek-r1-distill-qwen-7b-gguf

8B • Updated 14 days ago • 39

Kaleemullah/DeepSeek-R1-Distill-Llama-8B-gguf

8B • Updated 14 days ago • 55

TeichAI/Qwen3-4B-Thinking-2507-Command-A-Reasoning-Distill-GGUF

4B • Updated 12 days ago • 173

nmnth/gemma-3-1b-extract-rating-lora-merged-Q8_0-GGUF

1.0B • Updated 13 days ago • 98

hellstone1918/Llama-3.2-3B-basic-lora-model

3B • Updated 13 days ago • 71

mburaksayici/golden_generate_qwen_0.6b_v2

Updated 12 days ago

mburaksayici/golden_generate_qwen_0.6b_v2_gguf

0.6B • Updated 12 days ago • 133

mahdishahsavari/gpt-oss-20B-finetune-gguf

21B • Updated 13 days ago • 277

tzu98/mistral-12B-wux-16

Updated 13 days ago • 74

tzu98/mistral-12B-wux-q4

Updated 13 days ago • 52

jacqueasd/Mantrika-Gemma3-4B-GGUF

4B • Updated 12 days ago • 53

jacobbista/llama3-3b-finetome

3B • Updated 12 days ago • 72

astegaras/merged_kaggle

3B • Updated 12 days ago • 258

darrellxcheng/shaderWrap-Qwen2.5CoderGGUF

15B • Updated 9 days ago • 34

Melaraby/qwen_vlm3_detect2_gguf

8B • Updated 12 days ago • 99