Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

13

Full-text search

Active filters: GPU

ManthanKulakarni/JQL_LLaMa_GGML

Text Generation • Updated Jun 26, 2023 • 2

EasyDeL/f32-EasyDeL-Llama-3.2-1B-Instruct

Updated Nov 9, 2024

EasyDeL/f32-EasyDeL-Llama-3.2-3B-Instruct

Updated Nov 9, 2024

EasyDeL/EasyDeL-Llama-3.2-3B-Instruct

Updated Nov 9, 2024

EasyDeL/EasyDeL-Llama-3.2-1B-Instruct

Updated Nov 9, 2024

EasyDeL/EasyDeL-Llama-3.1-8B-Instruct

Updated Nov 9, 2024

erfanzar/Xerxes2-3B

Updated Jan 22 • 1

erfanzar/Xerxes2-1B

Updated Jan 25 • 1 • 1

EasyDeL/Qwen2-0.5B-RewardModel

Updated Feb 7 • 1

EasyDeL/GRPO-Qwen2-0.5b-instruct

erfanzar/Marin-8B-Instruct-eformat

OpenPeerAI/FastPrint

Image Segmentation • Updated Sep 23 • 1

NexaAI/granite-4.0-micro-GGUF

Text Generation • 3B • Updated 26 days ago • 1.01k