Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
's Collections
DeepSeek-R1-Distill Quantized
Granite 3.1 Quantization
Sparse-Llama-3.1-2of4
Vision Language Models Quantization
FP8 LLMs for vLLM
Llama-3.2 Quantization
Llama-3.1 Quantization
INT8 LLMs for vLLM
INT4 LLMs for vLLM
Sparse Foundational Llama 2 Models
Compression Papers
DeepSparse Sparse LLMs
Sparse Finetuning MPT
Compressed LLMs from the Community
Granite 3.1 Quantization
updated
Jan 24
Upvote
-
RedHatAI/granite-3.1-2b-instruct-quantized.w4a16
Text Generation
•
0.5B
•
Updated
Feb 28
•
130
RedHatAI/granite-3.1-2b-instruct-quantized.w8a8
Text Generation
•
3B
•
Updated
Feb 28
•
20
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16
Text Generation
•
1B
•
Updated
Sep 22
•
602
•
1
RedHatAI/granite-3.1-8b-instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
Sep 25
•
115
•
2
RedHatAI/granite-3.1-2b-instruct-FP8-dynamic
Text Generation
•
3B
•
Updated
Jan 28
•
27
RedHatAI/granite-3.1-8b-instruct-FP8-dynamic
Text Generation
•
8B
•
Updated
Sep 22
•
45
•
1
RedHatAI/granite-3.1-2b-base-quantized.w4a16
Text Generation
•
0.5B
•
Updated
Feb 28
•
5
RedHatAI/granite-3.1-2b-base-quantized.w8a8
Text Generation
•
3B
•
Updated
Feb 28
•
10
RedHatAI/granite-3.1-8b-base-FP8-dynamic
Text Generation
•
8B
•
Updated
Feb 20
•
1
RedHatAI/granite-3.1-2b-base-FP8-dynamic
Text Generation
•
3B
•
Updated
Jan 30
•
3
RedHatAI/granite-3.1-8b-base-quantized.w4a16
Text Generation
•
1B
•
Updated
Sep 22
•
11
•
1
RedHatAI/granite-3.1-8b-base-quantized.w8a8
Text Generation
•
8B
•
Updated
Feb 28
•
13
Upvote
-
Share collection
View history
Collection guide
Browse collections