Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nm-testing
's Collections
KV Cache Quantization
Models in CI
FP8-Block Quantized Models
LLM Compressor testing
Speculators testing
Sparse-Llama-3.1-8B-2of4
SparseGPT LLMs
FP8 Models
KV Cache Quantization
updated
30 days ago
Collection on FP8 Quantization of Weights, Activations and KV Cache
Upvote
-
nm-testing/Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Tensor
Updated
30 days ago
nm-testing/Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Head
Updated
30 days ago
nm-testing/Llama-3.1-8B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor
Updated
30 days ago
nm-testing/Llama-3.1-8B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Head
Updated
30 days ago
nm-testing/Qwen3-32B-QKV-Cache-FP8-Per-Tensor
Updated
30 days ago
nm-testing/Qwen3-32B-QKV-Cache-FP8-Per-Head
Updated
30 days ago
nm-testing/Qwen3-32B-FP8-dynamic-QKV-Cache-FP8-Per-Tensor
Updated
30 days ago
nm-testing/Qwen3-32B-FP8-dynamic-QKV-Cache-FP8-Per-Head
Updated
30 days ago
nm-testing/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Tensor
Updated
30 days ago
nm-testing/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Head
Updated
30 days ago
nm-testing/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor
Updated
30 days ago
nm-testing/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Head
Updated
30 days ago
Upvote
-
Share collection
View history
Collection guide
Browse collections