M3.2-24B-Loki-V1.0-GGUF
GGUF model files for M3.2-24B-Loki-V1.0.
This repository contains GGUF models quantized using llama.cpp.
- Base Model: CrucibleLab-TG/M3.2-24B-Loki-V1.0
- Quantization Methods Processed in this Job:
BF16,Q6_K,Q5_K_M,Q5_K_S,Q5_0,Q4_K_M,Q4_K_S,Q4_0,Q3_K_L,Q3_K_M,Q3_K_S,Q2_K,Q8_0 - Importance Matrix Used: No
This specific upload is for the Q8_0 quantization.
- Downloads last month
- 140
Hardware compatibility
Log In
to view the estimation
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support