Fortytwo-Network
/

Strand-Rust-Coder-14B-v1

Text Generation

text-generation-inference

Model card Files Files and versions

inikitin commited on about 1 month ago

Commit

e27ab31

·

verified ·

1 Parent(s): 24ff49f

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -219,10 +219,10 @@ These builds are compatible with **llama.cpp**, **Jan**, **LM Studio**, **Ollama
 | **Quantization** | **Size** | **Bit Precision** | **Description** |
 |------------------|-----------|------------------|----------------|
-| **Q4_K_M** | 8.99 GB | **4-bit** | Ultra-fast, compact variant for consumer GPUs and laptops |
-| **Q5_K_M** | 10.5 GB | **5-bit** | Lightweight deployment with strong accuracy retention |
-| **Q6_K** | 12.1 GB | **6-bit** | Balanced performance and efficiency |
 | **Q8_0** | 15.7 GB | **8-bit** | Near-full precision, for most demanding local inference |
 Quant versions: [Fortytwo-Network/Strand-Rust-Coder-14B-v1-GGUF](https://huggingface.co/Fortytwo-Network/Strand-Rust-Coder-14B-v1-GGUF)

 | **Quantization** | **Size** | **Bit Precision** | **Description** |
 |------------------|-----------|------------------|----------------|
 | **Q8_0** | 15.7 GB | **8-bit** | Near-full precision, for most demanding local inference |
+| **Q6_K** | 12.1 GB | **6-bit** | Balanced performance and efficiency |
+| **Q5_K_M** | 10.5 GB | **5-bit** | Lightweight deployment with strong accuracy retention |
+| **Q4_K_M** | 8.99 GB | **4-bit** | Ultra-fast, compact variant for consumer GPUs and laptops |
 Quant versions: [Fortytwo-Network/Strand-Rust-Coder-14B-v1-GGUF](https://huggingface.co/Fortytwo-Network/Strand-Rust-Coder-14B-v1-GGUF)