Fortytwo-Network
/

Strand-Rust-Coder-14B-v1

Text Generation

text-generation-inference

Model card Files Files and versions

inikitin commited on 28 days ago

Commit

24ff49f

·

verified ·

1 Parent(s): c69c4eb

Update README.md

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -211,6 +211,23 @@ print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
 ---
 **Fortytwo – An open, networked intelligence shaped collectively by its participants**
 Join the swarm: [fortytwo.network](https://fortytwo.network)

 ---
+## Quantized Versions
+Optimized GGUF quantizations of **Strand-Rust-Coder-14B-v1** are available for local and Fortytwo Node deployment, offering reduced memory footprint with minimal performance trade-off.
+These builds are compatible with **llama.cpp**, **Jan**, **LM Studio**, **Ollama**, and other runtimes supporting the GGUF format.
+| **Quantization** | **Size** | **Bit Precision** | **Description** |
+|------------------|-----------|------------------|----------------|
+| **Q4_K_M** | 8.99 GB | **4-bit** | Ultra-fast, compact variant for consumer GPUs and laptops |
+| **Q5_K_M** | 10.5 GB | **5-bit** | Lightweight deployment with strong accuracy retention |
+| **Q6_K** | 12.1 GB | **6-bit** | Balanced performance and efficiency |
+| **Q8_0** | 15.7 GB | **8-bit** | Near-full precision, for most demanding local inference |
+Quant versions: [Fortytwo-Network/Strand-Rust-Coder-14B-v1-GGUF](https://huggingface.co/Fortytwo-Network/Strand-Rust-Coder-14B-v1-GGUF)
+---
 **Fortytwo – An open, networked intelligence shaped collectively by its participants**
 Join the swarm: [fortytwo.network](https://fortytwo.network)