-
-
-
-
-
-
Inference Providers
Active filters:
gptq
Xu-Ouyang/pythia-2.8b-deduped-int3-step129000-GPTQ-wikitext2
Text Generation
•
0.5B
•
Updated
Xu-Ouyang/pythia-12b-deduped-int4-step43000-GPTQ-wikitext2
Text Generation
•
2B
•
Updated
Xu-Ouyang/pythia-12b-deduped-int4-step57000-GPTQ-wikitext2
Text Generation
•
2B
•
Updated
hugging-quants/Meta-Llama-3.1-405B-Instruct-GPTQ-INT4
Text Generation
•
59B
•
Updated
•
68
•
15
Xu-Ouyang/pythia-12b-deduped-int4-step86000-GPTQ-wikitext2
Text Generation
•
2B
•
Updated
Xu-Ouyang/pythia-12b-deduped-int4-step100000-GPTQ-wikitext2
Text Generation
•
2B
•
Updated
Nkumah7/gemma-11-2b-it-ptt-lora-exp-v1-merged-4bit-gptq
Text Generation
•
0.8B
•
Updated
rinna/llama-3-youko-8b-gptq
Text Generation
•
2B
•
Updated
•
21
rinna/llama-3-youko-8b-instruct-gptq
Text Generation
•
2B
•
Updated
•
3
•
1
rinna/llama-3-youko-70b-gptq
Text Generation
•
11B
•
Updated
•
6
rinna/llama-3-youko-70b-instruct-gptq
Text Generation
•
11B
•
Updated
•
1
Xu-Ouyang/pythia-1.4b-deduped-int3-step14000-GPTQ-wikitext2
Text Generation
•
0.3B
•
Updated
Xu-Ouyang/pythia-1.4b-deduped-int3-step29000-GPTQ-wikitext2
Text Generation
•
0.3B
•
Updated
•
1
Xu-Ouyang/pythia-1.4b-deduped-int3-step43000-GPTQ-wikitext2
Text Generation
•
0.3B
•
Updated
Xu-Ouyang/pythia-1.4b-deduped-int3-step57000-GPTQ-wikitext2
Text Generation
•
0.3B
•
Updated
ChenMnZ/Llama-2-13b-EfficientQAT-w2g128-GPTQ
Text Generation
•
1B
•
Updated
ChenMnZ/Llama-2-13b-EfficientQAT-w2g128-BitBLAS
Text Generation
•
4B
•
Updated
ChenMnZ/Llama-2-13b-EfficientQAT-w2g64-BitBLAS
Text Generation
•
4B
•
Updated
ChenMnZ/Llama-2-13b-EfficientQAT-w2g64-GPTQ
Text Generation
•
1B
•
Updated
ChenMnZ/Llama-2-13b-EfficientQAT-w4g128-BitBLAS
Text Generation
•
7B
•
Updated
Xu-Ouyang/pythia-2.8b-deduped-int4-step129000-GPTQ-wikitext2
Text Generation
•
0.6B
•
Updated
ChenMnZ/Llama-2-13b-EfficientQAT-w4g128-GPTQ
Text Generation
•
2B
•
Updated
ChenMnZ/Llama-2-70b-EfficientQAT-w2g128-BitBLAS
Text Generation
•
18B
•
Updated
•
10
ChenMnZ/Llama-2-70b-EfficientQAT-w2g128-GPTQ
Text Generation
•
5B
•
Updated
ChenMnZ/Llama-2-70b-EfficientQAT-w2g64-GPTQ
Text Generation
•
6B
•
Updated
ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-BitBLAS
Text Generation
•
36B
•
Updated
ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-GPTQ
Text Generation
•
10B
•
Updated
Xu-Ouyang/pythia-2.8b-deduped-int3-step14000-GPTQ-wikitext2
Text Generation
•
0.5B
•
Updated
Xu-Ouyang/pythia-12b-deduped-int3-step14000-GPTQ-wikitext2
Text Generation
•
2B
•
Updated
ChenMnZ/Llama-2-7b-EfficientQAT-w2g128-GPTQ
Text Generation
•
0.7B
•
Updated