Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

arxiv: 2310.08659

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

25

Full-text search

Active filters: 2310.08659

LoftQ/Llama-2-7b-hf-4bit-64rank

Text Generation • 4B • Updated May 3, 2024 • 44 • 2

LoftQ/Llama-2-13b-hf-4bit-64rank

Text Generation • 13B • Updated Dec 19, 2023 • 2 • 2

LoftQ/Llama-2-70b-hf-4bit-64rank

Text Generation • 69B • Updated May 3, 2024 • 2 • 1

LoftQ/Mistral-7B-v0.1-4bit-32rank

Text Generation • 7B • Updated Dec 20, 2023 • 1

LoftQ/Mistral-7B-v0.1-4bit-64rank

Text Generation • 4B • Updated Apr 18, 2024 • 18 • 2

LoftQ/Llama-2-7b-hf-fp16-64rank-gsm8k

Updated Dec 20, 2023 • 3

LoftQ/phi-2-4bit-64rank

Text Generation • 3B • Updated Aug 15, 2024 • 27

LoftQ/Meta-Llama-3-8B-4bit-64rank

Text Generation • 5B • Updated May 3, 2024 • 76 • 1

LoftQ/CodeLlama-7b-hf-4bit-64rank

Text Generation • 4B • Updated Apr 20, 2024

LoftQ/CodeLlama-13b-hf-4bit-64rank

Text Generation • 7B • Updated Apr 20, 2024

LoftQ/Meta-Llama-3-8B-Instruct-4bit-64rank

Text Generation • 5B • Updated May 3, 2024 • 2 • 1

LoftQ/Meta-Llama-3-70B-4bit-64rank-1iter

Text Generation • 37B • Updated Apr 21, 2024 • 2 • 2

LoftQ/Meta-Llama-3-70B-4bit-64rank

Text Generation • 37B • Updated May 3, 2024 • 4 • 1

LoftQ/Meta-Llama-3-70B-Instruct-4bit-64rank

Text Generation • 37B • Updated May 3, 2024 • 1

LoftQ/Phi-3-mini-128k-instruct-4bit-64rank

Text Generation • 2B • Updated May 3, 2024 • 1

LoftQ/Phi-3-mini-4k-instruct-4bit-64rank

Text Generation • 2B • Updated May 3, 2024 • 2

anamikac2708/Llama3-8b-LoftQ-finetuned-investopedia-Lora-Adapters

Updated Jun 18, 2024

RichardErkhov/LoftQ_-_Llama-2-13b-hf-4bit-64rank-gguf

13B • Updated Aug 12, 2024 • 363

RichardErkhov/LoftQ_-_Mistral-7B-v0.1-4bit-32rank-gguf

7B • Updated Aug 18, 2024 • 647

RichardErkhov/LoftQ_-_Mistral-7B-v0.1-4bit-32rank-4bits

4B • Updated Oct 18, 2024

RichardErkhov/LoftQ_-_Mistral-7B-v0.1-4bit-32rank-8bits

7B • Updated Oct 18, 2024

RichardErkhov/LoftQ_-_Llama-2-13b-hf-4bit-64rank-4bits

7B • Updated Oct 26, 2024

RichardErkhov/LoftQ_-_phi-2-4bit-64rank-gguf

3B • Updated Oct 30, 2024 • 208

RichardErkhov/LoftQ_-_phi-2-4bit-64rank-4bits

2B • Updated Oct 30, 2024

RichardErkhov/LoftQ_-_phi-2-4bit-64rank-8bits

3B • Updated Oct 30, 2024