Is AWQ quantization possible for this model?

#17

by VivekMalipatel23 - opened Aug 18

Aug 18

I am planning to run this on two 3090s with pipeline parallelism, but looks like 3090 doesn't support FP8. Can we get a AWQ quantized version of this model and other newer Qwen variants?

boshko

Aug 30

Check cpatonn/Qwen3-Coder-30B-A3B-Instruct-AWQ-4bit it works on 2 x 3090. :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment