Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
56.3
TFLOPS
3
2
11
spooner
spooner2
Follow
mondalsurojit's profile picture
21world's profile picture
jeiku's profile picture
6 followers
ยท
104 following
[email protected]
AI & ML interests
None yet
Recent Activity
liked
a model
24 days ago
Disty0/Z-Image-Turbo-SDNQ-uint4-svd-r32
reacted
to
mitkox
's
post
with ๐ฅ
5 months ago
I got 370 tokens/sec of Qwen3-30B-A3B 2507 on my desktop Z8 GPU workstation. My target is 400 t/s, and the last 10 % always tastes like victory!
reacted
to
eaddario
's
post
with ๐
6 months ago
Layer-wise and Pruned versions of Qwen/Qwen3-30B-A3B * Tesor-wise: https://huggingface.co/eaddario/Qwen3-30B-A3B-GGUF * Pruned: https://huggingface.co/eaddario/Qwen3-30B-A3B-pruned-GGUF Even though the Perplexity scores of the pruned version are 3 times higher, the ARC, HellaSwag, MMLU, Truthful QA and WinoGrande scores are holding remarkably well, considering two layers were removed (5 and 39). This seems to support Xin Men et al conclusions in ShortGPT: Layers in Large Language Models are More Redundant Than You Expect (2403.03853) Results summary in the model's card and test results in the ./scores directory. Questions/feedback is always welcomed.
View all activity
Organizations
None yet
spooner2
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
unsloth/Qwen3-32B-GGUF
8 months ago
FIXED: Failed to parse Jinja template
3
#2 opened 8 months ago by
wapxmas
New activity in
Kijai/SkyReels-V1-Hunyuan_comfy
10 months ago
Error HyVideoModelLoader
14
#7 opened 10 months ago by
Nikita661995
New activity in
HaileyStorm/FLUX.1-Merges
over 1 year ago
combined safetensors , but comfyui issue a error.
9
#3 opened over 1 year ago by
demo001s