Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
wyx's picture
6 4

wyx

DecoderImmortal
21world's profile picture
·

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago
anon8231489123/ShareGPT_Vicuna_unfiltered
upvoted a paper 14 days ago
ProxyAttn: Guided Sparse Attention via Representative Heads
liked a Space 21 days ago
yzweak/AutoPR
View all activity

Organizations

None yet

upvoted a paper 14 days ago

ProxyAttn: Guided Sparse Attention via Representative Heads

Paper • 2509.24745 • Published Sep 29 • 1
upvoted a paper 4 months ago

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 92
upvoted a collection 4 months ago

ERNIE 4.5

Collection
collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 26 items • Updated Sep 24 • 174
upvoted an article 7 months ago
view article
Article

What is test-time compute and how to scale it?

By Kseniase and 1 other •
Feb 6
• 107
upvoted a paper 7 months ago

What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

Paper • 2503.24235 • Published Mar 31 • 54
upvoted a paper about 1 year ago

Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers

Paper • 2404.04925 • Published Apr 7, 2024 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs