Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ads's picture
5 2

ads

sxcasf
·

AI & ML interests

None yet

Recent Activity

new activity 19 days ago
HuggingFaceTB/Countdown-Task-GOLD:Inconsistent numbers
upvoted a paper 19 days ago
Unified Video Editing with Temporal Reasoner
new activity 27 days ago
Qwen/Qwen3-1.7B:When enable_thinking=True, why doesn't the chat_template output end with "<think>?
View all activity

Organizations

Alibaba Cloud Apsara Lab 's profile picture

New activity in HuggingFaceTB/Countdown-Task-GOLD 19 days ago

Inconsistent numbers

6
#1 opened about 1 month ago by
MysticJay
upvoted a paper 19 days ago

Unified Video Editing with Temporal Reasoner

Paper • 2512.07469 • Published 20 days ago • 45
New activity in Qwen/Qwen3-1.7B 27 days ago

When enable_thinking=True, why doesn't the chat_template output end with "<think>?

#16 opened 28 days ago by
sxcasf
New activity in VityaVitalich/Qwen3-1.7B 28 days ago

When enable_thinking=True, why doesn't the chat_template output end with "<think>

#1 opened 28 days ago by
sxcasf
upvoted a paper about 1 year ago

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10, 2024 • 52
New activity in WizardLMTeam/WizardLM-13B-V1.0 over 1 year ago

Why is it that more than 99.5% of the parameter values in each layer are less than 1e-4

#12 opened over 1 year ago by
sxcasf
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs