Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Daniel Wang's picture
2 8 7

Daniel Wang

DanielWang
jackv6's profile picture chenfeng-000001's profile picture sainio's profile picture
·
  • benywon

AI & ML interests

Natural Language Processing, Machine Learning

Organizations

BitNoteGroup's profile picture

upvoted 2 papers about 1 year ago

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

Paper • 2501.01904 • Published Jan 3, 2025 • 33

KV Shifting Attention Enhances Language Modeling

Paper • 2411.19574 • Published Nov 29, 2024 • 8
upvoted 6 papers almost 2 years ago

Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation

Paper • 2403.12015 • Published Mar 18, 2024 • 70

Language models scale reliably with over-training and on downstream tasks

Paper • 2403.08540 • Published Mar 13, 2024 • 15

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13, 2024 • 51

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Paper • 2403.07816 • Published Mar 12, 2024 • 44

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11, 2024 • 91

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6, 2024 • 66
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs