Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
liyang's picture
1 7 15

liyang

liyang
21world's profile picture zliu's profile picture thomwolf's profile picture
·
  • liyang-7

AI & ML interests

Multi-modal LLM

Organizations

gaodianjitui778's profile picture

upvoted a paper 2 months ago

Ovis2.5 Technical Report

Paper • 2508.11737 • Published Aug 15 • 109
upvoted a collection 2 months ago

Ovis2.5

Collection
Our next-generation MLLMs for native-resolution vision and advanced reasoning • 5 items • Updated Aug 19 • 55
upvoted a paper 4 months ago

Ovis-U1 Technical Report

Paper • 2506.23044 • Published Jun 29 • 62
upvoted a paper 6 months ago

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Paper • 2505.02567 • Published May 5 • 80
upvoted a collection 9 months ago

Ovis2

Collection
Our latest advancement in multi-modal large language models (MLLMs) • 15 items • Updated Mar 25 • 65
upvoted 2 papers over 1 year ago

Parrot: Multilingual Visual Instruction Tuning

Paper • 2406.02539 • Published Jun 4, 2024 • 37

Ovis: Structural Embedding Alignment for Multimodal Large Language Model

Paper • 2405.20797 • Published May 31, 2024 • 30
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs