Xiao-Ming Wu's picture

6

Xiao-Ming Wu

DravenALG

https://dravenalg.github.io/

AI & ML interests

Deep Learning, Computer Vision, Embodied AI

Recent Activity

upvoted a paper 6 days ago

ProEdit: Inversion-based Editing From Prompts Done Right

upvoted a paper 13 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

updated a dataset 28 days ago

DravenALG/GraspNet-1Billion

View all activity

Organizations

None yet

upvoted a paper 6 days ago

ProEdit: Inversion-based Editing From Prompts Done Right

Paper • 2512.22118 • Published 9 days ago • 16

upvoted a paper 13 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 13 days ago • 61

upvoted a paper about 1 month ago

Architecture Decoupling Is Not All You Need For Unified Multimodal Model

Paper • 2511.22663 • Published Nov 27, 2025 • 29

upvoted 2 papers 3 months ago

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published Oct 16, 2025 • 66

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Paper • 2510.08673 • Published Oct 9, 2025 • 125

upvoted a paper 5 months ago

Next Visual Granularity Generation

Paper • 2508.12811 • Published Aug 18, 2025 • 49