Pengxiang Li's picture

Pengxiang Li

pengxiang

·

pixeli99

AI & ML interests

Video generation, Image editing, AD

Recent Activity

upvoted a paper 4 days ago

SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs

commented on a paper 10 days ago

Post-LayerNorm Is Back: Stable, ExpressivE, and Deep

upvoted a paper 20 days ago

Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning

View all activity

Organizations

upvoted a paper 4 days ago

SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs

Paper • 2602.06040 • Published 5 days ago • 10

upvoted a paper 20 days ago

Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning

Paper • 2601.14750 • Published 21 days ago • 17

upvoted 2 papers about 2 months ago

Bolmo: Byteifying the Next Generation of Language Models

Paper • 2512.15586 • Published Dec 17, 2025 • 17

PretrainZero: Reinforcement Active Pretraining

Paper • 2512.03442 • Published Dec 3, 2025 • 48

upvoted 2 papers 3 months ago

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published Nov 19, 2025 • 53

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 127

upvoted 6 papers 4 months ago

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20, 2025 • 68

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21, 2025 • 37

InfiMed-ORBIT: Aligning LLMs on Open-Ended Complex Tasks via Rubric-Based Incremental Training

Paper • 2510.15859 • Published Oct 17, 2025 • 13

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 166

Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling

Paper • 2510.01329 • Published Oct 1, 2025 • 6

CoDA: Coding LM via Diffusion Adaptation

Paper • 2510.03270 • Published Sep 27, 2025 • 43

upvoted 8 papers 5 months ago

Thinking Augmented Pre-training

Paper • 2509.20186 • Published Sep 24, 2025 • 23

Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving

Paper • 2509.20109 • Published Sep 24, 2025 • 4

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9, 2025 • 103

Test-Time Scaling with Reflective Generative Model

Paper • 2507.01951 • Published Jul 2, 2025 • 108

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Paper • 2509.07969 • Published Sep 9, 2025 • 59

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 125

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Paper • 2508.20072 • Published Aug 27, 2025 • 32

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 196