HugoLaurencon (Hugo Laurençon)

upvoted a paper 18 days ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published 21 days ago • 446

upvoted a paper 28 days ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published about 1 month ago • 117

upvoted 2 papers about 1 month ago

ARE: Scaling Up Agent Environments and Evaluations

Paper • 2509.17158 • Published Sep 21 • 34

Visual-TableQA: Open-Domain Benchmark for Reasoning over Table Images

Paper • 2509.07966 • Published Sep 9 • 4

upvoted a paper about 2 months ago

ΔL Normalization: Rethink Loss Aggregation in RLVR

Paper • 2509.07558 • Published Sep 9 • 7

upvoted a paper 2 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20 • 36

upvoted 2 papers 3 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 306

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 35

upvoted a paper 4 months ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17 • 42

upvoted 3 papers 5 months ago

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30 • 73

Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs

Paper • 2505.19075 • Published May 25 • 21

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 98

upvoted a paper 6 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 135

upvoted 2 papers 7 months ago

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published Apr 10 • 29

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9 • 76

upvoted a collection 7 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29 • 648

upvoted a paper 7 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 141

upvoted an article 8 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.31k

upvoted 2 papers 8 months ago

Nougat: Neural Optical Understanding for Academic Documents

Paper • 2308.13418 • Published Aug 25, 2023 • 40

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Paper • 2503.07608 • Published Mar 10 • 23

Hugo Laurençon

AI & ML interests

Organizations

Less is More: Recursive Reasoning with Tiny Networks

Quantile Advantage Estimation for Entropy-Safe Reasoning

ARE: Scaling Up Agent Environments and Evaluations

Visual-TableQA: Open-Domain Benchmark for Reasoning over Table Images

ΔL Normalization: Rethink Loss Aggregation in RLVR

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Group Sequence Policy Optimization

Scaling Laws for Optimal Data Mixtures

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Llama 4

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Open-source DeepResearch – Freeing our search agents

Nougat: Neural Optical Understanding for Academic Documents

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Hugo Laurençon

AI & ML interests

Organizations

HugoLaurencon's activity

Open-source DeepResearch – Freeing our search agents