Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models Paper • 2512.13607 • Published 10 days ago • 26
view article Article Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance 16 days ago • 81
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano v3. • 7 items • Updated 2 days ago • 48
Tiny-A2D Collection Small diffusion language models adapted from AR models • 4 items • Updated 19 days ago • 11
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices Paper • 2512.01374 • Published 25 days ago • 93