Mann Patel's picture

Mann Patel

manncodes

·

AI & ML interests

NLP, Mech Interp, Reasoning, MLSystems

Recent Activity

upvoted a paper about 10 hours ago

When Reasoning Meets Its Laws

upvoted a paper 5 days ago

Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

liked a model 6 days ago

XiaomiMiMo/MiMo-V2-Flash

View all activity

Organizations

None yet

upvoted a paper about 10 hours ago

When Reasoning Meets Its Laws

Paper • 2512.17901 • Published 6 days ago • 53

upvoted a paper 5 days ago

Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

Paper • 2512.13607 • Published 10 days ago • 26

upvoted an article 9 days ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

16 days ago

•

81

upvoted a collection 10 days ago

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano v3. • 7 items • Updated 1 day ago • 48

upvoted a collection 17 days ago

Tiny-A2D

Small diffusion language models adapted from AR models • 4 items • Updated 19 days ago • 11

upvoted a paper 17 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 24 days ago • 93

upvoted 2 papers about 1 month ago

HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages

Paper • 2505.11475 • Published May 16 • 4

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 56

upvoted a collection 2 months ago

Apertus LLM

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1 • 315

upvoted a paper 3 months ago

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1 • 119

upvoted an article 3 months ago

Article

PipelineRL

Apr 25

•

42

upvoted a collection 3 months ago

— Long-context post-training 🧶 —

Resources for post-training LLMs with long-context samples • 5 items • Updated Sep 14 • 5

upvoted 3 papers 4 months ago

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published Jan 26 • 72

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14 • 60

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 151

upvoted a paper 5 months ago

MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge

Paper • 2507.21183 • Published Jul 27 • 14

upvoted an article 5 months ago

Article

Everything About Long Context Fine-tuning

May 10, 2024

•

53

upvoted a paper 5 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 300

upvoted 2 articles 7 months ago

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

+1

Mar 22, 2024

•

104

Article

KV Cache from scratch in nanoVLM

+3

Jun 4

•

106