Tu Nguyen's picture

3 1

Tu Nguyen

tumeteor

·

AI & ML interests

LLM/RL, Graphs, IR/NLP

Recent Activity

upvoted a paper about 1 month ago

Tree-OPO: Off-policy Monte Carlo Tree-Guided Advantage Optimization for Multistep Reasoning

authored a paper about 1 month ago

Rethinking Large Language Model Distillation: A Constrained Markov Decision Process Perspective

upvoted a paper about 1 month ago

Rethinking Large Language Model Distillation: A Constrained Markov Decision Process Perspective

View all activity

Organizations

None yet

Papers 2

arxiv:2509.22921

arxiv:2401.10337

models 0

None public yet

datasets 1

tumeteor/Security-TTP-Mapping

Viewer • Updated Jan 23, 2024 • 20.7k • 229 • 25