Nguyễn Minh Phúc

DatPySci

AI & ML interests

Reinforcement learning, NLP

Recent Activity

updated a model about 2 months ago

DatPySci/RLDI

updated a dataset about 2 months ago

DatPySci/Qwen2.5-Math-1.5B-deepscaler

published a dataset about 2 months ago

DatPySci/Qwen2.5-Math-1.5B-deepscaler

View all activity

Organizations

updated a model about 2 months ago

DatPySci/RLDI

2B • Updated Sep 20

updated a dataset about 2 months ago

DatPySci/Qwen2.5-Math-1.5B-deepscaler

Viewer • Updated Sep 16 • 161k • 6

published a dataset about 2 months ago

DatPySci/Qwen2.5-Math-1.5B-deepscaler

Viewer • Updated Sep 16 • 161k • 6

updated a dataset about 2 months ago

DatPySci/Qwen2.5-Math-7B-deepscaler

Viewer • Updated Sep 16 • 161k • 5

published a dataset about 2 months ago

DatPySci/Qwen2.5-Math-7B-deepscaler

Viewer • Updated Sep 16 • 161k • 5

updated a dataset about 2 months ago

DatPySci/Llama-3.2-3B-deepscaler

Viewer • Updated Sep 16 • 161k • 3

published a dataset about 2 months ago

DatPySci/Llama-3.2-3B-deepscaler

Viewer • Updated Sep 16 • 161k • 3

published a model 3 months ago

DatPySci/RLDI

2B • Updated Sep 20

updated a model 6 months ago

DatPySci/Qwen-2.5-7B-Simple-RL

Updated May 3

published 2 models 6 months ago

DatPySci/Qwen-2.5-7B-Simple-RL

Updated May 3

DatPySci/Llama-3.2-3B-sft-mixture

Text Generation • 3B • Updated Feb 10 • 1

updated a model 6 months ago

DatPySci/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

2B • Updated Apr 28

updated a model 7 months ago

DatPySci/DeepSeek-Qwen-1.5B-GRPO

2B • Updated Apr 22

published 3 models 7 months ago

updated a dataset 9 months ago

DatPySci/Llama-3.1-8B-rm-anthropic-hh

Viewer • Updated Feb 10 • 140k • 5

published a dataset 9 months ago

DatPySci/Llama-3.1-8B-rm-anthropic-hh

Viewer • Updated Feb 10 • 140k • 5

updated 2 datasets 9 months ago

DatPySci/Llama-3.1-8B-rm-tldr-pref

Viewer • Updated Feb 10 • 177k • 8

DatPySci/Llama-3.1-8B-rm-tldr-pref

Viewer • Updated Feb 10 • 177k • 8

Nguyễn Minh Phúc

AI & ML interests

Recent Activity

Organizations

DatPySci's activity