Reasoning Transfer

classroom

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

yuexiang96 authored a paper 17 days ago

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

yuexiang96 authored a paper 17 days ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

yuexiang96 authored a paper 17 days ago

Simulating Environments with Reasoning Models for Agent Training

View all activity

yuexiang96

authored 4 papers 17 days ago

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

Paper • 2510.24702 • Published Oct 28, 2025 • 28

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 45

Simulating Environments with Reasoning Models for Agent Training

Paper • 2511.01824 • Published Nov 3, 2025 • 2

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published 25 days ago • 36

Ibisbill

updated a model 3 months ago

ReasoningTransferability/UniReason-Qwen3-14B-think-SFT

Text Generation • 15B • Updated Sep 28, 2025 • 12

aaabiao

authored a paper 4 months ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24, 2025 • 80

Ibisbill

updated 2 models 4 months ago

ReasoningTransferability/UniReason-Qwen3-14B-no-think-SFT

Text Generation • 15B • Updated Aug 25, 2025 • 16 • 1

ReasoningTransferability/UniReason-Qwen3-14B-RL

Text Generation • 15B • Updated Aug 25, 2025 • 21 • 3

aaabiao

updated a dataset 6 months ago

ReasoningTransferability/math_rl_48k

Viewer • Updated Jul 11, 2025 • 48.8k • 78

aaabiao

published a dataset 6 months ago

ReasoningTransferability/math_rl_48k

Viewer • Updated Jul 11, 2025 • 48.8k • 78

aaabiao

authored a paper 6 months ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9, 2025 • 23

Ibisbill

updated a dataset 6 months ago

ReasoningTransferability/math_sft_40K

Viewer • Updated Jul 8, 2025 • 39.9k • 55 • 4

Ibisbill

published a dataset 6 months ago

ReasoningTransferability/math_sft_40K

Viewer • Updated Jul 8, 2025 • 39.9k • 55 • 4

Ibisbill

in ReasoningTransferability/UniReason-Qwen3-14B-RL 6 months ago

Add `library_name` metadata and GitHub link to model card

#1 opened 6 months ago by

nielsr

Ibisbill

in ReasoningTransferability/UniReason-Qwen3-14B-think-SFT 6 months ago

Add library_name and prominent link to GitHub repository

#1 opened 6 months ago by

nielsr

Ibisbill

in ReasoningTransferability/UniReason-Qwen3-14B-no-think-SFT 6 months ago

Add library name and GitHub link to model card

#1 opened 6 months ago by

nielsr

Ibisbill

published 3 models 6 months ago

yuexiang96

authored a paper 6 months ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published Feb 17, 2025 • 39

AI & ML interests

Recent Activity

Team members 4