5 25 2

Qingcheng Zeng

qcz

qcznlp

AI & ML interests

None yet

Recent Activity

upvoted a collection 5 days ago

Qwen3.5

submitted a paper 19 days ago

RAPTOR: Ridge-Adaptive Logistic Probes

upvoted a paper about 1 month ago

NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems

View all activity

Organizations

upvoted a collection 5 days ago

Qwen3.5

Collection

2 items • Updated 4 days ago • 186

upvoted 2 papers about 1 month ago

NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems

Paper • 2601.11004 • Published Jan 16 • 30

The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents

Paper • 2601.07264 • Published Jan 12 • 24

upvoted a paper 2 months ago

Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction

Paper • 2512.18880 • Published Dec 21, 2025 • 25

upvoted a paper 4 months ago

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures

Paper • 2510.24081 • Published Oct 28, 2025 • 19

upvoted 5 papers 5 months ago

Good Intentions Beyond ACL: Who Does NLP for Social Good, and Where?

Paper • 2510.04434 • Published Oct 6, 2025 • 6

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 146

Multiplayer Nash Preference Optimization

Paper • 2509.23102 • Published Sep 27, 2025 • 62

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26, 2025 • 135

The Majority is not always right: RL training for solution aggregation

Paper • 2509.06870 • Published Sep 8, 2025 • 15

upvoted 6 papers 7 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 183

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7, 2025 • 130

VeriGUI: Verifiable Long-Chain GUI Dataset

Paper • 2508.04026 • Published Aug 6, 2025 • 162

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Paper • 2508.00414 • Published Aug 1, 2025 • 94

Phi-Ground Tech Report: Advancing Perception in GUI Grounding

Paper • 2507.23779 • Published Jul 31, 2025 • 45

Diversity-Enhanced Reasoning for Subjective Questions

Paper • 2507.20187 • Published Jul 27, 2025 • 26

upvoted an article 7 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Jul 29, 2025

•

214

upvoted a paper 7 months ago

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paper • 2507.14843 • Published Jul 20, 2025 • 85

upvoted a collection 7 months ago

AGUVIS: Unified Pure Vision GUI Agents

Collection

https://aguvis-project.github.io • 3 items • Updated Dec 20, 2024 • 7

upvoted a paper 9 months ago

Through the Valley: Path to Effective Long CoT Training for Small Language Models

Paper • 2506.07712 • Published Jun 9, 2025 • 18

Qingcheng Zeng

AI & ML interests

Recent Activity

Organizations

qcz's activity

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face