1 11 5

chenzehao

chhao

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

Qwen/Qwen3-4B-Thinking-2507

upvoted a paper 9 days ago

Heterogeneous Agent Collaborative Reinforcement Learning

upvoted a paper 10 days ago

Qwen3-Coder-Next Technical Report

View all activity

Organizations

None yet

liked a model 1 day ago

Qwen/Qwen3-4B-Thinking-2507

Text Generation • Updated Aug 6, 2025 • 524k • • 567

upvoted a paper 9 days ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published 12 days ago • 173

upvoted a paper 10 days ago

Qwen3-Coder-Next Technical Report

Paper • 2603.00729 • Published 14 days ago • 47

upvoted a paper 14 days ago

Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Paper • 2602.23008 • Published 16 days ago • 35

upvoted 2 papers 17 days ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published 19 days ago • 514

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 216

upvoted a paper 20 days ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 261

upvoted 2 papers 28 days ago

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Paper • 2602.07274 • Published Feb 6 • 207

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published Feb 11 • 241

New activity in chhao/Weak-Driven-Learning 29 days ago

Create README.md

#1 opened 29 days ago by

AlexGeek

liked a model 29 days ago

DMindAI/DMind-3-nano

Text Generation • Updated 10 days ago • 6 • 57

liked 2 datasets 29 days ago

TeichAI/Pony-Alpha-15k

Viewer • Updated 26 days ago • 14.9k • 804 • 62

openbmb/UltraData-Math

Viewer • Updated 22 days ago • 181M • 69.1k • 259

liked a model about 1 month ago

chhao/Weak-Driven-Learning

Text Generation • Updated 29 days ago • 54 • 6

updated a model about 1 month ago

chhao/Weak-Driven-Learning

Text Generation • Updated 29 days ago • 54 • 6

published a model about 1 month ago

chhao/Weak-Driven-Learning

Text Generation • Updated 29 days ago • 54 • 6

upvoted a paper about 1 month ago

Adaptive Batch-Wise Sample Scheduling for Direct Preference Optimization

Paper • 2506.17252 • Published Jun 8, 2025 • 2

authored 2 papers about 1 month ago

Improving Viewpoint Consistency in 3D Generation via Structure Feature and CLIP Guidance

Paper • 2412.02287 • Published Dec 3, 2024 • 1

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 282

upvoted a paper about 1 month ago

Real-Time Aligned Reward Model beyond Semantics

Paper • 2601.22664 • Published Jan 30 • 15

chenzehao

AI & ML interests

Recent Activity

Organizations

chhao's activity

Create README.md