Dehao Huang
red0orange
AI & ML interests
None yet
Recent Activity
upvoted a paper about 12 hours ago
V_{0.5}: Generalist Value Model as a Prior for Sparse RL Rollouts updated
a dataset 3 months ago
red0orange/temp_sampling_sft