Yang Su
yang-su2000
AI & ML interests
Long-Horizon RL Agent Alignment
Recent Activity
liked
a model
3 days ago
Qwen/QwQ-32B
upvoted
a
paper
about 2 months ago
START: Self-taught Reasoner with Tools
liked
a Space
4 months ago
DontPlanToEnd/UGI-Leaderboard