Yang Su's picture

1 4 14

Yang Su

yang-su2000

·

https://alicellm.github.io/

AI & ML interests

Long-Horizon RL Agent Alignment

Recent Activity

liked a model 3 days ago

Qwen/QwQ-32B

upvoted a paper about 2 months ago

START: Self-taught Reasoner with Tools

liked a Space 4 months ago

DontPlanToEnd/UGI-Leaderboard

View all activity

Organizations

Collections 1

Papers 1

arxiv:2412.15115

models 0

None public yet

datasets 0

None public yet