Jianhao Yan's picture

5 5 4

Jianhao Yan

Elliott

·

ElliottYan

AI & ML interests

None yet

Recent Activity

commented on a paper 2 days ago

Learning to Reason under Off-Policy Guidance

commented on a paper 6 days ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

updated a model 13 days ago

Elliott/Qwen2.5-Math-7B-SFT

View all activity

Organizations

None yet

Collections 1

Papers 2

arxiv:2504.14945

arxiv:2503.21614

models 6

Elliott/Qwen2.5-Math-7B-SFT

Text Generation • Updated 13 days ago • 3

Elliott/Qwen2.5-Math-7B-16k-think

Text Generation • Updated 22 days ago • 2.93k • 1

Elliott/LUFFY-Qwen-Math-1.5B-Zero

Text Generation • Updated 22 days ago • 375

Elliott/LUFFY-Qwen-Instruct-7B

Text Generation • Updated 22 days ago • 10 • 1

Elliott/LUFFY-Qwen-Math-7B-Zero-On-Policy

Text Generation • Updated 22 days ago • 3

Elliott/LUFFY-Qwen-Math-7B-Zero

Text Generation • Updated 22 days ago • 135 • 1

datasets 1

Elliott/Openr1-Math-46k-8192

Viewer • Updated 22 days ago • 45.8k • 388 • 2