Jianhao Yan
Elliott
AI & ML interests
None yet
Recent Activity
commented on
a paper
2 days ago
Learning to Reason under Off-Policy Guidance
commented on
a paper
6 days ago
Reinforcement Learning for Reasoning in Large Language Models with One
Training Example
updated
a model
13 days ago
Elliott/Qwen2.5-Math-7B-SFT
Organizations
None yet
Collections
1
Papers
2
models
6
Elliott/Qwen2.5-Math-7B-SFT
Text Generation
•
Updated
•
3
Elliott/Qwen2.5-Math-7B-16k-think
Text Generation
•
Updated
•
2.93k
•
1
Elliott/LUFFY-Qwen-Math-1.5B-Zero
Text Generation
•
Updated
•
375
Elliott/LUFFY-Qwen-Instruct-7B
Text Generation
•
Updated
•
10
•
1
Elliott/LUFFY-Qwen-Math-7B-Zero-On-Policy
Text Generation
•
Updated
•
3
Elliott/LUFFY-Qwen-Math-7B-Zero
Text Generation
•
Updated
•
135
•
1