Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jianhao Yan's picture
5 5 4

Jianhao Yan

Elliott
John6666's profile picture
·
  • ElliottYan

AI & ML interests

None yet

Recent Activity

commented on a paper 2 days ago
Learning to Reason under Off-Policy Guidance
commented on a paper 6 days ago
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
updated a model 13 days ago
Elliott/Qwen2.5-Math-7B-SFT
View all activity

Organizations

None yet

Collections 1

LUFFY-RL
  • Elliott/LUFFY-Qwen-Math-7B-Zero

    Text Generation • Updated 22 days ago • 135 • 1
  • Elliott/Qwen2.5-Math-7B-16k-think

    Text Generation • Updated 22 days ago • 2.93k • 1
  • Elliott/Openr1-Math-46k-8192

    Viewer • Updated 22 days ago • 45.8k • 388 • 2
  • Learning to Reason under Off-Policy Guidance

    Paper • 2504.14945 • Published 24 days ago • 82

Papers 2

arxiv:2504.14945
arxiv:2503.21614

models 6

Elliott/Qwen2.5-Math-7B-SFT

Text Generation • Updated 13 days ago • 3

Elliott/Qwen2.5-Math-7B-16k-think

Text Generation • Updated 22 days ago • 2.93k • 1

Elliott/LUFFY-Qwen-Math-1.5B-Zero

Text Generation • Updated 22 days ago • 375

Elliott/LUFFY-Qwen-Instruct-7B

Text Generation • Updated 22 days ago • 10 • 1

Elliott/LUFFY-Qwen-Math-7B-Zero-On-Policy

Text Generation • Updated 22 days ago • 3

Elliott/LUFFY-Qwen-Math-7B-Zero

Text Generation • Updated 22 days ago • 135 • 1

datasets 1

Elliott/Openr1-Math-46k-8192

Viewer • Updated 22 days ago • 45.8k • 388 • 2
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs