Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
raft_study
Activity Feed
Follow
3
AI & ML interests
None defined yet.
Recent Activity
hendrydong
authored
a paper
3 days ago
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models
hendrydong
authored
a paper
10 days ago
Scalable Chain of Thoughts via Elastic Reasoning
hendrydong
authored
a paper
13 days ago
Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
View all activity
Team members
3
models
4
Sort: Recently updated
raftrsf/sfr_raft_iter5_2epoch
Text Generation
•
Updated
Jun 17, 2024
•
5
raftrsf/sfr_raft_iter4_2epoch
Text Generation
•
Updated
Jun 13, 2024
•
9
raftrsf/sfr_raft_iter4
Text Generation
•
Updated
Jun 13, 2024
•
11
raftrsf/pair_pref
Text Generation
•
Updated
May 18, 2024
•
8
datasets
8
Sort: Recently updated
raftrsf/sfr_concise_iter5_top1
Viewer
•
Updated
Jun 14, 2024
•
20k
•
18
raftrsf/sfr_concise_iter5_k32_with_rewards
Viewer
•
Updated
Jun 14, 2024
•
20k
•
21
raftrsf/sfr_concise_iter4_top1
Viewer
•
Updated
Jun 12, 2024
•
20k
•
13
raftrsf/sfr_concise_iter4_k32_with_rewards
Viewer
•
Updated
Jun 12, 2024
•
20k
•
42
raftrsf/ipo_eval_data_baseline.json
Viewer
•
Updated
May 18, 2024
•
7.62k
•
22
raftrsf/zephyr_pi0_gen_57k_for_offline_dpo_ipo
Viewer
•
Updated
May 7, 2024
•
57.5k
•
22
raftrsf/iterative_ipo_pm_iter1_n4
Viewer
•
Updated
Apr 25, 2024
•
13.5k
•
20
raftrsf/iterative_ipo_pm_iter1
Viewer
•
Updated
Apr 24, 2024
•
13.5k
•
67