raft_study

AI & ML interests

None defined yet.

Recent Activity

hendrydong authored a paper 3 days ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

hendrydong authored a paper 10 days ago

Scalable Chain of Thoughts via Elastic Reasoning

hendrydong authored a paper 13 days ago

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

View all activity

models 4

raftrsf/sfr_raft_iter5_2epoch

Text Generation • Updated Jun 17, 2024 • 5

raftrsf/sfr_raft_iter4_2epoch

Text Generation • Updated Jun 13, 2024 • 9

raftrsf/sfr_raft_iter4

Text Generation • Updated Jun 13, 2024 • 11

raftrsf/pair_pref

Text Generation • Updated May 18, 2024 • 8

datasets 8

raftrsf/sfr_concise_iter5_top1

Viewer • Updated Jun 14, 2024 • 20k • 18

raftrsf/sfr_concise_iter5_k32_with_rewards

Viewer • Updated Jun 14, 2024 • 20k • 21

raftrsf/sfr_concise_iter4_top1

Viewer • Updated Jun 12, 2024 • 20k • 13

raftrsf/sfr_concise_iter4_k32_with_rewards

Viewer • Updated Jun 12, 2024 • 20k • 42

raftrsf/ipo_eval_data_baseline.json

Viewer • Updated May 18, 2024 • 7.62k • 22

raftrsf/zephyr_pi0_gen_57k_for_offline_dpo_ipo

Viewer • Updated May 7, 2024 • 57.5k • 22

raftrsf/iterative_ipo_pm_iter1_n4

Viewer • Updated Apr 25, 2024 • 13.5k • 20

raftrsf/iterative_ipo_pm_iter1

Viewer • Updated Apr 24, 2024 • 13.5k • 67