hkust-nlp/SimpleRL-Zoo-Data
Viewer
•
Updated
•
53.1k
•
1.09k
•
4
The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild"