SimpleRL-Zoo

hkust-nlp 's Collections

M-STAR

Deita

updated 19 days ago

The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild"