Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,9 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
datasets:
|
|
|
1 |
+
GPG: A Simple and Strong Reinforcement Learning
|
2 |
+
Baseline for Model Reasoning
|
3 |
+
https://arxiv.org/abs/2504.02546
|
4 |
+
|
5 |
+
The RL model trained on the Open-r1 dataset based on GPG, using DeepSeek-R1-Distill-Qwen-1.5B as the baseline model.
|
6 |
+
|
7 |
---
|
8 |
license: apache-2.0
|
9 |
datasets:
|