Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,12 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
---
|
4 |
+
### Qwen2.5-7B-Huatuo-difficulty-SFT
|
5 |
+
|
6 |
+
- Base Model: [Qwen/Qwen2.5-7B](https://huggingface.co/Qwen/Qwen2.5-7B)
|
7 |
+
|
8 |
+
- Training Epoches: 3
|
9 |
+
- Training Objective: SFT + RL
|
10 |
+
- Training Data:
|
11 |
+
- SFT Data: [ReasoningEval/Huatuo-SFT-difficulty](https://huggingface.co/datasets/ReasoningEval/Huatuo-SFT-difficulty)
|
12 |
+
- RL Data: [ReasoningEval/Huatuo-RL](https://huggingface.co/datasets/ReasoningEval/Huatuo-RL)
|