Update README.md
Browse files
README.md
CHANGED
@@ -52,6 +52,10 @@ With respect to general capabilities, evaluations indicate modest improvements o
|
|
52 |
|
53 |

|
54 |
|
|
|
|
|
|
|
|
|
55 |
|
56 |
## 4. How to Run Locally
|
57 |
|
@@ -181,7 +185,7 @@ We recommend adhering to the following configurations when utilizing the Zhi-wri
|
|
181 |
|
182 |
* Set the temperature within the range of 0.5-0.7 (0.6 is recommended) to prevent endless repetitions or incoherent outputs.
|
183 |
|
184 |
-
* When evaluating model performance, it is recommended to conduct multiple tests and average the results. (We use `n=16` for mathematical tasks and `n=2` for others)
|
185 |
|
186 |
* To ensure that the model engages in thorough reasoning like DeepSeek-R1 series models, we recommend enforcing the model to initiate its response with "\<think\>\n" at the beginning of every output.
|
187 |
|
|
|
52 |
|
53 |

|
54 |
|
55 |
+
<figcaption style="text-align:center; font-size:0.9em; color:#666">
|
56 |
+
Figure 2: When evaluating model performance, it is recommended to conduct multiple tests and average the results. (We use n=16 and max_tokens=32768 for mathematical tasks and n=2 for others)
|
57 |
+
</figcaption>
|
58 |
+
|
59 |
|
60 |
## 4. How to Run Locally
|
61 |
|
|
|
185 |
|
186 |
* Set the temperature within the range of 0.5-0.7 (0.6 is recommended) to prevent endless repetitions or incoherent outputs.
|
187 |
|
188 |
+
* When evaluating model performance, it is recommended to conduct multiple tests and average the results. (We use `n=16` and `max_tokens=32768` for mathematical tasks and `n=2` for others)
|
189 |
|
190 |
* To ensure that the model engages in thorough reasoning like DeepSeek-R1 series models, we recommend enforcing the model to initiate its response with "\<think\>\n" at the beginning of every output.
|
191 |
|