Parkerlambert123 commited on
Commit
7fb75cb
·
verified ·
1 Parent(s): 9ca0aa4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -52,6 +52,10 @@ With respect to general capabilities, evaluations indicate modest improvements o
52
 
53
  ![general](./images/general_score.png)
54
 
 
 
 
 
55
 
56
  ## 4. How to Run Locally
57
 
@@ -181,7 +185,7 @@ We recommend adhering to the following configurations when utilizing the Zhi-wri
181
 
182
  * Set the temperature within the range of 0.5-0.7 (0.6 is recommended) to prevent endless repetitions or incoherent outputs.
183
 
184
- * When evaluating model performance, it is recommended to conduct multiple tests and average the results. (We use `n=16` for mathematical tasks and `n=2` for others)
185
 
186
  * To ensure that the model engages in thorough reasoning like DeepSeek-R1 series models, we recommend enforcing the model to initiate its response with "\<think\>\n" at the beginning of every output.
187
 
 
52
 
53
  ![general](./images/general_score.png)
54
 
55
+ <figcaption style="text-align:center; font-size:0.9em; color:#666">
56
+ Figure 2: When evaluating model performance, it is recommended to conduct multiple tests and average the results. (We use n=16 and max_tokens=32768 for mathematical tasks and n=2 for others)
57
+ </figcaption>
58
+
59
 
60
  ## 4. How to Run Locally
61
 
 
185
 
186
  * Set the temperature within the range of 0.5-0.7 (0.6 is recommended) to prevent endless repetitions or incoherent outputs.
187
 
188
+ * When evaluating model performance, it is recommended to conduct multiple tests and average the results. (We use `n=16` and `max_tokens=32768` for mathematical tasks and `n=2` for others)
189
 
190
  * To ensure that the model engages in thorough reasoning like DeepSeek-R1 series models, we recommend enforcing the model to initiate its response with "\<think\>\n" at the beginning of every output.
191