Zhihu-ai
/

Zhi-writing-dsr1-14b-gptq-int4

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

Parkerlambert123 commited on 4 days ago

Commit

7fb75cb

·

verified ·

1 Parent(s): 9ca0aa4

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -52,6 +52,10 @@ With respect to general capabilities, evaluations indicate modest improvements o
 ![general](./images/general_score.png)
 ## 4. How to Run Locally
@@ -181,7 +185,7 @@ We recommend adhering to the following configurations when utilizing the Zhi-wri
 * Set the temperature within the range of 0.5-0.7 (0.6 is recommended) to prevent endless repetitions or incoherent outputs.
-* When evaluating model performance, it is recommended to conduct multiple tests and average the results. (We use `n=16` for mathematical tasks and `n=2` for others)
 * To ensure that the model engages in thorough reasoning like DeepSeek-R1 series models, we recommend enforcing the model to initiate its response with "\<think\>\n" at the beginning of every output.

 ![general](./images/general_score.png)
+<figcaption style="text-align:center; font-size:0.9em; color:#666">
+Figure 2: When evaluating model performance, it is recommended to conduct multiple tests and average the results. (We use n=16 and max_tokens=32768 for mathematical tasks and n=2 for others)
+</figcaption>
 ## 4. How to Run Locally
 * Set the temperature within the range of 0.5-0.7 (0.6 is recommended) to prevent endless repetitions or incoherent outputs.
+* When evaluating model performance, it is recommended to conduct multiple tests and average the results. (We use `n=16` and `max_tokens=32768` for mathematical tasks and `n=2` for others)
 * To ensure that the model engages in thorough reasoning like DeepSeek-R1 series models, we recommend enforcing the model to initiate its response with "\<think\>\n" at the beginning of every output.