Mungert commited on
Commit
203cf2c
·
verified ·
1 Parent(s): c7af1a9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -79,7 +79,7 @@ All tests conducted on **Llama-3-8B-Instruct** using:
79
  - 2048-token context window
80
  - Same prompt set across all quantizations
81
 
82
- ### **Key Improvements**
83
  - **Dynamic Precision Allocation**:
84
  - First/Last 25% of layers → IQ4_XS (selected layers)
85
  - Middle 50% → IQ2_XXS/IQ3_S (increase efficiency)
 
79
  - 2048-token context window
80
  - Same prompt set across all quantizations
81
 
82
+ ### **Method**
83
  - **Dynamic Precision Allocation**:
84
  - First/Last 25% of layers → IQ4_XS (selected layers)
85
  - Middle 50% → IQ2_XXS/IQ3_S (increase efficiency)