Update README.md
Browse files
README.md
CHANGED
@@ -79,7 +79,7 @@ All tests conducted on **Llama-3-8B-Instruct** using:
|
|
79 |
- 2048-token context window
|
80 |
- Same prompt set across all quantizations
|
81 |
|
82 |
-
### **
|
83 |
- **Dynamic Precision Allocation**:
|
84 |
- First/Last 25% of layers → IQ4_XS (selected layers)
|
85 |
- Middle 50% → IQ2_XXS/IQ3_S (increase efficiency)
|
|
|
79 |
- 2048-token context window
|
80 |
- Same prompt set across all quantizations
|
81 |
|
82 |
+
### **Method**
|
83 |
- **Dynamic Precision Allocation**:
|
84 |
- First/Last 25% of layers → IQ4_XS (selected layers)
|
85 |
- Middle 50% → IQ2_XXS/IQ3_S (increase efficiency)
|