Update README.md
Browse files
README.md
CHANGED
@@ -23,7 +23,7 @@ datasets:
|
|
23 |
- **Model Developers:** Red Hat (Neural Magic)
|
24 |
|
25 |
This model is a quantized version of [RedHatAI/Sparse-Llama-3.1-8B-tldr-2of4](https://huggingface.co/RedHatAI/Sparse-Llama-3.1-8B-tldr-2of4), which is fine-tuned on the [trl-lib/tldr](https://huggingface.co/datasets/trl-lib/tldr) dataset.
|
26 |
-
This sparse-quantized model recovers 100% of the BERTScore (0.366) obtained by the dense model [RedHatAI/Llama-3.1-8B-tldr](https://huggingface.co/RedHatAI/Llama-3.1-8B-tldr).
|
27 |
|
28 |
|
29 |
## Deployment
|
|
|
23 |
- **Model Developers:** Red Hat (Neural Magic)
|
24 |
|
25 |
This model is a quantized version of [RedHatAI/Sparse-Llama-3.1-8B-tldr-2of4](https://huggingface.co/RedHatAI/Sparse-Llama-3.1-8B-tldr-2of4), which is fine-tuned on the [trl-lib/tldr](https://huggingface.co/datasets/trl-lib/tldr) dataset.
|
26 |
+
This sparse-quantized model recovers 100% of the BERTScore (0.366) obtained by the dense model [RedHatAI/Llama-3.1-8B-tldr](https://huggingface.co/RedHatAI/Llama-3.1-8B-tldr) while providing up to 1.6x speedup.
|
27 |
|
28 |
|
29 |
## Deployment
|