Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
Sparse-Llama-3.1-8B-tldr-2of4-FP8-dynamic
like
0
Follow
Red Hat AI
1.08k
Text Generation
Transformers
Safetensors
trl-lib/tldr
llama
text-generation-inference
compressed-tensors
License:
llama3.1
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Sparse-Llama-3.1-8B-tldr-2of4-FP8-dynamic
/
inference_performance
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
alexmarques
Rename chart (3).png to inference_performance/latency.png
9c18b32
verified
5 days ago
latency.png
32.6 kB
Rename chart (3).png to inference_performance/latency.png
5 days ago