
Red Hat AI
company
Verified
AI & ML interests
OpenSource and AI
Recent Activity
View all activity
Organization Card
Red Hat AI
Build AI for your world
The Red Hat AI repository on Hugging Face is an open-source initiative backed by deep collaboration between IBM and Red Hat’s research, engineering, and business units. We’re committed to making AI more accessible, efficient, and community-driven from research to production.
We believe the future of AI is open. That’s why we’re sharing our latest models and research on Hugging Face, which are freely available to help researchers, developers, and organizations deploy high-performance AI at scale.
🔧 With Red Hat AI, you can:
- Use or build optimized foundation models, including Llama, Mistral, Qwen, Gemma, DeepSeek, and others, tailored for performance and accuracy in real-world deployments.
- Customize and fine-tune models for your workflows, from experimentation to production, with tools and frameworks built to support reproducible research and enterprise AI pipelines.
- Maximize inference efficiency across hardware using production-grade compression and optimization techniques like quantization (FP8, INT8, INT4), structured/unstructured sparsity, distillation, and more, ready for cost-efficient deployments with vLLM.
- Validated models by Red Hat AI offer confidence, predictability, and flexibility when deploying third-party generative AI models across the Red Hat AI platform. Red Hat AI validates models by running a series of capacity planning scenarios with GuideLLM for benchmarking, Language Model Evaluation Harness for accuracy evaluations, and vLLM for inference serving across a wide variety of AI acclerators.
🔗 Explore relevant open-source tools:
- vLLM – Serve large language models efficiently across GPUs and environments.
- LLM Compressor – Compress and optimize your own models with SOTA quantization and sparsity techniques.
- InstructLab – Fine-tune open models with your data using scalable, community-backed workflows.
- GuideLLM – Benchmark, evaluate, and guide your deployments with structured performance and latency insights.
Or learn more about our full product suite at https://www.redhat.com/en/products/ai
Collections
10
v1.0 Collection of third-party generative AI models validated by Red Hat AI for use across the Red Hat AI Product Portfolio.
-
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic
Image-Text-to-Text • Updated • 9.67k • 20 -
RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16
Image-Text-to-Text • Updated • 6.41k • 6 -
RedHatAI/Llama-4-Scout-17B-16E-Instruct
Image-Text-to-Text • Updated • 429 -
RedHatAI/Llama-4-Maverick-17B-128E-Instruct
Image-Text-to-Text • Updated • 86 • 1
models
480

RedHatAI/gemma-3-27b-it-FP8-dynamic
Image-Text-to-Text
•
Updated
•
577

RedHatAI/Sparse-Llama-3.1-8B-tldr-2of4-FP8-dynamic
Text Generation
•
Updated
•
66

RedHatAI/Sparse-Llama-3.1-8B-tldr-2of4
Text Generation
•
Updated
•
129

RedHatAI/Llama-3.1-8B-tldr-FP8-dynamic
Text Generation
•
Updated
•
8
•
1

RedHatAI/gemma-3-4b-it-FP8-dynamic
Image-Text-to-Text
•
Updated
•
69

RedHatAI/Llama-3.1-8B-tldr
Text Generation
•
Updated
•
110
•
1

RedHatAI/gemma-3-12b-it-FP8-dynamic
Image-Text-to-Text
•
Updated
•
71

RedHatAI/gemma-3-27b-it-quantized.w4a16
Updated

RedHatAI/gemma-3-27b-it-quantized.w8a8
Image-Text-to-Text
•
Updated
•
6
•
1

RedHatAI/gemma-3-12b-it-quantized.w4a16
Image-Text-to-Text
•
Updated
•
20
datasets
0
None public yet