Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory. • 19 items • Updated 5 days ago • 24
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated Feb 20 • 255
Whisper Collection OpenAI Whisper speech recognition models in MLX format • 48 items • Updated Oct 1, 2024 • 43