MLX
Safetensors
mixtral
trl
orpo
Generated from Trainer
muhtasham's picture
Upload folder using huggingface_hub
e4c692d verified
metadata
license: apache-2.0
tags:
  - trl
  - orpo
  - generated_from_trainer
  - mlx
base_model: mistral-community/Mixtral-8x22B-v0.1
datasets:
  - argilla/distilabel-capybara-dpo-7k-binarized
model-index:
  - name: zephyr-orpo-141b-A35b-v0.1
    results: []

mlx-community/zephyr-orpo-141b-A35b-v0.1-4bit

This model was converted to MLX format from HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1. Refer to the original model card for more details on the model.

Use with mlx

pip install mlx-lm
from mlx_lm import load, generate

model, tokenizer = load("mlx-community/zephyr-orpo-141b-A35b-v0.1-4bit")
response = generate(model, tokenizer, prompt="hello", verbose=True)