Safetensors
English
llama
alignment-handbook
trl
dpo
Generated from Trainer
Llama-3.1-8B-Magpie-Align-v0.2 / tokenizer_config.json

Commit History

Training in progress, step 100
f92c0a2
verified

Zhangchen Xu commited on