Safetensors
English
llama
alignment-handbook
trl
dpo
Generated from Trainer
Llama-3.1-8B-Magpie-Align-v0.2 / eval_results.json

Commit History

End of training
f98f101
verified

Zhangchen Xu commited on