Safetensors
English
llama
alignment-handbook
trl
dpo
Generated from Trainer
Llama-3.1-8B-Magpie-Align-v0.2 / trainer_state.json
Zhangchen Xu
Model save
d8eec6f verified
raw
history
403 kB
File too large to display, you can check the raw version instead.