Safetensors
English
llama
alignment-handbook
trl
dpo
Generated from Trainer

Commit History

Update README.md
30e6682
verified

Zhangchen Xu commited on

Update README.md
9ebe5f7
verified

Zhangchen Xu commited on

Update README.md
88d17cd
verified

Zhangchen Xu commited on

End of training
f98f101
verified

Zhangchen Xu commited on

Model save
d8eec6f
verified

Zhangchen Xu commited on