Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Magpie-Align
/
Llama-3.1-8B-Magpie-Align-v0.2
like
3
Follow
Magpie Alignment
125
Safetensors
Magpie-Align/Llama-3.1-70B-PO-100K-armorm
English
llama
alignment-handbook
trl
dpo
Generated from Trainer
arxiv:
2406.08464
arxiv:
2406.12845
License:
llama3.1
Model card
Files
Files and versions
Community
main
Llama-3.1-8B-Magpie-Align-v0.2
/
README.md
Commit History
Update README.md
30e6682
verified
Zhangchen Xu
commited on
Aug 19, 2024
Update README.md
9ebe5f7
verified
Zhangchen Xu
commited on
Aug 19, 2024
Update README.md
88d17cd
verified
Zhangchen Xu
commited on
Aug 19, 2024
End of training
f98f101
verified
Zhangchen Xu
commited on
Aug 3, 2024
Model save
d8eec6f
verified
Zhangchen Xu
commited on
Aug 3, 2024