Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Magpie-Align
/
Llama-3.1-8B-Magpie-Align-v0.2
like
3
Follow
Magpie Alignment
130
Safetensors
Magpie-Align/Llama-3.1-70B-PO-100K-armorm
English
llama
alignment-handbook
trl
dpo
Generated from Trainer
arxiv:
2406.08464
arxiv:
2406.12845
License:
llama3.1
Model card
Files
Files and versions
Community
88d17cd
Llama-3.1-8B-Magpie-Align-v0.2
/
model-00001-of-00004.safetensors
Commit History
Training in progress, step 765
cf8884e
verified
Zhangchen Xu
commited on
Aug 3, 2024
Training in progress, step 700
037bbde
verified
Zhangchen Xu
commited on
Aug 3, 2024
Training in progress, step 600
0fa5b18
verified
Zhangchen Xu
commited on
Aug 3, 2024
Training in progress, step 500
0a183b3
verified
Zhangchen Xu
commited on
Aug 3, 2024
Training in progress, step 400
b8c4d60
verified
Zhangchen Xu
commited on
Aug 3, 2024
Training in progress, step 300
fa08f18
verified
Zhangchen Xu
commited on
Aug 3, 2024
Training in progress, step 200
bf37a10
verified
Zhangchen Xu
commited on
Aug 2, 2024
Training in progress, step 100
f92c0a2
verified
Zhangchen Xu
commited on
Aug 2, 2024