Text Generation
Transformers
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
MagpieLM-8B-Chat-v0.1 / model-00004-of-00004.safetensors

Commit History

Training in progress, step 1531
e0f4dd7
verified

Zhangchen Xu commited on

Training in progress, step 1500
72f60d1
verified

Zhangchen Xu commited on

Training in progress, step 1000
c53d548
verified

Zhangchen Xu commited on

Training in progress, step 500
8656d23
verified

Zhangchen Xu commited on