alexredna
/
TinyLlama-1.1B-Chat-v1.0-reasoning-v2-dpo

Model card Files Files and versions Metrics Training metrics Community
1