Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
yjh00
/
Qwen2.5-1.5B-Open-R1-Distill
like
0
Text Generation
Transformers
Safetensors
HuggingFaceH4/numina-deepseek-r1-qwen-7b
qwen2
Generated from Trainer
open-r1
trl
sft
conversational
text-generation-inference
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
0fb8cef
Qwen2.5-1.5B-Open-R1-Distill
/
all_results.json
yjh00
Model save
0fb8cef
verified
3 months ago
raw
Copy download link
history
blame
Safe
211 Bytes
{
"total_flos"
:
28487712768000.0
,
"train_loss"
:
0.920774197101593
,
"train_runtime"
:
1759.2244
,
"train_samples"
:
16610
,
"train_samples_per_second"
:
18.19
,
"train_steps_per_second"
:
0.568
}