Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
chloeli
/
qwen-2.5-0.5B-instruct-sft-lora-countdown-o3-1k
like
0
Text Generation
Transformers
Safetensors
MelinaLaimon/stream-of-search
qwen2
Generated from Trainer
alignment-handbook
trl
sft
conversational
text-generation-inference
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
ca87287
qwen-2.5-0.5B-instruct-sft-lora-countdown-o3-1k
Commit History
Model save
ca87287
verified
chloeli
commited on
Mar 25
Training in progress, step 125
292d05c
verified
chloeli
commited on
Mar 25
Training in progress, step 100
2391533
verified
chloeli
commited on
Mar 25
initial commit
829dc05
verified
chloeli
commited on
Mar 25