Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
RSPO
/
Qwen2.5-7B-Instruct-GRPO
like
0
Follow
RainbowSamplingPO
3
Safetensors
qwen2
License:
mit
Model card
Files
Files and versions
Community
README.md exists but content is empty.
Downloads last month
0
Safetensors
Model size
7.62B params
Tensor type
BF16
ยท
Chat template
Files info
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Collection including
RSPO/Qwen2.5-7B-Instruct-GRPO
Qwen2.5-7B-Instruct
Collection
2 items
โข
Updated
about 20 hours ago