Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
SpectralPO
's Collections
DeepSeek-R1-Distill-Llama-8B
Qwen2.5-32B-Instruct
Qwen2.5-14B-Instruct
DeepSeek-R1-Distill-Qwen-7B
Qwen2.5-7B-Instruct
Offline RL with Neg Samples
Qwen2.5-7B-Instruct
updated
Apr 27
Upvote
-
SpectralPO/Qwen2.5-7B-Instruct-GRPO
Updated
Apr 27
•
3
SpectralPO/Qwen2.5-7B-Instruct-SPO
Updated
Apr 27
•
3
Upvote
-
Share collection
View history
Collection guide
Browse collections