liumy2010
/

Llama-3.2-1B-countdown-R3

+---
+library_name: transformers
+pipeline_tag: text-generation
+base_model:
+- meta-llama/Llama-3.2-1B
+---
+## UFT
+This repository contains the model presented in [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://huggingface.co/papers/2505.16984).
+Code: https://github.com/liumy2010/UFT
+    ## References
     * [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)