liumy2010 commited on
Commit
9e0e792
·
verified ·
1 Parent(s): a6d6d9f

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +14 -1
README.md CHANGED
@@ -1,4 +1,17 @@
1
- ## References
 
 
 
 
 
 
 
 
 
 
 
 
 
2
 
3
  * [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)
4
 
 
1
+ ---
2
+ library_name: transformers
3
+ pipeline_tag: text-generation
4
+ base_model:
5
+ - meta-llama/Llama-3.2-1B
6
+ ---
7
+
8
+ ## UFT
9
+
10
+ This repository contains the model presented in [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://huggingface.co/papers/2505.16984).
11
+
12
+ Code: https://github.com/liumy2010/UFT
13
+
14
+ ## References
15
 
16
  * [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)
17