[Fine-tuning] ๐SFT/DPO/GRPO support!
#20
by
study-hjt
- opened
currently only the training of the thinker part is supported... (text/audio/image/video -> text)
study-hjt
changed discussion title from
๐SFT/DPO/GRPO support!
to [Fine-tuning] ๐SFT/DPO/GRPO support!