[Fine-tuning] ๐Ÿš€SFT/DPO/GRPO support!

#20
by study-hjt - opened

currently only the training of the thinker part is supported... (text/audio/image/video -> text)

here ~ ๐Ÿ˜Š
https://github.com/modelscope/ms-swift/pull/3613

study-hjt changed discussion title from ๐Ÿš€SFT/DPO/GRPO support! to [Fine-tuning] ๐Ÿš€SFT/DPO/GRPO support!
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment