Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging Paper • 2410.01610 • Published Oct 2, 2024
Smaller Language Models Are Better Instruction Evolvers Paper • 2412.11231 • Published Dec 15, 2024 • 29
Smaller Language Models Are Better Instruction Evolvers Paper • 2412.11231 • Published Dec 15, 2024 • 29