Training-Free Efficient Video Generation via Dynamic Token Carving Paper • 2505.16864 • Published 17 days ago • 21
Emerging Properties in Unified Multimodal Pretraining Paper • 2505.14683 • Published 19 days ago • 129
view post Post 2786 ByteDance is absolutely cooking lately🔥BAGEL 🥯 7B active parameter open multimodal foundation model by Bytedance Seed team. ByteDance-Seed/BAGEL-7B-MoT✨ Apache 2.0✨ Outperforms top VLMs (Qwen2.5-VL & InternVL-2.5)✨ Mixture-of-Transformer-Experts + dual encoders✨ Trained on trillions of interleaved tokens See translation 🚀 6 6 🔥 5 5 + Reply