SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 16 days ago • 169
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 226
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 385
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 130
view article Article A failed experiment: Infini-Attention, and why we should keep trying? Aug 14, 2024 • 62