Pleias-RAG Collection New generation of small reasoning models for RAG, search, and source summarization. • 4 items • Updated 4 days ago • 22
Describe Anything Collection Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 3 days ago • 40
Describe Anything: Detailed Localized Image and Video Captioning Paper • 2504.16072 • Published 5 days ago • 53
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters Paper • 2504.08791 • Published 21 days ago • 125
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published 13 days ago • 245
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning Paper • 2504.07128 • Published 26 days ago • 82
RADIO Collection A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 13 items • Updated 3 days ago • 17
Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated 17 days ago • 76
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 20 days ago • 176