Aioli: A Unified Optimization Framework for Language Model Data Mixing Paper • 2411.05735 • Published Nov 8, 2024 • 1
Cell2Sentence Models Collection Cell2Sentence models trained for single-cell tasks • 5 items • Updated 7 days ago • 6
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning Paper • 2503.07365 • Published Mar 10 • 60
🏜️MIRAGE-Bench [NAACL'25] Collection Dataset Collection from the MIRAGE-Bench paper • 13 items • Updated 23 days ago • 2
DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning Paper • 2504.11456 • Published 8 days ago • 11
DataDecide Collection A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated 7 days ago • 12
Apriel Collection ServiceNow Language Modeling Lab's first model family series • 2 items • Updated 9 days ago • 7
RADIO Collection A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 12 items • Updated about 1 hour ago • 16
ALEA Mid- and Post-Train Resources Collection Various Q&A, abstractive/extractive summarization, classification, drafting, prediction, and conversational tasks • 9 items • Updated 13 days ago • 2
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 11 days ago • 61
Inference-Time Scaling for Generalist Reward Modeling Paper • 2504.02495 • Published 20 days ago • 53