AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories Paper • 2504.08942 • Published 10 days ago • 26
InteractVLM: 3D Interaction Reasoning from 2D Foundational Models Paper • 2504.05303 • Published 14 days ago • 4
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce Paper • 2504.11343 • Published 6 days ago • 13
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published 7 days ago • 232
BitNet Collection 🔥BitNet family of large language models (1-bit LLMs). • 6 items • Updated 4 days ago • 28
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models Paper • 2504.10449 • Published 7 days ago • 10
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation Paper • 2504.08736 • Published 10 days ago • 47
view article Article LeRobot goes to driving school: World’s largest open-source self-driving dataset Mar 11 • 76
view article Article Hugging Face to sell open-source robots thanks to Pollen Robotics acquisition 🤖 8 days ago • 36
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications Paper • 2408.11878 • Published Aug 20, 2024 • 60
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning Paper • 2504.08600 • Published 10 days ago • 25
HIGGS Collection Models prequantized with [HIGGS](https://arxiv.org/abs/2411.17525) zero-shot quantization. Requires the latest `transformers` to run. • 18 items • Updated Feb 28 • 15
RADIO Collection A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 12 items • Updated 7 days ago • 16
Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated 11 days ago • 76
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published 21 days ago • 251