Unicorn: Text-Only Data Synthesis for Vision Language Model Training Paper • 2503.22655 • Published 29 days ago • 38
CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction Paper • 2412.06782 • Published Dec 9, 2024 • 7