Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark Paper • 2504.13143 • Published 3 days ago • 7
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling Paper • 2504.13169 • Published 3 days ago • 36
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding Paper • 2504.13180 • Published 3 days ago • 13
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float Paper • 2504.11651 • Published 5 days ago • 13
Perception Encoder: The best visual embeddings are not at the output of the network Paper • 2504.13181 • Published 3 days ago • 20
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework Paper • 2504.12395 • Published 4 days ago • 13
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training Paper • 2504.13161 • Published 3 days ago • 85
WORLDMEM: Long-term Consistent World Simulation with Memory Paper • 2504.12369 • Published 4 days ago • 28
ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering Paper • 2504.05506 • Published 13 days ago • 19
REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers Paper • 2504.10483 • Published 6 days ago • 19
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Paper • 2504.11536 • Published 5 days ago • 53
ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness Paper • 2504.10514 • Published 10 days ago • 45
Cobra: Efficient Line Art COlorization with BRoAder References Paper • 2504.12240 • Published 4 days ago • 25
Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning Paper • 2504.08672 • Published 9 days ago • 52
Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding Paper • 2504.10465 • Published 6 days ago • 27
Heimdall: test-time scaling on the generative verification Paper • 2504.10337 • Published 6 days ago • 30