Overflow Prevention Enhances Long-Context Recurrent LLMs Paper • 2505.07793 • Published 7 days ago • 3
DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation Paper • 2505.07233 • Published 7 days ago • 7
MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills Paper • 2505.06176 • Published 10 days ago • 10
Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent Paper • 2505.07596 • Published 7 days ago • 10
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining Paper • 2505.07608 • Published 7 days ago • 74
WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch Paper • 2505.03733 • Published 13 days ago • 16
Learning Dynamics in Continual Pre-Training for Large Language Models Paper • 2505.07796 • Published 7 days ago • 18
Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning Paper • 2505.07263 • Published 7 days ago • 29
AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection Paper • 2505.07293 • Published 7 days ago • 24
REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback Paper • 2505.06548 • Published 9 days ago • 27
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published 12 days ago • 60
Flow-GRPO: Training Flow Matching Models via Online RL Paper • 2505.05470 • Published 11 days ago • 69
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper • 2505.02567 • Published 14 days ago • 70
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers Paper • 2504.20752 • Published 20 days ago • 88