-
Training Language Models to Self-Correct via Reinforcement Learning
Paper • 2409.12917 • Published • 140 -
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution
Paper • 2409.12191 • Published • 78 -
Expect the Unexpected: FailSafe Long Context QA for Finance
Paper • 2502.06329 • Published • 131 -
Competitive Programming with Large Reasoning Models
Paper • 2502.06807 • Published • 70
Julian Wergieluk
jwergieluk
·
AI & ML interests
machine learning, mathematics, optimization
Organizations
Collections
1
models
0
None public yet
datasets
0
None public yet