view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 136
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others • Mar 10 • 142
Running 563 563 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • Jul 5, 2024 • 251
openai/whisper-large-v3-turbo Automatic Speech Recognition • Updated Oct 4, 2024 • 6.34M • • 2.37k