Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a Space 24 minutes ago

open-r1/open-r1-eval-leaderboard

updated a Space 34 minutes ago

open-r1/open-r1-eval-leaderboard

updated a Space about 2 hours ago

open-r1/open-r1-eval-leaderboard

View all activity

Organizations

lewtun's activity

upvoted an article 4 days ago

Article

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

Jun 13, 2024

• 54

upvoted a collection 12 days ago

Cogito v1 Preview

5 items • Updated 13 days ago • 101

upvoted a paper 12 days ago

Leanabell-Prover: Posttraining Scaling in Formal Reasoning

Paper • 2504.06122 • Published 13 days ago • 6

upvoted a paper 13 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 13 days ago • 163

upvoted a paper 20 days ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 20 days ago • 61

upvoted a paper 21 days ago

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published 25 days ago • 43

upvoted a paper 26 days ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published 27 days ago • 30

upvoted a paper 29 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 46

upvoted a paper about 1 month ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10 • 41

upvoted an article about 1 month ago

Article

Open R1: How to use OlympicCoder locally for coding?

Mar 20

• 56

upvoted a paper about 1 month ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 119

upvoted a paper about 2 months ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published Feb 17 • 34

upvoted a collection 2 months ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 21 items • Updated 5 days ago • 126