Leandro von Werra's picture

Leandro von Werra

lvwerra

·

https://github.com/lvwerra

AI & ML interests

NLP and RL

Recent Activity

liked a model 13 days ago

ds4sd/SmolDocling-256M-preview

liked a model 13 days ago

rasbt/llama-3.2-from-scratch

authored a paper 15 days ago

SmolVLM: Redefining small and efficient multimodal models

View all activity

Organizations

lvwerra's activity

upvoted a paper 15 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 16 days ago • 169

upvoted a collection 18 days ago

Llama 4

Llama 4 release • 10 items • Updated 18 days ago • 442

upvoted a paper 3 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 226

upvoted 3 articles 3 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.22k

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

Feb 4

• 74

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 479

upvoted a paper 3 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 385

upvoted an article 3 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 845

upvoted a paper 3 months ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 61

upvoted a collection 5 months ago

🤖 Agents

21 items • Updated Dec 31, 2024 • 151

upvoted a paper 6 months ago

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31, 2024 • 25

upvoted an article 7 months ago

Article

FineVideo: behind the scenes

Sep 23, 2024

• 31

upvoted a paper 7 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 148

upvoted a paper 8 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 130

upvoted 2 articles 8 months ago

Article

Tool Use, Unified

Aug 12, 2024

• 99

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14, 2024

• 62