Leandro von Werra's picture

Leandro von Werra

lvwerra

·

https://github.com/lvwerra

AI & ML interests

NLP and RL

Recent Activity

liked a model 9 days ago

ds4sd/SmolDocling-256M-preview

liked a model 9 days ago

rasbt/llama-3.2-from-scratch

authored a paper 11 days ago

SmolVLM: Redefining small and efficient multimodal models

View all activity

Organizations

lvwerra's activity

liked 2 models 9 days ago

ds4sd/SmolDocling-256M-preview

Image-Text-to-Text • Updated 27 days ago • 88.2k • 1.24k

rasbt/llama-3.2-from-scratch

Updated 3 days ago • 258

liked a Space 16 days ago

Try YourBench!

Generate a custom benchmark from any document

liked a Space about 1 month ago

QwQ 32B Demo

Send text and get detailed responses

liked 2 Spaces about 2 months ago

Open LLM Progress Tracker

Visualize Open vs. Proprietary LLM Progress

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked a Space 2 months ago

DABstep Leaderboard

DABstep Reasoning Benchmark Leaderboard

liked a model 3 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 23 days ago • 1.72M • • 11.9k

liked 2 Spaces 4 months ago

Jupyter Agent

Create and run Jupyter notebooks interactively

Scaling test-time compute

Enhance math problem solving by scaling test-time compute

liked a dataset 4 months ago

microsoft/RedStone

Updated Dec 5, 2024 • 119 • 33

liked a dataset 5 months ago

ylecun/mnist

Viewer • Updated Aug 8, 2024 • 70k • 37.3k • 168

liked a Space 5 months ago

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

Evaluate multilingual models using FineTasks

liked a model 6 months ago

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • Updated Mar 6 • 78.8k • 595

liked 2 Spaces 6 months ago

CinePileLeaderboard

Video-LLM evaluations on CinePile's evaluation split.

TxT360: Trillion Extracted Text

Create a large, deduplicated dataset for LLM pre-training

liked a dataset 7 months ago

HuggingFaceFV/finevideo

Viewer • Updated Dec 16, 2024 • 39.5k • 8.63k • 308

liked a model 8 months ago

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 6.21M • • 3.86k

liked a model 9 months ago

google/gemma-2-2b

Text Generation • Updated Aug 7, 2024 • 528k • 539

liked a Space 10 months ago

BigCodeBench Leaderboard

Explore and analyze code evaluation data