Thomas Bouvier

tbouvier

https://thomas-bouvier.io

AI & ML interests

HPC for ML, large-scale pretraining, AI4Science

Recent Activity

liked a model 3 days ago

allenai/ACE2-ERA5

liked a model 7 days ago

microsoft/aurora

liked a Space about 2 months ago

Leiyre/memory-viz

View all activity

Organizations

None yet

tbouvier's activity

liked a model 3 days ago

allenai/ACE2-ERA5

Updated Nov 21, 2024 • 3

liked a model 7 days ago

microsoft/aurora

Updated Jan 15 • 20

liked a Space about 2 months ago

Memory Viz

🧠

Memory Viz

liked 2 Spaces 2 months ago

Predict Memory

🧮

Calculate memory usage from model configurations

2.53k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 2 months ago

PleIAs/common_corpus

Viewer • Updated Feb 11 • 470M • 37.6k • 254

liked a dataset 3 months ago

HuggingFaceFW/fineweb-edu

Viewer • Updated Jan 31 • 3.3B • 257k • 666

liked 2 models 3 months ago

mistralai/Mistral-Small-24B-Base-2501

Text Generation • Updated Jan 30 • 13k • 244

meta-llama/Llama-3.2-3B

Text Generation • Updated Oct 24, 2024 • 611k • • 555

liked a model 4 months ago

deepseek-ai/DeepSeek-V3

Text Generation • Updated Mar 27 • 738k • • 3.82k

upvoted a collection 4 months ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 141

liked a model 4 months ago

answerdotai/ModernBERT-base

Fill-Mask • Updated Jan 15 • 818k • 835

liked 2 Spaces 4 months ago

TheWell

🌍

Visualization of data from the Well

924

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a model 4 months ago

deepseek-ai/DeepSeek-V3-Base

Updated Mar 27 • 6.3k • 1.63k

liked a dataset 9 months ago

mlfoundations/dclm-baseline-1.0

Preview • Updated Jul 22, 2024 • 813k • 217

liked a model 9 months ago

apple/DCLM-7B

Updated Jul 26, 2024 • 451 • 833