7 31 80

neuralink

AI & ML interests

nanotron @ hf

Recent Activity

upvoted an article 14 days ago

You could have designed state of the art positional encoding

upvoted an article 14 days ago

Welcome Llama 4 Maverick & Scout on Hugging Face!

liked a dataset 17 days ago

nanotron/ultrascale-playbook-data

View all activity

Organizations

neuralink's activity

liked a dataset 17 days ago

nanotron/ultrascale-playbook-data

Updated Mar 12 • 11.7k • 5

liked a Space about 2 months ago

Predict Memory

🧮

Calculate memory usage from model configurations

liked 2 Spaces 2 months ago

634

Open Deep-Research

🏆

OpenAI's Deep Research, but open

2.52k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked 2 Spaces 4 months ago

Scaling With Vocab Demo

📊

Predict optimal vocabulary size based on model parameters

Harm Space

⚡

liked a model 5 months ago

tencent/Tencent-Hunyuan-Large

Text Generation • Updated Jan 19 • 153 • 587

liked a model 7 months ago

meta-llama/Llama-3.2-11B-Vision

Image-Text-to-Text • Updated Sep 27, 2024 • 37.9k • 510

liked a model 9 months ago

nanotron/llama3-8b-infini-attention

Updated Aug 5, 2024 • 3 • 3

liked a dataset 9 months ago

huggingface/documentation-images

Viewer • Updated about 4 hours ago • 52 • 3.05M • 60

liked a dataset 10 months ago

nanotron/minipile_100_samples

Viewer • Updated Jul 10, 2024 • 100 • 33 • 1

liked 2 Spaces 10 months ago

Train LLMs

⚡

Calculate training cost and model efficiency

Lighteval Tasks Explorer

😻

liked a model 10 months ago

nanotron/old_bench

Updated Jul 6, 2024 • 3

liked a dataset 10 months ago

rokset3/slim_pajama_chunk_1

Viewer • Updated Nov 15, 2023 • 59M • 267 • 2

liked a model 10 months ago

meta-llama/Llama-2-7b-hf

Text Generation • Updated Apr 17, 2024 • 910k • 2.04k

liked a model 11 months ago

Snowflake/snowflake-arctic-embed-m

liked a dataset 11 months ago

HuggingFaceFW/fineweb-edu

Viewer • Updated Jan 31 • 3.3B • 254k • 665

liked a Space 11 months ago

922

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a dataset about 1 year ago

teknium/openhermes

Viewer • Updated Sep 7, 2023 • 243k • 424 • 207