Daniel van Strien's picture

Daniel van Strien PRO

davanstrien

·

https://danielvanstrien.xyz/

AI & ML interests

Machine Learning Librarian

Recent Activity

updated a dataset about 3 hours ago

data-is-better-together/fineweb-c-progress

updated a dataset about 4 hours ago

librarian-bots/model_cards_with_metadata

updated a dataset about 9 hours ago

librarian-bots/dataset-columns

View all activity

Organizations

davanstrien's activity

New activity in NousResearch/Minos-v1 3 days ago

add library tag

#1 opened 3 days ago by

commented a paper 5 days ago

Aioli: A Unified Optimization Framework for Language Model Data Mixing

Paper • 2411.05735 • Published Nov 8, 2024 • 1 •

New activity in newfacade/LeetCodeDataset 5 days ago

add citation info

#2 opened 5 days ago by

New activity in davanstrien/ModernBERT-based-Reasoning-Required 11 days ago

training code

#2 opened 12 days ago by

commented a paper 11 days ago

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published 12 days ago • 11 •

New activity in zwhe99/DeepMath-103K 11 days ago

add full citation

#3 opened 11 days ago by

New activity in reasoning-datasets-competition/README 12 days ago

Competition Lobby

#1 opened 17 days ago by

New activity in davanstrien/fine-reasoning-questions 12 days ago

Update README.md

#1 opened 12 days ago by

New activity in davanstrien/reasoning-required 17 days ago

add paper link

#9 opened 17 days ago by

New activity in davanstrien/reasoning-required 19 days ago

add background

#8 opened 19 days ago by

New activity in davanstrien/ModernBERT-based-Reasoning-Required 19 days ago

Update README.md

#1 opened 19 days ago by

New activity in davanstrien/reasoning-required 19 days ago

remove dead code

#7 opened 19 days ago by

[bot] Conversion to Parquet

#1 opened 22 days ago by

parquet-converter

add code link

#6 opened 19 days ago by

Upload gen_data.py

#5 opened 19 days ago by

Upload gen_data.py

#4 opened 19 days ago by

Update README.md

#3 opened 19 days ago by

Update README.md

#2 opened 19 days ago by

commented a paper 25 days ago

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions

Paper • 2502.13124 • Published Feb 18 • 6 •

commented a paper 27 days ago

BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction

Paper • 2503.19658 • Published Mar 25 • 2 •