Daniel van Strien's picture

Daniel van Strien PRO

davanstrien

·

https://danielvanstrien.xyz/

AI & ML interests

Machine Learning Librarian

Recent Activity

updated a dataset less than a minute ago

davanstrien/testarxiv

updated a dataset 3 minutes ago

davanstrien/testarxiv-out

updated a dataset 4 minutes ago

davanstrien/dataset-creation-scripts

View all activity

Organizations

davanstrien's activity

upvoted a paper about 21 hours ago

Aioli: A Unified Optimization Framework for Language Model Data Mixing

Paper • 2411.05735 • Published Nov 8, 2024 • 1

upvoted a collection 5 days ago

Cell2Sentence Models

Cell2Sentence models trained for single-cell tasks • 5 items • Updated 7 days ago • 6

upvoted a collection 6 days ago

blt

4 items • Updated 6 days ago • 17

upvoted a paper 6 days ago

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published Mar 10 • 60

upvoted a collection 7 days ago

🏜️MIRAGE-Bench [NAACL'25]

Dataset Collection from the MIRAGE-Bench paper • 13 items • Updated 23 days ago • 2

upvoted a paper 7 days ago

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published 8 days ago • 11

upvoted a collection 8 days ago

DataDecide

A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated 7 days ago • 12

upvoted a collection 9 days ago

Apriel

ServiceNow Language Modeling Lab's first model family series • 2 items • Updated 9 days ago • 7

upvoted 6 collections 12 days ago

RADIO

A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 12 items • Updated about 1 hour ago • 16

kl3m

KL3M models and tokenizers • 13 items • Updated Feb 1 • 2

kl3m-data

25 items • Updated 12 days ago • 3

kl3m-index

KL3M Dataset Indices • 7 items • Updated 28 days ago • 1

KL3M Embeddings

7 items • Updated Mar 17 • 1

ALEA Mid- and Post-Train Resources

Various Q&A, abstractive/extractive summarization, classification, drafting, prediction, and conversational tasks • 9 items • Updated 13 days ago • 2

upvoted 2 collections 14 days ago

Reasoning Required?

4 items • Updated 8 days ago • 4

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 11 days ago • 61

upvoted 2 papers 14 days ago

Rethinking Reflection in Pre-Training

Paper • 2504.04022 • Published 18 days ago • 76

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published 20 days ago • 53