15 623 258

Taufiq Dwi Purnomo

taufiqdp

https://taufiqdp.com

AI & ML interests

SLM, VLM

Recent Activity

upvoted a paper 3 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

liked a model 4 days ago

microsoft/MAI-DS-R1

upvoted a paper 5 days ago

BitNet b1.58 2B4T Technical Report

View all activity

Organizations

taufiqdp's activity

upvoted a paper 3 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published 4 days ago • 85

upvoted a paper 5 days ago

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published 5 days ago • 59

upvoted 4 papers 6 days ago

The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer

Paper • 2504.10462 • Published 7 days ago • 14

upvoted a paper 7 days ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published 14 days ago • 117

upvoted 2 papers 8 days ago

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Paper • 2504.08388 • Published 10 days ago • 38

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published 10 days ago • 119

upvoted 2 papers 10 days ago

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published 11 days ago • 26

Kimi-VL Technical Report

Paper • 2504.07491 • Published 12 days ago • 115

upvoted a paper 12 days ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published 14 days ago • 72

upvoted a collection 12 days ago

Cogito v1 Preview

Collection

5 items • Updated 14 days ago • 101

upvoted a paper 13 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 14 days ago • 164

upvoted a paper 14 days ago

Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models

Paper • 2504.04823 • Published 15 days ago • 29

upvoted a paper 15 days ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published 18 days ago • 52

upvoted a collection 16 days ago

Llama 4

Collection

Llama 4 release • 10 items • Updated 16 days ago • 439

upvoted a paper 19 days ago

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Paper • 2504.00824 • Published 20 days ago • 39

upvoted a paper 20 days ago

Recitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems?

Paper • 2504.00509 • Published 21 days ago • 21

upvoted a paper 22 days ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published 25 days ago • 43