1 94

Mahdi Pourmirzaei

Mahdip72

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

Mahdip72/prot2token

upvoted a paper 5 days ago

D-AR: Diffusion via Autoregressive Models

upvoted a paper 8 days ago

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

View all activity

Organizations

None yet

Mahdip72's activity

updated a model 3 days ago

Mahdip72/prot2token

Updated 3 days ago

upvoted a paper 5 days ago

D-AR: Diffusion via Autoregressive Models

Paper • 2505.23660 • Published 8 days ago • 34

upvoted a paper 8 days ago

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 59

commented a paper 8 days ago

Prot2Token: A Unified Framework for Protein Modeling via Next-Token Prediction

Paper • 2505.20589 • Published 10 days ago • 6 •

authored a paper 8 days ago

Prot2Token: A Unified Framework for Protein Modeling via Next-Token Prediction

Paper • 2505.20589 • Published 10 days ago • 6

upvoted a paper 8 days ago

Thinking with Generated Images

Paper • 2505.22525 • Published 9 days ago • 13

upvoted a paper 9 days ago

Prot2Token: A Unified Framework for Protein Modeling via Next-Token Prediction

Paper • 2505.20589 • Published 10 days ago • 6

upvoted a paper 10 days ago

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 24

upvoted a paper 11 days ago

A decoder-only foundation model for time-series forecasting

Paper • 2310.10688 • Published Oct 14, 2023 • 6

upvoted 2 papers about 1 month ago

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Paper • 2505.02707 • Published May 5 • 82

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24 • 110

upvoted 3 papers 3 months ago

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published Jan 23 • 42

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 165

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103

upvoted 3 papers 4 months ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Paper • 2502.03032 • Published Feb 5 • 61

Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Paper • 2501.12375 • Published Jan 21 • 22

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 400

upvoted a paper 5 months ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10, 2024 • 71

upvoted 2 papers 6 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 103

Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published Dec 11, 2024 • 46