2 81 21

sdtana

roxani_17

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Time Blindness: Why Video-Language Models Can't See What Humans Can?

upvoted a paper 5 days ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

liked a dataset 6 days ago

TIPO-Anonymous/TIPO-dataset

View all activity

Organizations

sdtana's activity

upvoted 2 papers 5 days ago

Time Blindness: Why Video-Language Models Can't See What Humans Can?

Paper • 2505.24867 • Published 7 days ago • 72

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published 7 days ago • 112

liked a dataset 6 days ago

TIPO-Anonymous/TIPO-dataset

Viewer • Updated 16 days ago • 19.4M • 84 • 1

upvoted 2 papers 8 days ago

To Trust Or Not To Trust Your Vision-Language Model's Prediction

Paper • 2505.23745 • Published 8 days ago • 5

D-AR: Diffusion via Autoregressive Models

Paper • 2505.23660 • Published 8 days ago • 34

upvoted 2 papers 10 days ago

Diffusion Classifiers Understand Compositionality, but Conditions Apply

Paper • 2505.17955 • Published 14 days ago • 19

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Paper • 2505.19297 • Published 12 days ago • 73

upvoted a paper 18 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published 23 days ago • 182

upvoted a paper 21 days ago

Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis

Paper • 2505.10046 • Published 23 days ago • 9

upvoted a paper 28 days ago

Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published 29 days ago • 78

upvoted a paper about 1 month ago

Group Downsampling with Equivariant Anti-aliasing

Paper • 2504.17258 • Published Apr 24 • 8

upvoted 7 papers about 2 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 268

PixelFlow: Pixel-Space Generative Models with Flow

Paper • 2504.07963 • Published Apr 10 • 19

upvoted an article 2 months ago

Article

You could have designed state of the art positional encoding

•

Nov 25, 2024

• 287

upvoted a paper 2 months ago

Multi-Token Attention

Paper • 2504.00927 • Published Apr 1 • 52