1 25 3

Donghao Zhou

donghao-zhou

https://correr-zhou.github.io

Correr-Zhou

AI & ML interests

Generative AI

Recent Activity

upvoted a paper 4 days ago

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

upvoted a paper 8 days ago

Seedream 3.0 Technical Report

authored a paper 15 days ago

An Empirical Study of GPT-4o Image Generation Capabilities

View all activity

Organizations

None yet

donghao-zhou's activity

upvoted a paper 4 days ago

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published 7 days ago • 46

upvoted a paper 8 days ago

Seedream 3.0 Technical Report

Paper • 2504.11346 • Published 9 days ago • 47

authored a paper 15 days ago

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published 16 days ago • 61

upvoted a paper 16 days ago

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published 16 days ago • 61

upvoted a paper 24 days ago

ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation

Paper • 2503.22194 • Published 27 days ago • 24

upvoted a paper about 1 month ago

LEGION: Learning to Ground and Explain for Synthetic Image Detection

Paper • 2503.15264 • Published Mar 19 • 21

liked a Space about 1 month ago

Model Atlas

🗺

A demo for exploring and analyzing large-scale model repos

upvoted 2 papers about 1 month ago

CoRe^2: Collect, Reflect and Refine to Generate Better and Faster

Paper • 2503.09662 • Published Mar 12 • 34

MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice

Paper • 2503.05978 • Published Mar 7 • 35

upvoted 3 papers about 2 months ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5 • 42

LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation

Paper • 2502.18302 • Published Feb 25 • 5

DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks

Paper • 2502.17157 • Published Feb 24 • 53

liked a Space about 2 months ago

1.51k

Wan2.1

💻

Wan: Open and Advanced Large-Scale Video Generative Models

upvoted a paper about 2 months ago

VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing

Paper • 2502.17258 • Published Feb 24 • 79

upvoted 5 papers 2 months ago

Dynamic Concepts Personalization from Single Videos

Paper • 2502.14844 • Published Feb 20 • 16

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 143

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

Paper • 2502.09621 • Published Feb 13 • 28

Magic 1-For-1: Generating One Minute Video Clips within One Minute

Paper • 2502.07701 • Published Feb 11 • 36

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Paper • 2502.04320 • Published Feb 6 • 37

upvoted a paper 3 months ago

Generating Multi-Image Synthetic Data for Text-to-Image Customization

Paper • 2502.01720 • Published Feb 3 • 8