2 76 132

Wenhao Chai

wchai

http://rese1f.github.io

AI & ML interests

computer vision, artificial intelligence

Recent Activity

upvoted a paper about 5 hours ago

It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization

upvoted a paper 4 days ago

WORLDMEM: Long-term Consistent World Simulation with Memory

upvoted a paper 8 days ago

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

View all activity

Organizations

wchai's activity

upvoted a paper about 5 hours ago

It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization

Paper • 2504.13173 • Published 4 days ago • 9

upvoted a paper 4 days ago

WORLDMEM: Long-term Consistent World Simulation with Memory

Paper • 2504.12369 • Published 5 days ago • 29

upvoted a paper 8 days ago

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

Paper • 2504.08736 • Published 10 days ago • 47

upvoted 2 papers 11 days ago

HoloPart: Generative 3D Part Amodal Segmentation

Paper • 2504.07943 • Published 11 days ago • 27

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published 20 days ago • 80

upvoted a paper 12 days ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published 14 days ago • 72

upvoted a collection 12 days ago

Kimi-VL-A3B

Collection

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 9 days ago • 61

upvoted 2 papers 12 days ago

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published 13 days ago • 146

Less-to-More Generalization: Unlocking More Controllability by In-Context Generation

Paper • 2504.02160 • Published 19 days ago • 33

upvoted a paper 13 days ago

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published 13 days ago • 59

upvoted a collection 17 days ago

Science-T2I

Collection

Addressing Scientific Illusions in Image Synthesis • 10 items • Updated 4 days ago • 4

upvoted a paper 19 days ago

Scaling Language-Free Visual Representation Learning

Paper • 2504.01017 • Published 20 days ago • 26

upvoted a paper 21 days ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published 23 days ago • 125

upvoted a collection 26 days ago

Qwen2.5-Omni

Collection

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 26 days ago • 89

upvoted a paper 29 days ago

Modifying Large Language Model Post-Training for Diverse Creative Writing

Paper • 2503.17126 • Published Mar 21 • 36

upvoted 5 papers about 1 month ago

Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption

Paper • 2503.09279 • Published Mar 12 • 5

Autoregressive Image Generation with Randomized Parallel Decoding

Paper • 2503.10568 • Published Mar 13 • 8