Han-Bit Kang

hbkang

AI & ML interests

Recent Activity

upvoted a paper 4 days ago

Cobra: Efficient Line Art COlorization with BRoAder References

upvoted a paper 6 days ago

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

updated a collection 11 days ago

cool-papers

View all activity

Organizations

None yet

hbkang's activity

upvoted a paper 4 days ago

Cobra: Efficient Line Art COlorization with BRoAder References

Paper • 2504.12240 • Published 4 days ago • 25

upvoted a paper 6 days ago

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published 9 days ago • 119

upvoted a paper 11 days ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published 13 days ago • 71

upvoted 3 papers 13 days ago

upvoted 3 papers 14 days ago

Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and Harmonization

Paper • 2504.03011 • Published 17 days ago • 9

HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration

Paper • 2504.03536 • Published 17 days ago • 11

Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation

Paper • 2504.02542 • Published 18 days ago • 41

upvoted a paper 17 days ago

Multi-Token Attention

Paper • 2504.00927 • Published 19 days ago • 44

upvoted a paper 18 days ago

Scaling Language-Free Visual Representation Learning

Paper • 2504.01017 • Published 19 days ago • 26

upvoted a paper 20 days ago

SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling

Paper • 2503.21732 • Published 24 days ago • 8

upvoted a paper 21 days ago

ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model

Paper • 2503.21144 • Published 25 days ago • 25

upvoted 2 papers 24 days ago

DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis

Paper • 2503.15667 • Published Mar 19 • 8

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published 26 days ago • 40

upvoted 3 papers 25 days ago

BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation

Paper • 2503.20672 • Published 25 days ago • 14

Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models

Paper • 2503.20240 • Published 26 days ago • 22

Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency

Paper • 2409.02634 • Published Sep 4, 2024 • 98

upvoted a paper 27 days ago

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation

Paper • 2503.16660 • Published Mar 20 • 73

upvoted a paper 28 days ago

Tokenize Image as a Set

Paper • 2503.16425 • Published Mar 20 • 15