JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization Paper β’ 2503.23377 β’ Published 27 days ago β’ 52
Story-Adapter: A Training-free Iterative Framework for Long Story Visualization Paper β’ 2410.06244 β’ Published Oct 8, 2024 β’ 19
openai/clip-vit-large-patch14 Zero-Shot Image Classification β’ Updated Sep 15, 2023 β’ 45M β’ 1.71k