Papers
arxiv:2503.07598

VACE: All-in-One Video Creation and Editing

Published on Mar 10
ยท Submitted by BestWishYsh on Mar 11
Authors:
,
,

Abstract

VACE, an all-in-one framework for video creation and editing, integrates multiple tasks within a unified model using a Video Condition Unit and Context Adapter for flexible and consistent video synthesis.

AI-generated summary

Diffusion Transformer has demonstrated powerful capability and scalability in generating high-quality images and videos. Further pursuing the unification of generation and editing tasks has yielded significant progress in the domain of image content creation. However, due to the intrinsic demands for consistency across both temporal and spatial dynamics, achieving a unified approach for video synthesis remains challenging. We introduce VACE, which enables users to perform Video tasks within an All-in-one framework for Creation and Editing. These tasks include reference-to-video generation, video-to-video editing, and masked video-to-video editing. Specifically, we effectively integrate the requirements of various tasks by organizing video task inputs, such as editing, reference, and masking, into a unified interface referred to as the Video Condition Unit (VCU). Furthermore, by utilizing a Context Adapter structure, we inject different task concepts into the model using formalized representations of temporal and spatial dimensions, allowing it to handle arbitrary video synthesis tasks flexibly. Extensive experiments demonstrate that the unified model of VACE achieves performance on par with task-specific models across various subtasks. Simultaneously, it enables diverse applications through versatile task combinations. Project page: https://ali-vilab.github.io/VACE-Page/.

Community

Paper submitter
Paper author

deleted
This comment has been hidden (marked as Off-Topic)
Paper author

We are delighted to announce that the VACE-Preview version has been officially released (for more details, please refer to Huggingface and Modelscope). You can now download and enjoy this preview version. If you have any questions or feedback, please feel free to let us know to help us further improve.

IMG_5555.png

6d0b2dd5-3ea8-42b4-8e39-dc8502aa1889.png

make the child moving

make the child and ball moving
lsxZyd9CZwlH8E2m5ZKWq.png

Sign up or log in to comment

Models citing this paper 5

Browse 5 models citing this paper

Datasets citing this paper 1

Spaces citing this paper 8

Collections including this paper 11