Spaces:
Configuration error
Configuration error
# VideoGrain: Modulating Space-Time Attention for Multi-Grained Video Editing (ICLR 2025) | |
## [<a href="https://knightyxp.github.io/VideoGrain_project_page/" target="_blank">Project Page</a>] | |
[](https://arxiv.org/abs/) | |
[](https://huggingface.co/spaces/) | |
[](https://mc-e.github.io/project/ReVideo/) | |
## βΆοΈ Setup Environment | |
Our method is tested using cuda12.1, fp16 of accelerator and xformers on a single L40. | |
```bash | |
# Step 1: Create and activate Conda environment | |
conda create -n st-modulator python==3.10 | |
conda activate st-modulator | |
# Step 2: Install PyTorch, CUDA and Xformers | |
conda install pytorch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 pytorch-cuda=12.1 -c pytorch -c nvidia | |
pip install --pre -U xformers==0.0.27 | |
# Step 3: Install additional dependencies with pip | |
pip install -r requirements.txt | |
``` | |
`xformers` is recommended to save memory and running time. | |
</details> | |
You may download all data and checkpoints using the following bash command | |
```bash | |
bash download_all.sh | |
``` | |
## π Prepare all the data | |
``` | |
gdown https://drive.google.com/file/d/1dzdvLnXWeMFR3CE2Ew0Bs06vyFSvnGXA/view?usp=drive_link | |
tar -zxvf videograin_data.tar.gz | |
``` | |
## π₯ ST-Modulator Editing | |
You could reproduce multi-grained editing results in our teaser by running: | |
```bash | |
bash test.sh | |
#or accelerate launch test.py --config config/run_two_man.yaml | |
``` | |
<details><summary>The result is saved at `./result` . (Click for directory structure) </summary> | |
``` | |
result | |
βββ run_two_man | |
β βββ infer_samples | |
β βββ sample | |
β βββ step_0 # result image folder | |
β βββ step_0.mp4 # result video | |
β βββ source_video.mp4 # the input video | |
``` | |
</details> |