EffiVED:Efficient Video Editing via Text-instruction Diffusion Models

Arxiv Link: https://arxiv.org/abs/2403.11568

Origin Videos & Editing Videos	Instrctuion
	Turn the rabbit into a fox.
	make it Van Gogh style
	make it a white fox in the desert trail
	make it snowy
	add a flock of flowers flying.

News

2024.6.5: Release the inference code

To Do List

Release the training dataset and code

Getting Started

This repository is based on I2VGen-XL.

Create Conda Environment (Optional)

It is recommended to install Anaconda.

Windows Installation: https://docs.anaconda.com/anaconda/install/windows/

Linux Installation: https://docs.anaconda.com/anaconda/install/linux/

conda create -n animation python=3.10
conda activate animation

Python Requirements

pip install -r requirements.txt

Running inference

Please download the pretrained model to checkpoints, then modify the test_model with your download model name. You should add your test videos and edited instruction like provided in data/test_list.txt. Then run the following command:

python inference.py --cfg configs/effived_infer.yaml

Training

Obtaining data from image editing datasets.

You can run the following command to generate the video editing pairs:

python scripts/img2seq_augmenter.py

Here we provide a demo to generate the data from MagicBrush. You can download this dataset following this MagicBrush.

Obtaining data from narrow videos.

You can automatically caption the videos using the Video-BLIP2-Preprocessor Script and set the dataset_types and json_path like this:

  - dataset_types: 
      - video_blip
    train_data:
      json_path: 'blip_generated.json'

Then generate the instruction using the code provided in InstructPix2pix and generate the editing videos using CoDeF.

Bibtex

Please cite this paper if you find the code is useful for your research:

@misc{zhang2024effived,
      title={EffiVED:Efficient Video Editing via Text-instruction Diffusion Models}, 
      author={Zhenghao Zhang and Zuozhuo Dai and Long Qin and Weizhi Wang},
      year={2024},
      eprint={2403.11568},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
configs		configs
scripts		scripts
tools		tools
utils		utils
LICENSE		LICENSE
README.md		README.md
cog.yaml		cog.yaml
inference_effived.py		inference_effived.py
predict.py		predict.py
requirements.txt		requirements.txt
train_effived.py		train_effived.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EffiVED:Efficient Video Editing via Text-instruction Diffusion Models

News

To Do List

Getting Started

Create Conda Environment (Optional)

Python Requirements

Running inference

Training

Obtaining data from image editing datasets.

Obtaining data from narrow videos.

Bibtex

Shoutouts

About

Releases

Packages

Languages

License

alibaba/EffiVED

Folders and files

Latest commit

History

Repository files navigation

EffiVED:Efficient Video Editing via Text-instruction Diffusion Models

News

To Do List

Getting Started

Create Conda Environment (Optional)

Python Requirements

Running inference

Training

Obtaining data from image editing datasets.

Obtaining data from narrow videos.

Bibtex

Shoutouts

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages