Skip to content

Memory-Guided Diffusion for Expressive Talking Video Generation

License

Notifications You must be signed in to change notification settings

jax-explorer/ComfyUI-IF_MemoAvatar

 
 

Repository files navigation

Migration from original github

  1. fix runtime error
  2. Automatic downloads are being fix....(WIP)
  3. There are also some parameter tuning issues, under normal development. (WIP)
  4. Optimize compatibility with different images

online run: https://www.comfyonline.app/explore/791e2d71-eaaf-48da-8f99-9b4f7d657ff2

====================================================================

ComfyUI-IF_MemoAvatar

Memory-Guided Diffusion for Expressive Talking Video Generation

demo

#ORIGINAL REPO MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Longtao Zheng*, Yifan Zhang*, Hanzhong Guo, Jiachun Pan, Zhenxiong Tan, Jiahao Lu, Chuanxin Tang, Bo An, Shuicheng Yan
Project Page | arXiv | Model

This repository contains the example inference script for the MEMO-preview model. The gif demo below is compressed. See our project page for full videos.

Demo GIF

ComfyUI-IF_MemoAvatar

Memory-Guided Diffusion for Expressive Talking Video Generation

Overview

This is a ComfyUI implementation of MEMO (Memory-Guided Diffusion for Expressive Talking Video Generation), which enables the creation of expressive talking avatar videos from a single image and audio input.

Features

  • Generate expressive talking head videos from a single image
  • Audio-driven facial animation
  • Emotional expression transfer
  • High-quality video output thorium_XMBCG9kbGn

Installation

git clone the repo to your custom_nodes folder and then

cd ComfyUI-IF_MemoAvatar
pip install -r requirements.txt

I removed xformers from the file because it needs a particular combination of pytorch on windows to work

if you are on linux you can just run

pip install xformers 

for windows users if you don't have xformers on your env

pip show xformers 

follow this guide to install a good comfyui environment if you don't see any version install the latest following this free guide

https://ko-fi.com/post/Installing-Triton-and-Sage-Attention-Flash-Attenti-P5P8175434

Model Files

The models will automatically download to the following locations in your ComfyUI installation:

models/checkpoints/memo/
├── audio_proj/
├── diffusion_net/
├── image_proj/
├── misc/
│ ├── audio_emotion_classifier/
│ ├── face_analysis/
│ └── vocal_separator/
└── reference_net/
models/wav2vec/
models/vae/sd-vae-ft-mse/
models/emotion2vec/emotion2vec_plus_large/

Copy the faceanalisys/models models from the folder directly into faceanalisys just until I make sure don't just move then duplicate them cos HF will detect empty and download them every time If you don't see a models.json or errors out create one yourself this is the content

{
  "detection": [
    "scrfd_10g_bnkps"
  ],
  "recognition": [
    "glintr100"
  ],
  "analysis": [
    "genderage",
    "2d106det",
    "1k3d68"
  ]
}

and a version.txt containing 0.7.3

yW8hDQhnhM

:IF_MemoAvatar_comfy

About

Memory-Guided Diffusion for Expressive Talking Video Generation

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.7%
  • JavaScript 1.3%