Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

April 6, 2024 matt Comments 0 Comment

EMO is an audio-driven portrait-to-video generation framework. Input a single reference image and the vocal audio, e.g. talking and singing, and generate vocal avatar videos with expressive facial expressions, various head poses. Generate any duration depending on the length of input video.

You can make still images talk, sing, or cross-actor operations in which still images can deliver the performances of other actors or in different languages.

It reminds me of other AI facial animation software like MyHeritage’s Deep Nostalgia.

https://www.youtube.com/watch?v=VlJ71kzcn9Y

Project: https://humanaigc.github.io/emote-portrait-alive/

Github: https://github.com/HumanAIGC/EMO

Matt's Homepage

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

April 6, 2024 matt Comments 0 Comment

Related

Leave a Reply Cancel reply

Share this:

Related

Leave a Reply Cancel reply