Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion (2411.16726v2)

Published 23 Nov 2024 in cs.CV and cs.AI

Abstract: Diffusion models have revolutionized the field of talking head generation, yet still face challenges in expressiveness, controllability, and stability in long-time generation. In this research, we propose an EmotiveTalk framework to address these issues. Firstly, to realize better control over the generation of lip movement and facial expression, a Vision-guided Audio Information Decoupling (V-AID) approach is designed to generate audio-based decoupled representations aligned with lip movements and expression. Specifically, to achieve alignment between audio and facial expression representation spaces, we present a Diffusion-based Co-speech Temporal Expansion (Di-CTE) module within V-AID to generate expression-related representations under multi-source emotion condition constraints. Then we propose a well-designed Emotional Talking Head Diffusion (ETHD) backbone to efficiently generate highly expressive talking head videos, which contains an Expression Decoupling Injection (EDI) module to automatically decouple the expressions from reference portraits while integrating the target expression information, achieving more expressive generation performance. Experimental results show that EmotiveTalk can generate expressive talking head videos, ensuring the promised controllability of emotions and stability during long-time generation, yielding state-of-the-art performance compared to existing methods.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (13)
  1. Haotian Wang (61 papers)
  2. Yuzhe Weng (3 papers)
  3. Yueyan Li (5 papers)
  4. Zilu Guo (9 papers)
  5. Jun Du (130 papers)
  6. Shutong Niu (13 papers)
  7. Jiefeng Ma (21 papers)
  8. Shan He (23 papers)
  9. Xiaoyan Wu (6 papers)
  10. Qiming Hu (11 papers)
  11. Bing Yin (56 papers)
  12. Cong Liu (169 papers)
  13. Qingfeng Liu (14 papers)
X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets