Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Face Animation with an Attribute-Guided Diffusion Model (2304.03199v1)

Published 6 Apr 2023 in cs.CV

Abstract: Face animation has achieved much progress in computer vision. However, prevailing GAN-based methods suffer from unnatural distortions and artifacts due to sophisticated motion deformation. In this paper, we propose a Face Animation framework with an attribute-guided Diffusion Model (FADM), which is the first work to exploit the superior modeling capacity of diffusion models for photo-realistic talking-head generation. To mitigate the uncontrollable synthesis effect of the diffusion model, we design an Attribute-Guided Conditioning Network (AGCN) to adaptively combine the coarse animation features and 3D face reconstruction results, which can incorporate appearance and motion conditions into the diffusion process. These specific designs help FADM rectify unnatural artifacts and distortions, and also enrich high-fidelity facial details through iterative diffusion refinements with accurate animation attributes. FADM can flexibly and effectively improve existing animation videos. Extensive experiments on widely used talking-head benchmarks validate the effectiveness of FADM over prior arts.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Bohan Zeng (19 papers)
  2. Xuhui Liu (17 papers)
  3. Sicheng Gao (5 papers)
  4. Boyu Liu (10 papers)
  5. Hong Li (216 papers)
  6. Jianzhuang Liu (91 papers)
  7. Baochang Zhang (113 papers)
Citations (19)

Summary

We haven't generated a summary for this paper yet.