Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Make a Face: Towards Arbitrary High Fidelity Face Manipulation (1908.07191v1)

Published 20 Aug 2019 in cs.CV

Abstract: Recent studies have shown remarkable success in face manipulation task with the advance of GANs and VAEs paradigms, but the outputs are sometimes limited to low-resolution and lack of diversity. In this work, we propose Additive Focal Variational Auto-encoder (AF-VAE), a novel approach that can arbitrarily manipulate high-resolution face images using a simple yet effective model and only weak supervision of reconstruction and KL divergence losses. First, a novel additive Gaussian Mixture assumption is introduced with an unsupervised clustering mechanism in the structural latent space, which endows better disentanglement and boosts multi-modal representation with external memory. Second, to improve the perceptual quality of synthesized results, two simple strategies in architecture design are further tailored and discussed on the behavior of Human Visual System (HVS) for the first time, allowing for fine control over the model complexity and sample quality. Human opinion studies and new state-of-the-art Inception Score (IS) / Frechet Inception Distance (FID) demonstrate the superiority of our approach over existing algorithms, advancing both the fidelity and extremity of face manipulation task.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Shengju Qian (16 papers)
  2. Kwan-Yee Lin (23 papers)
  3. Wayne Wu (60 papers)
  4. Yangxiaokang Liu (2 papers)
  5. Quan Wang (130 papers)
  6. Fumin Shen (50 papers)
  7. Chen Qian (226 papers)
  8. Ran He (173 papers)
Citations (69)

Summary

We haven't generated a summary for this paper yet.