Local Convergence of Gradient Descent-Ascent for Training Generative Adversarial Networks (2305.08277v2)

Published 14 May 2023 in cs.LG and stat.ML

Abstract: Generative Adversarial Networks (GANs) are a popular formulation to train generative models for complex high dimensional data. The standard method for training GANs involves a gradient descent-ascent (GDA) procedure on a minimax optimization problem. This procedure is hard to analyze in general due to the nonlinear nature of the dynamics. We study the local dynamics of GDA for training a GAN with a kernel-based discriminator. This convergence analysis is based on a linearization of a non-linear dynamical system that describes the GDA iterations, under an \textit{isolated points model} assumption from [Becker et al. 2022]. Our analysis brings out the effect of the learning rates, regularization, and the bandwidth of the kernel discriminator, on the local convergence rate of GDA. Importantly, we show phase transitions that indicate when the system converges, oscillates, or diverges. We also provide numerical simulations that verify our claims.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (4)

Evan Becker (5 papers)
Parthe Pandit (25 papers)
Sundeep Rangan (129 papers)
Alyson K. Fletcher (30 papers)

Local Convergence of Gradient Descent-Ascent for Training Generative Adversarial Networks (2305.08277v2)

Related Papers