Convergence Rates for Gaussian Mixtures of Experts (1907.04377v2)

Published 9 Jul 2019 in math.ST, cs.LG, stat.ML, and stat.TH

Abstract: We provide a theoretical treatment of over-specified Gaussian mixtures of experts with covariate-free gating networks. We establish the convergence rates of the maximum likelihood estimation (MLE) for these models. Our proof technique is based on a novel notion of \emph{algebraic independence} of the expert functions. Drawing on optimal transport theory, we establish a connection between the algebraic independence and a certain class of partial differential equations (PDEs). Exploiting this connection allows us to derive convergence rates and minimax lower bounds for parameter estimation.

Citations (37)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Convergence Rates for Gaussian Mixtures of Experts (1907.04377v2)

Summary

Related Papers