2000 character limit reached
A Coefficient of Determination for Probabilistic Topic Models (1911.11061v2)
Published 20 Nov 2019 in cs.IR, cs.LG, and stat.ML
Abstract: This research proposes a new (old) metric for evaluating goodness of fit in topic models, the coefficient of determination, or $R2$. Within the context of topic modeling, $R2$ has the same interpretation that it does when used in a broader class of statistical models. Reporting $R2$ with topic models addresses two current problems in topic modeling: a lack of standard cross-contextual evaluation metrics for topic modeling and ease of communication with lay audiences. The author proposes that $R2$ should be reported as a standard metric when constructing topic models.
Collections
Sign up for free to add this paper to one or more collections.