Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

An Automatic Approach for Document-level Topic Model Evaluation (1706.05140v1)

Published 16 Jun 2017 in cs.CL

Abstract: Topic models jointly learn topics and document-level topic distribution. Extrinsic evaluation of topic models tends to focus exclusively on topic-level evaluation, e.g. by assessing the coherence of topics. We demonstrate that there can be large discrepancies between topic- and document-level model quality, and that basing model evaluation on topic-level analysis can be highly misleading. We propose a method for automatically predicting topic model quality based on analysis of document-level topic allocations, and provide empirical evidence for its robustness.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Shraey Bhatia (5 papers)
  2. Jey Han Lau (67 papers)
  3. Timothy Baldwin (125 papers)
Citations (33)

Summary

We haven't generated a summary for this paper yet.