Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Lessons from the Bible on Modern Topics: Low-Resource Multilingual Topic Model Evaluation (1804.10184v1)

Published 26 Apr 2018 in cs.CL

Abstract: Multilingual topic models enable document analysis across languages through coherent multilingual summaries of the data. However, there is no standard and effective metric to evaluate the quality of multilingual topics. We introduce a new intrinsic evaluation of multilingual topic models that correlates well with human judgments of multilingual topic coherence as well as performance in downstream applications. Importantly, we also study evaluation for low-resource languages. Because standard metrics fail to accurately measure topic quality when robust external resources are unavailable, we propose an adaptation model that improves the accuracy and reliability of these metrics in low-resource settings.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Shudong Hao (3 papers)
  2. Jordan Boyd-Graber (68 papers)
  3. Michael J. Paul (9 papers)
Citations (12)

Summary

We haven't generated a summary for this paper yet.