Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Dirichlet Process with Mixed Random Measures: A Nonparametric Topic Model for Labeled Data (1206.4658v1)

Published 18 Jun 2012 in cs.LG and stat.ML

Abstract: We describe a nonparametric topic model for labeled data. The model uses a mixture of random measures (MRM) as a base distribution of the Dirichlet process (DP) of the HDP framework, so we call it the DP-MRM. To model labeled data, we define a DP distributed random measure for each label, and the resulting model generates an unbounded number of topics for each label. We apply DP-MRM on single-labeled and multi-labeled corpora of documents and compare the performance on label prediction with MedLDA, LDA-SVM, and Labeled-LDA. We further enhance the model by incorporating ddCRP and modeling multi-labeled images for image segmentation and object labeling, comparing the performance with nCuts and rddCRP.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Dongwoo Kim (63 papers)
  2. Suin Kim (5 papers)
  3. Alice Oh (82 papers)
Citations (23)

Summary

We haven't generated a summary for this paper yet.