Variational Bayesian Methods for a Tree-Structured Stick-Breaking Process Mixture of Gaussians by Application of the Bayes Codes for Context Tree Models (2405.00385v2)
Abstract: The tree-structured stick-breaking process (TS-SBP) mixture model is a non-parametric Bayesian model that can represent tree-like hierarchical structures among the mixture components. For TS-SBP mixture models, only a Markov chain Monte Carlo (MCMC) method has been proposed and any variational Bayesian (VB) methods has not been proposed. In general, MCMC methods are computationally more expensive than VB methods. Therefore, we require a large computational cost to learn the TS-SBP mixture model. In this paper, we propose a learning algorithm with less computational cost for the TS-SBP mixture of Gaussians by using the VB method under an assumption of finite tree width and depth. When constructing such VB method, the main challenge is efficient calculation of a sum over all possible trees. To solve this challenge, we utilizes a subroutine in the Bayes coding algorithm for context tree models. We confirm the computational efficiency of our VB method through an experiments on a benchmark dataset.
- T. Matsushima and S. Hirasawa, “Reducing the space complexity of a Bayes coding algorithm using an expanded context tree,” in 2009 IEEE International Symposium on Information Theory, June 2009, pp. 719–723.
- Y. Nakahara, S. Saito, A. Kamatsuka, and T. Matsushima, “Probability distribution on full rooted trees,” Entropy, vol. 24, no. 3, 2022. [Online]. Available: https://www.mdpi.com/1099-4300/24/3/328
- J. H. Ward, “Hierarchical grouping to optimize an objective function,” Journal of the American Statistical Association, vol. 58, no. 301, pp. 236–244, 1963. [Online]. Available: http://www.jstor.org/stable/2282967
- Z. Ghahramani, M. Jordan, and R. P. Adams, “Tree-structured stick breaking for hierarchical data,” in Advances in Neural Information Processing Systems, J. Lafferty, C. Williams, J. Shawe-Taylor, R. Zemel, and A. Culotta, Eds., vol. 23. Curran Associates, Inc., 2010. [Online]. Available: https://proceedings.neurips.cc/paper_files/paper/2010/file/a5e00132373a7031000fd987a3c9f87b-Paper.pdf
- Y. Nakahara, “Tree-structured gaussian mixture models and their variational inference,” in 2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2023, pp. 1129–1135.
- T. Minka, “The dirichlet-tree distribution,” https://tminka.github.io/papers/dirichlet/minka-dirtree.pdf, 1999, (Accessed on 05/01/2024).