Papers
Topics
Authors
Recent
Search
2000 character limit reached

Majorization-minimization for Sparse Nonnegative Matrix Factorization with the $β$-divergence

Published 13 Jul 2022 in cs.LG and math.OC | (2207.06316v4)

Abstract: This article introduces new multiplicative updates for nonnegative matrix factorization with the $\beta$-divergence and sparse regularization of one of the two factors (say, the activation matrix). It is well known that the norm of the other factor (the dictionary matrix) needs to be controlled in order to avoid an ill-posed formulation. Standard practice consists in constraining the columns of the dictionary to have unit norm, which leads to a nontrivial optimization problem. Our approach leverages a reparametrization of the original problem into the optimization of an equivalent scale-invariant objective function. From there, we derive block-descent majorization-minimization algorithms that result in simple multiplicative updates for either $\ell_{1}$-regularization or the more "aggressive" log-regularization. In contrast with other state-of-the-art methods, our algorithms are universal in the sense that they can be applied to any $\beta$-divergence (i.e., any value of $\beta$) and that they come with convergence guarantees. We report numerical comparisons with existing heuristic and Lagrangian methods using various datasets: face images, an audio spectrogram, hyperspectral data, and song play counts. We show that our methods obtain solutions of similar quality at convergence (similar objective values) but with significantly reduced CPU times.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (43)
  1. P. Paatero and U. Tapper, “Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values,” Environmetrics, vol. 5, no. 2, pp. 111–126, Jun. 1994.
  2. D. D. Lee and H. S. Seung, “Learning the parts of objects by non-negative matrix factorization,” Nature, vol. 401, no. 6755, pp. 788–791, Oct. 1999.
  3. P. Smaragdis, C. Févotte, G. J. Mysore, N. Mohammadiha, and M. Hoffman, “Static and dynamic source separation using nonnegative factorizations: A unified view,” IEEE Signal Process. Mag., vol. 31, no. 3, pp. 66–75, May 2014.
  4. M. W. Berry, M. Browne, A. N. Langville, V. P. Pauca, and R. J. Plemmons, “Algorithms and applications for approximate nonnegative matrix factorization,” Comput. Stat. Data Anal., vol. 52, no. 1, pp. 155–173, Sep. 2007.
  5. J. M. Bioucas-Dias, A. Plaza, N. Dobigeon, M. Parente, Q. Du, P. Gader, and J. Chanussot, “Hyperspectral unmixing overview: Geometrical, statistical, and sparse regression-based approaches,” IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens., vol. 5, no. 2, pp. 354–379, Apr. 2012.
  6. Y. Hu, Y. Koren, and C. Volinsky, “Collaborative filtering for implicit feedback datasets,” in Proc. IEEE Int. Conf. Data Mining.   IEEE, Dec. 2008.
  7. X. Fu, K. Huang, N. D. Sidiropoulos, and W.-K. Ma, “Nonnegative matrix factorization for signal and data analytics: identifiability, algorithms, and applications,” IEEE Signal Process. Mag., vol. 36, no. 2, pp. 59–80, Mar. 2019.
  8. A. Cichocki, S. Cruces, and S. ichi Amari, “Generalized Alpha-Beta divergences and their application to robust nonnegative matrix factorization,” Entropy, vol. 13, no. 1, pp. 134–170, jan 2011.
  9. C. Févotte and J. Idier, “Algorithms for nonnegative matrix factorization with the β𝛽\betaitalic_β-divergence,” Neural Comput., vol. 23, no. 9, pp. 2421–2456, Sep. 2011.
  10. P. O. Hoyer, “Non-negative sparse coding,” in Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing.   IEEE, 2002.
  11. ——, “Non-negative matrix factorization with sparseness constraints,” J. Mach. Learn. Res., vol. 5, p. 1457–1469, Dec. 2004.
  12. J. Eggert and E. Körner, “Sparse coding and NMF,” in Proc. Int. Joint Conf. Neur. Netw.   IEEE, Jul. 2004, pp. 2529–2533.
  13. H. Kim and H. Park, “Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis,” Bioinformatics, vol. 23, no. 12, pp. 1495–1502, May 2007.
  14. A. Cichoki and A.-H. Phan, “Fast local algorithms for large scale nonnegative matrix and tensor factorizations,” IEICE Trans. Fund. Electron. Comm. Comput. Sci., vol. E92-A, no. 3, pp. 708–721, 2009.
  15. J. Mairal, F. Bach, J. Ponce, and G. Sapiro, “Online learning for matrix factorization and sparse coding,” J. Mach. Learn. Res., vol. 11, pp. 10–60, 2010.
  16. N. Guan, D. Tao, Z. Luo, and B. Yuan, “NeNMF: An optimal gradient method for nonnegative matrix factorization,” IEEE Trans. Signal Process., vol. 60, no. 6, pp. 2882–2898, Jun. 2012.
  17. R. Zhao and V. Y. F. Tan, “A unified convergence analysis of the multiplicative update algorithm for regularized nonnegative matrix factorization,” IEEE Trans. Signal Process., vol. 66, no. 1, pp. 129–138, Jan. 2018.
  18. Y. Qian, S. Jia, J. Zhou, and A. Robles-Kelly, “L1/2 sparsity constrained nonnegative matrix factorization for hyperspectral unmixing,” in Proc. International Conference on Digital Image Computing: Techniques and Applications.   IEEE, Dec. 2010.
  19. J. Sigurdsson, M. O. Ulfarsson, and J. R. Sveinsson, “Hyperspectral unmixing with ℓqsubscriptℓ𝑞\ell_{q}roman_ℓ start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT regularization,” IEEE Trans. Geosci. Remote Sens., vol. 52, no. 11, pp. 6793–6806, Nov. 2014.
  20. A. Lefèvre, F. Bach, and C. Févotte, “Itakura-Saito nonnegative matrix factorization with group sparsity,” in Proc. Int. Conf. Acoust. Speech Signal Process.   IEEE, May 2011.
  21. V. Y. F. Tan and C. Févotte, “Automatic relevance determination in nonnegative matrix factorization with the β𝛽\betaitalic_β-divergence,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 35, no. 7, pp. 1592–1605, Jul. 2013.
  22. C. Peng, Y. Zhang, Y. Chen, Z. Kang, C. Chen, and Q. Cheng, “Log-based sparse nonnegative matrix factorization for data representation,” Knowledge-Based Syst., vol. 251, p. 109127, Sep. 2022.
  23. M. Shashanka, B. Raj, and P. Smaragdis, “Sparse overcomplete latent variable decomposition of counts data,” in Proc. Ann. Conf. Neur. Inform. Proc. Syst., J. Platt, D. Koller, Y. Singer, and S. Roweis, Eds., vol. 20.   Curran Associates, Inc., 2007, pp. 1313–1320.
  24. C. Joder, F. Weninger, D. Virette, and B. Schuller, “A comparative study on sparsity penalties for NMF-based speech separation: Beyond LP-norms,” in Proc. Int. Conf. Acoust. Speech Signal Process.   IEEE, May 2013.
  25. R. Peharz and F. Pernkopf, “Sparse nonnegative matrix factorization with ℓ0subscriptℓ0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-constraints,” Neurocomputing, vol. 80, pp. 38–46, Mar. 2012.
  26. J. Bolte, S. Sabach, and M. Teboulle, “Proximal alternating linearized minimization for nonconvex and nonsmooth problems,” Math. Programm., vol. 146, no. 1-2, pp. 459–494, Jul. 2013.
  27. J. Kim, R. D. C. Monteiro, and H. Park, “Group sparsity in nonnegative matrix factorization,” in Proc. SIAM Int. Conf. Data Mining.   Society for Industrial and Applied Mathematics, apr 2012.
  28. J. Le Roux and F. W. J. R. Hershey, “Sparse NMF – half-baked or well done?” Mitsubishi Electric Research Laboratories, Tech. Rep., Mar. 2015.
  29. V. Leplat, N. Gillis, and J. Idier, “Multiplicative updates for NMF with β𝛽\betaitalic_β-divergences under disjoint equality constraints,” SIAM J. Matrix Anal. Appl., vol. 42, no. 2, pp. 730–752, Jan. 2021.
  30. L. Filstroff, O. Gouvert, C. Févotte, and O. Cappé, “A comparative study of Gamma Markov chains for temporal non-negative matrix factorization,” IEEE Trans. Signal Process., vol. 69, pp. 1614–1626, 2021.
  31. S. Essid and C. Févotte, “Smooth nonnegative matrix factorization for unsupervised audiovisual document structuring,” IEEE Trans. Multimedia, vol. 15, no. 2, pp. 415–425, Feb. 2013.
  32. C. Févotte, N. Bertin, and J.-L. Durrieu, “Nonnegative matrix factorization with the Itakura-Saito Divergence: With application to music analysis,” Neural Comput., vol. 21, no. 3, pp. 793–830, Mar. 2009.
  33. E. Vincent, N. Bertin, and R. Badeau, “Adaptive harmonic spectral decomposition for multiple pitch estimation,” IEEE Trans. Audio Speech Lang. Process., vol. 18, no. 3, pp. 528–537, Mar. 2010.
  34. C. Févotte and N. Dobigeon, “Nonlinear hyperspectral unmixing with robust nonnegative matrix factorization,” IEEE Trans. Image Process., vol. 24, no. 12, pp. 4810–4819, Dec. 2015.
  35. X. Fu, K. Huang, and N. D. Sidiropoulos, “On identifiability of nonnegative matrix factorization,” IEEE Signal Process. Lett., vol. 25, no. 3, pp. 328–332, mar 2018.
  36. Z. Yang and E. Oja, “Unified development of multiplicative algorithms for linear and quadratic nonnegative matrix factorization,” IEEE Trans. Neural Netw., vol. 22, no. 12, pp. 1878–1891, Dec. 2011.
  37. A. Cichocki, R. Zdunek, and S. ichi Amari, “Csiszár’s divergences for non-negative matrix factorization: Family of new algorithms,” in Independent Component Analysis and Blind Signal Separation.   Springer Berlin Heidelberg, 2006, pp. 32–39.
  38. Y. Sun, P. Babu, and D. P. Palomar, “Majorization-minimization algorithms in signal processing, communications, and machine learning,” IEEE Trans. Signal Process., vol. 65, no. 3, pp. 794–816, Feb. 2017.
  39. M. Nakano, H. Kameoka, J. L. Roux, Y. Kitano, N. Ono, and S. Sagayama, “Convergence-guaranteed multiplicative algorithms for nonnegative matrix factorization with β𝛽\betaitalic_β-divergence,” in IEEE Int. Workshop Mach. Learn. Signal Process.   IEEE, Sep. 2010.
  40. E. J. Candès, M. B. Wakin, and S. P. Boyd, “Enhancing sparsity by reweighted ℓ1subscriptℓ1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT minimization,” J. Fourier Anal. Appl., vol. 14, no. 5-6, pp. 877–905, Oct. 2008.
  41. F. S. Samaria and A. Harter, “Parameterisation of a stochastic model for human face identification,” in Proc. Workshop on Applications of Computer Vision.   IEEE Comput. Soc. Press, 1994.
  42. T. Bertin-Mahieux, D. P. Ellis, B. Whitman, and P. Lamere, “The million song dataset,” in Proc. Int. Conf. Music Inform. Retrieval (ISMIR), 2011.
  43. O. Gouvert, T. Oberlin, and C. Févotte, “Ordinal non-negative matrix factorization for recommendation,” in Proc. Int. Conf. Mach. Learn., 2020, pp. 3680–3689.
Citations (5)

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.