Block Majorization Minimization with Extrapolation and Application to $β$-NMF (2401.06646v1)
Abstract: We propose a Block Majorization Minimization method with Extrapolation (BMMe) for solving a class of multi-convex optimization problems. The extrapolation parameters of BMMe are updated using a novel adaptive update rule. By showing that block majorization minimization can be reformulated as a block mirror descent method, with the Bregman divergence adaptively updated at each iteration, we establish subsequential convergence for BMMe. We use this method to design efficient algorithms to tackle nonnegative matrix factorization problems with the $\beta$-divergences ($\beta$-NMF) for $\beta\in [1,2]$. These algorithms, which are multiplicative updates with extrapolation, benefit from our novel results that offer convergence guarantees. We also empirically illustrate the significant acceleration of BMMe for $\beta$-NMF through extensive experiments.
- Accelerating nonnegative matrix factorization algorithms using extrapolation. Neural Computation, 31(2):417–439, 2019.
- Proximal alternating minimization and projection methods for nonconvex problems: An approach based on the Kurdyka-Łojasiewicz inequality. Mathematics of Operations Research, 35(2):438–457, 2010.
- On the convergence of block coordinate descent type methods. SIAM Journal on Optimization, 23:2037–2060, 2013.
- Automatic music transcription: An overview. IEEE Signal Processing Magazine, 36(1):20–30, 2019.
- Bertsekas, D. Nonlinear Programming. Athena Scientific, 2016.
- Hyperspectral unmixing overview: Geometrical, statistical, and sparse regression-based approaches. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 5(2):354–379, April 2012.
- Proximal alternating linearized minimization for nonconvex and nonsmooth problems. Mathematical Programming, 146(1):459–494, Aug 2014.
- GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership. Genome Biology, 24(1):236, 2023.
- Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 39(1):1–38, 1977.
- Nonlinear hyperspectral unmixing with robust nonnegative matrix factorization. arXiv preprint arXiv:1401.5649, 2014.
- Algorithms for nonnegative matrix factorization with the β𝛽\betaitalic_β-divergence. Neural Computation, 23(9):2421–2456, Sept 2011.
- Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis. Neural Computation, 21(3):793–830, 03 2009.
- Blind separation of quasi-stationary sources: Exploiting convex geometry in covariance domain. IEEE Transactions on Signal Processing, 63(9):2306–2320, 2015.
- On the convergence of the block nonlinear gauss–seidel method under convex constraints. Operations Research Letters, 26(3):127 – 136, 2000.
- Inertial alternating direction method of multipliers for non-convex non-smooth optimization. Computational Optimization and Applications, 83:247–285, 2022.
- Algorithms for nonnegative matrix factorization with the Kullback-Leibler divergence. Journal of Scientific Computing, (87):93, 2021.
- Inertial block proximal method for non-convex non-smooth optimization. In Thirty-seventh International Conference on Machine Learning (ICML), 2020.
- An inertial block majorization minimization framework for nonsmooth nonconvex optimization. Journal of Machine Learning Research, 24(18):1–41, 2023.
- Fast coordinate descent methods with variable selection for non-negative matrix factorization. In Proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining, pp. 1064–1072, 2011.
- A tutorial on mm algorithms. The American Statistician, 58(1):30–37, 2004.
- Optimization transfer using surrogate objective functions. Journal of Computational and Graphical Statistics, 9(1):1–20, 2000.
- Learning the parts of objects by nonnegative matrix factorization. Nature, 401:788–791, 1999.
- Blind audio source separation with minimum-volume beta-divergence NMF. IEEE Transactions on Signal Processing, 68:3400–3410, 2020.
- Multiplicative updates for NMF with beta-divergences under disjoint equality constraints. SIAM Journal on Matrix Analysis and Applications, 42(2):730–752, 2021.
- Identifiability of the simplex volume minimization criterion for blind hyperspectral unmixing: The no-pure-pixel case. IEEE Transactions on Geoscience and Remote Sensing, 53(10):5530–5546, 2015.
- A signal processing perspective on hyperspectral unmixing: Insights from remote sensing. IEEE Signal Processing Magazine, 31(1):67–81, 2014.
- Mairal, J. Optimization with first-order surrogate functions. In Proceedings of the 30th International Conference on International Conference on Machine Learning - Volume 28, ICML’13, pp. 783–791, 2013.
- Joint majorization-minimization for nonnegative matrix factorization with the β𝛽\betaitalic_β-divergence. Signal Processing, 209:109048, 2023.
- Iteration-complexity of a Jacobi-type non-Euclidean ADMM for multi-block linearly constrained nonconvex programs, 2017.
- Endmember extraction from highly mixed data using minimum volume constrained nonnegative matrix factorization. IEEE Transactions on Geoscience and Remote Sensing, 45(3):765–777, 2007.
- A View of the EM Algorithm that Justifies Incremental, Sparse, and other Variants, pp. 355–368. Springer Netherlands, Dordrecht, 1998.
- Nesterov, Y. Lectures on Convex Optimization. Springer, 2018.
- Ochs, P. Unifying abstract inexact convergence theorems and block coordinate variable metric iPiano. SIAM Journal on Optimization, 29(1):541–570, 2019.
- iPiano: Inertial proximal algorithm for nonconvex optimization. SIAM Journal on Imaging Sciences, 7(2):1388–1419, 2014.
- A fast multiplicative updates algorithm for non-negative matrix factorization. arXiv preprint arXiv:2303.17992, 2023.
- Inertial proximal alternating linearized minimization (iPALM) for nonconvex and nonsmooth problems. SIAM Journal on Imaging Sciences, 9(4):1756–1787, 2016.
- A unified convergence analysis of block successive minimization methods for nonsmooth optimization. SIAM Journal on Optimization, 23(2):1126–1153, 2013.
- Richardson, W. H. Bayesian-based iterative method of image restoration. JoSA, 62(1):55–59, 1972.
- Non-negative matrix factorization for polyphonic music transcription. In 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 177–180. IEEE, 2003.
- Majorization-minimization algorithms in signal processing, communications, and machine learning. IEEE Transactions on Signal Processing, 65:794–816, 02 2017.
- Tseng, P. Convergence of a block coordinate descent method for nondifferentiable minimization. Journal of Optimization Theory and Applications, 109(3):475–494, Jun 2001.
- Tseng, P. On accelerated proximal gradient methods for convex-concave optimization. Technical report, 2008.
- A coordinate gradient descent method for nonsmooth separable minimization. Mathematical Programming, 117(1):387–423, Mar 2009.
- Inertial majorization-minimization algorithm for minimum-volume NMF. In 2021 29th European Signal Processing Conference (EUSIPCO), pp. 1065–1069, 2021.
- A block coordinate descent method for regularized multiconvex optimization with applications to nonnegative tensor factorization and completion. SIAM Journal on Imaging Sciences, 6(3):1758–1789, 2013.
- Generative model-based document clustering: a comparative study. Knowledge and Information Systems, 8:374–384, 2005.
- Zhu, F. Hyperspectral unmixing: ground truth labeling, datasets, benchmark performances and survey. arXiv preprint arXiv:1708.05125, 2017.