A Riemannian Proximal Newton Method (2304.04032v3)
Abstract: In recent years, the proximal gradient method and its variants have been generalized to Riemannian manifolds for solving optimization problems with an additively separable structure, i.e., $f + h$, where $f$ is continuously differentiable, and $h$ may be nonsmooth but convex with computationally reasonable proximal mapping. In this paper, we generalize the proximal Newton method to embedded submanifolds for solving the type of problem with $h(x) = \mu |x|_1$. The generalization relies on the Weingarten and semismooth analysis. It is shown that the Riemannian proximal Newton method has a local quadratic convergence rate under certain reasonable assumptions. Moreover, a hybrid version is given by concatenating a Riemannian proximal gradient method and the Riemannian proximal Newton method. It is shown that if the switch parameter is chosen appropriately, then the hybrid method converges globally and also has a local quadratic convergence rate. Numerical experiments on random and synthetic data are used to demonstrate the performance of the proposed methods.
- Optimization algorithms on matrix manifolds. Princeton University Press, Princeton, NJ, 2008.
- An extrinsic look at the Riemannian Hessian. In International conference on geometric science of information, pages 361–368. Springer, 2013.
- The rate of convergence of Nesterov’s accelerated forward-backward method is actually faster than 1/k21superscript𝑘21/k^{2}1 / italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. SIAM Journal on Optimization, 26:1824–1834, 2016.
- Amir Beck. First-order methods in optimization. SIAM, 2017.
- Large-scale sparse inverse covariance matrix estimation. SIAM Journal on Scientific Computing, 41:A380–A401, 2019.
- Nicolas Boumal. An introduction to optimization on smooth manifolds. Cambridge University Press, 2023.
- Fast gradient-based algorithms for constrained total variation image denoising and deblurring problems. IEEE Transactions on Image Processing, 18:2419–2434, 2009.
- A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM Journal on Imaging Sciences, 2:183–202, 2009.
- Frank H Clarke. Optimization and nonsmooth analysis. SIAM, 1990.
- Proximal gradient method for nonsmooth optimization over the Stiefel manifold. SIAM Journal on Optimization, 30:210–239, 2020.
- Solving partial least squares regression via manifold optimization approaches. IEEE Transactions on Neural Networks and Learning Systems, 30:588–600, 2 2019.
- A direct formulation for sparse PCA using semidefinite programming. In Advances in Neural Information Processing Systems, volume 17. MIT Press, 2004.
- Implicit functions and solution mappings: a view from variational analysis, volume 11. Springer, 2009.
- ε𝜀\varepsilonitalic_ε-subgradient algorithms for locally Lipschitz functions on Riemannian manifolds. Advances in Computational Mathematics, 42:333–360, 2016.
- M. Seetharama Gowda. Inverse and implicit function theorems for H-differentiable and semismooth functions. Optimization Methods and Software, 19:443–461, 10 2004.
- Intrinsic representation of tangent vectors and vector transports on matrix manifolds. Numerische Mathematik, 136:523–543, 6 2017.
- A proximal bundle algorithm for nonsmooth optimization on Riemannian manifolds. IMA Journal of Numerical Analysis, 43(1):293–325, 12 2021.
- S. Hosseini and A. Uschmajew. A Riemannian gradient sampling algorithm for nonsmooth optimization on manifolds. SIAM Journal on Optimization, 27(1):173–189, 2017.
- Wen Huang. Optimization algorithms on Riemannian manifolds with applications. PhD thesis, The Florida State University, 2013.
- Wen Huang and Ke Wei. An extension of fast iterative shrinkage-thresholding algorithm to Riemannian optimization for sparse principal component analysis. Numerical Linear Algebra with Applications, 29, 1 2022.
- Wen Huang and Ke Wei. Riemannian proximal gradient methods. Mathematical Programming, 194:371–413, 7 2022.
- Wen Huang and Ke Wei. An inexact Riemannian proximal gradient method. Computational Optimization and Applications, 2023.
- A Riemannian optimization approach to clustering problems, 2022.
- Norman H Josephy. Newton’s method for generalized equations. Technical report, Wisconsin Univ-Madison Mathematics Research Center, 1979.
- Norman H Josephy. Quasi-Newton methods for generalized equations. Technical report, Wisconsin Univ-madison Mathematics Research Center, 1979.
- A modified principal component technique based on the lasso. Journal of Computational and Graphical Statistics, 12:531–547, 9 2003.
- The implicit function theorem: history, theory, and applications. Springer Science & Business Media, 2002.
- IMRO: A proximal quasi-Newton method for solving ℓ1subscriptℓ1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-regularized least squares problems. SIAM Journal on Optimization, 27:583–615, 2017.
- Weakly convex optimization over Stiefel manifold using Riemannian subgradient-type methods. SIAM Journal on Optimization, 31(3):1605–1634, 2021.
- John M. Lee. Introduction to Smooth Manifolds. Springer, New York, NY, 2012.
- John M Lee. Introduction to Riemannian manifolds, volume 2. Springer, 2018.
- Accelerated proximal gradient methods for nonconvex programming. In Advances in Neural Information Processing Systems, 2015.
- Proximal Newton-type methods for minimizing composite functions. SIAM Journal on Optimization, 24:1420–1443, 2014.
- On efficiently solving the subproblems of a level-set method for fused lasso problems. SIAM Journal on Optimization, 28:1842–1866, 2018.
- Convex sparse spectral clustering: single-view to multi-view. IEEE Transactions on Image Processing, 25:2833–2843, 6 2016.
- A globally convergent proximal Newton-type method in nonsmooth convex optimization. Mathematical Programming, 198:899–936, 3 2023.
- Yurii Nesterov. A method for solving the convex programming problem with convergence rate 𝒪(1/k2)𝒪1superscript𝑘2\mathcal{O}(1/k^{2})caligraphic_O ( 1 / italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). Proceedings of the USSR Academy of Sciences, 269:543–547, 1983.
- Yurii Nesterov. Lectures on convex optimization, volume 137. Springer, 2018.
- Compressed modes for variational problems in mathematics and physics. Proceedings of the National Academy of Sciences, 110:18368–18373, 11 2013.
- A sparse proximal Newton splitting method for constrained image deblurring. Neurocomputing, 122:245–257, 12 2013.
- Semismooth homeomorphisms and strong stability of semidefinite and Lorentz complementarity problems. Mathematics of Operations Research, 28(1):39–63, 2003.
- Spectral clustering based on learning similarity matrix. Bioinformatics, 34:2069–2076, 6 2018.
- A nonsmooth version of Newton’s method. Mathematical Programming, 58:353–367, 1993.
- SpaSM: A MATLAB toolbox for sparse statistical modeling. Journal of Statistical Software, 84, 2018.
- A proximal Newton-type method for equilibrium problems. Optimization Letters, 12:997–1009, 7 2018.
- Defeng Sun. A further result on an implicit function theorem for locally Lipschitz functions. Operations Research Letters, 28:193–198, 2001.
- Face recognition by sparse discriminant analysis via joint ℓ2,1subscriptℓ21\ell_{2,1}roman_ℓ start_POSTSUBSCRIPT 2 , 1 end_POSTSUBSCRIPT-norm minimization. Pattern Recognition, 47:2447–2453, 2014.
- Robert Tibshirani. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological), 58:267–288, 1 1996.
- Michael Ulbrich. Semismooth Newton methods for variational inequalities and constrained optimization problems in function spaces. SIAM, 2011.
- Accelerated and inexact forward-backward algorithms. SIAM Journal on Optimization, 23:1607–1633, 2013.
- Multitask learning in computational biology. In Proceedings of ICML Workshop on Unsupervised and Transfer Learning, volume 27, pages 207–216. PMLR, 2012.
- Proximal quasi-Newton method for composite optimization over the Stiefel manifold. Journal of Scientific Computing, 95, 5 2023.
- A regularized semismooth Newton method with projection steps for composite convex programs. Journal of Scientific Computing, 76:364–389, 7 2018.
- On the robust isolated calmness of a class of nonsmooth optimizations on Riemannian manifolds and its applications, 2022.
- A semismooth Newton based augmented Lagrangian method for nonsmooth optimization on matrix manifolds. Mathematical Programming, pages 1–61, 2022.
- Sparse principal component analysis. Journal of Computational and Graphical Statistics, 15:265–286, 6 2006.
- On the global geometry of sphere-constrained sparse blind deconvolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 4894–4902, 2017.
- A selective overview of sparse principal component analysis. Proceedings of the IEEE, 106:1311–1320, 8 2018.
- Proximal quasi-Newton for computationally intensive ℓ1subscriptℓ1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-regularized M𝑀Mitalic_M-estimators. In Advances in Neural Information Processing Systems, 2014.