Minimax-optimal estimation for sparse multi-reference alignment with collision-free signals (2312.07839v1)
Abstract: The Multi-Reference Alignment (MRA) problem aims at the recovery of an unknown signal from repeated observations under the latent action of a group of cyclic isometries, in the presence of additive noise of high intensity $\sigma$. It is a more tractable version of the celebrated cryo EM model. In the crucial high noise regime, it is known that its sample complexity scales as $\sigma6$. Recent investigations have shown that for the practically significant setting of sparse signals, the sample complexity of the maximum likelihood estimator asymptotically scales with the noise level as $\sigma4$. In this work, we investigate minimax optimality for signal estimation under the MRA model for so-called collision-free signals. In particular, this signal class covers the setting of generic signals of dilute sparsity (wherein the support size $s=O(L{1/3})$, where $L$ is the ambient dimension. We demonstrate that the minimax optimal rate of estimation in for the sparse MRA problem in this setting is $\sigma2/\sqrt{n}$, where $n$ is the sample size. In particular, this widely generalizes the sample complexity asymptotics for the restricted MLE in this setting, establishing it as the statistically optimal estimator. Finally, we demonstrate a concentration inequality for the restricted MLE on its deviations from the ground truth.
- Multireference alignment is easier with an aperiodic translation distribution. IEEE Transactions on Information Theory, 65(6):3565–3584, 2018.
- Multireference alignment using semidefinite programming. In Proceedings of the 5th conference on Innovations in theoretical computer science, pages 459–470, 2014.
- Optimal rates of estimation for multi-reference alignment. Mathematical Statistics and Learning, 2(1):25–75, 2020.
- 2.2 å resolution cryo-em structure of β𝛽\betaitalic_β-galactosidase in complex with a cell-permeant inhibitor. Science, 348(6239):1147–1151, 2015.
- There are no further counterexamples to s. piccard’s theorem. Information Theory, IEEE Transactions on, 53:2864 – 2867, 09 2007.
- Single-particle cryo-electron microscopy: Mathematical theory, computational challenges, and opportunities. IEEE signal processing magazine, 37(2):58–76, 2020.
- Bispectrum inversion with application to multireference alignment. IEEE Transactions on signal processing, 66(4):1037–1050, 2017.
- Autocorrelation analysis for cryo-em with sparsity constraints: Improved sample complexity and projection-based algorithms. Proceedings of the National Academy of Sciences, 120(18):e2216507120, 2023.
- Sparse multi-reference alignment: Sample complexity and computational hardness. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 8977–8981. IEEE, 2022.
- Gary S Bloom. A counterexample to a theorem of s. piccard. Journal of Combinatorial Theory, Series A, 22(3):378–379, 1977.
- Concentration Inequalities: A Nonasymptotic Theory of Independence. OUP Oxford, 2013.
- Heterogeneous multireference alignment: A single pass approach. In 2018 52nd Annual Conference on Information Sciences and Systems (CISS), pages 1–6. IEEE, 2018.
- Lisa Gottesfeld Brown. A survey of image registration techniques. ACM computing surveys (CSUR), 24(4):325–376, 1992.
- Victor-Emmanuel Brunel. Learning rates for gaussian mixtures under group action. In Conference on Learning Theory, pages 471–491. PMLR, 2019.
- Robert Diamond. On the multiple simultaneous superposition of molecular structures by rigid body transformations. Protein Science, 1(10):1279–1287, 1992.
- Rates of estimation for high-dimensional multi-reference alignment. arXiv preprint arXiv:2205.01847, 2022.
- Statistical shape analysis: with applications in R, volume 995. John Wiley & Sons, 2016.
- Maximum likelihood for high-noise group orbit estimation and single-particle cryo-em. arXiv preprint arXiv:2107.01305, 2021.
- Likelihood landscape and maximum likelihood estimation for the discrete orbit recovery model. Communications on Pure and Applied Mathematics, 76(6):1208–1302, 2023.
- Extension of phase correlation to subpixel registration. IEEE transactions on image processing, 11(3):188–200, 2002.
- Sparse multi-reference alignment: Phase retrieval, uniform uncertainty principles and the beltway problem. Foundations of Computational Mathematics, pages 1–48, 2022.
- Using multilayer perceptrons to align high range resolution radar signals. In Artificial Neural Networks: Formal Models and Their Applications–ICANN 2005: 15th International Conference, Warsaw, Poland, September 11-15, 2005. Proceedings, Part II 15, pages 911–916. Springer, 2005.
- Likelihood maximization and moment matching in low snr gaussian mixture models. Communications on Pure and Applied Mathematics, 76(4):788–842, 2023.
- Adaptive estimation of a quadratic functional by model selection. Annals of statistics, pages 1302–1338, 2000.
- Ryan O’Donnell. Analysis of boolean functions. Cambridge University Press, 2014.
- An assembly automation approach to alignment of noncircular projections in electron microscopy. IEEE Transactions on Automation Science and Engineering, 11(3):668–679, 2014.
- A stochastic kinematic model of class averaging in single-particle electron microscopy. The International journal of robotics research, 30(6):730–754, 2011.
- The sample complexity of multireference alignment. SIAM Journal on Mathematics of Data Science, 1(3):497–517, 2019.
- Sophie Piccard. Sur les ensembles de distances des emsembles de points d’un espace euclidien. 1939.
- Ya’Acov Ritov. Estimating a signal with noisy nuisance parameters. Biometrika, 76(1):31–37, 1989.
- Optimal registration of aliased images using variable projection with applications to super-resolution. The Computer Journal, 52(1):31–42, 2009.
- A certifiably correct algorithm for synchronization over the special euclidean group. In Algorithmic Foundations of Robotics XII: Proceedings of the Twelfth Workshop on the Algorithmic Foundations of Robotics, pages 64–79. Springer, 2020.
- Shift-and rotation-invariant object reconstruction using the bispectrum. JOSA A, 9(1):57–69, 1992.
- Maximum-likelihood multi-reference refinement for electron microscopy images. Journal of molecular biology, 348(1):139–149, 2005.
- The 3.8 å resolution cryo-em structure of zika virus. Science, 352(6284):467–470, 2016.
- Optimal simultaneous superpositioning of multiple structures with missing data. Bioinformatics, 28(15):1972–1979, 2012.
- Alexandre B. Tsybakov. Introduction to Nonparametric Estimation. Springer series in statistics. Springer, 2009.
- Fast translation invariant classification of hrr range profiles in a zero phase representation. IEE Proceedings-Radar, Sonar and Navigation, 150(6):411–418, 2003.