$\ell_p$-Regression in the Arbitrary Partition Model of Communication (2307.05117v1)
Abstract: We consider the randomized communication complexity of the distributed $\ell_p$-regression problem in the coordinator model, for $p\in (0,2]$. In this problem, there is a coordinator and $s$ servers. The $i$-th server receives $Ai\in{-M, -M+1, \ldots, M}{n\times d}$ and $bi\in{-M, -M+1, \ldots, M}n$ and the coordinator would like to find a $(1+\epsilon)$-approximate solution to $\min_{x\in\mathbb{R}n} |(\sum_i Ai)x - (\sum_i bi)|_p$. Here $M \leq \mathrm{poly}(nd)$ for convenience. This model, where the data is additively shared across servers, is commonly referred to as the arbitrary partition model. We obtain significantly improved bounds for this problem. For $p = 2$, i.e., least squares regression, we give the first optimal bound of $\tilde{\Theta}(sd2 + sd/\epsilon)$ bits. For $p \in (1,2)$,we obtain an $\tilde{O}(sd2/\epsilon + sd/\mathrm{poly}(\epsilon))$ upper bound. Notably, for $d$ sufficiently large, our leading order term only depends linearly on $1/\epsilon$ rather than quadratically. We also show communication lower bounds of $\Omega(sd2 + sd/\epsilon2)$ for $p\in (0,1]$ and $\Omega(sd2 + sd/\epsilon)$ for $p\in (1,2]$. Our bounds considerably improve previous bounds due to (Woodruff et al. COLT, 2013) and (Vempala et al., SODA, 2020).
- Information lower bounds via self-reducibility. Theory Comput. Syst., 59(2):377–396, 2016.
- Nearly-optimal bounds for sparse recovery in generic norms, with applications to $k$-median sketching. CoRR, abs/1504.01076, 2015.
- An information statistics approach to data stream and communication complexity. J. Comput. Syst. Sci., 68(4):702–732, 2004.
- Toward a unified theory of sparse dimensionality reduction in euclidean space. CoRR, abs/1311.2542, 2013.
- Optimal principal component analysis in distributed and streaming models. In Daniel Wichs and Yishay Mansour, editors, Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2016, Cambridge, MA, USA, June 18-21, 2016, pages 236–249. ACM, 2016.
- Online active regression. arXiv:2207.05945 [cs.LG], 2022.
- Michael B. Cohen. Nearly tight oblivious subspace embeddings by trace inequalities. In Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms, SODA ’16, page 278–287, USA, 2016. Society for Industrial and Applied Mathematics.
- lpsubscript𝑙𝑝l_{p}italic_l start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT row sampling by lewis weights. In Rocco A. Servedio and Ronitt Rubinfeld, editors, Proceedings of the Forty-Seventh Annual ACM on Symposium on Theory of Computing, STOC 2015, Portland, OR, USA, June 14-17, 2015, pages 183–192. ACM, 2015.
- Informational complexity and the direct sum problem for simultaneous message complexity. In 42nd Annual Symposium on Foundations of Computer Science, FOCS 2001, 14-17 October 2001, Las Vegas, Nevada, USA, pages 270–278. IEEE Computer Society, 2001.
- Numerical linear algebra in the streaming model. In Michael Mitzenmacher, editor, Proceedings of the 41st Annual ACM Symposium on Theory of Computing, STOC 2009, Bethesda, MD, USA, May 31 - June 2, 2009, pages 205–214. ACM, 2009.
- Low-rank approximation and regression in input sparsity time. J. ACM, 63(6), January 2017.
- Dimensionality reduction for tukey regression. In Kamalika Chaudhuri and Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA, volume 97 of Proceedings of Machine Learning Research, pages 1262–1271. PMLR, 2019.
- Random embedding of ℓpnsuperscriptsubscriptℓ𝑝𝑛\ell_{p}^{n}roman_ℓ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT into ℓrnsuperscriptsubscriptℓ𝑟𝑛\ell_{r}^{n}roman_ℓ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. Mathematische Annalen, 350(4):953–972, 2011.
- André Gronemeier. Asymptotically optimal lower bounds on the nih-multi-party information complexity of the and-function and disjointness. In Susanne Albers and Jean-Yves Marion, editors, 26th International Symposium on Theoretical Aspects of Computer Science, STACS 2009, February 26-28, 2009, Freiburg, Germany, Proceedings, volume 3 of LIPIcs, pages 505–516. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, Germany, 2009.
- T. S. Jayram. Hellinger strikes back: A note on the multi-party information complexity of AND. In Irit Dinur, Klaus Jansen, Joseph Naor, and José D. P. Rolim, editors, Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, 12th International Workshop, APPROX 2009, and 13th International Workshop, RANDOM 2009, Berkeley, CA, USA, August 21-23, 2009. Proceedings, volume 5687 of Lecture Notes in Computer Science, pages 562–573. Springer, 2009.
- Principal component analysis and higher correlations for distributed data. In Maria-Florina Balcan, Vitaly Feldman, and Csaba Szepesvári, editors, Proceedings of The 27th Conference on Learning Theory, COLT 2014, Barcelona, Spain, June 13-15, 2014, volume 35 of JMLR Workshop and Conference Proceedings, pages 1040–1057. JMLR.org, 2014.
- The ℓpsubscriptnormal-ℓ𝑝\ell_{p}roman_ℓ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT-Subspace Sketch Problem in Small Dimensions with Applications to Support Vector Machines, pages 850–877. SIAM, 2023.
- Exponentially improved dimensionality reduction for l1: Subspace embeddings and independence testing. In Mikhail Belkin and Samory Kpotufe, editors, Conference on Learning Theory, COLT 2021, 15-19 August 2021, Boulder, Colorado, USA, volume 134 of Proceedings of Machine Learning Research, pages 3111–3195. PMLR, 2021.
- Active sampling for linear regression beyond the l2subscript𝑙2l_{2}italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT norm. In 63rd IEEE Annual Symposium on Foundations of Computer Science, FOCS 2022, Denver, CO, USA, Oct 31–Nov 3, 2022, pages 744–753. IEEE, 2022.
- S. Muthukrishnan. Data streams: Algorithms and applications. Found. Trends Theor. Comput. Sci., 1(2), 2005.
- Near-linear time and fixed-parameter tractable algorithms for tensor decompositions. arXiv:2207.07417 [cs.DS], 2022.
- J. Nelson and H. L. Nguyên. Osnap: Faster numerical linear algebra algorithms via sparser subspace embeddings. In 2013 IEEE 54th Annual Symposium on Foundations of Computer Science, pages 117–126, 2013.
- Gilles Pisier. On the dimension of the ℓpnsuperscriptsubscriptℓ𝑝𝑛\ell_{p}^{n}roman_ℓ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT-subspaces of banach spaces, for 1≤p<21𝑝21\leq p<21 ≤ italic_p < 2. Trans. of AMS, 276:201–211, 1983.
- Lower bounds for number-in-hand multiparty communication complexity, made easy. SIAM J. Comput., 45(1):174–196, 2016.
- Subspace embeddings for the l11{}_{\mbox{1}}start_FLOATSUBSCRIPT 1 end_FLOATSUBSCRIPT-norm with applications. In Lance Fortnow and Salil P. Vadhan, editors, Proceedings of the 43rd ACM Symposium on Theory of Computing, STOC 2011, San Jose, CA, USA, 6-8 June 2011, pages 755–764. ACM, 2011.
- Konstantin Tikhomirov. Singularity of random bernoulli matrices. Annals of Mathematics, 191(2):593–634, 2020.
- Bengt von Bahr and Carl-Gustav Esseen. Inequalities for the rth absolute moment of a sum of random variables, 1≤r≤21𝑟21\leq r\leq 21 ≤ italic_r ≤ 2. The Annals of Mathematical Statistics, pages 299–303, 1965.
- The communication complexity of optimization. In Shuchi Chawla, editor, Proceedings of the 2020 ACM-SIAM Symposium on Discrete Algorithms, SODA 2020, Salt Lake City, UT, USA, January 5-8, 2020, pages 1733–1752. SIAM, 2020.
- David P. Woodruff. Sketching as a tool for numerical linear algebra. Found. Trends Theor. Comput. Sci., 10(1-2):1–157, 2014.
- Tight bounds for ℓpsubscriptℓ𝑝\ell_{p}roman_ℓ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT oblivious subspace embeddings. In Timothy M. Chan, editor, Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2019, San Diego, California, USA, January 6-9, 2019, pages 1825–1843. SIAM, 2019.
- Subspace embeddings and lp regression using exponential random variables. In Shai Shalev-Shwartz and Ingo Steinwart, editors, COLT 2013 - The 26th Annual Conference on Learning Theory, June 12-14, 2013, Princeton University, NJ, USA, volume 30 of JMLR Workshop and Conference Proceedings, pages 546–567. JMLR.org, 2013.
- Vladimir M Zolotarev. One-dimensional stable distributions, volume 65. American Mathematical Soc., 1986.