Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Near-Linear Sample Complexity for $L_p$ Polynomial Regression (2211.06790v1)

Published 13 Nov 2022 in cs.DS

Abstract: We study $L_p$ polynomial regression. Given query access to a function $f:[-1,1] \rightarrow \mathbb{R}$, the goal is to find a degree $d$ polynomial $\hat{q}$ such that, for a given parameter $\varepsilon > 0$, $$ |\hat{q}-f|p\le (1+\varepsilon) \cdot \min{q:\text{deg}(q)\le d}|q-f|p. $$ Here $|\cdot|_p$ is the $L_p$ norm, $|g|_p = (\int{-1}1 |g(t)|p dt){1/p}$. We show that querying $f$ at points randomly drawn from the Chebyshev measure on $[-1,1]$ is a near-optimal strategy for polynomial regression in all $L_p$ norms. In particular, to find $\hat q$, it suffices to sample $O(d\, \frac{\text{polylog}\,d}{\text{poly}\,\varepsilon})$ points from $[-1,1]$ with probabilities proportional to this measure. While the optimal sample complexity for polynomial regression was well understood for $L_2$ and $L_\infty$, our result is the first that achieves sample complexity linear in $d$ and error $(1+\varepsilon)$ for other values of $p$ without any assumptions. Our result requires two main technical contributions. The first concerns $p\leq 2$, for which we provide explicit bounds on the $L_p$ Lewis weight function of the infinite linear operator underlying polynomial regression. Using tools from the orthogonal polynomial literature, we show that this function is bounded by the Chebyshev density. Our second key contribution is to take advantage of the structure of polynomials to reduce the $p>2$ case to the $p\leq 2$ case. By doing so, we obtain a better sample complexity than what is possible for general $p$-norm linear regression problems, for which $\Omega(d{p/2})$ samples are required.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Raphael A. Meyer (8 papers)
  2. Cameron Musco (82 papers)
  3. Christopher Musco (66 papers)
  4. David P. Woodruff (206 papers)
  5. Samson Zhou (76 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.