Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

The $\ell_p$-Subspace Sketch Problem in Small Dimensions with Applications to Support Vector Machines (2211.07132v2)

Published 14 Nov 2022 in cs.DS

Abstract: In the $\ell_p$-subspace sketch problem, we are given an $n\times d$ matrix $A$ with $n>d$, and asked to build a small memory data structure $Q(A,\epsilon)$ so that, for any query vector $x\in\mathbb{R}d$, we can output a number in $(1\pm\epsilon)|Ax|_pp$ given only $Q(A,\epsilon)$. This problem is known to require $\tilde{\Omega}(d\epsilon{-2})$ bits of memory for $d=\Omega(\log(1/\epsilon))$. However, for $d=o(\log(1/\epsilon))$, no data structure lower bounds were known. We resolve the memory required to solve the $\ell_p$-subspace sketch problem for any constant $d$ and integer $p$, showing that it is $\Omega(\epsilon{-2(d-1)/(d+2p)})$ bits and $\tilde{O} (\epsilon{-2(d-1)/(d+2p)})$ words. This shows that one can beat the $\Omega(\epsilon{-2})$ lower bound, which holds for $d = \Omega(\log(1/\epsilon))$, for any constant $d$. We also show how to implement the upper bound in a single pass stream, with an additional multiplicative $\operatorname{poly}(\log \log n)$ factor and an additive $\operatorname{poly}(\log n)$ cost in the memory. Our bounds can be applied to point queries for SVMs with additive error, yielding an optimal bound of $\tilde{\Theta}(\epsilon{-2d/(d+3)})$ for every constant $d$. This is a near-quadratic improvement over the $\Omega(\epsilon{-(d+1)/(d+3)})$ lower bound of (Andoni et al. 2020). Our techniques rely on a novel connection to low dimensional techniques from geometric functional analysis.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Yi Li (483 papers)
  2. Honghao Lin (17 papers)
  3. David P. Woodruff (207 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.