Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Random Fourier Features for Kernel Ridge Regression: Approximation Bounds and Statistical Guarantees (1804.09893v2)

Published 26 Apr 2018 in cs.LG, cs.DS, cs.NA, math.NA, and stat.ML

Abstract: Random Fourier features is one of the most popular techniques for scaling up kernel methods, such as kernel ridge regression. However, despite impressive empirical results, the statistical properties of random Fourier features are still not well understood. In this paper we take steps toward filling this gap. Specifically, we approach random Fourier features from a spectral matrix approximation point of view, give tight bounds on the number of Fourier features required to achieve a spectral approximation, and show how spectral matrix approximation bounds imply statistical guarantees for kernel ridge regression. Qualitatively, our results are twofold: on the one hand, we show that random Fourier feature approximation can provably speed up kernel ridge regression under reasonable assumptions. At the same time, we show that the method is suboptimal, and sampling from a modified distribution in Fourier space, given by the leverage function of the kernel, yields provably better performance. We study this optimal sampling distribution for the Gaussian kernel, achieving a nearly complete characterization for the case of low-dimensional bounded datasets. Based on this characterization, we propose an efficient sampling scheme with guarantees superior to random Fourier features in this regime.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Haim Avron (51 papers)
  2. Michael Kapralov (55 papers)
  3. Cameron Musco (82 papers)
  4. Christopher Musco (66 papers)
  5. Ameya Velingker (24 papers)
  6. Amir Zandieh (23 papers)
Citations (153)

Summary

We haven't generated a summary for this paper yet.