Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
162 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Isotonic Regression in Multi-Dimensional Spaces and Graphs (1812.08944v2)

Published 21 Dec 2018 in math.ST and stat.TH

Abstract: In this paper we study minimax and adaptation rates in general isotonic regression. For uniform deterministic and random designs in $[0,1]d$ with $d\ge 2$ and $N(0,1)$ noise, the minimax rate for the $\ell_2$ risk is known to be bounded from below by $n{-1/d}$ when the unknown mean function $f$ is nondecreasing and its range is bounded by a constant, while the least squares estimator (LSE) is known to nearly achieve the minimax rate up to a factor $(\log n)\gamma$ where $n$ is sample size, $\gamma = 4$ in the lattice design and $\gamma = \max{9/2, (d2+d+1)/2 }$ in the random design. Moreover, the LSE is known to achieve the adaptation rate $(K/n){-2/d}{1\vee \log(n/K)}{2\gamma}$ when $f$ is piecewise constant on $K$ hyperrectangles in a partition of $[0,1]d$. Due to the minimax theorem, the LSE is identical on every design point to both the max-min and min-max estimators over all upper and lower sets containing the design point. This motivates our consideration of estimators which lie in-between the max-min and min-max estimators over possibly smaller classes of upper and lower sets, including a subclass of block estimators. Under a $q$-th moment condition on the noise, we develop $\ell_q$ risk bounds for such general estimators for isotonic regression on graphs. For uniform deterministic and random designs in $[0,1]d$ with $d\ge 3$, our $\ell_2$ risk bound for the block estimator matches the minimax rate $n{-1/d}$ when the range of $f$ is bounded and achieves the near parametric adaptation rate $(K/n){1\vee\log(n/K)}{d}$ when $f$ is $K$-piecewise constant. Furthermore, the block estimator possesses the following oracle property in variable selection: When $f$ depends on only a subset $S$ of variables, the $\ell_2$ risk of the block estimator automatically achieves up to a poly-logarithmic factor the minimax rate based on the oracular knowledge of $S$.

Summary

We haven't generated a summary for this paper yet.