Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
144 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Asymptotic locations of bounded and unbounded eigenvalues of sample correlation matrices of certain factor models -- application to a components retention rule (2407.07282v1)

Published 10 Jul 2024 in math.ST and stat.TH

Abstract: Let the dimension $N$ of data and the sample size $T$ tend to $\infty$ with $N/T \to c > 0$. The spectral properties of a sample correlation matrix $\mathbf{C}$ and a sample covariance matrix $\mathbf{S}$ are asymptotically equal whenever the population correlation matrix $\mathbf{R}$ is bounded (El Karoui 2009). We demonstrate this also for general linear models for unbounded $\mathbf{R}$, by examining the behavior of the singular values of multiplicatively perturbed matrices. By this, we establish: Given a factor model of an idiosyncratic noise variance $\sigma2$ and a rank-$r$ factor loading matrix $\mathbf{L}$ which rows all have common Euclidean norm $L$. Then, the $k$th largest eigenvalues $\lambda_k$ $(1\le k\le N)$ of $\mathbf{C}$ satisfy almost surely: (1) $\lambda_r$ diverges, (2) $\lambda_k/s_k2\to1/(L2 + \sigma2)$ $(1 \le k \le r)$ for the $k$th largest singular value $s_k$ of $\mathbf{L}$, and (3) $\lambda_{r + 1}\to(1-\rho)(1+\sqrt{c})2$ for $\rho := L2/(L2 + \sigma2)$. Whenever $s_r$ is much larger than $\sqrt{\log N}$, then broken-stick rule (Frontier 1976, Jackson 1993), which estimates $\mathrm{rank}\, \mathbf{L}$ by a random partition (Holst 1980) of $[0,\,1]$, tends to $r$ (a.s.). We also provide a natural factor model where the rule tends to "essential rank" of $\mathbf{L}$ (a.s.) which is smaller than $\mathrm{rank}\, \mathbf{L}$.

Summary

We haven't generated a summary for this paper yet.