Limiting distribution of the sample canonical correlation coefficients of high-dimensional random vectors

Fan Yang fyang75@wharton.upenn.edu

Probability Statistics Theory and Methods mathscidoc:2110.28004

2021.10
In this paper, we prove a CLT for the sample canonical correlation coefficients between two high-dimensional random vectors with finite rank correlations. More precisely, consider two random vectors $\wt{\bx}=\mathbf x + A \mathbf z $ and $\wt{\by}=\mathbf y + B \mathbf z $, where $\mathbf x \in \R^p$, $\mathbf y \in \R^q$ and $\mathbf z\in \R^r$ are independent random vectors with i.i.d.\;entries of mean zero and variance one, and $A \in \R^{p\times r}$ and $B\in \R^{q\times r}$ are two arbitrary deterministic matrices. Given $n$ samples of $\wt{\bx}$ and $\wt{\by}$, we stack them into two matrices $\cal X= X+AZ$ and $\cal Y= Y+BZ$, where $X\in \R^{p\times n}$, $Y\in \R^{q\times n}$ and $Z\in \R^{r\times n}$ are random matrices with i.i.d.\;entries of mean zero and variance one. Let $\wt\lambda_1 \ge \wt\lambda_2\ge \cdots \ge \wt\lambda_{r}$ be the largest $r$ eigenvalues of the sample canonical correlation (SCC) matrix $\cal C_{\cal X\cal Y}=(\cal X\cal X^\top)^{-1/2}\cal X\cal Y^\top (\cal Y\cal Y^\top)^{-1}\cal Y \cal X^\top (\cal X\cal X^\top)^{-1/2}$, and let $t_1\ge t_2 \ge \cdots\ge t_r$ be the squares of the population canonical correlation coefficients between $\wt{\bx}$ and $\wt{\by}$. Under certain moment assumptions, we show that there exists a threshold $t_c \in(0, 1)$ such that if $t_i>t_c$, then \smash{$\sqrt{n} (\wt\lambda_i-\theta_i)$} converges weakly to a centered normal distribution, where $\theta_i $ is a fixed outlier location determined by $t_i$. Our proof uses a self-adjoint linearization of the SCC matrix and a sharp local law on the inverse of the linearized matrix.
No keywords uploaded!
[ Download ] [ 2021-10-01 12:29:27 uploaded by yangf75 ] [ 1054 downloads ] [ 0 comments ]
@inproceedings{fan2021limiting,
  title={Limiting distribution of the sample canonical correlation coefficients of high-dimensional random vectors},
  author={Fan Yang},
  url={http://archive.ymsc.tsinghua.edu.cn/pacm_paperurl/20211001122927232977875},
  year={2021},
}
Fan Yang. Limiting distribution of the sample canonical correlation coefficients of high-dimensional random vectors. 2021. http://archive.ymsc.tsinghua.edu.cn/pacm_paperurl/20211001122927232977875.
Please log in for comment!
 
 
Contact us: office-iccm@tsinghua.edu.cn | Copyright Reserved