Endogeneity in ultrahigh dimension

Jianqing Fan Yuan Liao

Statistics Theory and Methods mathscidoc:1912.43398

Available at SSRN 2045864, 2012.4
Most papers on high-dimensional statistics are based on the assumption that none of the regressors are correlated with the regression error, namely, they are exogeneous. Yet, endogeneity arises easily in high-dimensional regression due to a large pool of regressors and this causes the inconsistency of the penalized least-squares methods and possible false scientific discoveries. A necessary condition for model selection of a very general class of penalized regression methods is given, which allows us to prove formally the inconsistency claim. To cope with the possible endogeneity, we construct a novel penalized focussed generalized method of moments (FGMM) criterion function and offer a new optimization algorithm. The FGMM is not a smooth function. To establish its asymptotic properties, we first study the model selection consistency and an oracle property for a general class of penalized regression methods. These results are then used to show that the FGMM possesses an oracle property even in the presence of endogenous predictors, and that the solution is also near global minimum under the over-identification assumption. Finally, we also show how the semi-parametric efficiency of estimation can be achieved via a two-step approach.
No keywords uploaded!
[ Download ] [ 2019-12-21 11:41:55 uploaded by Jianqing_Fan ] [ 255 downloads ] [ 0 comments ]
  title={Endogeneity in ultrahigh dimension},
  author={Jianqing Fan, and Yuan Liao},
  booktitle={Available at SSRN 2045864},
Jianqing Fan, and Yuan Liao. Endogeneity in ultrahigh dimension. 2012. In Available at SSRN 2045864. http://archive.ymsc.tsinghua.edu.cn/pacm_paperurl/20191221114156000421958.
Please log in for comment!
Contact us: office-iccm@tsinghua.edu.cn | Copyright Reserved