Journal article
PROJECTED PRINCIPAL COMPONENT ANALYSIS IN FACTOR MODELS
The Annals of statistics, Vol.44(1), pp.219-254
02/01/2016
DOI: 10.1214/15-AOS1364
PMCID: PMC4714810
PMID: 26783374
Abstract
This paper introduces a Projected Principal Component Analysis (Projected-PCA), which employs principal component analysis to the projected (smoothed) data matrix onto a given linear space spanned by covariates. When it applies to high-dimensional factor analysis, the projection removes noise components. We show that the unobserved latent factors can be more accurately estimated than the conventional PCA if the projection is genuine, or more precisely, when the factor loading matrices are related to the projected linear space. When the dimensionality is large, the factors can be estimated accurately even when the sample size is finite. We propose a flexible semiparametric factor model, which decomposes the factor loading matrix into the component that can be explained by subject-specific covariates and the orthogonal residual component. The covariates' effects on the factor loadings are further modeled by the additive model via sieve approximations. By using the newly proposed Projected-PCA, the rates of convergence of the smooth factor loading matrices are obtained, which are much faster than those of the conventional factor analysis. The convergence is achieved even when the sample size is finite and is particularly appealing in the high-dimension-low-sample-size situation. This leads us to developing nonparametric tests on whether observed covariates have explaining powers on the loadings and whether they fully explain the loadings. The proposed method is illustrated by both simulated data and the returns of the components of the S&P 500 index.
Details
- Title: Subtitle
- PROJECTED PRINCIPAL COMPONENT ANALYSIS IN FACTOR MODELS
- Creators
- Jianqing Fan - University of Maryland, College ParkYuan Liao - Princeton UniversityWeichen Wang - University of Maryland, College Park
- Resource Type
- Journal article
- Publication Details
- The Annals of statistics, Vol.44(1), pp.219-254
- DOI
- 10.1214/15-AOS1364
- PMID
- 26783374
- PMCID
- PMC4714810
- NLM abbreviation
- Ann Stat
- ISSN
- 0090-5364
- eISSN
- 2168-8966
- Publisher
- Inst Mathematical Statistics
- Number of pages
- 36
- Grant note
- DMS-12-06464; 2R01-GM072611-9 / NSF; National Science Foundation (NSF) R01GM072611 / NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES; United States Department of Health & Human Services; National Institutes of Health (NIH) - USA; NIH National Institute of General Medical Sciences (NIGMS) University of Maryland
- Language
- English
- Date published
- 02/01/2016
- Academic Unit
- Economics
- Record Identifier
- 9984936839102771
Metrics
6 Record Views