Journal article
Model selection for correlated data with diverging number of parameters
Statistica Sinica, Vol.23(2), pp.901-927
2013
DOI: 10.5705/ss.2011.058
Abstract
High-dimensional longitudinal data arise frequently in biomedical and genomic research. It is important to select relevant covariates when the dimension of the parameters diverges as the sample size increases. We propose the penalized quadratic inference function to perform model selection and estimation simultaneously in the framework of a diverging number of regression parameters. The penalized quadratic inference function can easily take correlation information from clustered data into account, yet it does not require specifying the likelihood function. This is advantageous compared to existing model selection methods for discrete data with large cluster size. In addition, the proposed approach enjoys the oracle property; it is able to identify non-zero components consistently with probability tending to 1, and any finite linear combination of the estimated non-zero components has an asymptotic normal distribution. We propose an efficient algorithm by selecting an effective tuning parameter to solve the penalized quadratic inference function. Monte Carlo simulation studies have the proposed method selecting the correct model with a high frequency and estimating covariate effects accurately even when the dimension of parameters is high. We illustrate the proposed approach by analyzing periodontal disease data.
Details
- Title: Subtitle
- Model selection for correlated data with diverging number of parameters
- Creators
- Hyunkeun ChoAnnie Qu
- Resource Type
- Journal article
- Publication Details
- Statistica Sinica, Vol.23(2), pp.901-927
- DOI
- 10.5705/ss.2011.058
- ISSN
- 1017-0405
- eISSN
- 1996-8507
- Language
- English
- Date published
- 2013
- Academic Unit
- Biostatistics
- Record Identifier
- 9984214664202771
Metrics
16 Record Views