Journal article
Integrative analysis and variable selection with multiple high-dimensional data sets
Biostatistics (Oxford, England), Vol.12(4), pp.763-775
10/2011
DOI: 10.1093/biostatistics/kxr004
PMCID: PMC3169668
PMID: 21415015
Abstract
In high-throughput -omics studies, markers identified from analysis of single data sets often suffer from a lack of reproducibility because of sample limitation. A cost-effective remedy is to pool data from multiple comparable studies and conduct integrative analysis. Integrative analysis of multiple -omics data sets is challenging because of the high dimensionality of data and heterogeneity among studies. In this article, for marker selection in integrative analysis of data from multiple heterogeneous studies, we propose a 2-norm group bridge penalization approach. This approach can effectively identify markers with consistent effects across multiple studies and accommodate the heterogeneity among studies. We propose an efficient computational algorithm and establish the asymptotic consistency property. Simulations and applications in cancer profiling studies show satisfactory performance of the proposed approach.
Details
- Title: Subtitle
- Integrative analysis and variable selection with multiple high-dimensional data sets
- Creators
- Shuangge Ma - School of Public Health, Yale University, 60 College Street, New Haven, CT 06520, USA. shuangge.ma@yale.eduJian HuangXiao Song
- Resource Type
- Journal article
- Publication Details
- Biostatistics (Oxford, England), Vol.12(4), pp.763-775
- Publisher
- England
- DOI
- 10.1093/biostatistics/kxr004
- PMID
- 21415015
- PMCID
- PMC3169668
- ISSN
- 1468-4357
- eISSN
- 1468-4357
- Grant note
- LM009828 / NLM NIH HHS CA142774 / NCI NIH HHS CA120988 / NCI NIH HHS
- Language
- English
- Date published
- 10/2011
- Academic Unit
- Statistics and Actuarial Science
- Record Identifier
- 9983985967702771
Metrics
24 Record Views