Multiple testing in high-throughput sequence data: experiences from Group 8 of Genetic Analysis Workshop 17

Inke R König; Jeremie Nsengimana; Charalampos Papachristou; Matthew A Simonson; Kai Wang; Jason A Weisburd

doi:10.1002/gepi.20651

Back

Journal article

Multiple testing in high-throughput sequence data: experiences from Group 8 of Genetic Analysis Workshop 17

Inke R König, Jeremie Nsengimana, Charalampos Papachristou, Matthew A Simonson, Kai Wang and Jason A Weisburd

Genetic epidemiology, Vol.35(S1), pp.S61-S66

2011

DOI: 10.1002/gepi.20651

PMCID: PMC3265920

PMID: 22128061

Files and links (1)

url

https://www.ncbi.nlm.nih.gov/pmc/articles/3265920View

Open Access

Abstract

The use of high-throughput sequence data in genetic epidemiology allows the investigation of common and rare variants in the entire genome, thus increasing the amount of information and the potential number of statistical tests performed within one study. As a consequence, the problem of multiple testing may become even more pressing than in previous studies. As an important challenge, the exact number of statistical tests depends on the actual statistical method used. Furthermore, many statistical approaches for the analysis of sequence data require permutation. Thus it may be difficult to also use permutation to estimate correct type I error levels as in genome-wide association studies. In view of this, a separate group at Genetic Analysis Workshop 17 was formed with a focus on multiple testing. Here, we present the approaches used for the workshop. Apart from tackling the multiple testing problem, the new group focused on different issues. Some contributors developed and investigated modifications of existing collapsing methods. Others aimed at improving the identification of functional variants through a reduction and analysis of the underlying data dimensions. Two research groups investigated the overall accumulation of rare variation across the genome and its value in predicting phenotypes. Finally, other investigators left the path of traditional statistical analyses by reversing null and alternative hypotheses and by proposing a novel resampling method. We describe and discuss all these approaches.

collapsing methods

next-generation sequencing

rare sequence variants

resampling

Details

Title: Subtitle: Multiple testing in high-throughput sequence data: experiences from Group 8 of Genetic Analysis Workshop 17
Creators: Inke R König - Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Lübeck, Germany
Jeremie Nsengimana - St James's University Hospital
Charalampos Papachristou - University of the Sciences
Matthew A Simonson - University of Colorado Boulder
Kai Wang - University of Iowa, Biostatistics
Jason A Weisburd - Stony Brook University
Resource Type: Journal article
Publication Details: Genetic epidemiology, Vol.35(S1), pp.S61-S66
DOI: 10.1002/gepi.20651
PMID: 22128061
PMCID: PMC3265920
NLM abbreviation: Genet Epidemiol
ISSN: 0741-0395
eISSN: 1098-2272
Publisher: Wiley Subscription Services, Inc., A Wiley Company
Number of pages: 6
Language: English
Date published: 2011
Academic Unit: Biostatistics
Record Identifier: 9984229855702771

Metrics

21 Record Views

1 Times Cited - Web of Science