Logo image
Machine learning on genome-wide association studies to predict the risk of radiation-associated contralateral breast cancer in the WECARE Study
Journal article   Open access   Peer reviewed

Machine learning on genome-wide association studies to predict the risk of radiation-associated contralateral breast cancer in the WECARE Study

Sangkyu Lee, Xiaolin Liang, Meghan Woods, Anne S Reiner, Patrick Concannon, Leslie Bernstein, Charles F Lynch, John D Boice, Joseph O Deasy, Jonine L Bernstein, …
PloS one, Vol.15(2), pp.e0226157-e0226157
2020
DOI: 10.1371/journal.pone.0226157
PMCID: PMC7046218
PMID: 32106268
url
https://doi.org/10.1371/journal.pone.0226157View
Published (Version of record) Open Access

Abstract

The purpose of this study was to identify germline single nucleotide polymorphisms (SNPs) that optimally predict radiation-associated contralateral breast cancer (RCBC) and to provide new biological insights into the carcinogenic process. Fifty-two women with contralateral breast cancer and 153 women with unilateral breast cancer were identified within the Women's Environmental Cancer and Radiation Epidemiology (WECARE) Study who were at increased risk of RCBC because they were ≤ 40 years of age at first diagnosis of breast cancer and received a scatter radiation dose > 1 Gy to the contralateral breast. A previously reported algorithm, preconditioned random forest regression, was applied to predict the risk of developing RCBC. The resulting model produced an area under the curve (AUC) of 0.62 (p = 0.04) on hold-out validation data. The biological analysis identified the cyclic AMP-mediated signaling and Ephrin-A as significant biological correlates, which were previously shown to influence cell survival after radiation in an ATM-dependent manner. The key connected genes and proteins that are identified in this analysis were previously identified as relevant to breast cancer, radiation response, or both. In summary, machine learning/bioinformatics methods applied to genome-wide genotyping data have great potential to reveal plausible biological correlates associated with the risk of RCBC.
Adult Breast Neoplasms - epidemiology Breast Neoplasms - genetics Breast Neoplasms - radiotherapy Cancer Survivors Case-Control Studies Cohort Studies Female Genome-Wide Association Study - methods Genotype Germ Cells Humans Machine Learning Neoplasms, Radiation-Induced - epidemiology Neoplasms, Radiation-Induced - genetics Polymorphism, Single Nucleotide Risk Factors Young Adult

Details

Metrics

Logo image