Journal article
Deriving enhanced geographical representations via similarity-based spectral analysis: predicting colorectal cancer survival curves in Iowa
International Journal of Data Mining and Bioinformatics, Vol.21(3), pp.183-211
2018
DOI: 10.1504/IJDMB.2018.097677
Abstract
Neural networks are capable of learning rich, nonlinear feature representations shown to be beneficial in many predictive tasks. In this work, we use such models to explore different geographical feature representations in the context of predicting colorectal cancer survival curves for patients in the state of Iowa, spanning the years 1989 to 2013. Specifically, we compare model performance using area between the curves (ABC) to assess (a) whether survival curves can be reasonably predicted for colorectal cancer patients in the state of Iowa, (b) whether geographical features improve predictive performance, (c) whether a simple binary representation, or a richer, spectral analysis-elicited representation perform better, and (d) whether spectral analysis-based representations can be improved upon by leveraging geographically-descriptive features. In exploring (d), we devise a similarity-based spectral analysis procedure, which allows for the combination of geographically relational and geographically descriptive features. Our findings suggest that survival curves can be reasonably estimated on average, with predictive performance deviating at the five-year survival mark among all models. We also find that geographical features improve predictive performance, and that better performance is obtained using richer, spectral analysis-elicited features. Furthermore, we find that similarity-based spectral analysis-elicited representations improve upon the original spectral analysis results by approximately 40%.
Details
- Title: Subtitle
- Deriving enhanced geographical representations via similarity-based spectral analysis: predicting colorectal cancer survival curves in Iowa
- Creators
- Michael T Lash - 1Department of Computer Science, University of Iowa, Iowa City, IA, USAMin Zhang - 2Interdisciplinary Graduate Program in Informatics, University of Iowa, Iowa City, IA, USAXun Zhou - 3Management Sciences Department, University of Iowa, Iowa City, IA, USAW. Nick Street - 4Management Sciences Department, University of Iowa, Iowa City, IA, USACharles F Lynch - 5Department of Epidemiology, University of Iowa, Iowa City, IA, USA
- Resource Type
- Journal article
- Publication Details
- International Journal of Data Mining and Bioinformatics, Vol.21(3), pp.183-211
- DOI
- 10.1504/IJDMB.2018.097677
- ISSN
- 1748-5673
- eISSN
- 1748-5681
- Publisher
- Inderscience Publishers (IEL)
- Language
- English
- Date published
- 2018
- Academic Unit
- Bus Admin College; Epidemiology; Nursing; Computer Science; Business Analytics
- Record Identifier
- 9983996092602771
Metrics
26 Record Views