Representative random sampling for feature engineering of -Omics data: using machine learning to identify biomarkers for head and neck squamous cell carcinoma
Abstract
Details
- Title: Subtitle
- Representative random sampling for feature engineering of -Omics data: using machine learning to identify biomarkers for head and neck squamous cell carcinoma
- Creators
- Michael C. Rendleman
- Contributors
- Thomas L Casavant (Advisor)Terry A Braun (Committee Member)John M Buatti (Committee Member)Guadalupe Canahuate (Committee Member)Brian J Smith (Committee Member)
- Resource Type
- Dissertation
- Degree Awarded
- Doctor of Philosophy (PhD), University of Iowa
- Degree in
- Electrical and Computer Engineering
- Date degree season
- Autumn 2021
- DOI
- 10.17077/etd.006280
- Publisher
- University of Iowa
- Number of pages
- xv, 93 pages
- Copyright
- Copyright 2021 Michael C. Rendleman
- Language
- English
- Description illustrations
- illustrations (some color)
- Description bibliographic
- Includes bibliographical references (pages 85-88).
- Public Abstract (ETD)
Cancer is a class of diseases that 40% of people are diagnosed with at some point in their lives. In modern medicine, new technologies are changing how we study and understand these diseases. In this thesis, we focus on a class of cancers called Head and Neck squamous cell carcinoma (HNSCC). It ranks 6th in the world by prevalence and is associated with human papillomavirus (HPV) as well as the use of tobacco and alcohol.
Precision medicine is guiding the development of new tests and treatments. Cancer researchers are doing more data collection than ever before, including whole genome and tumor DNA sequencing. A challenge with this kind of data is that it can be voluminous, overwhelming, and difficult to interpret. To be able to make sense of this complex data, researchers use computers and statistics. In recent years, some have been able to apply newer tools such as machine learning to aid in their research.
We propose a new method that can produce more accurate results with limited data. By applying this new approach to publicly-available cancer data, we compare different ways of using machine learning to study HNSCC. Results show that transforming the data with a trained neural network was capable of improving prediction of treatment outcomes. Additionally, this kind of transformed data may prove useful in the diagnosis and treatment of HNSCC.
- Academic Unit
- Electrical and Computer Engineering
- Record Identifier
- 9984210527002771