Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data

Christopher W. Bartlett; Jamie Bossenbroek; Yukie Ueyama; Patricia McCallinhart; Olivia A. Peters; Donna A. Santillan; Mark K. Santillan; Aaron J. Trask; William C. Ray

doi:10.1007/s42979-022-01553-8

Back

Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data

Journal article

Open access

Peer reviewed

Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data

Christopher W. Bartlett, Jamie Bossenbroek, Yukie Ueyama, Patricia McCallinhart, Olivia A. Peters, Donna A. Santillan, Mark K. Santillan, Aaron J. Trask and William C. Ray

SN computer science, Vol.4(2), 161

01/12/2023

DOI: 10.1007/s42979-022-01553-8

PMCID: PMC9836982

PMID: 36647373

Files and links (1)

url

https://doi.org/10.1007/s42979-022-01553-8View

Published (Version of record) Open Access

Abstract

Early stopping is an extremely common tool to minimize overfitting, which would otherwise be a cause of poor generalization of the model to novel data. However, early stopping is a heuristic that, while effective, primarily relies on ad hoc parameters and metrics. Optimizing when to stop remains a challenge. In this paper, we suggest that for some biomedical applications, a natural dichotomy of invasive/non-invasive measurements, or more generally proximal vs distal measurements of a biological system can be exploited to provide objective advice on early stopping. We discuss the conditions where invasive measurements of a biological process should provide better predictions than non-invasive measurements, or at best offer parity. Hence, if data from an invasive measurement are available locally, or from the literature, that information can be leveraged to know with high certainty whether a model of non-invasive data is overfitted. We present paired invasive/non-invasive cardiac and coronary artery measurements from two mouse strains, one of which spontaneously develops type 2 diabetes, posed as a classification problem. Examination of the various stopping rules shows that generalization is reduced with more training epochs and commonly applied stopping rules give widely different generalization error estimates. The use of an empirically derived training ceiling is demonstrated to be helpful as added information to leverage early stopping in order to reduce overfitting.

Computer Science

Vision

Computer Imaging

Computer Systems Organization and Communication Networks

Data Structures and Information Theory

General

Information Systems and Communication Service

Original Research

Pattern Recognition and Graphics

Signal Processing and Multimedia Applications

Software Engineering/Programming and Operating Systems

Details

Title: Subtitle: Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data
Creators: Christopher W. Bartlett - The Ohio State University
Jamie Bossenbroek - The Ohio State University
Yukie Ueyama - Nationwide Children's Hospital
Patricia McCallinhart - Nationwide Children's Hospital
Olivia A. Peters - Department of Obstetrics & Gynecology, University of Iowa Hospitals & Clinics
Donna A. Santillan - University of Iowa
Mark K. Santillan - University of Iowa
Aaron J. Trask - The Ohio State University
William C. Ray - The Ohio State University
Resource Type: Journal article
Publication Details: SN computer science, Vol.4(2), 161
DOI: 10.1007/s42979-022-01553-8
PMID: 36647373
PMCID: PMC9836982
NLM abbreviation: SN Comput Sci
ISSN: 2661-8907
eISSN: 2661-8907
Publisher: Springer Nature Singapore
Grant note: UL1TR002537; R00HL116769; R21EB026518; R01HD08994 / National Institutes of Health (http://dx.doi.org/10.13039/100000002) P50HD105556 / National Institutes of Health (http://dx.doi.org/10.13039/100000002)
Language: English
Date published: 01/12/2023
Academic Unit: Obstetrics and Gynecology
Record Identifier: 9984358155802771

Metrics

30 Record Views