Thompson sampling with smoothing splines for hyperparameter optimization: a tuning algorithm that doesn't need tuning
Abstract
Details
- Title: Subtitle
- Thompson sampling with smoothing splines for hyperparameter optimization: a tuning algorithm that doesn't need tuning
- Creators
- Matthew L. Davis
- Contributors
- Brian J. Smith (Advisor)Gideon Zamba (Committee Member)Patrick Breheny (Committee Member)Grant Brown (Committee Member)Sanvesh Srivastava (Committee Member)
- Resource Type
- Dissertation
- Degree Awarded
- Doctor of Philosophy (PhD), University of Iowa
- Degree in
- Biostatistics
- Date degree season
- Spring 2023
- Publisher
- University of Iowa
- DOI
- 10.25820/etd.007158
- Number of pages
- xiii, 225 pages
- Copyright
- Copyright 2023 Matthew L. Davis
- Language
- English
- Date submitted
- 04/22/2023
- Date approved
- 05/08/2023
- Description illustrations
- Illustrations, tables, graphs, charts
- Description bibliographic
- Includes bibliographical references (pages 201-225).
- Public Abstract (ETD)
Machine learning methods are increasing in popularity for biomedical research. The task of hyperparameter optimization - informally referred to as “tuning” - is often the most laborious and time-consuming part of training machine learning models from the biostatistician’s perspective, but is critical for ensuring appropriate performance. The problem of finding optimal hyperparameters rarely admits a tractable formula, so computationally demanding methods like cross-validation and split-sample approaches are usually used instead.
In this dissertation, we propose a novel algorithm for tuning, in which a cubic smoothing spline regression model is used to relate hyperparameters to simple univariate transformations of validated performance, and Thompson sampling is used to decide which grid points to evaluate next. The theoretical properties of the procedure are discussed, and in empirical analyses, its performance is observed to compare favorably to random grid search and other state-of-the-art procedures. Additionally, it is shown how the method can be used asynchronously in parallel, and a stopping rule is devised. Our results demonstrate the potential of Thompson sampling with smoothing splines as a useful tool for tuning, and importantly, does not itself require tuning.
All software used for producing the results and methods in this dissertation, as well as data sets used, results, and an implementation of the proposed method as an R package, are provided online at https://github.com/matthewlouisdavisBioStat/TSSSr.
- Academic Unit
- Biostatistics
- Record Identifier
- 9984425199202771