Logo image
High-dimensional intervals for penalized regression
Dissertation   Open access

High-dimensional intervals for penalized regression

Logan Harris
University of Iowa
Doctor of Philosophy (PhD), University of Iowa
Autumn 2025
DOI: 10.25820/etd.008201
pdf
harris-logan-2025-10-271.69 MBDownloadView
Open Access

Abstract

The lasso (least absolute shrinkage and selection operator) – and, more broadly, sparse penalized regression methods – are widely used for datasets with many predictors because they can be used for variable selection and coefficient estimation simultaneously. With that said, inference on sparse penalized regression is difficult which has led to a wide array of procedures, many of which focus on significance through the use of false discovery rate (FDR) control. Methods for constructing confidence intervals exist, but the bias introduced by the penalties poses a particular challenge here and debiasing is required to obtain traditional frequentist coverage (i.e., correct 1 − α coverage for each parameter individually). However, the drawback to these approaches is that the intervals are not constructed under the same assumptions as were used for obtaining point estimates. Alternatively, this dissertation develops a coherent framework for constructing intervals that are consistent with corresponding point estimates. Specifically, the intervals target average rather than individual coverage, allowing for bias in the construction just like the corresponding point estimates. We refer to intervals as high-dimensional intervals (HDIs). HDIs have properties that fall in between confidence and credible intervals. In particular, we show that the idea of average coverage aligns with Bayesian inference. Additionally, the interval construction methods proposed leverage the connection between the lasso penalty and a prior. Chapter 2 outlines the alternative framework and provides an initial approach for interval construction, Chapter 3 adds an additional criteria that improves alignment of HDIs with corresponding estimates and introduces two alternative approaches, and Chapter 4 extends the proposed methods to a broader range of penalties and outcome types.
confidence intervals high-dimensional inference lasso penalized regression

Details

Metrics

1 Record Views
Logo image