Logo image
Secure computation with horizontally partitioned data using adaptive regression splines
Journal article   Peer reviewed

Secure computation with horizontally partitioned data using adaptive regression splines

Joyee Ghosh, Jerome P Reiter and Alan F Karr
Computational statistics & data analysis, Vol.51(12), pp.5813-5820
2007
DOI: 10.1016/j.csda.2006.10.013

View Online

Abstract

When several data owners possess data on different records but the same variables, known as horizontally partitioned data, the owners can improve statistical inferences by sharing their data with each other. Often, however, the owners are unwilling or unable to share because the data are confidential or proprietary. Secure computation protocols enable the owners to compute parameter estimates for some statistical models, including linear regressions, without sharing individual records’ data. A drawback to these techniques is that the model must be specified in advance of initiating the protocol, and the usual exploratory strategies for determining good-fitting models have limited usefulness since the individual records are not shared. In this paper, we present a protocol for secure adaptive regression splines that allows for flexible, semi-automatic regression modeling. This reduces the risk of model mis-specification inherent in secure computation settings. We illustrate the protocol with air pollution data.
Regression Disclosure Confidentiality Secure computation Spline

Details

Metrics

Logo image