Logo image
Scalable Bayes under Informative Sampling
Journal article   Peer reviewed

Scalable Bayes under Informative Sampling

Terrance D Savitsky and Sanvesh Srivastava
Scandinavian Journal of Statistics, Vol.45(3), pp.534-556
09/2018
DOI: 10.1111/sjos.12312
url
https://arxiv.org/pdf/1606.07488View
Open Access

Abstract

Bayesian hierarchical formulations are utilized by the U.S. Bureau of Labor Statistics (BLS) with respondent‐level data for missing item imputation because these formulations are readily parameterized to capture correlation structures. BLS collects survey data under informative sampling designs that assign probabilities of inclusion to be correlated with the response on which sampling‐weighted pseudo posterior distributions are estimated for asymptotically unbiased inference about population model parameters. Computation is expensive and does not support BLS production schedules. We propose a new method to scale the computation that divides the data into smaller subsets, estimates a sampling‐weighted pseudo posterior distribution, in parallel, for every subset and combines the pseudo posterior parameter samples from all the subsets through their mean in the Wasserstein space of order 2. We construct conditions on a class of sampling designs where posterior consistency of the proposed method is achieved. We demonstrate on both synthetic data and in application to the Current Employment Statistics survey that our method produces results of similar accuracy as the usual approach while offering substantially faster computation.
survey sampling Markov chain Monte Carlo posterior consistency barycentre distributed Bayesian computations Wasserstein distance current employment statistics survey pseudo posterior distribution

Details

Metrics

Logo image