Alleviating Linear Ecological Bias and Optimal Design with Subsample Data

Citation:

Glynn A, Wakefield J, Handcock M, Richardson T. Alleviating Linear Ecological Bias and Optimal Design with Subsample Data. Journal of the Royal Statistical Society: Series A. 2008;171 (1) :179-202.

Abstract:

In this paper, we illustrate that combining ecological data with subsample data in situations in which a linear model is appropriate provides two main benefits. First, by including the individual level subsample data, the biases associated with linear ecological inference can be eliminated. Second, we can use readily available ecological data to design optimal subsampling schemes, so as to maximize information about parameters. We present an application of this methodology to the classic problem of estimating the effect of a college degree on wages, showing that small, optimally chosen subsamples can be combined with ecological data to generate precise estimates relative to a simple random subsample.