Accounting for Unobservable Heterogeneity in Cross Section Using Spatial First Differences
We develop a simple cross-sectional research design to identify causal effects that is robust to unobservable heterogeneity. When many observational units are dense in physical space, it may be sufficient to regress the “spatial first differences” (SFD) of the outcome on the treatment and omit all covariates. This approach is conceptually similar to first differencing approaches in time-series or panel models, except the index for time is replaced with an index for locations in space. The SFD design identifies plausibly causal effects, even when no instruments are available, so long as local changes in the treatment and unobservable confounders are not systematically correlated between immediately adjacent neighbors. We demonstrate the SFD approach by recovering new cross-sectional estimates for the effects of time-invariant geographic factors, soil and climate, on long-run average crop productivities across US counties — relationships that are notoriously confounded by unobservables but crucial for guiding economic decisions, such as land management and climate policy.