Privacy Protection and Accuracy: What Do We Know? Do We Know Things?? Let's Find Out!
Statistical agencies have a dual mandate to provide accurate data and protect the privacy and confidentiality of data subjects. These mandates are fundamentally at odds and therefore must be balanced: more accurate data reduces privacy, while privacy protections introduce error that reduces accuracy. Balancing accuracy and privacy requires, among other things, that we can quantify accuracy and privacy. Quantifying privacy has become easier thanks to differential privacy. Quantifying accuracy may sound easy by comparison, but there are many challenges to doing this effectively. In this paper, we first discuss some challenges associated with quantifying data accuracy. We then focus on an often-ignored challenge, which is the existence of survey error in the data being protected. We provide an overview of how privacy protection error relates to total survey error. We also summarize recent work that uses validation data to quantify the impact of privacy protection error relative to and conditional on other sources of survey error. Finally, we discuss opportunities and challenges for future work on data privacy and survey error.
Published Versions
Forthcoming: Privacy Protection and Accuracy: What Do We Know? Do We Know Things?? Let's Find Out!, Evan S. Totty, Thor Watson. in Data Privacy Protection and the Conduct of Applied Research: Methods, Approaches and their Consequences, Gong, Hotz, and Schmutte. 2024