An Examination of the Alleged Privacy Threats of Confidence-Ranked Reconstruction of Census Microdata | SpringerLink
Skip to main content

An Examination of the Alleged Privacy Threats of Confidence-Ranked Reconstruction of Census Microdata

  • Conference paper
  • First Online:
Privacy in Statistical Databases (PSD 2024)

Abstract

The threat of reconstruction attacks has led the U.S. Census Bureau (USCB) to replace in the Decennial Census 2020 the traditional statistical disclosure limitation based on rank swapping with one based on differential privacy (DP), leading to substantial accuracy loss of released statistics. Yet, it has been argued that, if many different reconstructions are compatible with the released statistics, most of them do not correspond to actual original data, which protects against respondent reidentification. Recently, a new attack has been proposed, which incorporates the confidence that a reconstructed record was in the original data. The alleged risk of disclosure entailed by such confidence-ranked reconstruction has renewed the interest of the USCB to use DP-based solutions. To forestall a potential accuracy loss in future releases, we show that the proposed reconstruction is neither effective as a reconstruction method nor conducive to disclosure as claimed by its authors. Specifically, we report empirical results showing the proposed ranking cannot guide reidentification or attribute disclosure attacks, and hence fails to warrant the utility sacrifice entailed by the use of DP to release census statistical data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 8465
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 10581
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://github.com/terranceliu/rap-rank-reconstruction.

  2. 2.

    https://github.com/NajeebJebreel/CRR-analysis.

References

  1. Abowd, J.: Declaration of John M. Abowd. Case no. 3:21-CV-211-RAH-ECM-KCN (2021)

    Google Scholar 

  2. Abowd, J., Hawes, M.: Confidentiality protection in the 2020 U.S. Census of population and housing. Annu. Rev. Stat. Appl. 10, 119–144 (2023)

    Google Scholar 

  3. Blanco-Justicia, A., Sánchez, D., Domingo-Ferrer, J., Muralidhar, K.: A critical review on the use (and misuse) of differential privacy in machine learning. ACM Comput. Surv. 55(8), 1–16 (2023)

    Article  Google Scholar 

  4. Daily, D.: Disclosure avoidance protection for the American Community Survey (2022). https://www.census.gov/newsroom/blogs/random-samplings/2022/12/disclosure-avoidance-protections-acs.html. Accessed 3 May 2023

  5. Dick, T., et al.: Confidence-ranked reconstruction of census microdata from published statistics. Proc. Natl. Acad. Sci. U.S.A. 120(8), e2218605120 (2023)

    Google Scholar 

  6. Dick, T., et al.: Reply to Sánchez et al.: multiplicity does not protect privacy. Proc. Natl. Acad. Sci. U.S.A. 120(8), e2304263120 (2023)

    Google Scholar 

  7. Domingo-Ferrer, J., Sánchez, D., Blanco-Justicia, A.: The limits of differential privacy (and its misuse in data release and machine learning). Commun. ACM 64(7), 33–35 (2021)

    Article  Google Scholar 

  8. Dove, I.: Applying differential privacy protection to ONS mortality data. Pilot study (2021). https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/methodologies/applyingdifferentialprivacyprotectiontoonsmortalitydatapilotstudy. Accessed 23 May 2023

  9. Gehrke, J., Hay, M., Lui, E., Pass, R.: Crowd-blending privacy. In: Safavi-Naini, R., Canetti, R. (eds.) Advances in Cryptology – CRYPTO 2012. CRYPTO 2012. LNCS, vol. 7417, pp. 479–496. Springer, Berlin, Heidelberg (2012). https://doi.org/10.1007/978-3-642-32009-5_28

  10. Hawes, M.: Reconstruction and reidentification of the Demographic and Housing Characteristics file (DHC) (2022). https://www2.census.gov/about/partners/cac/sac/meetings/2022-09/presentation-reconstruction-and-re-dentification-of-dhc-file.pdf. Accessed 14 Mar 2023

  11. Hotz, V., et al.: Balancing data privacy and usability in the federal statistical system. Proc. Natl. Acad. Sci. U.S.A. 119(31), e2104906119 (2022)

    Google Scholar 

  12. Keller, S., Abowd, J.: Database reconstruction does compromise confidentiality. Proc. Natl. Acad. Sci. U.S.A. 120(12), e2300976120 (2023)

    Google Scholar 

  13. Kenny, C., Kuriwaki, S., McCartan, C., Rosenman, E., Simko, T., Imai, K.: The use of differential privacy for census data and its impact on redistricting: the case of the 2020 U.S. Census. Sci. Adv. 7(41) (2021)

    Google Scholar 

  14. Li, N., Li, T., Venkatasubramanian, S.: t-Closeness: privacy beyond k-anonymity and l-diversity. In: 23rd IEEE International Conference on Data Engineering (ICDE’07), pp. 106–115 (2007)

    Google Scholar 

  15. Machanavajjhala, A., Gehrke, J., Kifer, D., Venkitasubramaniam, M.: L-diversity: privacy beyond k-anonymity. ACM Trans. Knowl. Discov. Data 1(1), 3–es (2007)

    Google Scholar 

  16. Menger, G.: Using 2020 Census data (2021). https://appliedgeographic.com/2021/09/using-2020-census-data/. Accessed 9 May 2023

  17. Muralidhar, K.: A Re-examination of the census bureau reconstruction and reidentification attack. In: Domingo-Ferrer, J., Laurent, M. (eds.) Privacy in Statistical Databases. PSD 2022. LNCS, vol. 13463, pp. 312–323. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-13945-1_22

  18. Muralidhar, K., Domingo-Ferrer, J.: Database reconstruction is not so easy and is different from reidentification. J. Off. Stat. 39(3), 381–398 (2023)

    Article  Google Scholar 

  19. Ruggles, S., Riper, D.V.: The role of chance in the Census Bureau database reconstruction experiment. Popul. Res. Policy Rev. 41, 781–788 (2022)

    Article  Google Scholar 

  20. Samarati, P.: Protecting respondents identities in microdata release. IEEE Trans. Knowl. Data Eng. 13(6), 1010–1027 (2001)

    Article  Google Scholar 

  21. Santos-Lozada, A., Howard, J., Verdery, A.M.: How differential privacy will affect our understanding of health disparities in the united states. Proc. Natl. Acad. Sci. U.S.A. 117(24), 13405–13412 (2020)

    Google Scholar 

  22. Schneider, M.: Researchers ask Census to stop controversial privacy method (2022). https://www.usnews.com/news/business/articles/2022-08-08/researchers-ask-census-to-stop-controversial-privacy-method. Accessed 15 May 2023

  23. U.S. Census Bureau: Developing the DAS: Demonstration data and progress metrics (2020). https://www.census.gov/programs-surveys/decennial-census/decade/2020/planningmanagement/process/disclosure-avoidance/2020-das-development.html

  24. U.S. Census Bureau: Disclosure avoidance for the 2020 Census: An introduction (2021). https://www2.census.gov/library/publications/decennial/2020/2020-census-disclosure-avoidance-handbook.pdf. Accessed 15 May 2023

  25. Zayatz, L., Lucero, J., Massell, P., Ramanayake, A.: Disclosure avoidance for Census 2010 and American Community Survey five-year tabular data products. Technical report. RRS2009-10, Census Bureau (2009). https://www.census.gov/content/dam/Census/library/working-papers/2009/adrm/rrs2009-10.pdf. Accessed 9 Mar 2023

  26. Zayatz, L., Lucero, J., Massell, P., Ramanayake, A.: Disclosure avoidance for Census 2010 and American Community Survey five-year tabular data products (2009), https://www.census.gov/content/dam/Census/library/working-papers/2009/adrm/rrs2009-10.pdf, accessed 9 May 2023

Download references

Acknowledgments

This research was funded by the European Commission (project H2020-871042 “SoBigData++”), the Government of Catalonia (ICREA Acadèmia Prizes to J. Domingo-Ferrer and to D. Sánchez and grant 2021SGR-00115), MCIN/AEI/ 10.13039/501100011033 and “ERDF A way of making Europe” under grants PID2021-123637NB-I00 “CURLING” and PRE2019-089210, and INCIBE and European Union NextGenerationEU/PRTR (project “HERMES” and INCIBE-URV Cybersecurity Chair).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to David Sánchez , Josep Domingo-Ferrer or Alberto Blanco-Justicia .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sánchez, D., Jebreel, N., Muralidhar, K., Domingo-Ferrer, J., Blanco-Justicia, A. (2024). An Examination of the Alleged Privacy Threats of Confidence-Ranked Reconstruction of Census Microdata. In: Domingo-Ferrer, J., Önen, M. (eds) Privacy in Statistical Databases. PSD 2024. Lecture Notes in Computer Science, vol 14915. Springer, Cham. https://doi.org/10.1007/978-3-031-69651-0_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-69651-0_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-69650-3

  • Online ISBN: 978-3-031-69651-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics