lynx   »   [go: up one dir, main page]

IDEAS home Printed from https://ideas.repec.org/a/eee/ecosta/v31y2024icp81-99.html
   My bibliography  Save this article

Differentially Private Goodness-of-Fit Tests for Continuous Variables

Author

Listed:
  • Kwak, Seung Woo
  • Ahn, Jeongyoun
  • Lee, Jaewoo
  • Park, Cheolwoo
Abstract
Data privacy is a growing concern in modern data analyses as more and more types of information about individuals are collected and shared. Statistical analysis in consideration of privacy is thus becoming an exciting area of research. Differential privacy can provide a means by which one can measure the stochastic risk of violating the privacy of individuals that can result from conducting an analysis, such as a simple query from a database and a hypothesis test. The main interest of the work is a goodness-of-fit test that compares the sampled data to a known distribution. Many differentially private goodness-of-fit tests have been proposed for discrete random variables, but little work has been done for continuous variables. The objective is to review some existing tests that guarantee differential privacy for discrete random variables, and to propose an extension to continuous cases via a discretization process. The proposed test procedures are demonstrated through simulated examples and applied to the Household Financial Welfare Survey of South Korea in 2018.

Suggested Citation

  • Kwak, Seung Woo & Ahn, Jeongyoun & Lee, Jaewoo & Park, Cheolwoo, 2024. "Differentially Private Goodness-of-Fit Tests for Continuous Variables," Econometrics and Statistics, Elsevier, vol. 31(C), pages 81-99.
  • Handle: RePEc:eee:ecosta:v:31:y:2024:i:c:p:81-99
    DOI: 10.1016/j.ecosta.2021.09.007
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S2452306221001143
    Download Restriction: Full text for ScienceDirect subscribers only. Contains open access articles

    File URL: https://libkey.io/10.1016/j.ecosta.2021.09.007?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Wasserman, Larry & Zhou, Shuheng, 2010. "A Statistical Framework for Differential Privacy," Journal of the American Statistical Association, American Statistical Association, vol. 105(489), pages 375-389.
    2. Campano, Fred & Salvatore, Dominick, 2006. "Income Distribution," OUP Catalogue, Oxford University Press, number 9780195300918, Decembrie.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. John M. Abowd & Ian M. Schmutte & William Sexton & Lars Vilhuber, 2019. "Suboptimal Provision of Privacy and Statistical Accuracy When They are Public Goods," Papers 1906.09353, arXiv.org.
    2. Roberto Dell’Anno & Jorge Martinez-Vazquez, 2013. "A Behavioral Local Public Finance Perspective on the Renter’s Illusion Hypothesis," International Center for Public Policy Working Paper Series, at AYSPS, GSU paper1303, International Center for Public Policy, Andrew Young School of Policy Studies, Georgia State University.
    3. Ron S. Jarmin & John M. Abowd & Robert Ashmead & Ryan Cumings-Menon & Nathan Goldschlag & Michael B. Hawes & Sallie Ann Keller & Daniel Kifer & Philip Leclerc & Jerome P. Reiter & Rolando A. Rodrígue, 2023. "An in-depth examination of requirements for disclosure risk assessment," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 120(43), pages 2220558120-, October.
    4. Raj Chetty & John N. Friedman, 2019. "A Practical Method to Reduce Privacy Loss When Disclosing Statistics Based on Small Samples," AEA Papers and Proceedings, American Economic Association, vol. 109, pages 414-420, May.
    5. John M. Abowd & Robert Ashmead & Ryan Cumings-Menon & Simson Garfinkel & Micah Heineck & Christine Heiss & Robert Johns & Daniel Kifer & Philip Leclerc & Ashwin Machanavajjhala & Brett Moran & William, 2022. "The 2020 Census Disclosure Avoidance System TopDown Algorithm," Papers 2204.08986, arXiv.org.
    6. Ori Heffetz & Katrina Ligett, 2014. "Privacy and Data-Based Research," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 75-98, Spring.
    7. Dawid, H. & Harting, P. & Neugart, M., 2018. "Cohesion policy and inequality dynamics: Insights from a heterogeneous agents macroeconomic model," Journal of Economic Behavior & Organization, Elsevier, vol. 150(C), pages 220-255.
    8. Toth Daniell, 2014. "Data Smearing: An Approach to Disclosure Limitation for Tabular Data," Journal of Official Statistics, Sciendo, vol. 30(4), pages 839-857, December.
    9. Soumya Mukherjee & Aratrika Mustafi & Aleksandra Slavkovi'c & Lars Vilhuber, 2023. "Assessing Utility of Differential Privacy for RCTs," Papers 2309.14581, arXiv.org.
    10. Katherine B. Coffman & Lucas C. Coffman & Keith M. Marzilli Ericson, 2017. "The Size of the LGBT Population and the Magnitude of Antigay Sentiment Are Substantially Underestimated," Management Science, INFORMS, vol. 63(10), pages 3168-3186, October.
    11. Chongliang Luo & Md. Nazmul Islam & Natalie E. Sheils & John Buresh & Jenna Reps & Martijn J. Schuemie & Patrick B. Ryan & Mackenzie Edmondson & Rui Duan & Jiayi Tong & Arielle Marks-Anglin & Jiang Bi, 2022. "DLMM as a lossless one-shot algorithm for collaborative multi-site distributed linear mixed models," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    12. Shaughnessy, Timothy M. & White, Mary L. & Brendler, Michael D., 2010. "The Income Distribution Effect of Natural Disasters: An Analysis of Hurricane Katrina," Journal of Regional Analysis and Policy, Mid-Continent Regional Science Association, vol. 40(01), pages 1-12.
    13. Salvatore, Dominick, 2010. "Growth or stagnation after recession for the U.S. and other large advanced economies," Journal of Policy Modeling, Elsevier, vol. 32(5), pages 637-647, September.
    14. Lalanne, Clément & Gadat, Sébastien, 2024. "Privately Learning Smooth Distributions on the Hypercube by Projections," TSE Working Papers 24-1505, Toulouse School of Economics (TSE).
    15. Maria Denisa VASILESCU & Larisa STANILA & Amalia CRISTESCU, 2014. "The evolution of earnings inequality in Romania," Romanian Journal of Economics, Institute of National Economy, vol. 39(2(48)), pages 88-99, December.
    16. Rukmani Gounder & Zhongwei Xing, 2012. "The measurement of inequality in Fiji's household income distribution," International Journal of Social Economics, Emerald Group Publishing Limited, vol. 39(4), pages 264-280, March.
    17. Chang, Jinyuan & Hu, Qiao & Kolaczyk, Eric D. & Yao, Qiwei & Yi, Fengting, 2024. "Edge differentially private estimation in the β-model via jittering and method of moments," LSE Research Online Documents on Economics 122099, London School of Economics and Political Science, LSE Library.
    18. Walker, Douglas O., 2007. "Patterns of income distribution among world regions," Journal of Policy Modeling, Elsevier, vol. 29(4), pages 643-655.
    19. Claire McKay Bowen & Fang Liu & Bingyue Su, 2021. "Differentially private data release via statistical election to partition sequentially," METRON, Springer;Sapienza Università di Roma, vol. 79(1), pages 1-31, April.
    20. Salvatore, Dominick & Campano, Fred, 2022. "Regional differences in inequality and income distribution in the United States," Journal of Policy Modeling, Elsevier, vol. 44(4), pages 780-789.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ecosta:v:31:y:2024:i:c:p:81-99. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/econometrics-and-statistics .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.
    Лучший частный хостинг