Multivariate Mean Comparison under Differential Privacy

10/15/2021
by   Martin Dunsche, et al.
0

The comparison of multivariate population means is a central task of statistical inference. While statistical theory provides a variety of analysis tools, they usually do not protect individuals' privacy. This knowledge can create incentives for participants in a study to conceal their true data (especially for outliers), which might result in a distorted analysis. In this paper we address this problem by developing a hypothesis test for multivariate mean comparisons that guarantees differential privacy to users. The test statistic is based on the popular Hotelling's t^2-statistic, which has a natural interpretation in terms of the Mahalanobis distance. In order to control the type-1-error, we present a bootstrap algorithm under differential privacy that provably yields a reliable test decision. In an empirical study we demonstrate the applicability of this approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2019

Automated Methods for Checking Differential Privacy

Differential privacy is a de facto standard for statistical computations...
research
08/11/2021

Statistical Inference in the Differential Privacy Model

In modern settings of data analysis, we may be running our algorithms on...
research
10/08/2020

Duff: A Dataset-Distance-Based Utility Function Family for the Exponential Mechanism

We propose and analyze a general-purpose dataset-distance-based utility ...
research
09/07/2022

Bayesian and Frequentist Semantics for Common Variations of Differential Privacy: Applications to the 2020 Census

The purpose of this paper is to guide interpretation of the semantic pri...
research
07/20/2022

Improved Generalization Guarantees in Restricted Data Models

Differential privacy is known to protect against threats to validity inc...
research
03/24/2018

Comparing Population Means under Local Differential Privacy: with Significance and Power

A statistical hypothesis test determines whether a hypothesis should be ...
research
03/09/2022

Census TopDown: The Impacts of Differential Privacy on Redistricting

The 2020 Decennial Census will be released with a new disclosure avoidan...

Please sign up or login with your details

Forgot password? Click here to reset