Comparing Population Means under Local Differential Privacy: with Significance and Power

03/24/2018
by   Bolin Ding, et al.
0

A statistical hypothesis test determines whether a hypothesis should be rejected based on samples from populations. In particular, randomized controlled experiments (or A/B testing) that compare population means using, e.g., t-tests, have been widely deployed in technology companies to aid in making data-driven decisions. Samples used in these tests are collected from users and may contain sensitive information. Both the data collection and the testing process may compromise individuals' privacy. In this paper, we study how to conduct hypothesis tests to compare population means while preserving privacy. We use the notation of local differential privacy (LDP), which has recently emerged as the main tool to ensure each individual's privacy without the need of a trusted data collector. We propose LDP tests that inject noise into every user's data in the samples before collecting them (so users do not need to trust the data collector), and draw conclusions with bounded type-I (significance level) and type-II errors (1 - power). Our approaches can be extended to the scenario where some users require LDP while some are willing to provide exact data. We report experimental results on real-world datasets to verify the effectiveness of our approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/02/2018

An Algorithmic Framework For Differentially Private Data Analysis on Trusted Processors

Differential privacy has emerged as the main definition for private data...
research
05/24/2019

Hypothesis Testing Interpretations and Renyi Differential Privacy

Differential privacy is the gold standard in data privacy, with applicat...
research
04/01/2022

LDP-IDS: Local Differential Privacy for Infinite Data Streams

Streaming data collection is essential to real-time data analytics in va...
research
08/09/2020

Local Differential Privacy and Its Applications: A Comprehensive Survey

With the fast development of Information Technology, a tremendous amount...
research
10/15/2021

Multivariate Mean Comparison under Differential Privacy

The comparison of multivariate population means is a central task of sta...
research
09/02/2023

A Survey of Local Differential Privacy and Its Variants

The introduction and advancements in Local Differential Privacy (LDP) va...
research
02/09/2018

Automatic Passenger Counting: Introducing the t-Test Induced Equivalence Test

Automatic passenger counting in public transport has been emerging rapid...

Please sign up or login with your details

Forgot password? Click here to reset