How Much User Context Do We Need? Privacy by Design in Mental Health NLP Application

09/05/2022
by   Ramit Sawhney, et al.
0

Clinical NLP tasks such as mental health assessment from text, must take social constraints into account - the performance maximization must be constrained by the utmost importance of guaranteeing privacy of user data. Consumer protection regulations, such as GDPR, generally handle privacy by restricting data availability, such as requiring to limit user data to 'what is necessary' for a given purpose. In this work, we reason that providing stricter formal privacy guarantees, while increasing the volume of user data in the model, in most cases increases benefit for all parties involved, especially for the user. We demonstrate our arguments on two existing suicide risk assessment datasets of Twitter and Reddit posts. We present the first analysis juxtaposing user history length and differential privacy budgets and elaborate how modeling additional user context enables utility preservation while maintaining acceptable user privacy guarantees.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/09/2023

Privacy Aware Question-Answering System for Online Mental Health Risk Assessment

Social media platforms have enabled individuals suffering from mental il...
research
12/05/2019

Element Level Differential Privacy: The Right Granularity of Privacy

Differential Privacy (DP) provides strong guarantees on the risk of comp...
research
11/01/2022

User-Entity Differential Privacy in Learning Natural Language Models

In this paper, we introduce a novel concept of user-entity differential ...
research
11/01/2017

Re-DPoctor: Real-time health data releasing with w-day differential privacy

Wearable devices enable users to collect health data and share them with...
research
06/26/2021

Benchmarking Differential Privacy and Federated Learning for BERT Models

Natural Language Processing (NLP) techniques can be applied to help with...
research
06/23/2022

ReuseKNN: Neighborhood Reuse for Privacy-Aware Recommendations

User-based KNN recommender systems (UserKNN) utilize the rating data of ...
research
02/23/2020

PrivGen: Preserving Privacy of Sequences Through Data Generation

Sequential data is everywhere, and it can serve as a basis for research ...

Please sign up or login with your details

Forgot password? Click here to reset