GenAIPABench: A Benchmark for Generative AI-based Privacy Assistants

09/10/2023
by   Aamir Hamid, et al.
0

Privacy policies inform users about the data management practices of organizations. Yet, their complexity often renders them largely incomprehensible to the average user, necessitating the development of privacy assistants. With the advent of generative AI (genAI) technologies, there is an untapped potential to enhance privacy assistants in answering user queries effectively. However, the reliability of genAI remains a concern due to its propensity for generating incorrect or misleading information. This study introduces GenAIPABench, a novel benchmarking framework designed to evaluate the performance of Generative AI-based Privacy Assistants (GenAIPAs). GenAIPABench comprises: 1) A comprehensive set of questions about an organization's privacy policy and a data protection regulation, along with annotated answers for several organizations and regulations; 2) A robust set of evaluation metrics for assessing the accuracy, relevance, and consistency of the generated responses; and 3) An evaluation tool that generates appropriate prompts to introduce the system to the privacy document and different variations of the privacy questions to evaluate its robustness. We use GenAIPABench to assess the potential of three leading genAI systems in becoming GenAIPAs: ChatGPT, Bard, and Bing AI. Our results demonstrate significant promise in genAI capabilities in the privacy domain while also highlighting challenges in managing complex queries, ensuring consistency, and verifying source accuracy.

READ FULL TEXT

page 8

page 10

page 11

page 16

page 17

page 18

page 19

page 20

research
10/11/2017

Understanding Organizational Approach towards End User Privacy

End user privacy is a critical concern for all organizations that collec...
research
06/16/2023

Data Protection for Data Privacy-A South African Problem?

This study proposes a comprehensive framework for enhancing data securit...
research
07/06/2023

VerifAI: Verified Generative AI

Generative AI has made significant strides, yet concerns about the accur...
research
01/25/2022

The Text Anonymization Benchmark (TAB): A Dedicated Corpus and Evaluation Framework for Text Anonymization

We present a novel benchmark and associated evaluation metrics for asses...
research
04/05/2023

The Saudi Privacy Policy Dataset

This paper introduces the Saudi Privacy Policy Dataset, a diverse compil...
research
03/15/2023

Exploring the Relevance of Data Privacy-Enhancing Technologies for AI Governance Use Cases

The development of privacy-enhancing technologies has made immense progr...
research
02/05/2019

PUTWorkbench: Analysing Privacy in AI-intensive Systems

AI intensive systems that operate upon user data face the challenge of b...

Please sign up or login with your details

Forgot password? Click here to reset