Supporting Human-AI Collaboration in Auditing LLMs with LLMs

04/19/2023
by   Charvi Rastogi, et al.
0

Large language models are becoming increasingly pervasive and ubiquitous in society via deployment in sociotechnical systems. Yet these language models, be it for classification or generation, have been shown to be biased and behave irresponsibly, causing harm to people at scale. It is crucial to audit these language models rigorously. Existing auditing tools leverage either or both humans and AI to find failures. In this work, we draw upon literature in human-AI collaboration and sensemaking, and conduct interviews with research experts in safe and fair AI, to build upon the auditing tool: AdaTest (Ribeiro and Lundberg, 2022), which is powered by a generative large language model (LLM). Through the design process we highlight the importance of sensemaking and human-AI communication to leverage complementary strengths of humans and generative models in collaborative auditing. To evaluate the effectiveness of the augmented tool, AdaTest++, we conduct user studies with participants auditing two commercial language models: OpenAI's GPT-3 and Azure's sentiment analysis model. Qualitative analysis shows that AdaTest++ effectively leverages human strengths such as schematization, hypothesis formation and testing. Further, with our tool, participants identified a variety of failures modes, covering 26 different topics over 2 tasks, that have been shown before in formal audits and also those previously under-reported.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/20/2023

"It Felt Like Having a Second Mind": Investigating Human-AI Co-creativity in Prewriting with Large Language Models

Prewriting is the process of discovering and developing ideas before a f...
research
03/31/2023

Augmented Collective Intelligence in Collaborative Ideation: Agenda and Challenges

AI systems may be better thought of as peers than as tools. This paper e...
research
06/21/2023

LLM-based Smart Reply (LSR): Enhancing Collaborative Performance with ChatGPT-mediated Smart Reply System

CSCW studies have increasingly explored AI's role in enhancing communica...
research
06/23/2023

Exploring Qualitative Research Using LLMs

The advent of AI driven large language models (LLMs) have stirred discus...
research
11/04/2022

Measuring Progress on Scalable Oversight for Large Language Models

Developing safe and useful general-purpose AI systems will require us to...
research
03/06/2023

Choice Over Control: How Users Write with Large Language Models using Diegetic and Non-Diegetic Prompting

We propose a conceptual perspective on prompts for Large Language Models...
research
09/14/2022

Out of One, Many: Using Language Models to Simulate Human Samples

We propose and explore the possibility that language models can be studi...

Please sign up or login with your details

Forgot password? Click here to reset