KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Application

05/28/2023
by   Hwaran Lee, et al.
0

Large language models (LLMs) learn not only natural text generation abilities but also social biases against different demographic groups from real-world data. This poses a critical risk when deploying LLM-based applications. Existing research and resources are not readily applicable in South Korea due to the differences in language and culture, both of which significantly affect the biases and targeted demographic groups. This limitation requires localized social bias datasets to ensure the safe and effective deployment of LLMs. To this end, we present KO SB I, a new social bias dataset of 34k pairs of contexts and sentences in Korean covering 72 demographic groups in 15 categories. We find that through filtering-based moderation, social biases in generated content can be reduced by 16.47 82B), and GPT-3.

READ FULL TEXT

page 6

page 15

research
05/01/2020

Towards Controllable Biases in Language Generation

We present a general approach towards controllable societal biases in na...
research
12/20/2022

Debiasing NLP Models Without Demographic Information

Models trained from real-world data tend to imitate and amplify social b...
research
11/10/2019

Social Bias Frames: Reasoning about Social and Power Implications of Language

Language has the power to reinforce stereotypes and project social biase...
research
10/17/2022

Prompting GPT-3 To Be Reliable

Large language models (LLMs) show impressive abilities via few-shot prom...
research
05/29/2023

Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models

To recognize and mitigate harms from large language models (LLMs), we ne...
research
10/13/2022

SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models

A common limitation of diagnostic tests for detecting social biases in N...
research
05/06/2020

Fast Mapping onto Census Blocks

Pandemic measures such as social distancing and contact tracing can be e...

Please sign up or login with your details

Forgot password? Click here to reset