StereoKG: Data-Driven Knowledge Graph Construction for Cultural Knowledge and Stereotypes

05/27/2022
by   Awantee Deshpande, et al.
0

Analyzing ethnic or religious bias is important for improving fairness, accountability, and transparency of natural language processing models. However, many techniques rely on human-compiled lists of bias terms, which are expensive to create and are limited in coverage. In this study, we present a fully data-driven pipeline for generating a knowledge graph (KG) of cultural knowledge and stereotypes. Our resulting KG covers 5 religious groups and 5 nationalities and can easily be extended to include more entities. Our human evaluation shows that the majority (59.2 coherent and complete stereotypes. We further show that performing intermediate masked language model training on the verbalized KG leads to a higher level of cultural awareness in the model and has the potential to increase classification performance on knowledge-crucial samples on a related task, i.e., hate speech detection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2019

ArCo: the Italian Cultural Heritage Knowledge Graph

ArCo is the Italian Cultural Heritage knowledge graph, consisting of a n...
research
09/08/2022

Geolocation of Cultural Heritage using Multi-View Knowledge Graph Embedding

Knowledge Graphs (KGs) have proven to be a reliable way of structuring d...
research
03/17/2022

Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists

Natural Language Processing (NLP) models risk overfitting to specific te...
research
11/03/2021

Marriage is a Peach and a Chalice: Modelling Cultural Symbolism on the SemanticWeb

In this work, we fill the gap in the Semantic Web in the context of Cult...
research
05/25/2021

GCNBoost: Artwork Classification by Label Propagation through a Knowledge Graph

The rise of digitization of cultural documents offers large-scale conten...
research
10/14/2022

Extracting Cultural Commonsense Knowledge at Scale

Structured knowledge is important for many AI applications. Commonsense ...
research
07/09/2019

Systematic quantitative analyses reveal the folk-zoological knowledge embedded in folktales

Cultural learning is a unique human capacity essential for a wide range ...

Please sign up or login with your details

Forgot password? Click here to reset