Extracting Cultural Commonsense Knowledge at Scale

10/14/2022
by   Tuan-Phong Nguyen, et al.
0

Structured knowledge is important for many AI applications. Commonsense knowledge, which is crucial for robust human-centric AI, is covered by a small number of structured knowledge projects. However, they lack knowledge about human traits and behaviors conditioned on socio-cultural contexts, which is crucial for situative AI. This paper presents CANDLE, an end-to-end methodology for extracting high-quality cultural commonsense knowledge (CCSK) at scale. CANDLE extracts CCSK assertions from a huge web corpus and organizes them into coherent clusters, for 3 domains of subjects (geography, religion, occupation) and several cultural facets (food, drinks, clothing, traditions, rituals, behaviors). CANDLE includes judicious techniques for classification-based filtering and scoring of interestingness. Experimental evaluations show the superiority of the CANDLE CCSK collection over prior works, and an extrinsic use case demonstrates the benefits of CCSK for the GPT-3 language model. Code and data can be accessed at https://cultural-csk.herokuapp.com/.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/11/2020

An Atlas of Cultural Commonsense for Machine Reasoning

Existing commonsense reasoning datasets for AI and NLP tasks fail to add...
research
05/27/2019

Commonsense Properties from Query Logs and Question Anwering Forums

Commonsense knowledge about object properties, human behavior and genera...
research
05/27/2019

Commonsense Properties from Query Logs and Question Answering Forums

Commonsense knowledge about object properties, human behavior and genera...
research
11/30/2021

Refined Commonsense Knowledge from Large-Scale Web Contents

Commonsense knowledge (CSK) about concepts and their properties is usefu...
research
05/27/2022

StereoKG: Data-Driven Knowledge Graph Construction for Cultural Knowledge and Stereotypes

Analyzing ethnic or religious bias is important for improving fairness, ...
research
11/02/2020

Advanced Semantics for Commonsense Knowledge Extraction

Commonsense knowledge (CSK) about concepts and their properties is usefu...
research
03/01/2023

That's All Folks: a KG of Values as Commonsense Social Norms and Behaviors

Values, as intended in ethics, determine the shape and validity of moral...

Please sign up or login with your details

Forgot password? Click here to reset