A Human-AI Loop Approach for Joint Keyword Discovery and Expectation Estimation in Micropost Event Detection

12/02/2019
by   Akansha Bhardwaj, et al.
16

Microblogging platforms such as Twitter are increasingly being used in event detection. Existing approaches mainly use machine learning models and rely on event-related keywords to collect the data for model training. These approaches make strong assumptions on the distribution of the relevant micro-posts containing the keyword – referred to as the expectation of the distribution – and use it as a posterior regularization parameter during model training. Such approaches are, however, limited as they fail to reliably estimate the informativeness of a keyword and its expectation for model training. This paper introduces a Human-AI loop approach to jointly discover informative keywords for model training while estimating their expectation. Our approach iteratively leverages the crowd to estimate both keyword specific expectation and the disagreement between the crowd and the model in order to discover new keywords that are most beneficial for model training. These keywords and their expectation not only improve the resulting performance but also make the model training process more transparent. We empirically demonstrate the merits of our approach, both in terms of accuracy and interpretability, on multiple real-world datasets and show that our approach improves the state of the art by 24.3

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2016

Exploitation of Semantic Keywords for Malicious Event Classification

Learning an event classifier is challenging when the scenes are semantic...
research
05/28/2023

Spot keywords from very noisy and mixed speech

Most existing keyword spotting research focuses on conditions with sligh...
research
11/01/2022

Metric Learning for User-defined Keyword Spotting

The goal of this work is to detect new spoken terms defined by users. Wh...
research
07/01/2020

A Transformer-based Audio Captioning Model with Keyword Estimation

One of the problems with automated audio captioning (AAC) is the indeter...
research
04/04/2023

Thematic context vector association based on event uncertainty for Twitter

Keyword extraction is a crucial process in text mining. The extraction o...
research
03/29/2023

AraSpot: Arabic Spoken Command Spotting

Spoken keyword spotting (KWS) is the task of identifying a keyword in an...

Please sign up or login with your details

Forgot password? Click here to reset