Multi-task dialog act and sentiment recognition on Mastodon

07/13/2018
by   Christophe Cerisara, et al.
0

Because of license restrictions, it often becomes impossible to strictly reproduce most research results on Twitter data already a few months after the creation of the corpus. This situation worsened gradually as time passes and tweets become inaccessible. This is a critical issue for reproducible and accountable research on social media. We partly solve this challenge by annotating a new Twitter-like corpus from an alternative large social medium with licenses that are compatible with reproducible experiments: Mastodon. We manually annotate both dialogues and sentiments on this corpus, and train a multi-task hierarchical recurrent network on joint sentiment and dialog act recognition. We experimentally demonstrate that transfer learning may be efficiently achieved between both tasks, and further analyze some specific correlations between sentiments and dialogues on social media. Both the annotated corpus and deep network are released with an open-source license.

READ FULL TEXT
research
12/05/2016

Mapping the Dialog Act Annotations of the LEGO Corpus into the Communicative Functions of ISO 24617-2

In this paper we present strategies for mapping the dialog act annotatio...
research
07/19/2021

Predicting the 2020 US Presidential Election with Twitter

One major sub-domain in the subject of polling public opinion with socia...
research
08/08/2023

A Bi-directional Multi-hop Inference Model for Joint Dialog Sentiment Classification and Act Recognition

The joint task of Dialog Sentiment Classification (DSC) and Act Recognit...
research
10/31/2016

Generating Sentiment Lexicons for German Twitter

Despite a substantial progress made in developing new sentiment lexicon ...
research
04/08/2019

Issue Framing in Online Discussion Fora

In online discussion fora, speakers often make arguments for or against ...
research
02/26/2021

Multi-task transfer learning for finding actionable information from crisis-related messages on social media

The Incident streams (IS) track is a research challenge aimed at finding...
research
01/30/2019

Twitter Job/Employment Corpus: A Dataset of Job-Related Discourse Built with Humans in the Loop

We present the Twitter Job/Employment Corpus, a collection of tweets ann...

Please sign up or login with your details

Forgot password? Click here to reset