Extractive Adversarial Networks: High-Recall Explanations for Identifying Personal Attacks in Social Media Posts

09/01/2018
by   Samuel Carton, et al.
0

We introduce an adversarial method for producing high-recall explanations of neural text classifier decisions. Building on an existing architecture for extractive explanations via hard attention, we add an adversarial layer which scans the residual of the attention for remaining predictive signal. Motivated by the important domain of detecting personal attacks in social media comments, we additionally demonstrate the importance of manually setting a semantically appropriate `default' behavior for the model by explicitly manipulating its bias term. We develop a validation set of human-annotated personal attacks to evaluate the impact of these changes.

READ FULL TEXT
research
09/01/2016

Identifying Dogmatism in Social Media: Signals and Models

We explore linguistic and behavioral features of dogmatism in social med...
research
05/04/2020

Ten Questions in Lifelog Mining and Information Recall

With the advance of science and technology, people are used to record th...
research
05/30/2023

An Annotated Dataset for Explainable Interpersonal Risk Factors of Mental Disturbance in Social Media Posts

With a surge in identifying suicidal risk and its severity in social med...
research
12/16/2020

Building and Using Personal Knowledge Graph to Improve Suicidal Ideation Detection on Social Media

A large number of individuals are suffering from suicidal ideation in th...
research
09/18/2023

How to Generate Popular Post Headlines on Social Media?

Posts, as important containers of user-generated-content pieces on socia...
research
08/28/2019

Emotion Detection with Neural Personal Discrimination

There have been a recent line of works to automatically predict the emot...
research
11/18/2022

Towards Explaining Subjective Ground of Individuals on Social Media

Large-scale language models have been reducing the gap between machines ...

Please sign up or login with your details

Forgot password? Click here to reset