Anonymized BERT: An Augmentation Approach to the Gendered Pronoun Resolution Challenge

05/06/2019
by   Bo Liu, et al.
0

We present our 7th place solution to the Gendered Pronoun Resolution challenge, which uses BERT without fine-tuning and a novel augmentation strategy designed for contextual embedding token-level tasks. Our method anonymizes the referent by replacing candidate names with a set of common placeholder names. Besides the usual benefits of effectively increasing training data size, this approach diversifies idiosyncratic information embedded in names. Using same set of common first names can also help the model recognize names better, shorten token length, and remove gender and regional biases associated with names. The system scored 0.1947 log loss in stage 2, where the augmentation contributed to an improvements of 0.04. Post-competition analysis shows that, when using different embedding layers, the system scores 0.1799 which would be third place.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/18/2020

Small Business Classification By Name: Addressing Gender and Geographic Origin Biases

Small business classification is a difficult and important task within m...
research
07/22/2017

Predicting the Gender of Indonesian Names

We investigated a way to predict the gender of a name using character-le...
research
06/17/2016

Gender Inference using Statistical Name Characteristics in Twitter

Much attention has been given to the task of gender inference of Twitter...
research
11/10/2019

Improving BERT Fine-tuning with Embedding Normalization

Large pre-trained sentence encoders like BERT start a new chapter in nat...
research
05/03/2023

An Ontology Design Pattern for Role-Dependent Names

We present an ontology design pattern for modeling Names as part of Role...
research
02/25/2019

Neural Reverse Engineering of Stripped Binaries

We address the problem of predicting procedure names in stripped executa...
research
09/08/2018

A natural language processing and geospatial clustering framework for harvesting local place names from geotagged housing advertisements

Local place names are frequently used by residents living in a geographi...

Please sign up or login with your details

Forgot password? Click here to reset