ABI Neural Ensemble Model for Gender Prediction Adapt Bar-Ilan Submission for the CLIN29 Shared Task on Gender Prediction

02/23/2019
by   Eva Vanmassenhove, et al.
0

We present our system for the CLIN29 shared task on cross-genre gender detection for Dutch. We experimented with a multitude of neural models (CNN, RNN, LSTM, etc.), more "traditional" models (SVM, RF, LogReg, etc.), different feature sets as well as data pre-processing. The final results suggested that using tokenized, non-lowercased data works best for most of the neural models, while a combination of word clusters, character trigrams and word lists showed to be most beneficial for the majority of the more "traditional" (that is, non-neural) models, beating features used in previous tasks such as n-grams, character n-grams, part-of-speech tags and combinations thereof. In contradiction with the results described in previous comparable shared tasks, our neural models performed better than our best traditional approaches with our best feature set-up. Our final model consisted of a weighted ensemble model combining the top 25 models. Our final model won both the in-domain gender prediction task and the cross-genre challenge, achieving an average accuracy of 64.93 gender prediction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/12/2017

N-GrAM: New Groningen Author-profiling Model

We describe our participation in the PAN 2017 shared task on Author Prof...
research
06/19/2023

Grammatical gender in Swedish is predictable using recurrent neural networks

The grammatical gender of Swedish nouns is a mystery. While there are fe...
research
04/24/2017

Fast and Accurate Neural Word Segmentation for Chinese

Neural models with minimal feature engineering have achieved competitive...
research
04/06/2018

Neural models of factuality

We present two neural models for event factuality prediction, which yiel...
research
07/06/2019

Exploring difference in public perceptions on HPV vaccine between gender groups from Twitter using deep learning

In this study, we proposed a convolutional neural network model for gend...
research
09/13/2020

Combining Word and Character Vector Representation on Neural Machine Translation

This paper describes combinations of word vector representation and char...
research
05/09/2017

Phonetic Temporal Neural Model for Language Identification

Deep neural models, particularly the LSTM-RNN model, have shown great po...

Please sign up or login with your details

Forgot password? Click here to reset