Sentence-Level BERT and Multi-Task Learning of Age and Gender in Social Media

11/02/2019
by   Muhammad Abdul-Mageed, et al.
0

Social media currently provide a window on our lives, making it possible to learn how people from different places, with different backgrounds, ages, and genders use language. In this work we exploit a newly-created Arabic dataset with ground truth age and gender labels to learn these attributes both individually and in a multi-task setting at the sentence level. Our models are based on variations of deep bidirectional neural networks. More specifically, we build models with gated recurrent units and bidirectional encoder representations from transformers (BERT). We show the utility of multi-task learning (MTL) on the two tasks and identify task-specific attention as a superior choice in this context. We also find that a single-task BERT model outperform our best MTL models on the two tasks. We report tweet-level accuracy of 51.43 both of which outperforms our baselines with a large margin. Our models are language-agnostic, and so can be applied to other languages.

READ FULL TEXT
research
09/09/2019

BERT-Based Arabic Social Media AuthorProfiling

We report our models for detecting age, language variety, and gender fro...
research
09/09/2019

BERT-Based Arabic Social Media Author Profiling

We report our models for detecting age, language variety, and gender fro...
research
06/23/2021

BERT-based Multi-Task Model for Country and Province Level Modern Standard Arabic and Dialectal Arabic Identification

Dialect and standard language identification are crucial tasks for many ...
research
05/30/2023

Multitask learning for recognizing stress and depression in social media

Stress and depression are prevalent nowadays across people of all ages d...
research
12/28/2020

A Paragraph-level Multi-task Learning Model for Scientific Fact-Verification

Even for domain experts, it is a non-trivial task to verify a scientific...
research
10/31/2019

DiaNet: BERT and Hierarchical Attention Multi-Task Learning of Fine-Grained Dialect

Prediction of language varieties and dialects is an important language p...
research
12/30/2019

AraNet: A Deep Learning Toolkit for Arabic Social Media

We describe AraNet, a collection of deep learning Arabic social media pr...

Please sign up or login with your details

Forgot password? Click here to reset