David Ifeoluwa Adelani

research

∙ 09/14/2023

SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects

Despite the progress we have recorded in the last few years in multiling...

0 David Ifeoluwa Adelani, et al. ∙

research

∙ 08/18/2023

YORC: Yoruba Reading Comprehension dataset

In this paper, we create YORC: a new multi-choice Yoruba Reading Compreh...

0 Anuoluwapo Aremu, et al. ∙

research

∙ 07/29/2023

ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus

We introduce the ÌròyìnSpeech corpus – a new dataset influenced by a des...

0 Tolulope Ogunremi, et al. ∙

research

∙ 07/03/2023

Improving Language Plasticity via Pretraining with Active Forgetting

Pretrained language models (PLMs) are today the primary model for natura...

0 Yihong Chen, et al. ∙

research

∙ 05/18/2023

NollySenti: Leveraging Transfer Learning and Machine Translation for Nigerian Movie Sentiment Classification

Africa has over 2000 indigenous languages but they are under-represented...

0 Iyanuoluwa Shode, et al. ∙

research

∙ 04/19/2023

MasakhaNEWS: News Topic Classification for African languages

African languages are severely under-represented in NLP research due to ...

6 David Ifeoluwa Adelani, et al. ∙

research

∙ 04/13/2023

SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)

We present the first Africentric SemEval Shared task, Sentiment Analysis...

5 Shamsuddeen Hassan Muhammad, et al. ∙

research

∙ 02/17/2023

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages

Africa is home to over 2000 languages from over six language families an...

11 Shamsuddeen Hassan Muhammad, et al. ∙

research

∙ 12/19/2022

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting

The BLOOM model is a large open-source multilingual language model capab...

16 Zheng Xin Yong, et al. ∙

research

∙ 07/07/2022

BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus

BibleTTS is a large, high-quality, open speech dataset for ten languages...

2 Josh Meyer, et al. ∙

research

∙ 06/15/2022

TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models

Transferring knowledge from one domain to another is of practical import...

0 Ali Davody, et al. ∙

research

∙ 06/03/2022

Task-Adaptive Pre-Training for Boosting Learning With Noisy Labels: A Study on Text Classification for African Languages

For high-resource languages like English, text classification is a well-...

0 Dawei Zhu, et al. ∙

research

∙ 04/22/2022

MCSE: Multimodal Contrastive Learning of Sentence Embeddings

Learning semantically meaningful sentence embeddings is an open problem ...

0 Miaoran Zhang, et al. ∙

research

∙ 04/20/2022

yosm: A new yoruba sentiment corpus for movie reviews

A movie that is thoroughly enjoyed and recommended by an individual migh...

0 Iyanuoluwa Shode, et al. ∙

research

∙ 04/20/2022

Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification

Incorrect labels in training data occur when human annotators make mista...

0 Dawei Zhu, et al. ∙

research

∙ 04/13/2022

Multilingual Language Model Adaptive Fine-Tuning: A Study on African Languages

Multilingual pre-trained language models (PLMs) have demonstrated impres...

0 Jesujoba O. Alabi, et al. ∙

research

∙ 03/16/2022

Pre-Trained Multilingual Sequence-to-Sequence Models: A Hope for Low-Resource Language Translation?

What can pre-trained multilingual sequence-to-sequence models like mBART...

0 En-Shiun Annie Lee, et al. ∙

research

∙ 01/20/2022

NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis

Sentiment analysis is one of the most widely studied applications in NLP...

11 Shamsuddeen Hassan Muhammad, et al. ∙

research

∙ 09/19/2021

Preventing Author Profiling through Zero-Shot Multilingual Back-Translation

Documents as short as a single sentence may inadvertently reveal sensiti...

0 David Ifeoluwa Adelani, et al. ∙

research

∙ 03/22/2021

MasakhaNER: Named Entity Recognition for African Languages

We take a step towards addressing the under-representation of the Africa...

5 David Ifeoluwa Adelani, et al. ∙

research

∙ 08/07/2020

Privacy Guarantees for De-identifying Text Transformations

Machine Learning approaches to Natural Language Processing tasks benefit...

0 David Ifeoluwa Adelani, et al. ∙

research

∙ 06/19/2020

Robust Differentially Private Training of Deep Neural Networks

Differentially private stochastic gradient descent (DPSGD) is a variatio...

0 Ali Davody, et al. ∙

research

∙ 03/18/2020

Distant Supervision and Noisy Label Learning for Low Resource Named Entity Recognition: A Study on Hausa and Yorùbá

The lack of labeled training data has limited the development of natural...

0 David Ifeoluwa Adelani, et al. ∙

research

∙ 03/18/2020

Unsupervised Pidgin Text Generation By Pivoting English Data and Self-Training

West African Pidgin English is a language that is significantly spoken i...

0 Ernie Chang, et al. ∙

research

∙ 07/22/2019

Generating Sentiment-Preserving Fake Online Reviews Using Neural Language Models and Their Human- and Machine-based Detection

Advanced neural language models (NLMs) are widely used in sequence gener...

0 David Ifeoluwa Adelani, et al. ∙

David Ifeoluwa Adelani

Featured Co-authors

Sign in with Google

Consider DeepAI Pro