Log In Sign Up

Neural Coreference Resolution for Arabic

by   Abdulrahman Aloraini, et al.

No neural coreference resolver for Arabic exists, in fact we are not aware of any learning-based coreference resolver for Arabic since (Bjorkelund and Kuhn, 2014). In this paper, we introduce a coreference resolution system for Arabic based on Lee et al's end to end architecture combined with the Arabic version of bert and an external mention detector. As far as we know, this is the first neural coreference resolution system aimed specifically to Arabic, and it substantially outperforms the existing state of the art on OntoNotes 5.0 with a gain of 15.2 points conll F1. We also discuss the current limitations of the task for Arabic and possible approaches that can tackle these challenges.


page 1

page 2

page 3

page 4


Transliterating Kurdish texts in Latin into Persian-Arabic script

Kurdish is written in different scripts. The two most popular scripts ar...

Joint Coreference Resolution for Zeros and non-Zeros in Arabic

Most existing proposals about anaphoric zero pronoun (AZP) resolution re...

EdgeNet: A novel approach for Arabic numeral classification

Despite the importance of handwritten numeral classification, a robust a...

End-to-end Neural Coreference Resolution

We introduce the first end-to-end coreference resolution model and show ...

Problem of Multiple Diacritics Design for Arabic Script

This study focuses on the design of multiple Arabic diacritical marks an...

Build Fast and Accurate Lemmatization for Arabic

In this paper we describe the complexity of building a lemmatizer for Ar...

Improving Arabic Diacritization by Learning to Diacritize and Translate

We propose a novel multitask learning method for diacritization which tr...

Code Repositories