Fixing the Infix: Unsupervised Discovery of Root-and-Pattern Morphology

02/07/2017
by   Tarek Sakakini, et al.
0

We present an unsupervised and language-agnostic method for learning root-and-pattern morphology in Semitic languages. This form of morphology, abundant in Semitic languages, has not been handled in prior unsupervised approaches. We harness the syntactico-semantic information in distributed word representations to solve the long standing problem of root-and-pattern discovery in Semitic languages. Moreover, we construct an unsupervised root extractor based on the learned rules. We prove the validity of learned rules across Arabic, Hebrew, and Amharic, alongside showing that our root extractor compares favorably with a widely used, carefully engineered root extractor: ISRI.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/28/2015

CBAS: context based arabic stemmer

Arabic morphology encapsulates many valuable features such as word root....
research
03/31/2020

Automatic Extraction of Bengali Root Verbs using Paninian Grammar

In this research work, we have proposed an algorithm based on supervised...
research
04/06/2022

Urdu Morphology, Orthography and Lexicon Extraction

Urdu is a challenging language because of, first, its Perso-Arabic scrip...
research
05/11/2020

Neural Polysynthetic Language Modelling

Research in natural language processing commonly assumes that approaches...
research
10/12/2019

Acquisition of Inflectional Morphology in Artificial Neural Networks With Prior Knowledge

How does knowledge of one language's morphology influence learning of in...
research
04/23/2018

On the Diachronic Stability of Irregularity in Inflectional Morphology

Many languages' inflectional morphological systems are replete with irre...
research
03/09/2021

Exploring Coronal Heating Using Unsupervised Machine-Learning

The perplexing mystery of what maintains the solar coronal temperature a...

Please sign up or login with your details

Forgot password? Click here to reset