Predicting cross-linguistic adjective order with information gain

12/30/2020
by   William Dyer, et al.
0

Languages vary in their placement of multiple adjectives before, after, or surrounding the noun, but they typically exhibit strong intra-language tendencies on the relative order of those adjectives (e.g., the preference for `big blue box' in English, `grande boîte bleue' in French, and `alsundūq al'azraq alkabīr' in Arabic). We advance a new quantitative account of adjective order across typologically-distinct languages based on maximizing information gain. Our model addresses the left-right asymmetry of French-type ANA sequences with the same approach as AAN and NAA orderings, without appeal to other mechanisms. We find that, across 32 languages, the preferred order of adjectives largely mirrors an efficient algorithm of maximizing information gain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/18/2022

Offensive Language Detection in Under-resourced Algerian Dialectal Arabic Language

This paper addresses the problem of detecting the offensive and abusive ...
research
04/03/2023

PALI: A Language Identification Benchmark for Perso-Arabic Scripts

The Perso-Arabic scripts are a family of scripts that are widely adopted...
research
09/13/2021

On Language Models for Creoles

Creole languages such as Nigerian Pidgin English and Haitian Creole are ...
research
03/20/2015

On measuring linguistic intelligence

This work addresses the problem of measuring how many languages a person...
research
03/15/2019

Studying the Inductive Biases of RNNs with Synthetic Variations of Natural Languages

How do typological properties such as word order and morphological case ...
research
06/09/2023

Progress on Constructing Phylogenetic Networks for Languages

In 2006, Warnow, Evans, Ringe, and Nakhleh proposed a stochastic model (...

Please sign up or login with your details

Forgot password? Click here to reset