Are Automatic Methods for Cognate Detection Good Enough for Phylogenetic Reconstruction in Historical Linguistics?

04/15/2018
by   Taraka Rama, et al.
0

We evaluate the performance of state-of-the-art algorithms for automatic cognate detection by comparing how useful automatically inferred cognates are for the task of phylogenetic inference compared to classical manually annotated cognate sets. Our findings suggest that phylogenies inferred from automated cognate sets come close to phylogenies inferred from expert-annotated ones, although on average, the latter are still superior. We conclude that future work on phylogenetic reconstruction can profit much from automatic cognate detection. Especially where scholars are merely interested in exploring the bigger picture of a language family's phylogeny, algorithms for automatic cognate detection are a useful complement for current research on language phylogenies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/28/2022

Automatic Recognition and Classification of Future Work Sentences from Academic Articles in a Specific Domain

Future work sentences (FWS) are the particular sentences in academic pap...
research
02/21/2017

Reinforcement Learning Based Argument Component Detection

Argument component detection (ACD) is an important sub-task in argumenta...
research
02/10/2016

Automatic Sarcasm Detection: A Survey

Automatic sarcasm detection is the task of predicting sarcasm in text. T...
research
05/21/2018

Computational Historical Linguistics

Computational approaches to historical linguistics have been proposed si...
research
03/31/2023

Trimming Phonetic Alignments Improves the Inference of Sound Correspondence Patterns from Multilingual Wordlists

Sound correspondence patterns form the basis of cognate detection and ph...
research
02/06/2017

Q-WordNet PPV: Simple, Robust and (almost) Unsupervised Generation of Polarity Lexicons for Multiple Languages

This paper presents a simple, robust and (almost) unsupervised dictionar...
research
06/02/2023

LyricSIM: A novel Dataset and Benchmark for Similarity Detection in Spanish Song LyricS

In this paper, we present a new dataset and benchmark tailored to the ta...

Please sign up or login with your details

Forgot password? Click here to reset