Predicting Typological Features in WALS using Language Embeddings and Conditional Probabilities: ÚFAL Submission to the SIGTYP 2020 Shared Task

10/08/2020
by   Martin Vastl, et al.
0

We present our submission to the SIGTYP 2020 Shared Task on the prediction of typological features. We submit a constrained system, predicting typological features only based on the WALS database. We investigate two approaches. The simpler of the two is a system based on estimating correlation of feature values within languages by computing conditional probabilities and mutual information. The second approach is to train a neural predictor operating on precomputed language embeddings based on WALS features. Our submitted system combines the two approaches based on their self-estimated confidence scores. We reach the accuracy of 70.7

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/12/2020

NEMO: Frequentist Inference Approach to Constrained Linguistic Typology Feature Prediction in SIGTYP 2020 Shared Task

This paper describes the NEMO submission to SIGTYP 2020 shared task whic...
research
10/16/2020

SIGTYP 2020 Shared Task: Prediction of Typological Features

Typological knowledge bases (KBs) such as WALS (Dryer and Haspelmath, 20...
research
06/12/2020

On Neural Estimators for Conditional Mutual Information Using Nearest Neighbors Sampling

The estimation of mutual information (MI) or conditional mutual informat...
research
01/10/2021

Cisco at AAAI-CAD21 shared task: Predicting Emphasis in Presentation Slides using Contextualised Embeddings

This paper describes our proposed system for the AAAI-CAD21 shared task:...
research
04/30/2020

Linguistic Typology Features from Text: Inferring the Sparse Features of World Atlas of Language Structures

The use of linguistic typological resources in natural language processi...
research
08/09/2014

Characterizing predictable classes of processes

The problem is sequence prediction in the following setting. A sequence ...
research
06/05/2020

UDPipe at EvaLatin 2020: Contextualized Embeddings and Treebank Embeddings

We present our contribution to the EvaLatin shared task, which is the fi...

Please sign up or login with your details

Forgot password? Click here to reset