Multi-Dialectal Representation Learning of Sinitic Phonology

06/30/2023
by   Zhibai Jia, et al.
0

Machine learning techniques have shown their competence for representing and reasoning in symbolic systems such as language and phonology. In Sinitic Historical Phonology, notable tasks that could benefit from machine learning include the comparison of dialects and reconstruction of proto-languages systems. Motivated by this, this paper provides an approach for obtaining multi-dialectal representations of Sinitic syllables, by constructing a knowledge graph from structured phonological data, then applying the BoxE technique from knowledge base learning. We applied unsupervised clustering techniques to the obtained representations to observe that the representations capture phonemic contrast from the input dialects. Furthermore, we trained classifiers to perform inference of unobserved Middle Chinese labels, showing the representations' potential for indicating archaic, proto-language features. The representations can be used for performing completion of fragmented Sinitic phonological knowledge bases, estimating divergences between different characters, or aiding the exploration and reconstruction of archaic features.

READ FULL TEXT

page 7

page 8

research
09/14/2017

KBLRN : End-to-End Learning of Knowledge Base Representations with Latent, Relational, and Numerical Features

We present KBLRN, a novel framework for end-to-end learning of knowledge...
research
08/21/2023

Deciphering Raw Data in Neuro-Symbolic Learning with Provable Guarantees

Neuro-symbolic hybrid systems are promising for integrating machine lear...
research
09/22/2016

Image-embodied Knowledge Representation Learning

Entity images could provide significant visual information for knowledge...
research
12/13/2019

From Shallow to Deep Interactions Between Knowledge Representation, Reasoning and Machine Learning (Kay R. Amel group)

This paper proposes a tentative and original survey of meeting points be...
research
06/29/2018

On embeddings as alternative paradigm for relational learning

Many real-world domains can be expressed as graphs and, more generally, ...
research
02/03/2015

Incremental Knowledge Base Construction Using DeepDive

Populating a database with unstructured information is a long-standing p...
research
07/04/2020

Nested Subspace Arrangement for Representation of Relational Data

Studies on acquiring appropriate continuous representations of discrete ...

Please sign up or login with your details

Forgot password? Click here to reset