Language Models as an Alternative Evaluator of Word Order Hypotheses: A Case Study in Japanese

05/02/2020
by   Tatsuki Kuribayashi, et al.
0

We examine a methodology using neural language models (LMs) for analyzing the word order of language. This LM-based method has the potential to overcome the difficulties existing methods face, such as the propagation of preprocessor errors in count-based methods. In this study, we explore whether the LM-based method is valid for analyzing the word order. As a case study, this study focuses on Japanese due to its complex and flexible word order. To validate the LM-based method, we test (i) parallels between LMs and human word order preference, and (ii) consistency of the results obtained using the LM-based method with previous linguistic studies. Through our experiments, we tentatively conclude that LMs display sufficient word order knowledge for usage as an analysis tool. Finally, using the LM-based method, we demonstrate the relationship between the canonical word order and topicalization, which had yet to be analyzed by large-scale experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/28/2020

Word embedding and neural network on grammatical gender – A case study of Swedish

We analyze the information provided by the word embeddings about the gra...
research
07/20/2022

Integrating Linguistic Theory and Neural Language Models

Transformer-based language models have recently achieved remarkable resu...
research
02/14/2023

Exploring Category Structure with Contextual Language Models and Lexical Semantic Networks

Recent work on predicting category structure with distributional models,...
research
08/16/2018

Predicting Human Trustfulness from Facebook Language

Trustfulness -- one's general tendency to have confidence in unknown peo...
research
02/27/2019

Evaluation of a length-based method to estimate discard rate and the effect of sampling size

The common fisheries policy aims at eliminating discarding which has bee...
research
04/15/2022

On the Role of Pre-trained Language Models in Word Ordering: A Case Study with BART

Word ordering is a constrained language generation task taking unordered...
research
09/19/2023

Evaluating large language models' ability to understand metaphor and sarcasm using a screening test for Asperger syndrome

Metaphors and sarcasm are precious fruits of our highly-evolved social c...

Please sign up or login with your details

Forgot password? Click here to reset