Back to Patterns: Efficient Japanese Morphological Analysis with Feature-Sequence Trie

05/30/2023
by   Naoki Yoshinaga, et al.
0

Accurate neural models are much less efficient than non-neural models and are useless for processing billions of social media posts or handling user queries in real time with a limited budget. This study revisits the fastest pattern-based NLP methods to make them as accurate as possible, thus yielding a strikingly simple yet surprisingly accurate morphological analyzer for Japanese. The proposed method induces reliable patterns from a morphological dictionary and annotated data. Experimental results on two standard datasets confirm that the method exhibits comparable accuracy to learning-based baselines, while boasting a remarkable throughput of over 1,000,000 sentences per second on a single modern CPU. The source code is available at https://www.tkl.iis.u-tokyo.ac.jp/ ynaga/jagger/

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/21/2023

Morphological Inflection with Phonological Features

Recent years have brought great advances into solving morphological task...
research
06/10/2019

Automatically Identifying Complaints in Social Media

Complaining is a basic speech act regularly used in human and computer m...
research
08/09/2021

On the Transferability of Neural Models of Morphological Analogies

Analogical proportions are statements expressed in the form "A is to B a...
research
12/02/2019

Morphological Tagging and Lemmatization of Albanian: A Manually Annotated Corpus and Neural Models

In this paper, we present the first publicly available part-of-speech an...
research
11/29/2019

Efficient method for parallel computation of geodesic transformation on CPU

This paper introduces a fast Central Processing Unit (CPU) implementatio...
research
02/16/2021

Searching for Search Errors in Neural Morphological Inflection

Neural sequence-to-sequence models are currently the predominant choice ...

Please sign up or login with your details

Forgot password? Click here to reset