GrammarTagger: A Multilingual, Minimally-Supervised Grammar Profiler for Language Education

04/07/2021
by   Masato Hagiwara, et al.
0

We present GrammarTagger, an open-source grammar profiler which, given an input text, identifies grammatical features useful for language education. The model architecture enables it to learn from a small amount of texts annotated with spans and their labels, which 1) enables easier and more intuitive annotation, 2) supports overlapping spans, and 3) is less prone to error propagation, compared to complex hand-crafted rules defined on constituency/dependency parses. We show that we can bootstrap a grammar profiler model with F_1 ≈ 0.6 from only a couple hundred sentences both in English and Chinese, which can be further boosted via learning a multilingual model. With GrammarTagger, we also build Octanove Learn, a search engine of language learning materials indexed by their reading difficulty and grammatical features. The code and pretrained models are publicly available at <https://github.com/octanove/grammartagger>.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/14/2022

Open Source HamNoSys Parser for Multilingual Sign Language Encoding

This paper presents our recent developments in the field of automatic pr...
research
11/12/2015

A Multilingual FrameNet-based Grammar and Lexicon for Controlled Natural Language

Berkeley FrameNet is a lexico-semantic resource for English based on the...
research
06/02/2023

Learning from Partially Annotated Data: Example-aware Creation of Gap-filling Exercises for Language Learning

Since performing exercises (including, e.g., practice tests) forms a cru...
research
07/30/2021

MTVR: Multilingual Moment Retrieval in Videos

We introduce mTVR, a large-scale multilingual video moment retrieval dat...
research
04/27/2023

A Modular Approach for Multilingual Timex Detection and Normalization using Deep Learning and Grammar-based methods

Detecting and normalizing temporal expressions is an essential step for ...
research
03/23/2022

Geometry-Aware Supertagging with Heterogeneous Dynamic Convolutions

The syntactic categories of categorial grammar formalisms are structured...
research
03/07/2017

Learning opacity in Stratal Maximum Entropy Grammar

Opaque phonological patterns are sometimes claimed to be difficult to le...

Please sign up or login with your details

Forgot password? Click here to reset