DeepAI
Log In Sign Up

Localizing Open-Ontology QA Semantic Parsers in a Day Using Machine Translation

10/10/2020
by   Mehrad Moradshahi, et al.
17

We propose Semantic Parser Localizer (SPL), a toolkit that leverages Neural Machine Translation (NMT) systems to localize a semantic parser for a new language. Our methodology is to (1) generate training data automatically in the target language by augmenting machine-translated datasets with local entities scraped from public websites, (2) add a few-shot boost of human-translated sentences and train a novel XLMR-LSTM semantic parser, and (3) test the model on natural utterances curated using human translators. We assess the effectiveness of our approach by extending the current capabilities of Schema2QA, a system for English Question Answering (QA) on the open web, to 10 new languages for the restaurants and hotels domains. Our models achieve an overall test accuracy ranging between 61 hotels domain and between 64 favorably to 69 data and a few examples from validation set. We show our approach outperforms the previous state-of-the-art methodology by more than 30 for restaurants with localized ontologies for the subset of languages tested. Our methodology enables any software developer to add a new language capability to a QA system for a new domain, leveraging machine translation, in less than 24 hours.

READ FULL TEXT
07/05/2022

Cross-Lingual QA as a Stepping Stone for Monolingual Open QA in Icelandic

It can be challenging to build effective open question answering (open Q...
02/08/2017

Neural Machine Translation with Source-Side Latent Graph Parsing

This paper presents a novel neural machine translation model which joint...
07/09/2021

Using Machine Translation to Localize Task Oriented NLG Output

One of the challenges in a task oriented natural language application li...
04/06/2020

Bootstrapping a Crosslingual Semantic Parser

Datasets for semantic parsing scarcely consider languages other than Eng...
09/10/2018

Multilingual Extractive Reading Comprehension by Runtime Machine Translation

Existing end-to-end neural network models for extractive Reading Compreh...
03/21/2022

Paraphrasing Techniques for Maritime QA system

There has been an increasing interest in incorporating Artificial Intell...
09/16/2020

State-Machine-Based Dialogue Agents with Few-Shot Contextual Semantic Parsers

This paper presents a methodology and toolkit for creating a rule-based ...

Code Repositories

genienlp

This repository contains the code for Genie NLP models


view repo