Anaphora Resolution in Dialogue Systems for South Asian Languages

11/22/2019
by   Vinay Annam, et al.
0

Anaphora resolution is a challenging task which has been the interest of NLP researchers for a long time. Traditional resolution techniques like eliminative constraints and weighted preferences were successful in many languages. However, they are ineffective in free word order languages like most SouthAsian languages.Heuristic and rule-based techniques were typical in these languages, which are constrained to context and domain.In this paper, we venture a new strategy us-ing neural networks for resolving anaphora in human-human dialogues. The architecture chiefly consists of three components, a shallow parser for extracting features, a feature vector generator which produces the word embed-dings, and a neural network model which will predict the antecedent mention of an anaphora.The system has been trained and tested on Telugu conversation corpus we generated. Given the advantage of the semantic information in word embeddings and appending actor, gender, number, person and part of plural features the model has reached an F1-score of 86.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/05/2020

An exploration of the encoding of grammatical gender in word embeddings

The vector representation of words, known as word embeddings, has opened...
research
10/22/2020

On the Effects of Using word2vec Representations in Neural Networks for Dialogue Act Recognition

Dialogue act recognition is an important component of a large number of ...
research
12/27/2021

"A Passage to India": Pre-trained Word Embeddings for Indian Languages

Dense word vectors or 'word embeddings' which encode semantic properties...
research
04/18/2018

Gender Bias in Coreference Resolution: Evaluation and Debiasing Methods

We introduce a new benchmark, WinoBias, for coreference resolution focus...
research
10/01/2019

Specializing Word Embeddings (for Parsing) by Information Bottleneck

Pre-trained word embeddings like ELMo and BERT contain rich syntactic an...
research
11/02/2018

The Hard-CoRe Coreference Corpus: Removing Gender and Number Cues for Difficult Pronominal Anaphora Resolution

We introduce a new benchmark task for coreference resolution, Hard-CoRe,...
research
04/18/2015

A Knowledge-poor Pronoun Resolution System for Turkish

A pronoun resolution system which requires limited syntactic knowledge t...

Please sign up or login with your details

Forgot password? Click here to reset