Vector Representations of Idioms in Conversational Systems

05/07/2022
by   Tosin Adewumi, et al.
0

We demonstrate, in this study, that an open-domain conversational system trained on idioms or figurative language generates more fitting responses to prompts containing idioms. Idioms are part of everyday speech in many languages, across many cultures, but they pose a great challenge for many Natural Language Processing (NLP) systems that involve tasks such as Information Retrieval (IR) and Machine Translation (MT), besides conversational AI. We utilize the Potential Idiomatic Expression (PIE)-English idioms corpus for the two tasks that we investigate: classification and conversation generation. We achieve state-of-the-art (SoTA) result of 98 the classification task by using the SoTA T5 model. We experiment with three instances of the SoTA dialogue model, Dialogue Generative Pre-trained Transformer (DialoGPT), for conversation generation. Their performances are evaluated using the automatic metric perplexity and human evaluation. The results show that the model trained on the idiom corpus generates more fitting responses to prompts containing idioms 71.9 model not trained on the idioms corpus. We contribute the model checkpoint/demo and code on the HuggingFace hub for public access.

READ FULL TEXT
research
11/01/2019

DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation

We present a large, tunable neural conversational response generation mo...
research
10/12/2021

Småprat: DialoGPT for Natural Language Generation of Swedish Dialogue by Transfer Learning

Building open-domain conversational systems (or chatbots) that produce c...
research
04/25/2021

Potential Idiomatic Expression (PIE)-English: Corpus for Classes of Idioms

We present a fairly large, Potential Idiomatic Expression (PIE) dataset ...
research
08/23/2019

Deep Learning Based Chatbot Models

A conversational agent (chatbot) is a piece of software that is able to ...
research
08/22/2023

Learning to generate and corr- uh I mean repair language in real-time

In conversation, speakers produce language incrementally, word by word, ...
research
10/02/2022

Risk-graded Safety for Handling Medical Queries in Conversational AI

Conversational AI systems can engage in unsafe behaviour when handling u...
research
05/02/2021

Intelligent Conversational Android ERICA Applied to Attentive Listening and Job Interview

Following the success of spoken dialogue systems (SDS) in smartphone ass...

Please sign up or login with your details

Forgot password? Click here to reset