DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation

05/08/2023
by   ChaeHun Park, et al.
0

Despite the recent advances in open-domain dialogue systems, building a reliable evaluation metric is still a challenging problem. Recent studies proposed learnable metrics based on classification models trained to distinguish the correct response. However, neural classifiers are known to make overly confident predictions for examples from unseen distributions. We propose DEnsity, which evaluates a response by utilizing density estimation on the feature space derived from a neural classifier. Our metric measures how likely a response would appear in the distribution of human conversations. Moreover, to improve the performance of DEnsity, we utilize contrastive learning to further compress the feature space. Experiments on multiple response evaluation datasets show that DEnsity correlates better with human evaluations than the existing metrics. Our code is available at https://github.com/ddehun/DEnsity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/24/2019

Better Automatic Evaluation of Open-Domain Dialogue Systems with Contextualized Embeddings

Despite advances in open-domain dialogue systems, automatic evaluation o...
research
05/26/2023

Hybrid Energy Based Model in the Feature Space for Out-of-Distribution Detection

Out-of-distribution (OOD) detection is a critical requirement for the de...
research
05/08/2018

One "Ruler" for All Languages: Multi-Lingual Dialogue Evaluation with Adversarial Multi-Task Learning

Automatic evaluating the performance of Open-domain dialogue system is a...
research
05/31/2023

Investigation of the Robustness of Neural Density Fields

Recent advances in modeling density distributions, so-called neural dens...
research
05/23/2023

Improving Heterogeneous Model Reuse by Density Estimation

This paper studies multiparty learning, aiming to learn a model using th...
research
02/08/2023

Disentangling Learning Representations with Density Estimation

Disentangled learning representations have promising utility in many app...
research
06/07/2016

Better Conditional Density Estimation for Neural Networks

The vast majority of the neural network literature focuses on predicting...

Please sign up or login with your details

Forgot password? Click here to reset