INFACT: An Online Human Evaluation Framework for Conversational Recommendation

09/07/2022
by   Ahtsham Manzoor, et al.
0

Conversational recommender systems (CRS) are interactive agents that support their users in recommendation-related goals through multi-turn conversations. Generally, a CRS can be evaluated in various dimensions. Today's CRS mainly rely on offline(computational) measures to assess the performance of their algorithms in comparison to different baselines. However, offline measures can have limitations, for example, when the metrics for comparing a newly generated response with a ground truth do not correlate with human perceptions, because various alternative generated responses might be suitable too in a given dialog situation. Current research on machine learning-based CRS models therefore acknowledges the importance of humans in the evaluation process, knowing that pure offline measures may not be sufficient in evaluating a highly interactive system like a CRS.

READ FULL TEXT
research
06/15/2020

Evaluating Conversational Recommender Systems via User Simulation

Conversational information access is an emerging research area. Currentl...
research
12/09/2021

Self-Supervised Bot Play for Conversational Recommendation with Justifications

Conversational recommender systems offer the promise of interactive, eng...
research
05/22/2023

Rethinking the Evaluation for Conversational Recommendation in the Era of Large Language Models

The recent success of large language models (LLMs) has shown great poten...
research
12/12/2022

Evaluation of Synthetic Datasets for Conversational Recommender Systems

For researchers leveraging Large-Language Models (LLMs) in the generatio...
research
01/03/2023

Offline Evaluation for Reinforcement Learning-based Recommendation: A Critical Issue and Some Alternatives

In this paper, we argue that the paradigm commonly adopted for offline e...
research
03/17/2022

Conversational Recommendation: A Grand AI Challenge

Animated avatars, which look and talk like humans, are iconic visions of...
research
06/25/2019

Deep Conversational Recommender in Travel

When traveling to a foreign country, we are often in dire need of an int...

Please sign up or login with your details

Forgot password? Click here to reset