Evaluating Mixed-initiative Conversational Search Systems via User Simulation

04/17/2022
by   Ivan Sekulić, et al.
0

Clarifying the underlying user information need by asking clarifying questions is an important feature of modern conversational search system. However, evaluation of such systems through answering prompted clarifying questions requires significant human effort, which can be time-consuming and expensive. In this paper, we propose a conversational User Simulator, called USi, for automatic evaluation of such conversational search systems. Given a description of an information need, USi is capable of automatically answering clarifying questions about the topic throughout the search session. Through a set of experiments, including automated natural language generation metrics and crowdsourcing studies, we show that responses generated by USi are both inline with the underlying information need and comparable to human-generated answers. Moreover, we make the first steps towards multi-turn interactions, where conversational search systems asks multiple questions to the (simulated) user with a goal of clarifying the user need. To this end, we expand on currently available datasets for studying clarifying questions, i.e., Qulac and ClariQ, by performing a crowdsourcing-based multi-turn data acquisition. We show that our generative, GPT2-based model, is capable of providing accurate and natural answers to unseen clarifying questions in the single-turn setting and discuss capabilities of our model in the multi-turn setting. We provide the code, data, and the pre-trained model to be used for further research on the topic.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/26/2023

Exploiting Simulated User Feedback for Conversational Search: Ranking, Rewriting, and Beyond

This research aims to explore various methods for assessing user feedbac...
research
04/17/2023

An In-depth Investigation of User Response Simulation for Conversational Search

Conversational search has seen increased recent attention in both the IR...
research
04/17/2023

Reward-free Policy Imitation Learning for Conversational Search

Existing conversational search studies mainly focused on asking better c...
research
09/07/2021

POSSCORE: A Simple Yet Effective Evaluation of Conversational Search with Part of Speech Labelling

Conversational search systems, such as Google Assistant and Microsoft Co...
research
11/09/2019

Interactive Classification by Asking Informative Questions

Natural language systems often rely on a single, potentially ambiguous i...
research
04/27/2021

Meta-evaluation of Conversational Search Evaluation Metrics

Conversational search systems, such as Google Assistant and Microsoft Co...
research
04/29/2020

Conversations with Search Engines

In this paper, we address the problem of answering complex information n...

Please sign up or login with your details

Forgot password? Click here to reset