Reward-free Policy Imitation Learning for Conversational Search

04/17/2023
by   Zhenduo Wang, et al.
0

Existing conversational search studies mainly focused on asking better clarifying questions and/or improving search result quality. These works aim at retrieving better responses according to the search context, and their performances are evaluated on either single-turn tasks or multi-turn tasks under naive conversation policy settings. This leaves some questions about their applicability in real-world multi-turn conversations where realistically, each and every action needs to be made by the system itself, and search session efficiency is often an important concern of conversational search systems. While some recent works have identified the need for improving search efficiency in conversational search, they mostly require extensive data annotations and use hand-crafted rewards or heuristics to train systems that can achieve reasonable performance in a restricted number of turns, which has limited generalizability in practice. In this paper, we propose a reward-free conversation policy imitation learning framework, which can train a conversation policy without annotated conversation data or manually designed rewards. The trained conversation policy can be used to guide the conversational retrieval models to balance conversational search quality and efficiency. To evaluate the proposed conversational search system, we propose a new multi-turn-multi-response conversational evaluation metric named Expected Conversational Reciprocal Rank (ECRR). ECRR is designed to evaluate entire multi-turn conversational search sessions towards comprehensively evaluating both search result quality and search efficiency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2022

Evaluating Mixed-initiative Conversational Search Systems via User Simulation

Clarifying the underlying user information need by asking clarifying que...
research
04/17/2023

An In-depth Investigation of User Response Simulation for Conversational Search

Conversational search has seen increased recent attention in both the IR...
research
05/25/2023

ConvGQR: Generative Query Reformulation for Conversational Search

In conversational search, the user's real search intent for the current ...
research
01/01/2022

Simulating and Modeling the Risk of Conversational Search

In conversational search, agents can interact with users by asking clari...
research
06/01/2016

Conversational Contextual Cues: The Case of Personalization and History for Response Ranking

We investigate the task of modeling open-domain, multi-turn, unstructure...
research
12/19/2017

Attentive Memory Networks: Efficient Machine Reading for Conversational Search

Recent advances in conversational systems have changed the search paradi...
research
05/24/2020

Query Resolution for Conversational Search with Limited Supervision

In this work we focus on multi-turn passage retrieval as a crucial compo...

Please sign up or login with your details

Forgot password? Click here to reset