SWAN: A Generic Framework for Auditing Textual Conversational Systems

05/15/2023
by   Tetsuya Sakai, et al.
0

We present a simple and generic framework for auditing a given textual conversational system, given some samples of its conversation sessions as its input. The framework computes a SWAN (Schematised Weighted Average Nugget) score based on nugget sequences extracted from the conversation sessions. Following the approaches of S-measure and U-measure, SWAN utilises nugget positions within the conversations to weight the nuggets based on a user model. We also present a schema of twenty (+1) criteria that may be worth incorporating in the SWAN framework. In our future work, we plan to devise conversation sampling methods that are suitable for the various criteria, construct seed user turns for comparing multiple systems, and validate specific instances of SWAN for the purpose of preventing negative impacts of conversational systems on users and society. This paper was written while preparing for the ICTIR 2023 keynote (to be given on July 23, 2023).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/14/2021

SaFeRDialogues: Taking Feedback Gracefully after Conversational Safety Failures

Current open-domain conversational models can easily be made to talk in ...
research
08/03/2023

Memory Sandbox: Transparent and Interactive Memory Management for Conversational Agents

The recent advent of large language models (LLM) has resulted in high-pe...
research
02/02/2021

MultiTalk: A Highly-Branching Dialog Testbed for Diverse Conversations

We study conversational dialog in which there are many possible response...
research
02/17/2017

soc2seq: Social Embedding meets Conversation Model

While liking or upvoting a post on a mobile app is easy to do, replying ...
research
11/06/2022

Improved Target-specific Stance Detection on Social Media Platforms by Delving into Conversation Threads

Target-specific stance detection on social media, which aims at classify...
research
05/14/2018

Conversations Gone Awry: Detecting Early Signs of Conversational Failure

One of the main challenges online social systems face is the prevalence ...
research
07/19/2019

Towards automatic estimation of conversation floors within F-formations

The detection of free-standing conversing groups has received significan...

Please sign up or login with your details

Forgot password? Click here to reset