Too Slow to Be Useful? On Incorporating Humans in the Loop of Smart Speakers

12/07/2022
by   Shih-Hong Huang, et al.
0

Real-time crowd-powered systems, such as Chorus/Evorus, VizWiz, and Apparition, have shown how incorporating humans into automated systems could supplement where the automatic solutions fall short. However, one unspoken bottleneck of applying such architectures to more scenarios is the longer latency of including humans in the loop of automated systems. For the applications that have hard constraints in turnaround times, human-operated components' longer latency and large speed variation seem to be apparent deal breakers. This paper explicates and quantifies these limitations by using a human-powered text-based backend to hold conversations with users through a voice-only smart speaker. Smart speakers must respond to users' requests within seconds, so the workers behind the scenes only have a few seconds to compose answers. We measured the end-to-end system latency and the conversation quality with eight pairs of participants, showing the challenges and superiority of such systems.

READ FULL TEXT

page 4

page 6

page 7

page 8

research
01/08/2018

Evorus: A Crowd-powered Conversational Assistant Built to Automate Itself Over Time

Crowd-powered conversational assistants have been shown to be more robus...
research
08/10/2017

"Is there anything else I can help you with?": Challenges in Deploying an On-Demand Crowd-Powered Conversational Agent

Intelligent conversational assistants, such as Apple's Siri, Microsoft's...
research
03/22/2023

Real-World Community-in-the-Loop Smart Video Surveillance – A Case Study at a Community College

Smart Video surveillance systems have become important recently for ensu...
research
10/21/2019

On Automating Conversations

From 2016 to 2018, we developed and deployed Chorus, a system that blend...
research
03/30/2023

What Types of Questions Require Conversation to Answer? A Case Study of AskReddit Questions

The proliferation of automated conversational systems such as chatbots, ...
research
11/05/2020

BW-EDA-EEND: Streaming End-to-End Neural Speaker Diarization for a Variable Number of Speakers

We present a novel online end-to-end neural diarization system, BW-EDA-E...

Please sign up or login with your details

Forgot password? Click here to reset