Development of a Conversation State Prediction System

07/03/2021
by   Sujay Uday Rittikar, et al.
0

With the evolution of the concept of Speaker diarization using LSTM, it is relatively easier to understand the speaker identities for specific segments of input audio stream data than manually tagging the data. With such a concept, it is highly desirable to consider the possibility of using the identified speaker identities to aid in recognizing the Speaker States in a conversation. In this study, the Markov Chains are used to identify and update the Speaker States for the next conversations between the same set of speakers, to enable identification of their states in the most natural and long conversations. The model is based on several audio samples from natural conversations of three or greater than three speakers in two datasets with overall total error percentages for recognized states being lesser than or equal to 12 findings imply that the proposed extension to the Speaker diarization is effective to predict the states for a conversation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/17/2018

MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation (MCE) Plan, Dataset and Baseline System

The Multitarget Challenge aims to assess how well current speech technol...
research
03/28/2023

Egocentric Auditory Attention Localization in Conversations

In a noisy conversation environment such as a dinner party, people often...
research
03/01/2017

Tracing Linguistic Relations in Winning and Losing Sides of Explicit Opposing Groups

Linguistic relations in oral conversations present how opinions are cons...
research
06/03/2020

Improving Speaker Identification using Network Knowledge in Criminal Conversational Data

Criminal investigations rely on the collection of conversational data. T...
research
09/02/2018

Contextual Neural Model for Translating Bilingual Multi-Speaker Conversations

Recent works in neural machine translation have begun to explore documen...
research
03/26/2014

Constrained speaker linking

In this paper we study speaker linking (a.k.a. partitioning) given const...
research
10/24/2020

Stop Bugging Me! Evading Modern-Day Wiretapping Using Adversarial Perturbations

Mass surveillance systems for voice over IP (VoIP) conversations pose a ...

Please sign up or login with your details

Forgot password? Click here to reset