Speaker Diarization With Lexical Information

11/27/2018
by   Tae Jin Park, et al.
0

This work presents a novel approach to leverage lexical information for speaker diarization. We introduce a speaker diarization system that can directly integrate lexical as well as acoustic information into a speaker clustering process. Thus, we propose an adjacency matrix integration technique to integrate word level speaker turn probabilities with speaker embeddings in a comprehensive way. Our proposed method works without any reference transcript. Words, and word boundary information are provided by an ASR system. We show that our proposed method improves a baseline speaker diarization system solely based on speaker embeddings, achieving a meaningful improvement on the CALLHOME American English Speech dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2018

Multimodal Speaker Segmentation and Diarization using Lexical and Acoustic Cues via Sequence to Sequence Neural Networks

While there has been substantial amount of work in speaker diarization r...
research
09/11/2023

Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach

Large language models (LLMs) have shown great promise for capturing cont...
research
05/21/2018

Speaker Clustering Using Dominant Sets

Speaker clustering is the task of forming speaker-specific groups based ...
research
05/31/2020

A Unified Feature Representation for Lexical Connotations

Ideological attitudes and stance are often expressed through subtle mean...
research
08/26/2020

On the Optimality of Vagueness: "Around", "Between", and the Gricean Maxims

Why is our language vague? We argue that in contexts in which a cooperat...
research
06/02/2017

Prosodic Event Recognition using Convolutional Neural Networks with Context Information

This paper demonstrates the potential of convolutional neural networks (...
research
07/18/2021

Exploring the Potential of Lexical Paraphrases for Mitigating Noise-Induced Comprehension Errors

Listening in noisy environments can be difficult even for individuals wi...

Please sign up or login with your details

Forgot password? Click here to reset