Unsupervised Topic Segmentation of Meetings with BERT Embeddings

06/24/2021
by   Alessandro Solbiati, et al.
0

Topic segmentation of meetings is the task of dividing multi-person meeting transcripts into topic blocks. Supervised approaches to the problem have proven intractable due to the difficulties in collecting and accurately annotating large datasets. In this paper we show how previous unsupervised topic segmentation methods can be improved using pre-trained neural architectures. We introduce an unsupervised approach based on BERT embeddings that achieves a 15.5 two popular datasets for meeting transcripts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2020

Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence

Topic models extract meaningful groups of words from documents, allowing...
research
10/07/2020

ELMo and BERT in semantic change detection for Russian

We study the effectiveness of contextualized embeddings for the task of ...
research
08/21/2023

Unsupervised Dialogue Topic Segmentation in Hyperdimensional Space

We present HyperSeg, a hyperdimensional computing (HDC) approach to unsu...
research
11/29/2021

Changepoint Analysis of Topic Proportions in Temporal Text Data

Changepoint analysis deals with unsupervised detection and/or estimation...
research
02/12/2020

Improving automated segmentation of radio shows with audio embeddings

Audio features have been proven useful for increasing the performance of...
research
11/21/2022

TCBERT: A Technical Report for Chinese Topic Classification BERT

Bidirectional Encoder Representations from Transformers or BERT <cit.> h...
research
10/22/2020

GAN based Unsupervised Segmentation: Should We Match the Exact Number of Objects

The unsupervised segmentation is an increasingly popular topic in biomed...

Please sign up or login with your details

Forgot password? Click here to reset