Serial Speakers: a Dataset of TV Series

02/17/2020
by   Xavier Bost, et al.
0

For over a decade, TV series have been drawing increasing interest, both from the audience and from various academic fields. But while most viewers are hooked on the continuous plots of TV serials, the few annotated datasets available to researchers focus on standalone episodes of classical TV series. We aim at filling this gap by providing the multimedia/speech processing communities with Serial Speakers, an annotated dataset of 161 episodes from three popular American TV serials: Breaking Bad, Game of Thrones and House of Cards. Serial Speakers is suitable both for investigating multimedia retrieval in realistic use case scenarios, and for addressing lower level speech related tasks in especially challenging conditions. We publicly release annotations for every speech turn (boundaries, speaker) and scene boundary, along with annotations for shot boundaries, recurring shots, and interacting speakers in a subset of episodes. Because of copyright restrictions, the textual content of the speech turns is encrypted in the public version of the dataset, but we provide the users with a simple online tool to recover the plain text from their own subtitle files.

READ FULL TEXT
research
08/06/2022

Analysing the Memorability of a Procedural Crime-Drama TV Series, CSI

We investigate the memorability of a 5-season span of a popular crime-dr...
research
12/18/2018

Constrained speaker diarization of TV series based on visual patterns

Speaker diarization, usually denoted as the 'who spoke when' task, turns...
research
05/09/2020

Building a Manga Dataset "Manga109" with Annotations for Multimedia Applications

Manga, or comics, which are a type of multimodal artwork, have been left...
research
12/18/2018

Détection de locuteurs dans les séries TV

Speaker diarization of audio streams turns out to be particularly challe...
research
11/08/2016

A Surrogate-based Generic Classifier for Chinese TV Series Reviews

With the emerging of various online video platforms like Youtube, Youku ...
research
12/18/2018

Audiovisual speaker diarization of TV series

Speaker diarization may be difficult to achieve when applied to narrativ...
research
04/28/2021

Shot Contrastive Self-Supervised Learning for Scene Boundary Detection

Scenes play a crucial role in breaking the storyline of movies and TV ep...

Please sign up or login with your details

Forgot password? Click here to reset