NISP: A Multi-lingual Multi-accent Dataset for Speaker Profiling

07/12/2020
by   Shareef Babu Kalluri, et al.
0

Many commercial and forensic applications of speech demand the extraction of information about the speaker characteristics, which falls into the broad category of speaker profiling. The speaker characteristics needed for profiling include physical traits of the speaker like height, age, and gender of the speaker along with the native language of the speaker. Many of the datasets available have only partial information for speaker profiling. In this paper, we attempt to overcome this limitation by developing a new dataset which has speech data from five different Indian languages along with English. The metadata information for speaker profiling applications like linguistic information, regional information, and physical characteristics of a speaker are also collected. We call this dataset as NITK-IISc Multilingual Multi-accent Speaker Profiling (NISP) dataset. The description of the dataset, potential applications, and baseline results for speaker profiling on this dataset are provided in this paper.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/25/2023

Multi-Speaker Multi-Lingual VQTTS System for LIMMITS 2023 Challenge

In this paper, we describe the systems developed by the SJTU X-LANCE tea...
research
03/16/2022

Speaker Information Can Guide Models to Better Inductive Biases: A Case Study On Predicting Code-Switching

Natural language processing (NLP) models trained on people-generated dat...
research
08/16/2021

NIST SRE CTS Superset: A large-scale dataset for telephony speaker recognition

This document provides a brief description of the National Institute of ...
research
06/26/2019

The UN Security Council debates 1995-2017

This paper presents a new dataset containing 65,393 speeches held in the...
research
10/24/2021

Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling

Speaker profiling, which aims to estimate speaker characteristics such a...
research
11/23/2018

Learning pronunciation from a foreign language in speech synthesis networks

Although there are more than 65,000 languages in the world, the pronunci...
research
02/20/2023

Towards Measuring and Scoring Speaker Diarization Fairness

Speaker diarization, or the task of finding "who spoke and when", is now...

Please sign up or login with your details

Forgot password? Click here to reset