audino: A Modern Annotation Tool for Audio and Speech

06/09/2020
by   Manraj Singh Grover, et al.
0

In this paper, we introduce a collaborative and modern annotation tool for audio and speech: audino. The tool allows annotators to define and describe temporal segmentation in audios. These segments can be labelled and transcribed easily using a dynamically generated form. An admin can centrally control user roles and project assignment through the admin dashboard. The dashboard also enables describing labels and their values. The annotations can easily be exported in JSON format for further processing. The tool allows audio data to be uploaded and assigned to a user through a key-based API. The flexibility available in the annotation tool enables annotation for Speech Scoring, Voice Activity Detection (VAD), Speaker Diarisation, Speaker Identification, Speech Recognition, Emotion Recognition tasks and more. The MIT open source license allows it to be used for academic and commercial projects.

READ FULL TEXT

page 1

page 3

research
08/14/2023

Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations

Although automatic emotion recognition (AER) has recently drawn signific...
research
03/03/2020

Seshat: A tool for managing and verifying annotation campaigns of audio data

We introduce Seshat, a new, simple and open-source software to efficient...
research
03/13/2023

A processing framework to access large quantities of whispered speech found in ASMR

Whispering is a ubiquitous mode of communication that humans use daily. ...
research
12/02/2022

NEAL: An open-source tool for audio annotation

Passive acoustic monitoring is used widely in ecology, biodiversity, and...
research
08/13/2019

IMS-Speech: A Speech to Text Tool

We present the IMS-Speech, a web based tool for German and English speec...
research
03/06/2020

Multi-Time-Scale Convolution for Emotion Recognition from Speech Audio Signals

Robustness against temporal variations is important for emotion recognit...
research
12/17/2019

Libri-Light: A Benchmark for ASR with Limited or No Supervision

We introduce a new collection of spoken English audio suitable for train...

Please sign up or login with your details

Forgot password? Click here to reset