ivrit.ai: A Comprehensive Dataset of Hebrew Speech for AI Research and Development

07/17/2023
by   Yanir Marmor, et al.
0

We introduce "ivrit.ai", a comprehensive Hebrew speech dataset, addressing the distinct lack of extensive, high-quality resources for advancing Automated Speech Recognition (ASR) technology in Hebrew. With over 3,300 speech hours and a over a thousand diverse speakers, ivrit.ai offers a substantial compilation of Hebrew speech across various contexts. It is delivered in three forms to cater to varying research needs: raw unprocessed audio; data post-Voice Activity Detection, and partially transcribed data. The dataset stands out for its legal accessibility, permitting use at no cost, thereby serving as a crucial resource for researchers, developers, and commercial entities. ivrit.ai opens up numerous applications, offering vast potential to enhance AI capabilities in Hebrew. Future efforts aim to expand ivrit.ai further, thereby advancing Hebrew's standing in AI research and technology.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/03/2023

AI in HCI Design and User Experience

In this chapter, we review and discuss the transformation of AI technolo...
research
03/27/2018

Comprehending Real Numbers: Development of Bengali Real Number Speech Corpus

Speech recognition has received a less attention in Bengali literature d...
research
08/24/2023

Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion

There are growing implications surrounding generative AI in the speech d...
research
07/04/2019

Toward Fairness in AI for People with Disabilities: A Research Roadmap

AI technologies have the potential to dramatically impact the lives of p...
research
03/21/2023

Transformers in Speech Processing: A Survey

The remarkable success of transformers in the field of natural language ...
research
07/22/2022

Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities

As for other forms of AI, speech recognition has recently been examined ...
research
08/09/2021

Experiences with the Introduction of AI-based Tools for Moderation Automation of Voice-based Participatory Media Forums

Voice-based discussion forums where users can record audio messages whic...

Please sign up or login with your details

Forgot password? Click here to reset