A Demand-Driven Perspective on Generative Audio AI

07/10/2023
by   Sangshin Oh, et al.
0

To achieve successful deployment of AI research, it is crucial to understand the demands of the industry. In this paper, we present the results of a survey conducted with professional audio engineers, in order to determine research priorities and define various research tasks. We also summarize the current challenges in audio quality and controllability based on the survey. Our analysis emphasizes that the availability of datasets is currently the main bottleneck for achieving high-quality audio generation. Finally, we suggest potential solutions for some revealed issues with empirical evidence.

READ FULL TEXT
research
01/14/2020

Deep Audio-Visual Learning: A Survey

Audio-visual learning, aimed at exploiting the relationship between audi...
research
11/28/2021

How Deep Are the Fakes? Focusing on Audio Deepfake: A Survey

Deepfake is content or material that is synthetically generated or manip...
research
08/01/2021

A Survey on Audio Synthesis and Audio-Visual Multimodal Processing

With the development of deep learning and artificial intelligence, audio...
research
02/28/2022

Recent Advances and Challenges in Deep Audio-Visual Correlation Learning

Audio-visual correlation learning aims to capture essential corresponden...
research
09/20/2023

A Large-scale Dataset for Audio-Language Representation Learning

The AI community has made significant strides in developing powerful fou...
research
01/09/2020

Short-Range Audio Channels Security: Survey of Mechanisms, Applications, and Research Challenges

Short-range audio channels have a few distinguishing characteristics: ea...
research
06/14/2021

No Free Lunch: Microservice Practices Reconsidered in Industry

Microservice architecture advocates a number of technologies and practic...

Please sign up or login with your details

Forgot password? Click here to reset