Visual onoma-to-wave: environmental sound synthesis from visual onomatopoeias and sound-source images

10/17/2022
by   Hien Ohnaka, et al.
0

We propose a method for synthesizing environmental sounds from visually represented onomatopoeias and sound sources. An onomatopoeia is a word that imitates a sound structure, i.e., the text representation of sound. From this perspective, onoma-to-wave has been proposed to synthesize environmental sounds from the desired onomatopoeia texts. Onomatopoeias have another representation: visual-text representations of sounds in comics, advertisements, and virtual reality. A visual onomatopoeia (visual text of onomatopoeia) contains rich information that is not present in the text, such as a long-short duration of the image, so the use of this representation is expected to synthesize diverse sounds. Therefore, we propose visual onoma-to-wave for environmental sound synthesis from visual onomatopoeia. The method can transfer visual concepts of the visual text and sound-source image to the synthesized sound. We also propose a data augmentation method focusing on the repetition of onomatopoeias to enhance the performance of our method. An experimental evaluation shows that the methods can synthesize diverse environmental sounds from visual text and sound-source images.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2023

Environmental sound conversion from vocal imitations and sound event labels

One way of expressing an environmental sound is using vocal imitations, ...
research
02/11/2021

Onoma-to-wave: Environmental sound synthesis from onomatopoeic words

In this paper, we propose a new framework for environmental sound synthe...
research
08/27/2019

Overview of Tasks and Investigation of Subjective Evaluation Methods in Environmental Sound Synthesis and Conversion

Synthesizing and converting environmental sounds have the potential for ...
research
08/26/2023

ORES: Open-vocabulary Responsible Visual Synthesis

Avoiding synthesizing specific visual concepts is an essential challenge...
research
08/16/2022

How Should We Evaluate Synthesized Environmental Sounds

Although several methods of environmental sound synthesis have been prop...
research
05/28/2023

CAPTDURE: Captioned Sound Dataset of Single Sources

In conventional studies on environmental sound separation and synthesis ...
research
09/25/2012

Environmental Sounds Spectrogram Classification using Log-Gabor Filters and Multiclass Support Vector Machines

This paper presents novel approaches for efficient feature extraction us...

Please sign up or login with your details

Forgot password? Click here to reset