Sound Search by Text Description or Vocal Imitation?

07/19/2019
by   Yichi Zhang, et al.
0

Searching sounds by text labels is often difficult, as text descriptions cannot describe the audio content in detail. Query by vocal imitation bridges such gap and provides a novel way to sound search. Several algorithms for sound search by vocal imitation have been proposed and evaluated in a simulation environment, however, they have not been deployed into a real search engine nor evaluated by real users. This pilot work conducts a subjective study to compare these two approaches to sound search, and tries to answer the question of which approach works better for what kinds of sounds. To do so, we developed two web-based search engines for sound, one by vocal imitation (Vroom!) and the other by text description (TextSearch). We also developed an experimental framework to host these engines to collect statistics of user behaviors and ratings. Results showed that Vroom! received significantly higher search satisfaction ratings than TextSearch did for sound categories that were difficult for subjects to describe by text. Results also showed a better overall ease-of-use rating for Vroom! than TextSearch on the limited sound library in our experiments. These findings suggest advantages of vocal-imitation-based search for sound in practice.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/02/2017

Framework for evaluation of sound event detection in web videos

The largest source of sound events is web videos. Most videos lack sound...
research
11/01/2020

Search Engine Similarity Analysis: A Combined Content and Rankings Approach

How different are search engines? The search engine wars are a favorite ...
research
04/27/2023

Boosting Big Brother: Attacking Search Engines with Encodings

Search engines are vulnerable to attacks against indexing and searching ...
research
04/10/2022

Deep Conditional Representation Learning for Drum Sample Retrieval by Vocalisation

Imitating musical instruments with the human voice is an efficient way o...
research
05/24/2020

How Does That Sound? Multi-Language SpokenName2Vec Algorithm Using Speech Generation and Deep Learning

Searching for information about a specific person is an online activity ...
research
04/08/2020

Search Result Clustering in Collaborative Sound Collections

The large size of nowadays' online multimedia databases makes retrieving...
research
06/01/2021

WebMIaS on Docker: Deploying Math-Aware Search in a Single Line of Code

Math informational retrieval (MIR) search engines are absent in the wide...

Please sign up or login with your details

Forgot password? Click here to reset