Separating Sounds from a Single Image

07/15/2020
by   Lingyu Zhu, et al.
19

Recently, visual information has been widely used to aid the sound source separation tasks. It aims at identifying sound components from a given sound mixture with the presence of visual information. Especially, the appearance cues play an important role on separating sounds. However, the capacity of how well the network processes each modality is often ignored. In this paper, we investigate the performance of appearance information, extracted from a single image, in the task of recovering the original component signals from a mixture audio. An efficient appearance attention module is introduced to improve the sound separation performance by enhancing the distinction of the predicted semantic representations, and to precisely locate sound sources without extra computation. Moreover, we utilize the ground category information to study the capacity of each sub-network. We compare the proposed methods with recent baselines on the MUSIC dataset. Project page: https://ly-zhu.github.io/separating-sounds-from-single-image

READ FULL TEXT

page 1

page 3

page 6

research
06/04/2020

Visually Guided Sound Source Separation using Cascaded Opponent Filter Network

The objective of this paper is to recover the original component signals...
research
04/17/2021

Visually Guided Sound Source Separation and Localization using Self-Supervised Motion Representations

The objective of this paper is to perform audio-visual sound source sepa...
research
11/18/2019

Improving Universal Sound Separation Using Sound Classification

Deep learning approaches have recently achieved impressive performance o...
research
05/12/2023

Benchmarks and leaderboards for sound demixing tasks

Music demixing is the task of separating different tracks from the given...
research
09/18/2021

V-SlowFast Network for Efficient Visual Sound Separation

The objective of this paper is to perform visual sound separation: i) we...
research
04/20/2020

Music Gesture for Visual Sound Separation

Recent deep learning approaches have achieved impressive performance on ...
research
04/11/2019

The Sound of Motions

Sounds originate from object motions and vibrations of surrounding air. ...

Please sign up or login with your details

Forgot password? Click here to reset