Visually Guided Sound Source Separation using Cascaded Opponent Filter Network

06/04/2020
by   Lingyu Zhu, et al.
5

The objective of this paper is to recover the original component signals from a mixture audio with the aid of visual cues of the sound sources. Such task is usually referred as visually guided sound source separation. The proposed Cascaded Opponent Filter (COF) framework consists of multiple stages, which recursively refine the sound separation based on appearance and motion information. A key element is a novel opponent filter module that identifies and relocates residual components between sound sources. Finally, we propose a Sound Source Location Masking (SSLM) technique, which, together with COF, produces a pixel level mask of the source location. The entire system is trained end-to-end using a large set of unlabelled videos. We compare COF with recent baselines and obtain state-of-the-art performance in three challenging datasets (MUSIC, A-MUSIC, and A-NATURAL). The implementation and pre-trained models will be made publicly available.

READ FULL TEXT

page 2

page 11

page 13

page 18

page 20

page 21

page 22

research
04/17/2021

Visually Guided Sound Source Separation and Localization using Self-Supervised Motion Representations

The objective of this paper is to perform audio-visual sound source sepa...
research
07/15/2020

Separating Sounds from a Single Image

Recently, visual information has been widely used to aid the sound sourc...
research
04/11/2019

The Sound of Motions

Sounds originate from object motions and vibrations of surrounding air. ...
research
03/25/2021

Weakly-supervised Audio-visual Sound Source Detection and Separation

Learning how to localize and separate individual object sounds in the au...
research
04/16/2019

Co-Separating Sounds of Visual Objects

Learning how objects sound from video is challenging, since they often h...
research
11/15/2022

Music Similarity Calculation of Individual Instrumental Sounds Using Metric Learning

The criteria for measuring music similarity are important for developing...
research
07/28/2023

Automated approach for source location in shallow waters

This paper proposes a fully automated method for recovering the location...

Please sign up or login with your details

Forgot password? Click here to reset