Deep Learning Frameworks Applied For Audio-Visual Scene Classification

06/12/2021
by   Lam Pham, et al.
0

In this paper, we present deep learning frameworks for audio-visual scene classification (SC) and indicate how individual visual and audio features as well as their combination affect SC performance. Our extensive experiments, which are conducted on DCASE (IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events) Task 1B development dataset, achieve the best classification accuracy of 82.2 input only, visual input only, and both audio-visual input, respectively. The highest classification accuracy of 93.9 audio-based and visual-based frameworks, shows an improvement of 16.5 with DCASE baseline.

READ FULL TEXT
research
12/16/2021

An Audio-Visual Dataset and Deep Learning Frameworks for Crowded Scene Classification

This paper presents a task of audio-visual scene classification (SC) whe...
research
04/30/2023

Deep Learning Based Multimodal with Two-phase Training Strategy for Daily Life Video Classification

In this paper, we present a deep learning based multimodal system for cl...
research
05/28/2021

Audio-visual scene classification: analysis of DCASE 2021 Challenge submissions

This paper presents the details of the Audio-Visual Scene Classification...
research
06/19/2018

A Simple Fusion of Deep and Shallow Learning for Acoustic Scene Classification

In the past, Acoustic Scene Classification systems have been based on ha...
research
12/09/2021

On The Effect Of Coding Artifacts On Acoustic Scene Classification

Previous DCASE challenges contributed to an increase in the performance ...
research
04/25/2022

Audio-Visual Scene Classification Using A Transfer Learning Based Joint Optimization Strategy

Recently, audio-visual scene classification (AVSC) has attracted increas...
research
02/25/2019

Audio Caption: Listen and Tell

Increasing amount of research has shed light on machine perception of au...

Please sign up or login with your details

Forgot password? Click here to reset