Convolutional Neural Networks and x-vector Embedding for DCASE2018 Acoustic Scene Classification Challenge

10/01/2018
by   Hossein Zeinali, et al.
0

In this paper, the Brno University of Technology (BUT) team submissions for Task 1 (Acoustic Scene Classification, ASC) of the DCASE-2018 challenge are described. Also, the analysis of different methods on the leaderboard set is provided. The proposed approach is a fusion of two different Convolutional Neural Network (CNN) topologies. The first one is the common two-dimensional CNNs which is mainly used in image classification. The second one is a one-dimensional CNN for extracting fixed-length audio segment embeddings, so called x-vectors, which has also been used in speech processing, especially for speaker recognition. In addition to the different topologies, two types of features were tested: log mel-spectrogram and CQT features. Finally, the outputs of different systems are fused using a simple output averaging in the best performing system. Our submissions ranked third among 24 teams in the ASC sub-task A (task1a).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/13/2019

Acoustic Scene Classification Using Fusion of Attentive Convolutional Neural Networks for DCASE2019 Challenge

In this report, the Brno University of Technology (BUT) team submissions...
research
10/16/2019

BUT System Description to VoxCeleb Speaker Recognition Challenge 2019

In this report, we describe the submission of Brno University of Technol...
research
06/20/2017

A Hybrid Approach with Multi-channel I-Vectors and Convolutional Neural Networks for Acoustic Scene Classification

In Acoustic Scene Classification (ASC) two major approaches have been fo...
research
07/08/2016

CNN-LTE: a Class of 1-X Pooling Convolutional Neural Networks on Label Tree Embeddings for Audio Scene Recognition

We describe in this report our audio scene recognition system submitted ...
research
09/23/2019

Speech Replay Detection with x-Vector Attack Embeddings and Spectral Features

We present our system submission to the ASVspoof 2019 Challenge Physical...
research
06/19/2018

A Simple Fusion of Deep and Shallow Learning for Acoustic Scene Classification

In the past, Acoustic Scene Classification systems have been based on ha...
research
07/06/2020

Acoustic Scene Classification with Spectrogram Processing Strategies

Recently, convolutional neural networks (CNN) have achieved the state-of...

Please sign up or login with your details

Forgot password? Click here to reset