Dynamic Multi-scale Convolution for Dialect Identification

08/02/2021
by   Tianlong Kong, et al.
0

Time Delay Neural Networks (TDNN)-based methods are widely used in dialect identification. However, in previous work with TDNN application, subtle variant is being neglected in different feature scales. To address this issue, we propose a new architecture, named dynamic multi-scale convolution, which consists of dynamic kernel convolution, local multi-scale learning, and global multi-scale pooling. Dynamic kernel convolution captures features between short-term and long-term context adaptively. Local multi-scale learning, which represents multi-scale features at a granular level, is able to increase the range of receptive fields for convolution operation. Besides, global multi-scale pooling is applied to aggregate features from different bottleneck layers in order to collect information from multiple aspects. The proposed architecture significantly outperforms state-of-the-art system on the AP20-OLR-dialect-task of oriental language recognition (OLR) challenge 2020, with the best average cost performance (Cavg) of 0.067 and the best equal error rate (EER) of 6.52 9 parameters of proposed model are 91

READ FULL TEXT
research
10/14/2022

MCTNet: A Multi-Scale CNN-Transformer Network for Change Detection in Optical Remote Sensing Images

For the task of change detection (CD) in remote sensing images, deep con...
research
05/24/2018

Multi-Scale DenseNet-Based Electricity Theft Detection

Electricity theft detection issue has drawn lots of attention during las...
research
02/12/2021

Broad-UNet: Multi-scale feature learning for nowcasting tasks

Weather nowcasting consists of predicting meteorological components in t...
research
07/23/2019

PointAtrousGraph: Deep Hierarchical Encoder-Decoder with Atrous Convolution for Point Clouds

Motivated by the success of encoding multi-scale contextual information ...
research
07/07/2012

Object Recognition with Multi-Scale Pyramidal Pooling Networks

We present a Multi-Scale Pyramidal Pooling Network, featuring a novel py...
research
06/14/2022

Learning Behavior Representations Through Multi-Timescale Bootstrapping

Natural behavior consists of dynamics that are both unpredictable, can s...
research
05/17/2023

Two-Stream Regression Network for Dental Implant Position Prediction

In implant prosthesis treatment, the design of surgical guide requires l...

Please sign up or login with your details

Forgot password? Click here to reset