Insights into End-to-End Learning Scheme for Language Identification

04/02/2018
by   Weicheng Cai, et al.
0

A novel interpretable end-to-end learning scheme for language identification is proposed. It is in line with the classical GMM i-vector methods both theoretically and practically. In the end-to-end pipeline, a general encoding layer is employed on top of the front-end CNN, so that it can encode the variable-length input sequence into an utterance level vector automatically. After comparing with the state-of-the-art GMM i-vector methods, we give insights into CNN, and reveal its role and effect in the whole pipeline. We further introduce a general encoding layer, illustrating the reason why they might be appropriate for language identification. We elaborate on several typical encoding layers, including a temporal average pooling layer, a recurrent encoding layer and a novel learnable dictionary encoding layer. We conducted experiment on NIST LRE07 closed-set task, and the results show that our proposed end-to-end systems achieve state-of-the-art performance.

READ FULL TEXT

page 2

page 4

research
04/02/2018

A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification

A novel learnable dictionary encoding layer is proposed in this paper fo...
research
09/09/2018

End-to-end Language Identification using NetFV and NetVLAD

In this paper, we apply the NetFV and NetVLAD layers for the end-to-end ...
research
05/26/2023

Set-based Neural Network Encoding

We propose an approach to neural network weight encoding for generalizat...
research
05/30/2022

End-to-End Topology-Aware Machine Learning for Power System Reliability Assessment

Conventional power system reliability suffers from the long run time of ...
research
02/20/2019

Utterance-level end-to-end language identification using attention-based CNN-BLSTM

In this paper, we present an end-to-end language identification framewor...
research
08/20/2015

DeepWriterID: An End-to-end Online Text-independent Writer Identification System

Owing to the rapid growth of touchscreen mobile terminals and pen-based ...
research
02/05/2020

Identification of Indian Languages using Ghost-VLAD pooling

In this work, we propose a new pooling strategy for language identificat...

Please sign up or login with your details

Forgot password? Click here to reset