
-
T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model
Identifying multiple speakers without knowing where a speaker's voice is...
read it
-
Improving Audio Anomalies Recognition Using Temporal Convolutional Attention Network
Anomalous audio in speech recordings is often caused by speaker voice di...
read it
-
Exploration of Audio Quality Assessment and Anomaly Localisation Using Attention Models
Many applications of speech technology require more and more audio data....
read it
-
Speaker Re-identification with Speaker Dependent Speech Enhancement
While the use of deep neural networks has significantly boosted speaker ...
read it
-
Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification
Identifying multiple speakers without knowing where a speaker's voice is...
read it
-
Locality-Sensitive Hashing Scheme based on Longest Circular Co-Substring
Locality-Sensitive Hashing (LSH) is one of the most popular methods for ...
read it
-
GraphLIME: Local Interpretable Model Explanations for Graph Neural Networks
Graph structured data has wide applicability in various domains such as ...
read it
-
Robust Speaker Recognition Using Speech Enhancement And Attention Model
In this paper, a novel architecture for speaker recognition is proposed ...
read it
-
Multi-task Sentence Encoding Model for Semantic Retrieval in Question Answering Systems
Question Answering (QA) systems are used to provide proper responses to ...
read it
-
H-VECTORS: Utterance-level Speaker Embedding Using A Hierarchical Attention Model
In this paper, a hierarchical attention network to generate utterance-le...
read it
-
Contextual Joint Factor Acoustic Embeddings
Embedding acoustic information into fixed length representations is of i...
read it
-
Improving Robustness In Speaker Identification Using A Two-Stage Attention Model
In this paper a novel framework to tackle speaker recognition using a tw...
read it
-
Convolutional Gated Recurrent Neural Network Incorporating Spatial Features for Audio Tagging
Environmental audio tagging is a newly proposed task to predict the pres...
read it
-
Hierarchical learning for DNN-based acoustic scene classification
In this paper, we present a deep neural network (DNN)-based acoustic sce...
read it
-
Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging
Environmental audio tagging aims to predict only the presence or absence...
read it
-
Fully DNN-based Multi-label regression for audio tagging
Acoustic event detection for content analysis in most cases relies on lo...
read it