Exploration of Audio Quality Assessment and Anomaly Localisation Using Attention Models

05/16/2020
by   Qiang Huang, et al.
0

Many applications of speech technology require more and more audio data. Automatic assessment of the quality of the collected recordings is important to ensure they meet the requirements of the related applications. However, effective and high performing assessment remains a challenging task without a clean reference. In this paper, a novel model for audio quality assessment is proposed by jointly using bidirectional long short-term memory and an attention mechanism. The former is to mimic a human auditory perception ability to learn information from a recording, and the latter is to further discriminate interferences from desired signals by highlighting target related features. To evaluate our proposed approach, the TIMIT dataset is used and augmented by mixing with various natural sounds. In our experiments, two tasks are explored. The first task is to predict an utterance quality score, and the second is to identify where an anomalous distortion takes place in a recording. The obtained results show that the use of our proposed approach outperforms a strong baseline method and gains about 5 metrics, Linear Correlation Coefficient and Spearman Rank Correlation Coefficient, and F1.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/16/2018

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM

Nowadays, most of the objective speech quality assessment tools (e.g., p...
research
11/04/2022

CCATMos: Convolutional Context-aware Transformer Network for Non-intrusive Speech Quality Assessment

Speech quality assessment has been a critical component in many voice co...
research
10/21/2020

Improving Audio Anomalies Recognition Using Temporal Convolutional Attention Network

Anomalous audio in speech recordings is often caused by speaker voice di...
research
10/26/2020

Improving pronunciation assessment via ordinal regression with anchored reference samples

Sentence level pronunciation assessment is important for Computer Assist...
research
12/12/2016

An Attention-Driven Approach of No-Reference Image Quality Assessment

In this paper, we present a novel method of no-reference image quality a...
research
11/12/2022

Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings

Automatic speech quality assessment is essential for audio researchers, ...
research
06/27/2022

Audio Similarity is Unreliable as a Proxy for Audio Quality

Many audio processing tasks require perceptual assessment. However, the ...

Please sign up or login with your details

Forgot password? Click here to reset