Personalized Audio Quality Preference Prediction

02/16/2023
by   Chung-Che Wang, et al.
0

This paper proposes to use both audio input and subject information to predict the personalized preference of two audio segments with the same content in different qualities. A siamese network is used to compare the inputs and predict the preference. Several different structures for each side of the siamese network are investigated, and an LDNet with PANNs' CNN6 as the encoder and a multi-layer perceptron block as the decoder outperforms a baseline model using only audio input the most, where the overall accuracy grows from 77.56 to 78.04 information, including age, gender, and the specifications of headphones or earphones, is more effective than using only a part of them.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/29/2020

Learning Audio Embeddings with User Listening Data for Content-based Music Recommendation

Personalized recommendation on new track releases has always been a chal...
research
10/30/2017

Content-based Representations of audio using Siamese neural networks

In this paper, we focus on the problem of content-based retrieval for au...
research
07/08/2020

SiENet: Siamese Expansion Network for Image Extrapolation

Different from image inpainting, image outpainting has relative less con...
research
06/22/2023

Siamese SIREN: Audio Compression with Implicit Neural Representations

Implicit Neural Representations (INRs) have emerged as a promising metho...
research
03/04/2019

On measuring the iconicity of a face

For a given identity in a face dataset, there are certain iconic images ...
research
04/04/2019

Preference Neural Network

This paper proposes a preference neural network (PNN) to address the pro...
research
10/22/2022

GCT: Gated Contextual Transformer for Sequential Audio Tagging

Audio tagging aims to assign predefined tags to audio clips to indicate ...

Please sign up or login with your details

Forgot password? Click here to reset