The Sillwood Technologies System for the VoiceMOS Challenge 2022

04/08/2022
by   Jiameng Gao, et al.
0

In this paper we describe our entry for the VoiceMOS Challenge 2022 for both the main and out-of-domain (OOD) track of the competition. Our system is based on finetuning pre-trained self-supervised waveform prediction models, while improving its generalisation ability through stochastic weight averaging. Further, we use influence functions to identity possible low-quality data within the training set to further increase our model's performance for the OOD track. Our system ranked 5th and joint 7th for the main track and OOD track, respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/11/2022

Fusion of Self-supervised Learned Models for MOS Prediction

We participated in the mean opinion score (MOS) prediction challenge, 20...
research
08/17/2023

The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 2023

This paper is the system description of the DKU-MSXF System for the trac...
research
03/10/2021

Team Phoenix at WASSA 2021: Emotion Analysis on News Stories with Pre-Trained Language Models

Emotion is fundamental to humanity. The ability to perceive, understand ...
research
04/04/2020

Pre-Trained and Attention-Based Neural Networks for Building Noetic Task-Oriented Dialogue Systems

The NOESIS II challenge, as the Track 2 of the 8th Dialogue System Techn...
research
03/23/2022

Prompt-based Pre-trained Model for Personality and Interpersonal Reactivity Prediction

This paper describes the LingJing team's method to the Workshop on Compu...
research
04/05/2022

UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022

We present the UTokyo-SaruLab mean opinion score (MOS) prediction system...
research
08/14/2023

The Sound Demixing Challenge 2023 x2013 Cinematic Demixing Track

This paper summarizes the cinematic demixing (CDX) track of the Sound De...

Please sign up or login with your details

Forgot password? Click here to reset