Knowledge Transfer For On-Device Speech Emotion Recognition with Neural Structured Learning

10/26/2022
by   Yi Chang, et al.
0

Speech emotion recognition (SER) has been a popular research topic in human-computer interaction (HCI). As edge devices are rapidly springing up, applying SER to edge devices is promising for a huge number of HCI applications. Although deep learning has been investigated to improve the performance of SER by training complex models, the memory space and computational capability of edge devices represents a constraint for embedding deep learning models. We propose a neural structured learning (NSL) framework through building synthesized graphs. An SER model is trained on a source dataset and used to build graphs on a target dataset. A lightweight model is then trained with the speech samples and graphs together as the input. Our experiments demonstrate that training a lightweight SER model on the target dataset with speech samples and graphs can not only produce small SER models, but also enhance the model performance over models with speech samples only.

READ FULL TEXT
research
04/16/2018

Multi-Modal Emotion recognition on IEMOCAP Dataset using Deep Learning

Emotion recognition has become an important field of research in Human C...
research
11/23/2020

Speech Command Recognition in Computationally Constrained Environments with a Quadratic Self-organized Operational Layer

Automatic classification of speech commands has revolutionized human com...
research
05/01/2023

Emotions Beyond Words: Non-Speech Audio Emotion Recognition With Edge Computing

Non-speech emotion recognition has a wide range of applications includin...
research
06/25/2021

EARLIN: Early Out-of-Distribution Detection for Resource-efficient Collaborative Inference

Collaborative inference enables resource-constrained edge devices to mak...
research
06/04/2020

A Siamese Neural Network with Modified Distance Loss For Transfer Learning in Speech Emotion Recognition

Automatic emotion recognition plays a significant role in the process of...
research
08/22/2021

Training and Profiling a Pediatric Emotion Recognition Classifier on Mobile Devices

Implementing automated emotion recognition on mobile devices could provi...
research
05/09/2023

An Exploration into the Performance of Unsupervised Cross-Task Speech Representations for "In the Wild” Edge Applications

Unsupervised speech models are becoming ubiquitous in the speech and mac...

Please sign up or login with your details

Forgot password? Click here to reset