Multi-task learning (MTL) aims to improve the performance of a primary t...
Advances in passive acoustic monitoring and machine learning have led to...
With the rise of foundation models, a new artificial intelligence paradi...
This survey paper provides a comprehensive overview of the recent
advanc...
Language use has been shown to correlate with depression, but large-scal...
After the inception of emotion recognition or affective computing, it ha...
Despite recent advancements in speech emotion recognition (SER) models,
...
The employment of foundation models is steadily expanding, especially wi...
We conducted a data collection on the basis of the Google AudioSet datab...
The ACM Multimedia 2023 Computational Paralinguistics Challenge addresse...
Over the past few decades, multimodal emotion recognition has made remar...
ChatGPT has shown the potential of emerging general artificial intellige...
Driven by the need for larger and more diverse datasets to pre-train and...
Recent years have seen a rapid increase in digital medicine research in ...
Heart sound auscultation has been demonstrated to be beneficial in clini...
Charisma is considered as one's ability to attract and potentially also
...
Telling stories is an integral part of human communication which can evo...
Recent work has reported that AI classifiers trained on audio recordings...
Since early in the coronavirus disease 2019 (COVID-19) pandemic, there h...
The UK COVID-19 Vocal Audio Dataset is designed for the training and
eva...
Speech emotion recognition (SER) has been a popular research topic in
hu...
Speech emotion recognition (SER) is the task of recognising human's emot...
Speech is the fundamental mode of human communication, and its synthesis...
The Barlow Twins self-supervised learning objective requires neither neg...
Humour is a substantial element of human affect and cognition. Its autom...
Vocal bursts play an important role in communicating affect, making them...
Chronic obstructive pulmonary disease (COPD) causes lung inflammation an...
Despite the recent progress in speech emotion recognition (SER),
state-o...
Recognising continuous emotions and action unit (AU) intensities from fa...
More than two years after its outbreak, the COVID-19 pandemic continues ...
In this paper, we propose the Redundancy Reduction Twins Network (RRTN),...
In this work, we explore a novel few-shot personalisation architecture f...
Automatically recognising apparent emotions from face and voice is hard,...
The ACM Multimedia 2022 Computational Paralinguistics Challenge addresse...
Previous studies have shown the correlation between sensor data collecte...
Although running is a common leisure activity and a core training regime...
Stress is a major threat to well-being that manifests in a variety of
ph...
Digital health applications are becoming increasingly important for asse...
Video-to-speech synthesis (also known as lip-to-speech) refers to the
tr...
Detecting COVID-19 from audio signals, such as breathing and coughing, c...
Respiratory sound classification is an important tool for remote screeni...
Emotional voice conversion (EVC) focuses on converting a speech utteranc...
In this paper, we present our submission to 3rd Affective Behavior Analy...
Emotion and a broader range of affective driver states can be a life dec...
Among the seventeen Sustainable Development Goals (SDGs) proposed within...
Due to the development of machine learning and speech processing, speech...
What audio embedding approach generalizes best to a wide range of downst...
Professional athletes increasingly use automated analysis of meta- and s...
The COVID-19 pandemic has caused massive humanitarian and economic damag...
Algorithms and Machine Learning (ML) are increasingly affecting everyday...