Björn W. Schuller

research

∙ 09/18/2023

Task Selection and Assignment for Multi-modal Multi-task Dialogue Act Classification with Non-stationary Multi-armed Bandits

Multi-task learning (MTL) aims to improve the performance of a primary t...

0 Xiangheng He, et al. ∙

research

∙ 09/15/2023

Exploring Meta Information for Audio-based Zero-shot Bird Classification

Advances in passive acoustic monitoring and machine learning have led to...

0 Alexander Gebhard, et al. ∙

research

∙ 08/26/2023

A Wide Evaluation of ChatGPT on Affective Computing Tasks

With the rise of foundation models, a new artificial intelligence paradi...

0 Mostafa M. Amin, et al. ∙

research

∙ 08/24/2023

Sparks of Large Audio Models: A Survey and Outlook

This survey paper provides a comprehensive overview of the recent advanc...

0 Siddique Latif, et al. ∙

research

∙ 08/22/2023

Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model

Language use has been shown to correlate with depression, but large-scal...

0 Yuezhou Zhang, et al. ∙

research

∙ 08/21/2023

Refashioning Emotion Recognition Modelling: The Advent of Generalised Large Models

After the inception of emotion recognition or affective computing, it ha...

0 Zixing Zhang, et al. ∙

research

∙ 07/12/2023

Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers

Despite recent advancements in speech emotion recognition (SER) models, ...

0 Siddique Latif, et al. ∙

research

∙ 07/06/2023

Can ChatGPT's Responses Boost Traditional Natural Language Processing?

The employment of foundation models is steadily expanding, especially wi...

0 Mostafa M. Amin, et al. ∙

research

∙ 05/23/2023

Happy or Evil Laughter? Analysing a Database of Natural Audio Samples

We conducted a data collection on the basis of the Google AudioSet datab...

0 Aljoscha Düsterhöft, et al. ∙

research

∙ 04/28/2023

The ACM Multimedia 2023 Computational Paralinguistics Challenge: Emotion Share Requests

The ACM Multimedia 2023 Computational Paralinguistics Challenge addresse...

1 Björn W. Schuller, et al. ∙

research

∙ 04/18/2023

MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning

Over the past few decades, multimodal emotion recognition has made remar...

0 Zheng Lian, et al. ∙

research

∙ 03/03/2023

Will Affective Computing Emerge from Foundation Models and General AI? A First Evaluation on ChatGPT

ChatGPT has shown the potential of emerging general artificial intellige...

0 Mostafa M. Amin, et al. ∙

research

∙ 03/01/2023

audb – Sharing and Versioning of Audio and Annotation Data in Python

Driven by the need for larger and more diverse datasets to pre-train and...

0 Hagen Wierstorf, et al. ∙

research

∙ 01/25/2023

HEAR4Health: A blueprint for making computer audition a staple of modern healthcare

Recent years have seen a rapid increase in digital medicine research in ...

0 Andreas Triantafyllopoulos, et al. ∙

research

∙ 01/23/2023

A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era

Heart sound auscultation has been demonstrated to be beneficial in clini...

0 Zhao Ren, et al. ∙

research

∙ 12/31/2022

Computational Charisma – A Brick by Brick Blueprint for Building Charismatic Artificial Intelligence

Charisma is considered as one's ability to attract and potentially also ...

0 Björn W. Schuller, et al. ∙

research

∙ 12/21/2022

Automatic Emotion Modelling in Written Stories

Telling stories is an integral part of human communication which can evo...

0 Lukas Christ, et al. ∙

research

∙ 12/15/2022

Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers

Recent work has reported that AI classifiers trained on audio recordings...

0 Harry Coppock, et al. ∙

research

∙ 12/15/2022

Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19

Since early in the coronavirus disease 2019 (COVID-19) pandemic, there h...

0 Davide Pigoli, et al. ∙

research

∙ 12/15/2022

A large-scale and PCR-referenced vocal audio dataset for COVID-19

The UK COVID-19 Vocal Audio Dataset is designed for the training and eva...

0 Jobie Budd, et al. ∙

research

∙ 10/26/2022

Knowledge Transfer For On-Device Speech Emotion Recognition with Neural Structured Learning

Speech emotion recognition (SER) has been a popular research topic in hu...

0 Yi Chang, et al. ∙

research

∙ 10/26/2022

Fast Yet Effective Speech Emotion Recognition with Self-distillation

Speech emotion recognition (SER) is the task of recognising human's emot...

0 Zhao Ren, et al. ∙

research

∙ 10/06/2022

An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era

Speech is the fundamental mode of human communication, and its synthesis...

0 Andreas Triantafyllopoulos, et al. ∙

research

∙ 09/28/2022

Audio Barlow Twins: Self-Supervised Audio Representation Learning

The Barlow Twins self-supervised learning objective requires neither neg...

10 Jonah Anton, et al. ∙

research

∙ 09/28/2022

Multimodal Prediction of Spontaneous Humour: A Novel Dataset and First Results

Humour is a substantial element of human affect and cognition. Its autom...

0 Lukas Christ, et al. ∙

research

∙ 09/15/2022

Self-Supervised Attention Networks and Uncertainty Loss Weighting for Multi-Task Emotion Recognition on Vocal Bursts

Vocal bursts play an important role in communicating affect, making them...

0 Vincent Karas, et al. ∙

research

∙ 07/26/2022

Distinguishing between pre- and post-treatment in the speech of patients with chronic obstructive pulmonary disease

Chronic obstructive pulmonary disease (COPD) causes lung inflammation an...

0 Andreas Triantafyllopoulos, et al. ∙

research

∙ 07/12/2022

Multitask Learning from Augmented Auxiliary Data for Improving Speech Emotion Recognition

Despite the recent progress in speech emotion recognition (SER), state-o...

0 Siddique Latif, et al. ∙

research

∙ 07/03/2022

Are 3D Face Shapes Expressive Enough for Recognising Continuous Emotions and Action Unit Intensities?

Recognising continuous emotions and action unit (AU) intensities from fa...

0 Mani Kumar Tellamekala, et al. ∙

research

∙ 06/20/2022

COVYT: Introducing the Coronavirus YouTube and TikTok speech dataset featuring the same speakers with and without infection

More than two years after its outbreak, the COVID-19 pandemic continues ...

0 Andreas Triantafyllopoulos, et al. ∙

research

∙ 06/18/2022

Redundancy Reduction Twins Network: A Training framework for Multi-output Emotion Regression

In this paper, we propose the Redundancy Reduction Twins Network (RRTN),...

0 Xin Jing, et al. ∙

research

∙ 06/14/2022

Exploring speaker enrolment for few-shot personalisation in emotional vocalisation prediction

In this work, we explore a novel few-shot personalisation architecture f...

0 Andreas Triantafyllopoulos, et al. ∙

research

∙ 06/12/2022

COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion Recognition

Automatically recognising apparent emotions from face and voice is hard,...

0 Mani Kumar Tellamekala, et al. ∙

research

∙ 05/13/2022

The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, Mosquitoes

The ACM Multimedia 2022 Computational Paralinguistics Challenge addresse...

0 Björn W. Schuller, et al. ∙

research

∙ 05/10/2022

Depression Diagnosis and Forecast based on Mobile Phone Sensor Data

Previous studies have shown the correlation between sensor data collecte...

0 Xiangheng He, et al. ∙

research

∙ 05/09/2022

Fatigue Prediction in Outdoor Running Conditions using Audio Data

Although running is a common leisure activity and a core training regime...

0 Andreas Triantafyllopoulos, et al. ∙

research

∙ 05/09/2022

Insights on Modelling Physiological, Appraisal, and Affective Indicators of Stress using Audio Features

Stress is a major threat to well-being that manifests in a variety of ph...

7 Andreas Triantafyllopoulos, et al. ∙

research

∙ 05/06/2022

Journaling Data for Daily PHQ-2 Depression Prediction and Forecasting

Digital health applications are becoming increasingly important for asse...

0 Alexander Kathan, et al. ∙

research

∙ 05/04/2022

SVTS: Scalable Video-to-Speech Synthesis

Video-to-speech synthesis (also known as lip-to-speech) refers to the tr...

11 Rodrigo Mira, et al. ∙

research

∙ 03/31/2022

A Temporal-oriented Broadcast ResNet for COVID-19 Detection

Detecting COVID-19 from audio signals, such as breathing and coughing, c...

0 Xin Jing, et al. ∙

research

∙ 03/30/2022

Example-based Explanations with Adversarial Attacks for Respiratory Sound Analysis

Respiratory sound classification is an important tool for remote screeni...

0 Yi Chang, et al. ∙

research

∙ 03/29/2022

An Overview Analysis of Sequence-to-Sequence Emotional Voice Conversion

Emotional voice conversion (EVC) focuses on converting a speech utteranc...

0 Zijiang Yang, et al. ∙

research

∙ 03/24/2022

Continuous-Time Audiovisual Fusion with Recurrence vs. Attention for In-The-Wild Affect Recognition

In this paper, we present our submission to 3rd Affective Behavior Analy...

0 Vincent Karas, et al. ∙

research

∙ 03/14/2022

Audiovisual Affect Assessment and Autonomous Automobiles: Applications

Emotion and a broader range of affective driver states can be a life dec...

0 Björn W. Schuller, et al. ∙

research

∙ 03/10/2022

Climate Change Computer Audition: A Call to Action and Overview on Audio Intelligence to Help Save the Planet

Among the seventeen Sustainable Development Goals (SDGs) proposed within...

0 Björn W. Schuller, et al. ∙

research

∙ 03/09/2022

Robust Federated Learning Against Adversarial Attacks for Speech Emotion Recognition

Due to the development of machine learning and speech processing, speech...

0 Yi Chang, et al. ∙

research

∙ 03/06/2022

HEAR 2021: Holistic Evaluation of Audio Representations

What audio embedding approach generalizes best to a wide range of downst...

17 Joseph Turian, et al. ∙

research

∙ 02/18/2022

Predicting Sex and Stroke Success – Computer-aided Player Grunt Analysis in Tennis Matches

Professional athletes increasingly use automated analysis of meta- and s...

0 Lukas Stappen, et al. ∙

research

∙ 02/17/2022

A Summary of the ComParE COVID-19 Challenges

The COVID-19 pandemic has caused massive humanitarian and economic damag...

13 Harry Coppock, et al. ∙

research

∙ 02/02/2022

Normalise for Fairness: A Simple Normalisation Technique for Fairness in Regression Machine Learning Problems

Algorithms and Machine Learning (ML) are increasingly affecting everyday...

0 Mostafa M. Mohamed, et al. ∙

Björn W. Schuller

Featured Co-authors

Sign in with Google

Consider DeepAI Pro