Björn Schuller

research

∙ 07/05/2023

Going Retro: Astonishingly Simple Yet Effective Rule-based Prosody Modelling for Speech Synthesis Simulating Emotion Dimensions

We introduce two rule-based models to modify the prosody of speech synth...

0 Felix Burkhardt, et al. ∙

research

∙ 06/29/2023

Speech-based Age and Gender Prediction with Transformers

We report on the curation of several publicly available datasets for age...

0 Felix Burkhardt, et al. ∙

research

∙ 05/23/2023

Improving Speech Emotion Recognition Performance using Differentiable Architecture Search

Speech Emotion Recognition (SER) is a critical enabler of emotion-aware ...

0 Thejan Rajapakshe, et al. ∙

research

∙ 05/19/2023

A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model

In this paper, we propose to utilise diffusion models for data augmentat...

0 Ibrahim Malik, et al. ∙

research

∙ 03/09/2023

hierarchical network with decoupled knowledge distillation for speech emotion recognition

The goal of Speech Emotion Recognition (SER) is to enable computers to r...

0 Ziping Zhao, et al. ∙

research

∙ 11/14/2022

AI-Based Emotion Recognition: Promise, Peril, and Prescriptions for Prosocial Path

Automated emotion recognition (AER) technology can detect humans' emotio...

0 Siddique Latif, et al. ∙

research

∙ 10/27/2022

Proceedings of the ACII Affective Vocal Bursts Workshop and Competition 2022 (A-VB): Understanding a critically understudied modality of emotional expression

This is the Proceedings of the ACII Affective Vocal Bursts Workshop and ...

0 Alice Baird, et al. ∙

research

∙ 07/14/2022

Proceedings of the ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts

This is the Proceedings of the ICML Expressive Vocalization (ExVo) Compe...

0 Alice Baird, et al. ∙

research

∙ 07/07/2022

The ACII 2022 Affective Vocal Bursts Workshop Competition: Understanding a critically understudied modality of emotional expression

The ACII Affective Vocal Bursts Workshop Competition is focused on u...

12 Alice Baird, et al. ∙

research

∙ 06/26/2022

Data Augmentation for Dementia Detection in Spoken Language

Dementia is a growing problem as our society ages, and detection methods...

0 Anna Hlédiková, et al. ∙

research

∙ 06/15/2022

Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning

Emotion classification of speech and assessment of the emotion strength ...

0 Rui Liu, et al. ∙

research

∙ 05/03/2022

The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts

The ICML Expressive Vocalization (ExVo) Competition is focused on unders...

21 Alice Baird, et al. ∙

research

∙ 04/19/2022

Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition

Despite the recent advancement in speech emotion recognition (SER) withi...

0 Siddique Latif, et al. ∙

research

∙ 09/16/2021

A Machine Learning Framework for Automatic Prediction of Human Semen Motility

In this paper, human semen samples from the visem dataset collected by t...

28 Sandra Ottl, et al. ∙

research

∙ 05/04/2021

An Estimation of Online Video User Engagement from Features of Continuous Emotions

Portraying emotion and trustworthiness is known to increase the appeal o...

1 Lukas Stappen, et al. ∙

research

∙ 04/19/2021

Fitbeat: COVID-19 Estimation based on Wristband Heart Rate

This study investigates the potential of deep learning methods to identi...

0 Shuo Liu, et al. ∙

research

∙ 03/10/2021

EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition

In this manuscript, the topic of multi-corpus Speech Emotion Recognition...

0 Maurice Gerczuk, et al. ∙

research

∙ 02/18/2021

An Enhanced Adversarial Network with Combined Latent Features for Spatio-Temporal Facial Affect Estimation in the Wild

Affective Computing has recently attracted the attention of the research...

0 Decky Aspandi Latif, et al. ∙

research

∙ 01/15/2021

The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements

Truly real-life data presents a strong, but exciting challenge for senti...

10 Lukas Stappen, et al. ∙

research

∙ 01/12/2021

Personalized Federated Deep Learning for Pain Estimation From Face Images

Standard machine learning approaches require centralizing the users' dat...

37 Ognjen Rudovic, et al. ∙

research

∙ 05/18/2020

An Overview on Audio, Signal, Speech, Language Processing for COVID-19

Recently, there has been an increased attention towards innovating, enha...

0 Gauri Deshpande, et al. ∙

research

∙ 03/05/2020

Guided Generative Adversarial Neural Network for Representation Learning and High Fidelity Audio Generation using Fewer Labelled Audio Data

Recent improvements in Generative Adversarial Neural Networks (GANs) hav...

22 Kazi Nazmul Haque, et al. ∙

research

∙ 02/03/2020

Adversarial-based neural network for affect estimations in the wild

There is a growing interest in affective computing research nowadays giv...

10 Decky Aspandi Latif, et al. ∙

research

∙ 02/03/2020

Adversarial-based neural networks for affect estimations in the wild

There is a growing interest in affective computing research nowadays giv...

0 Decky Aspandi Latif, et al. ∙

research

∙ 11/16/2019

N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System

N-HANS is a Python toolkit for in-the-wild audio enhancement, including ...

0 Shuo Liu, et al. ∙

research

∙ 10/18/2019

Poisson CNN: Convolutional Neural Networks for the Solution of the Poisson Equation with Varying Meshes and Dirichlet Boundary Conditions

The Poisson equation is commonly encountered in engineering, including i...

57 Ali Girayhan Özbay, et al. ∙

research

∙ 08/30/2019

On Laughter and Speech-Laugh, Based on Observations of Child-Robot Interaction

In this article, we study laughter found in child-robot interaction wher...

0 Anton Batliner, et al. ∙

research

∙ 07/23/2019

EmoBed: Strengthening Monomodal Emotion Recognition via Training with Crossmodal Emotion Embeddings

Despite remarkable advances in emotion recognition, they are severely re...

0 Jing Han, et al. ∙

research

∙ 07/10/2019

AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition

The Audio/Visual Emotion Challenge and Workshop (AVEC 2019) "State-of-Mi...

0 Fabien Ringeval, et al. ∙

research

∙ 06/24/2019

Single-Channel Speech Separation with Auxiliary Speaker Embeddings

We present a novel source separation model to decompose asingle-channel ...

0 Shuo Liu, et al. ∙

research

∙ 06/07/2019

Multi-modal Active Learning From Human Data: A Deep Reinforcement Learning Approach

Human behavior expression and experience are inherently multi-modal, and...

9 Ognjen Rudovic, et al. ∙

research

∙ 04/15/2019

Synthesising 3D Facial Motion from "In-the-Wild" Speech

Synthesising 3D facial motion from speech is a crucial problem manifesti...

12 Panagiotis Tzirakis, et al. ∙

research

∙ 03/17/2019

Responsible and Representative Multimodal Data Acquisition and Analysis: On Auditability, Benchmarking, Confidence, Data-Reliance & Explainability

The ethical decisions behind the acquisition and analysis of audio, vide...

1 Alice Baird, et al. ∙

research

∙ 03/13/2019

Voice command generation using Progressive Wavegans

Generative Adversarial Networks (GANs) have become exceedingly popular i...

7 Thomas Wiest, et al. ∙

research

∙ 02/14/2019

On Many-to-Many Mapping Between Concordance Correlation Coefficient and Mean Square Error

The concordance correlation coefficient (CCC) is one of the most widely ...

4 Vedhas Pandit, et al. ∙

research

∙ 01/09/2019

SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild

Natural human-computer interaction and audio-visual human behaviour sens...

8 Jean Kossaifi, et al. ∙

research

∙ 10/26/2018

Scaling Speech Enhancement in Unseen Environments with Noise Embeddings

We address the problem of speech enhancement generalisation to unseen en...

0 Gil Keren, et al. ∙

research

∙ 09/26/2018

Dynamic Difficulty Awareness Training for Continuous Emotion Prediction

Time-continuous emotion prediction has become an increasingly compelling...

0 Zixing Zhang, et al. ∙

research

∙ 09/21/2018

Adversarial Training in Affective Computing and Sentiment Analysis: Recent Advances and Perspectives

Over the past few years, adversarial training has become an extremely ac...

0 Jing Han, et al. ∙

research

∙ 05/03/2018

Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification

The performance of speaker-related systems usually degrades heavily in p...

0 Siyang Song, et al. ∙

research

∙ 05/03/2018

audEERING's approach to the One-Minute-Gradual Emotion Challenge

This paper describes audEERING's submissions as well as additional evalu...

0 Andreas Triantafyllopoulos, et al. ∙

research

∙ 04/29/2018

Deep Affect Prediction in-the-wild: Aff-Wild Database and Challenge, Deep Architectures, and Beyond

Automatic understanding of human affect using visual signals is of great...

0 Dimitrios Kollias, et al. ∙

research

∙ 03/26/2018

Calibrated Prediction Intervals for Neural Network Regressors

Ongoing developments in neural network models are continually advancing ...

0 Gil Keren, et al. ∙

research

∙ 02/07/2018

Applying Cooperative Machine Learning to Speed Up the Annotation of Social Signals in Large Multi-modal Corpora

Scientific disciplines, such as Behavioural Psychology, Anthropology and...

0 Johannes Wagner, et al. ∙

research

∙ 02/04/2018

Personalized Machine Learning for Robot Perception of Affect and Engagement in Autism Therapy

Robots have great potential to facilitate future therapies for children ...

0 Ognjen Rudovic, et al. ∙

research

∙ 01/10/2018

Weakly Supervised One-Shot Detection with Attention Siamese Networks

We consider the task of weakly supervised one-shot detection. In this ta...

0 Gil Keren, et al. ∙

research

∙ 12/12/2017

auDeep: Unsupervised Learning of Representations from Audio with Deep Recurrent Neural Networks

auDeep is a Python toolkit for deep unsupervised representation learning...

0 Michael Freitag, et al. ∙

research

∙ 07/27/2017

Learning Audio Sequence Representations for Acoustic Event Classification

Acoustic Event Classification (AEC) has become a significant task for ma...

0 Zixing Zhang, et al. ∙

research

∙ 05/30/2017

Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments

Eliminating the negative effect of non-stationary environmental noise is...

0 Zixing Zhang, et al. ∙

research

∙ 05/29/2017

The Principle of Logit Separation

We consider neural network training, in applications in which there are ...

0 Gil Keren, et al. ∙

Björn Schuller

Featured Co-authors

Sign in with Google

Consider DeepAI Pro