The misuse of real photographs with conflicting image captions in news i...
In real-world scenarios, reinforcement learning under sparse-reward
syne...
The sparsity of extrinsic rewards poses a serious challenge for reinforc...
Aggregating messages is a key component for the communication of multi-a...
Wound image segmentation is a critical component for the clinical diagno...
Despite remarkable efforts been made, the classification of gigapixels
w...
We present an approach to learn voice-face representations from the talk...
Breast cancer is the most common malignancy in women, being responsible ...
Automated tagging of video advertisements has been a critical yet challe...
In this paper, we introduce the first Challenge on Multi-modal Aerial Vi...
Recently, deep reinforcement learning (RL) algorithms have made great
pr...
Recently, deep Reinforcement Learning (RL) algorithms have achieved
dram...
Ultrasound tongue imaging is widely used for speech production research,...
We review the AIM 2020 challenge on virtual image relighting and illumin...
In speech production research, different imaging modalities have been
em...
High quality labeled datasets have allowed deep learning to achieve
impr...
As an important component of multimedia analysis tasks, audio classifica...
The aim of multi-agent reinforcement learning systems is to provide
inte...
Multi-face alignment aims to identify geometry structures of multiple hu...
A challenge in speech production research is to predict future tongue
mo...
Previous attempts for data augmentation are designed manually, and the
a...
Sound event detection (SED) is typically posed as a supervised learning
...
Audio tagging aims to infer descriptive labels from audio clips. Audio
t...
Valuable training data is often owned by independent organizations and
l...
Audio tagging has attracted increasing attention since last decade and h...
Acoustic scene classification is an intricate problem for a machine. As ...
Motivated by the fact that characteristics of different sound classes ar...
Motivated by the fact that characteristics of different sound classes ar...
Electricity theft detection issue has drawn lots of attention during las...
Audio scene classification, the problem of predicting class labels of au...
During the last decades, the number of new full-reference image quality
...
This article describes the development of a platform designed to visuali...
This article describes a contour-based 3D tongue deformation visualizati...