
-
Convolutional Neural Network-Based Age Estimation Using B-Mode Ultrasound Tongue Image
Ultrasound tongue imaging is widely used for speech production research,...
read it
-
AIM 2020: Scene Relighting and Illumination Estimation Challenge
We review the AIM 2020 challenge on virtual image relighting and illumin...
read it
-
Quantification of Transducer Misalignment in Ultrasound Tongue Imaging
In speech production research, different imaging modalities have been em...
read it
-
Audio Tagging by Cross Filtering Noisy Labels
High quality labeled datasets have allowed deep learning to achieve impr...
read it
-
Multi-Representation Knowledge Distillation For Audio Classification
As an important component of multimedia analysis tasks, audio classifica...
read it
-
Attention-based Fault-tolerant Approach for Multi-agent Reinforcement Learning Systems
The aim of multi-agent reinforcement learning systems is to provide inte...
read it
-
FoxNet: A Multi-face Alignment Method
Multi-face alignment aims to identify geometry structures of multiple hu...
read it
-
Predicting tongue motion in unlabeled ultrasound videos using convolutional LSTM neural network
A challenge in speech production research is to predict future tongue mo...
read it
-
Learning data augmentation policies using augmented random search
Previous attempts for data augmentation are designed manually, and the a...
read it
-
Weakly supervised CRNN system for sound event detection with large-scale unlabeled in-domain data
Sound event detection (SED) is typically posed as a supervised learning ...
read it
-
General audio tagging with ensembling convolutional neural network and statistical features
Audio tagging aims to infer descriptive labels from audio clips. Audio t...
read it
-
Collaborative Deep Learning Across Multiple Data Centers
Valuable training data is often owned by independent organizations and l...
read it
-
Sample Mixed-Based Data Augmentation for Domestic Audio Tagging
Audio tagging has attracted increasing attention since last decade and h...
read it
-
Sample Dropout for Audio Scene Classification Using Multi-Scale Dense Connected Convolutional Neural Network
Acoustic scene classification is an intricate problem for a machine. As ...
read it
-
Environmental Sound Classification Based on Multi-temporal Resolution Convolutional Neural Network Combining with Multi-level Features
Motivated by the fact that characteristics of different sound classes ar...
read it
-
Environmental Sound Classification Based on Multi-temporal Resolution CNN Network Combining with Multi-level Features
Motivated by the fact that characteristics of different sound classes ar...
read it
-
Multi-Scale DenseNet-Based Electricity Theft Detection
Electricity theft detection issue has drawn lots of attention during las...
read it
-
Mixup-Based Acoustic Scene Classification Using Multi-Channel Convolutional Neural Network
Audio scene classification, the problem of predicting class labels of au...
read it
-
Full-reference image quality assessment-based B-mode ultrasound image similarity measure
During the last decades, the number of new full-reference image quality ...
read it
-
Development of a 3D tongue motion visualization platform based on ultrasound image sequences
This article describes the development of a platform designed to visuali...
read it
-
Contour-based 3d tongue motion visualization using ultrasound image sequences
This article describes a contour-based 3D tongue deformation visualizati...
read it