Kele Xu

research

∙ 06/05/2023

Cheap-fake Detection with LLM using Prompt Engineering

The misuse of real photographs with conflicting image captions in news i...

0 Guangyang Wu, et al. ∙

research

∙ 08/24/2022

Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning

In real-world scenarios, reinforcement learning under sparse-reward syne...

0 Zijian Gao, et al. ∙

research

∙ 08/24/2022

Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration

The sparsity of extrinsic rewards poses a serious challenge for reinforc...

0 Zijian Gao, et al. ∙

research

∙ 08/10/2022

Diversifying Message Aggregation in Multi-Agent Communication via Normalized Tensor Nuclear Norm Regularization

Aggregating messages is a key component for the communication of multi-a...

0 Yuanzhao Zhai, et al. ∙

research

∙ 07/12/2022

Wound Segmentation with Dynamic Illumination Correction and Dual-view Semantic Fusion

Wound image segmentation is a critical component for the clinical diagno...

0 Honghui Liu, et al. ∙

research

∙ 07/12/2022

Trusted Multi-Scale Classification Framework for Whole Slide Image

Despite remarkable efforts been made, the classification of gigapixels w...

0 Ming Feng, et al. ∙

research

∙ 04/28/2022

Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast

We present an approach to learn voice-face representations from the talk...

7 Boqing Zhu, et al. ∙

research

∙ 11/08/2021

HEROHE Challenge: assessing HER2 status in breast cancer without immunohistochemistry or in situ hybridization

Breast cancer is the most common malignancy in women, being responsible ...

10 Eduardo Conde-Sousa, et al. ∙

research

∙ 08/02/2021

Multimodal Feature Fusion for Video Advertisements Tagging Via Stacking Ensemble

Automated tagging of video advertisements has been a critical yet challe...

0 Qingsong Zhou, et al. ∙

research

∙ 07/02/2021

NTIRE 2021 Multi-modal Aerial View Object Classification Challenge

In this paper, we introduce the first Challenge on Multi-modal Aerial Vi...

9 Jerrick Liu, et al. ∙

research

∙ 05/25/2021

KnowSR: Knowledge Sharing among Homogeneous Agents in Multi-agent Reinforcement Learning

Recently, deep reinforcement learning (RL) algorithms have made great pr...

0 Zijian Gao, et al. ∙

research

∙ 03/27/2021

KnowRU: Knowledge Reusing via Knowledge Distillation in Multi-agent Reinforcement Learning

Recently, deep Reinforcement Learning (RL) algorithms have achieved dram...

0 Zijian Gao, et al. ∙

research

∙ 01/27/2021

Convolutional Neural Network-Based Age Estimation Using B-Mode Ultrasound Tongue Image

Ultrasound tongue imaging is widely used for speech production research,...

0 Kele Xu, et al. ∙

research

∙ 09/27/2020

AIM 2020: Scene Relighting and Illumination Estimation Challenge

We review the AIM 2020 challenge on virtual image relighting and illumin...

0 Majed El Helou, et al. ∙

research

∙ 08/06/2020

Quantification of Transducer Misalignment in Ultrasound Tongue Imaging

In speech production research, different imaging modalities have been em...

0 Tamás Gábor Csapó, et al. ∙

research

∙ 07/16/2020

Audio Tagging by Cross Filtering Noisy Labels

High quality labeled datasets have allowed deep learning to achieve impr...

0 Boqing Zhu, et al. ∙

research

∙ 02/22/2020

Multi-Representation Knowledge Distillation For Audio Classification

As an important component of multimedia analysis tasks, audio classifica...

0 Liang Gao, et al. ∙

research

∙ 10/05/2019

Attention-based Fault-tolerant Approach for Multi-agent Reinforcement Learning Systems

The aim of multi-agent reinforcement learning systems is to provide inte...

0 Mingyang Geng, et al. ∙

research

∙ 04/22/2019

FoxNet: A Multi-face Alignment Method

Multi-face alignment aims to identify geometry structures of multiple hu...

0 Yuxiang WU, et al. ∙

research

∙ 02/19/2019

Predicting tongue motion in unlabeled ultrasound videos using convolutional LSTM neural network

A challenge in speech production research is to predict future tongue mo...

0 Chaojie Zhao, et al. ∙

research

∙ 11/12/2018

Learning data augmentation policies using augmented random search

Previous attempts for data augmentation are designed manually, and the a...

0 Mingyang Geng, et al. ∙

research

∙ 11/01/2018

Weakly supervised CRNN system for sound event detection with large-scale unlabeled in-domain data

Sound event detection (SED) is typically posed as a supervised learning ...

0 Dezhi Wang, et al. ∙

research

∙ 10/30/2018

General audio tagging with ensembling convolutional neural network and statistical features

Audio tagging aims to infer descriptive labels from audio clips. Audio t...

0 Kele Xu, et al. ∙

research

∙ 10/16/2018

Collaborative Deep Learning Across Multiple Data Centers

Valuable training data is often owned by independent organizations and l...

0 Kele Xu, et al. ∙

research

∙ 08/12/2018

Sample Mixed-Based Data Augmentation for Domestic Audio Tagging

Audio tagging has attracted increasing attention since last decade and h...

0 Shengyun Wei, et al. ∙

research

∙ 06/12/2018

Sample Dropout for Audio Scene Classification Using Multi-Scale Dense Connected Convolutional Neural Network

Acoustic scene classification is an intricate problem for a machine. As ...

0 Dawei Feng, et al. ∙

research

∙ 05/24/2018

Environmental Sound Classification Based on Multi-temporal Resolution Convolutional Neural Network Combining with Multi-level Features

Motivated by the fact that characteristics of different sound classes ar...

0 Boqing Zhu, et al. ∙

research

∙ 05/24/2018

Environmental Sound Classification Based on Multi-temporal Resolution CNN Network Combining with Multi-level Features

Motivated by the fact that characteristics of different sound classes ar...

0 Boqing Zhu, et al. ∙

research

∙ 05/24/2018

Multi-Scale DenseNet-Based Electricity Theft Detection

Electricity theft detection issue has drawn lots of attention during las...

0 Bo Li, et al. ∙

research

∙ 05/18/2018

Mixup-Based Acoustic Scene Classification Using Multi-Channel Convolutional Neural Network

Audio scene classification, the problem of predicting class labels of au...

0 Kele Xu, et al. ∙

research

∙ 01/10/2017

Full-reference image quality assessment-based B-mode ultrasound image similarity measure

During the last decades, the number of new full-reference image quality ...

0 Kele Xu, et al. ∙

research

∙ 05/19/2016

Development of a 3D tongue motion visualization platform based on ultrasound image sequences

This article describes the development of a platform designed to visuali...

0 Kele Xu, et al. ∙

research

∙ 05/19/2016

Contour-based 3d tongue motion visualization using ultrasound image sequences

This article describes a contour-based 3D tongue deformation visualizati...

0 Kele Xu, et al. ∙

Kele Xu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro