Shuhui Wang

research

∙ 09/11/2023

Dual-view Curricular Optimal Transport for Cross-lingual Cross-modal Retrieval

Current research on cross-modal retrieval is mostly English-oriented, as...

0 Yabing Wang, et al. ∙

research

∙ 08/14/2023

Orthogonal Temporal Interpolation for Zero-Shot Video Recognition

Zero-shot video recognition (ZSVR) is a task that aims to recognize vide...

0 Yan Zhu, et al. ∙

research

∙ 03/30/2023

ImageNet-E: Benchmarking Neural Network Robustness via Attribute Editing

Recent studies have shown that higher accuracy on ImageNet usually leads...

0 Xiaodan Li, et al. ∙

research

∙ 02/01/2023

Stable Attribute Group Editing for Reliable Few-shot Image Generation

Few-shot image generation aims to generate data of an unseen category ba...

0 Guanqi Ding, et al. ∙

research

∙ 11/22/2022

The Euclidean Space is Evil: Hyperbolic Attribute Editing for Few-shot Image Generation

Few-shot image generation is a challenging task since it aims to generat...

0 Lingxiao Li, et al. ∙

research

∙ 08/05/2022

First Glance Diagnosis: Brain Disease Classification with Single fMRI Volume

In neuroimaging analysis, functional magnetic resonance imaging (fMRI) c...

11 Wei Dai, et al. ∙

research

∙ 07/26/2022

Multi-Attention Network for Compressed Video Referring Object Segmentation

Referring video object segmentation aims to segment the object referred ...

0 Weidong Chen, et al. ∙

research

∙ 07/18/2022

Entity-enhanced Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding

Weakly supervised Referring Expression Grounding (REG) aims to ground a ...

0 Xuejing Liu, et al. ∙

research

∙ 04/02/2022

Unsupervised Coherent Video Cartoonization with Perceptual Motion Consistency

In recent years, creative content generations like style transfer and ne...

0 Zhenhuan Liu, et al. ∙

research

∙ 04/02/2022

IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning

Conditional image generation is an active research topic including text2...

0 Zhenhuan Liu, et al. ∙

research

∙ 03/16/2022

Attribute Group Editing for Reliable Few-shot Image Generation

Few-shot image generation is a challenging task even using the state-of-...

3 Guanqi Ding, et al. ∙

research

∙ 12/20/2021

General Greedy De-bias Learning

Neural networks often make predictions relying on the spurious correlati...

13 Xinzhe Han, et al. ∙

research

∙ 11/23/2021

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Event analysis in untrimmed videos has attracted increasing attention du...

0 Zhaobo Qi, et al. ∙

research

∙ 11/23/2021

Self-Regulated Learning for Egocentric Video Activity Anticipation

Future activity anticipation is a challenging problem in egocentric visi...

0 Zhaobo Qi, et al. ∙

research

∙ 11/19/2021

DVCFlow: Modeling Information Flow Towards Human-like Video Captioning

Dense video captioning (DVC) aims to generate multi-sentence description...

0 Xu Yan, et al. ∙

research

∙ 10/11/2021

Semi-Autoregressive Image Captioning

Current state-of-the-art approaches for image captioning typically adopt...

0 Xu Yan, et al. ∙

research

∙ 07/27/2021

Greedy Gradient Ensemble for Robust Visual Question Answering

Language bias is a critical issue in Visual Question Answering (VQA), wh...

0 Xinzhe Han, et al. ∙

research

∙ 07/13/2021

Fast Batch Nuclear-norm Maximization and Minimization for Robust Domain Adaptation

Due to the domain discrepancy in visual domain adaptation, the performan...

0 Shuhao Cui, et al. ∙

research

∙ 07/07/2021

Learning Invariant Representation with Consistency and Diversity for Semi-supervised Source Hypothesis Transfer

Semi-supervised domain adaptation (SSDA) aims to solve tasks in target d...

0 Xiaodong Wang, et al. ∙

research

∙ 04/19/2021

Mining Latent Structures for Multimedia Recommendation

Multimedia content is of predominance in the modern Web era. Investigati...

9 Jinghao Zhang, et al. ∙

research

∙ 03/04/2021

QAIR: Practical Query-efficient Black-Box Attacks for Image Retrieval

We study the query-based attack against image retrieval to evaluate its ...

0 Xiaodan Li, et al. ∙

research

∙ 12/10/2020

Composite Adversarial Attacks

Adversarial attack is a technique for deceiving Machine Learning (ML) mo...

0 Xiaofeng Mao, et al. ∙

research

∙ 11/30/2020

Heuristic Domain Adaptation

In visual domain adaptation (DA), separating the domain-specific charact...

0 Shuhao Cui, et al. ∙

research

∙ 10/16/2020

Semantic Editing On Segmentation Map Via Multi-Expansion Loss

Semantic editing on segmentation map has been proposed as an intermediat...

0 Jianfeng He, et al. ∙

research

∙ 08/25/2020

Label Decoupling Framework for Salient Object Detection

To get more accurate saliency maps, recent methods mainly focus on aggre...

8 Jun Wei, et al. ∙

research

∙ 08/11/2020

Sharp Multiple Instance Learning for DeepFake Video Detection

With the rapid development of facial manipulation techniques, face forge...

2 Xiaodan Li, et al. ∙

research

∙ 04/10/2020

Parsing-based View-aware Embedding Network for Vehicle Re-Identification

Vehicle Re-Identification is to find images of the same vehicle from var...

0 Dechao Meng, et al. ∙

research

∙ 04/10/2020

State-Relabeling Adversarial Active Learning

Active learning is to design label-efficient algorithms by sampling the ...

0 Beichen Zhang, et al. ∙

research

∙ 03/30/2020

Gradually Vanishing Bridge for Adversarial Domain Adaptation

In unsupervised domain adaptation, rich domain-specific characteristics ...

0 Shuhao Cui, et al. ∙

research

∙ 03/27/2020

Towards Discriminability and Diversity: Batch Nuclear-norm Maximization under Label Insufficient Situations

The learning of the deep networks largely relies on the data with human-...

0 Shuhao Cui, et al. ∙

research

∙ 11/26/2019

F3Net: Fusion, Feedback and Focus for Salient Object Detection

Most of existing salient object detection models have achieved great pro...

17 Jun Wei, et al. ∙

research

∙ 09/05/2019

Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding

Weakly supervised referring expression grounding (REG) aims at localizin...

10 Xuejing Liu, et al. ∙

research

∙ 08/28/2019

Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding

Weakly supervised referring expression grounding aims at localizing the ...

11 Xuejing Liu, et al. ∙

research

∙ 08/14/2019

Harmonized Multimodal Learning with Gaussian Process Latent Variable Models

Multimodal learning aims to discover the relationship between multiple m...

8 Guoli Song, et al. ∙

research

∙ 04/18/2019

Unsupervised Open Domain Recognition by Semantic Discrepancy Minimization

We address the unsupervised open domain recognition (UODR) problem, wher...

0 Junbao Zhuo, et al. ∙

research

∙ 03/05/2018

Less Is More: Picking Informative Frames for Video Captioning

In video captioning task, the best practice has been achieved by attenti...

0 Yangyu Chen, et al. ∙

Shuhui Wang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro