b'Xilin Chen'

research

∙ 08/25/2023

Dual Compensation Residual Networks for Class Imbalanced Learning

Learning generalizable representation and classifier for class-imbalance...

0 Ruibing Hou, et al. ∙

research

∙ 06/19/2023

BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models

Large language models (LLMs) have demonstrated remarkable prowess in lan...

0 Shaolei Zhang, et al. ∙

research

∙ 04/24/2023

Function-Consistent Feature Distillation

Feature distillation makes the student mimic the intermediate features o...

0 Dongyang Liu, et al. ∙

research

∙ 03/09/2023

Diversity-Measurable Anomaly Detection

Reconstruction-based anomaly detection models achieve their purpose by s...

0 Wenrui Liu, et al. ∙

research

∙ 04/14/2022

Clothes-Changing Person Re-identification with RGB Modality Only

The key to address clothes-changing person re-identification (re-id) is ...

0 Xinqian Gu, et al. ∙

research

∙ 11/08/2021

SEGA: Semantic Guided Attention on Visual Prototype for Few-Shot Learning

Teaching machines to recognize a new category based on few training samp...

0 Fengyuan Yang, et al. ∙

research

∙ 08/05/2021

UniCon: Unified Context Network for Robust Active Speaker Detection

We introduce a new efficient framework, the Unified Context Network (Uni...

1 Yuanhang Zhang, et al. ∙

research

∙ 07/20/2021

Locality-aware Channel-wise Dropout for Occluded Face Recognition

Face recognition remains a challenging task in unconstrained scenarios, ...

4 Mingjie He, et al. ∙

research

∙ 06/24/2021

Feature Completion for Occluded Person Re-Identification

Person re-identification (reID) plays an important role in computer visi...

0 Ruibing Hou, et al. ∙

research

∙ 04/18/2021

Continuity-Discrimination Convolutional Neural Network for Visual Object Tracking

This paper proposes a novel model, named Continuity-Discrimination Convo...

6 Shen Li, et al. ∙

research

∙ 04/06/2021

Visual Alignment Constraint for Continuous Sign Language Recognition

Vision-based Continuous Sign Language Recognition (CSLR) aims to recogni...

0 Yuecong Min, et al. ∙

research

∙ 12/03/2020

Attributes Aware Face Generation with Generative Adversarial Networks

Recent studies have shown remarkable success in face image generations. ...

7 Zheng Yuan, et al. ∙

research

∙ 11/15/2020

Learn an Effective Lip Reading Model without Pains

Lip reading, also known as visual speech recognition, aims to recognize ...

0 Dalu Feng, et al. ∙

research

∙ 09/02/2020

IAUnet: Global Context-Aware Feature Learning for Person Re-Identification

Person re-identification (reID) by CNNs based networks has achieved favo...

0 Ruibing Hou, et al. ∙

research

∙ 07/18/2020

Temporal Complementary Learning for Video Person Re-Identification

This paper proposes a Temporal Complementary Learning Network that extra...

0 Ruibing Hou, et al. ∙

research

∙ 07/17/2020

Sketching Image Gist: Human-Mimetic Hierarchical Scene Graph Generation

Scene graph aims to faithfully reveal humans' perception of image conten...

0 Wenbin Wang, et al. ∙

research

∙ 07/16/2020

Appearance-Preserving 3D Convolution for Video-based Person Re-identification

Due to the imperfect person detection results and posture changes, tempo...

0 Xinqian Gu, et al. ∙

research

∙ 05/08/2020

Synchronous Bidirectional Learning for Multilingual Lip Reading

Lip reading has received increasing attention in recent years. This pape...

0 Mingshuang Luo, et al. ∙

research

∙ 04/29/2020

Single-Side Domain Generalization for Face Anti-Spoofing

Existing domain generalization methods for face anti-spoofing endeavor t...

0 Yunpei Jia, et al. ∙

research

∙ 04/13/2020

Dynamic R-CNN: Towards High Quality Object Detection via Dynamic Training

Although two-stage object detectors have continuously advanced the state...

0 Hongkai Zhang, et al. ∙

research

∙ 04/09/2020

Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

Image-level weakly supervised semantic segmentation is a challenging pro...

52 Yude Wang, et al. ∙

research

∙ 04/04/2020

Cross-domain Face Presentation Attack Detection via Multi-domain Disentangled Representation Learning

Face presentation attack detection (PAD) has been an urgent problem to b...

3 Guoqing Wang, et al. ∙

research

∙ 03/31/2020

Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text

Answering questions that require reading texts in an image is challengin...

0 Difei Gao, et al. ∙

research

∙ 03/13/2020

Mutual Information Maximization for Effective Lip Reading

Lip reading has received an increasing research interest in recent years...

0 Xing Zhao, et al. ∙

research

∙ 03/12/2020

Deformation Flow Based Two-Stream Network for Lip Reading

Lip reading is the task of recognizing the speech content by analyzing m...

0 Jingyun Xiao, et al. ∙

research

∙ 03/09/2020

Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading

Lip-reading aims to infer the speech content from the lip movement seque...

0 Mingshuang Luo, et al. ∙

research

∙ 03/06/2020

Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition

Recent advances in deep learning have heightened interest among research...

0 Yuan-Hang Zhang, et al. ∙

research

∙ 02/15/2020

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

We propose UniViLM: a Unified Video and Language pre-training Model for ...

16 Huaishao Luo, et al. ∙

research

∙ 02/13/2020

Emotion Recognition for In-the-wild Videos

This paper is a brief introduction to our submission to the seven basic ...

0 Hanyu Liu, et al. ∙

research

∙ 02/07/2020

M^3T: Multi-Modal Continuous Valence-Arousal Estimation in the Wild

This report describes a multi-modal multi-task (M^3T) approach underlyin...

0 Yuan-Hang Zhang, et al. ∙

research

∙ 11/04/2019

Deep Heterogeneous Hashing for Face Video Retrieval

Retrieving videos of a particular person with face image as a query via ...

22 Shishi Qiao, et al. ∙

research

∙ 11/04/2019

FCSR-GAN: Joint Face Completion and Super-resolution via Multi-task Learning

Combined variations containing low-resolution and occlusion often presen...

12 Jiancheng Cai, et al. ∙

research

∙ 11/01/2019

Learning-based Real-time Detection of Intrinsic Reflectional Symmetry

Reflectional symmetry is ubiquitous in nature. While extrinsic reflectio...

4 Yi-Ling Qiao, et al. ∙

research

∙ 10/30/2019

LaplacianNet: Learning on 3D Meshes with Laplacian Encoding and Pooling

3D models are commonly used in computer vision and graphics. With the wi...

23 Yi-Ling Qiao, et al. ∙

research

∙ 10/25/2019

RhythmNet: End-to-end Heart Rate Estimation from Face via Spatial-temporal Representation

Heart rate (HR) is an important physiological signal that reflects the p...

38 Xuesong Niu, et al. ∙

research

∙ 10/24/2019

Multi-label Co-regularization for Semi-supervised Facial Action Unit Recognition

Facial action units (AUs) recognition is essential for emotion analysis ...

0 Xuesong Niu, et al. ∙

research

∙ 10/17/2019

Cross Attention Network for Few-shot Classification

Few-shot classification aims to recognize unlabeled samples from unseen ...

0 Ruibing Hou, et al. ∙

research

∙ 10/11/2019

Cross-modal Scene Graph Matching for Relationship-aware Image-Text Retrieval

Image-text retrieval of natural scenes has been a popular research topic...

26 Sijin Wang, et al. ∙

research

∙ 09/24/2019

Object-Contextual Representations for Semantic Segmentation

In this paper, we address the problem of semantic segmentation and focus...

13 Yuhui Yuan, et al. ∙

research

∙ 09/09/2019

Self-supervised Scale Equivariant Network for Weakly Supervised Semantic Segmentation

Weakly supervised semantic segmentation has attracted much research inte...

13 Yude Wang, et al. ∙

research

∙ 08/16/2019

Transferable Contrastive Network for Generalized Zero-Shot Learning

Zero-shot learning (ZSL) is a challenging problem that aims to recognize...

0 Huajie Jiang, et al. ∙

research

∙ 08/11/2019

Temporal Knowledge Propagation for Image-to-Video Person Re-identification

In many scenarios of Person Re-identification (Re-ID), the gallery set c...

7 Xinqian Gu, et al. ∙

research

∙ 08/08/2019

From Two Graphs to N Questions: A VQA Dataset for Compositional Reasoning on Vision and Commonsense

Visual Question Answering (VQA) is a challenging task for evaluating the...

4 Difei Gao, et al. ∙

research

∙ 07/29/2019

Interlaced Sparse Self-Attention for Semantic Segmentation

In this paper, we present a so-called interlaced sparse self-attention a...

2 Lang Huang, et al. ∙

research

∙ 07/19/2019

Interaction-and-Aggregation Network for Person Re-identification

Person re-identification (reID) benefits greatly from deep convolutional...

5 Ruibing Hou, et al. ∙

research

∙ 07/19/2019

VRSTC: Occlusion-Free Video Person Re-Identification

Video person re-identification (re-ID) plays an important role in survei...

3 Ruibing Hou, et al. ∙

research

∙ 07/16/2019

Cascade RetinaNet: Maintaining Consistency for Single-Stage Object Detection

Recent researches attempt to improve the detection performance by adopti...

5 Hongkai Zhang, et al. ∙

research

∙ 06/22/2019

Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation

Non-Autoregressive Transformer (NAT) aims to accelerate the Transformer ...

1 Chenze Shao, et al. ∙

research

∙ 04/01/2019

Weakly Supervised Object Detection with Segmentation Collaboration

Weakly supervised object detection aims at learning precise object detec...

0 Xiaoyan Li, et al. ∙

research

∙ 03/31/2019

Fully Learnable Group Convolution for Acceleration of Deep Neural Networks

Benefitted from its great success on many tasks, deep learning is increa...

0 Xijun Wang, et al. ∙

Xilin Chen

Featured Co-authors

Sign in with Google

Consider DeepAI Pro