Gangshan Wu

research

∙ 08/25/2023

Joint Modeling of Feature, Correspondence, and a Compressed Memory for Video Object Segmentation

Current prevailing Video Object Segmentation (VOS) methods usually perfo...

0 Jiaming Zhang, et al. ∙

research

∙ 08/09/2023

Robust Object Modeling for Visual Tracking

Object modeling has become a core part of recent tracking frameworks. Cu...

0 Yidong Cai, et al. ∙

research

∙ 07/31/2023

Lightweight Super-Resolution Head for Human Pose Estimation

Heatmap-based methods have become the mainstream method for pose estimat...

0 Haonan Wang, et al. ∙

research

∙ 07/14/2023

MaxSR: Image Super-Resolution Using Improved MaxViT

While transformer models have been demonstrated to be effective for natu...

0 Bincheng Yang, et al. ∙

research

∙ 05/25/2023

MixFormerV2: Efficient Fully Transformer Tracking

Transformer-based trackers have achieved strong accuracy on the standard...

0 Yutao Cui, et al. ∙

research

∙ 04/26/2023

Video Frame Interpolation with Densely Queried Bilateral Correlation

Video Frame Interpolation (VFI) aims to synthesize non-existent intermed...

0 Chang Zhou, et al. ∙

research

∙ 04/17/2023

Efficient Video Action Detection with Token Dropout and Context Refinement

Streaming video clips with large-scale video tokens impede vision transf...

0 Lei Chen, et al. ∙

research

∙ 04/11/2023

SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes

Multi-object tracking in sports scenes plays a critical role in gatherin...

0 Yutao Cui, et al. ∙

research

∙ 03/28/2023

CycleACR: Cycle Modeling of Actor-Context Relations for Video Action Detection

The relation modeling between actors and scene context advances video ac...

0 Lei Chen, et al. ∙

research

∙ 03/28/2023

LinK: Linear Kernel for LiDAR-based 3D Perception

Extending the success of 2D Large Kernel to 3D perception is challenging...

0 Tao Lu, et al. ∙

research

∙ 03/28/2023

STMixer: A One-Stage Sparse Action Detector

Traditional video action detectors typically adopt the two-stage pipelin...

0 Tao Wu, et al. ∙

research

∙ 03/01/2023

Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation

Effectively extracting inter-frame motion and appearance information is ...

0 Guozhen Zhang, et al. ∙

research

∙ 02/13/2023

CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets

Current RGB-D scene recognition approaches often train two standalone ba...

1 Jiange Yang, et al. ∙

research

∙ 11/30/2022

From Coarse to Fine: Hierarchical Pixel Integration for Lightweight Image Super-Resolution

Image super-resolution (SR) serves as a fundamental tool for the process...

0 Jie Liu, et al. ∙

research

∙ 07/09/2022

Human-centric Spatio-Temporal Video Grounding via the Combination of Mutual Matching Network and TubeDETR

In this technical report, we represent our solution for the Human-centri...

0 Fan Yu, et al. ∙

research

∙ 05/02/2022

APP-Net: Auxiliary-point-based Push and Pull Operations for Efficient Point Cloud Classification

Point-cloud-based 3D classification task involves aggregating features f...

0 Tao Lu, et al. ∙

research

∙ 04/18/2022

Fast and Memory-Efficient Network Towards Efficient Image Super-Resolution

Runtime and memory consumption are two important aspects for efficient i...

0 Zongcai Du, et al. ∙

research

∙ 03/21/2022

MixFormer: End-to-End Tracking with Iterative Mixed Attention

Tracking often uses a multi-stage pipeline of feature extraction, target...

0 Yutao Cui, et al. ∙

research

∙ 03/01/2022

Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection

Generic Boundary Detection (GBD) aims at locating general boundaries tha...

0 Yuhong Wang, et al. ∙

research

∙ 12/31/2021

Efficient Single Image Super-Resolution Using Dual Path Connections with Multiple Scale Learning

Deep convolutional neural networks have been demonstrated to be effectiv...

9 Bin-Cheng Yang, et al. ∙

research

∙ 11/27/2021

AdaDM: Enabling Normalization for Image Super-Resolution

Normalization like Batch Normalization (BN) is a milestone technique to ...

31 Jie Liu, et al. ∙

research

∙ 10/24/2021

A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark

The existing few-shot video classification methods often employ a meta-l...

0 Zhenxi Zhu, et al. ∙

research

∙ 09/13/2021

Mutual Supervision for Dense Object Detection

The classification and regression head are both indispensable components...

0 Ziteng Gao, et al. ∙

research

∙ 09/10/2021

Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

Temporal grounding aims to localize a video moment which is semantically...

0 Zhenzhi Wang, et al. ∙

research

∙ 09/09/2021

Self Supervision to Distillation for Long-Tailed Visual Recognition

Deep learning has achieved remarkable progress for visual recognition on...

0 Tianhao Li, et al. ∙

research

∙ 08/18/2021

Target Adaptive Context Aggregation for Video Scene Graph Generation

This paper deals with a challenging task of video scene graph generation...

3 Yao Teng, et al. ∙

research

∙ 06/06/2021

SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

Three-dimensional face dense alignment and reconstruction in the wild is...

0 Zeyu Ruan, et al. ∙

research

∙ 05/20/2021

Anchor-based Plain Net for Mobile Image Super-Resolution

Along with the rapid development of real-world applications, higher requ...

0 Zongcai Du, et al. ∙

research

∙ 05/16/2021

MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions

Spatio-temporal action detection is an important and challenging problem...

0 Yixuan Li, et al. ∙

research

∙ 04/20/2021

MGSampler: An Explainable Sampling Strategy for Video Action Recognition

Frame sampling is a fundamental problem in video action recognition due ...

0 Yuan Zhi, et al. ∙

research

∙ 04/01/2021

Target Transformed Regression for Accurate Tracking

Accurate tracking is still a challenging task due to appearance variatio...

0 Yutao Cui, et al. ∙

research

∙ 02/03/2021

Relaxed Transformer Decoders for Direct Action Proposal Generation

Temporal action proposal generation is an important and challenging task...

0 Jiaqi Tang, et al. ∙

research

∙ 12/18/2020

TDN: Temporal Difference Networks for Efficient Action Recognition

Temporal modeling still remains challenging for action recognition in vi...

0 Limin Wang, et al. ∙

research

∙ 09/24/2020

Residual Feature Distillation Network for Lightweight Image Super-Resolution

Recent advances in single image super-resolution (SISR) explored the pow...

0 Jie Liu, et al. ∙

research

∙ 09/15/2020

AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results

This paper reviews the AIM 2020 challenge on efficient single image supe...

6 Kai Zhang, et al. ∙

research

∙ 07/20/2020

Context-Aware RCNN: A Baseline for Action Detection in Videos

Video action detection approaches usually conduct actor-centric action r...

0 Jianchao Wu, et al. ∙

research

∙ 04/15/2020

Fully Convolutional Online Tracking

Discriminative training has turned out to be effective for robust tracki...

0 Yutao Cui, et al. ∙

research

∙ 01/14/2020

Actions as Moving Points

The existing action tubelet detectors mainly depend on heuristic anchor ...

14 Yixuan Li, et al. ∙

research

∙ 11/23/2019

Simple and Lightweight Human Pose Estimation

Recent research on human pose estimation has achieved significant improv...

0 Zhe Zhang, et al. ∙

research

∙ 08/12/2019

LIP: Local Importance-based Pooling

Spatial downsampling layers are favored in convolutional neural networks...

9 Ziteng Gao, et al. ∙

research

∙ 05/27/2019

Dynamically Visual Disambiguation of Keyword-based Image Search

Due to the high cost of manual annotation, learning directly from the we...

0 Yazhou Yao, et al. ∙

research

∙ 04/28/2019

Translate-to-Recognize Networks for RGB-D Scene Recognition

Cross-modal transfer is helpful to enhance modality-specific discriminat...

4 Dapeng Du, et al. ∙

research

∙ 04/23/2019

Learning Actor Relation Graphs for Group Activity Recognition

Modeling relation between actors is important for recognizing group acti...

0 Jianchao Wu, et al. ∙

Gangshan Wu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro