Zongxin Yang

research

∙ 09/18/2023

CATR: Combinatorial-Dependence Audio-Queried Transformer for Audio-Visual Video Segmentation

Audio-visual video segmentation (AVVS) aims to generate pixel-level maps...

0 Kexin Li, et al. ∙

research

∙ 09/10/2023

Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation

Audio-driven talking-head synthesis is a popular research topic for virt...

0 Yuan Gan, et al. ∙

research

∙ 08/25/2023

Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation

Tracking any given object(s) spatially and temporally is a common purpos...

0 Yuanyou Xu, et al. ∙

research

∙ 07/31/2023

JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery

In this study, we focus on the problem of 3D human mesh recovery from a ...

0 Jiahao Li, et al. ∙

research

∙ 07/23/2023

TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering

In this paper, we focus on the task of generalizable neural human render...

0 Xiao Pan, et al. ∙

research

∙ 07/13/2023

AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion

Large-scale pre-trained vision-language models allow for the zero-shot t...

0 Shuo Huang, et al. ∙

research

∙ 07/05/2023

ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single Object Tracking

The Associating Objects with Transformers (AOT) framework has exhibited ...

0 Yuanyou Xu, et al. ∙

research

∙ 07/05/2023

ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised Video Object Segmentation

The Associating Objects with Transformers (AOT) framework has exhibited ...

0 Jiahao Li, et al. ∙

research

∙ 07/03/2023

Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition

In real-world scenarios, collected and annotated data often exhibit the ...

0 Chao Liang, et al. ∙

research

∙ 06/10/2023

Shuffled Autoregression For Motion Interpolation

This work aims to provide a deep-learning solution for the motion interp...

0 Shuo Huang, et al. ∙

research

∙ 05/17/2023

Pyramid Diffusion Models For Low-light Image Enhancement

Recovering noise-covered details from low-light images is challenging, a...

0 Dewei Zhou, et al. ∙

research

∙ 05/11/2023

Segment and Track Anything

This report presents a framework called Segment And Track Anything (SAMT...

0 Yangming Cheng, et al. ∙

research

∙ 05/08/2023

Video Object Segmentation in Panoptic Wild Scenes

In this paper, we introduce semi-supervised video object segmentation (V...

0 Yuanyou Xu, et al. ∙

research

∙ 03/26/2023

Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation

Video-based 3D human pose and shape estimations are evaluated by intra-f...

0 Xiaolong Shen, et al. ∙

research

∙ 10/18/2022

Decoupling Features in Hierarchical Propagation for Video Object Segmentation

This paper focuses on developing a more effective method of hierarchical...

9 Zongxin Yang, et al. ∙

research

∙ 07/26/2022

V^2L: Leveraging Vision and Vision-language Models into Large-scale Product Retrieval

Product retrieval is of great importance in the ecommerce domain. This p...

4 Wenhao Wang, et al. ∙

research

∙ 03/29/2022

In-N-Out Generative Learning for Dense Unsupervised Video Segmentation

In this paper, we focus on the unsupervised Video Object Segmentation (V...

9 Xiao Pan, et al. ∙

research

∙ 03/22/2022

Associating Objects with Scalable Transformers for Video Object Segmentation

This paper investigates how to realize better and more efficient embeddi...

5 Zongxin Yang, et al. ∙

research

∙ 06/04/2021

Associating Objects with Transformers for Video Object Segmentation

This paper investigates how to realize better and more efficient embeddi...

0 Zongxin Yang, et al. ∙

research

∙ 06/02/2021

Rethinking Cross-modal Interaction from a Top-down Perspective for Referring Video Object Segmentation

Referring video object segmentation (RVOS) aims to segment video objects...

0 Chen Liang, et al. ∙

research

∙ 04/08/2021

DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-scale Consistency

Compared to 2D object bounding-box labeling, it is very difficult for hu...

0 Zongxin Yang, et al. ∙

research

∙ 10/13/2020

Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration

This paper investigates the principles of embedding learning to tackle t...

6 Zongxin Yang, et al. ∙

research

∙ 03/18/2020

Collaborative Video Object Segmentation by Foreground-Background Integration

In this paper, we investigate the principles of embedding learning betwe...

11 Zongxin Yang, et al. ∙

research

∙ 12/29/2019

Very Long Natural Scenery Image Prediction by Outpainting

Comparing to image inpainting, image outpainting receives less attention...

6 Zongxin Yang, et al. ∙

research

∙ 09/25/2019

Gated Channel Transformation for Visual Recognition

In this work, we propose a generally applicable transformation unit for ...

0 Zongxin Yang, et al. ∙

Zongxin Yang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro