Jinxiang Liu

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Yu Wang
276 publications
Qi Tian
238 publications
Ya Zhang
123 publications
Yanfeng Wang
79 publications
Siheng Chen
75 publications
Weidi Xie
75 publications
Jianlong Chang
20 publications
Chen Ju
13 publications
Chaofan Ma
9 publications
Fei Zhang
9 publications
Peisen Zhao
9 publications

research

∙ 07/25/2023

Audio-aware Query-enhanced Transformer for Audio-Visual Segmentation

The goal of the audio-visual segmentation (AVS) task is to segment the s...

0 Jinxiang Liu, et al. ∙

research

∙ 05/18/2023

Annotation-free Audio-Visual Segmentation

The objective of Audio-Visual Segmentation (AVS) is to localise the soun...

12 Jinxiang Liu, et al. ∙

research

∙ 03/17/2023

DiffusionSeg: Adapting Diffusion Towards Unsupervised Object Discovery

Learning from a large corpus of data, pre-trained models have achieved i...

0 Chaofan Ma, et al. ∙

research

∙ 02/20/2023

Constraint and Union for Partially-Supervised Temporal Sentence Grounding

Temporal sentence grounding aims to detect the event timestamps describe...

0 Chen Ju, et al. ∙

research

∙ 12/19/2022

Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization (WTAL) learns to detect a...

0 Chen Ju, et al. ∙

research

∙ 06/26/2022

Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation

We present a simple yet effective self-supervised framework for audio-vi...

6 Jinxiang Liu, et al. ∙

research

∙ 09/24/2021

A 3D Mesh-based Lifting-and-Projection Network for Human Pose Transfer

Human pose transfer has typically been modeled as a 2D image-to-image tr...

0 Jinxiang Liu, et al. ∙

Success!

An error occurred

Jinxiang Liu

Featured Co-authors

Audio-aware Query-enhanced Transformer for Audio-Visual Segmentation

Annotation-free Audio-Visual Segmentation

DiffusionSeg: Adapting Diffusion Towards Unsupervised Object Discovery

Constraint and Union for Partially-Supervised Temporal Sentence Grounding

Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization

Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation

A 3D Mesh-based Lifting-and-Projection Network for Human Pose Transfer

Sign in with Google

Consider DeepAI Pro