Jun Xiao

research

∙ 09/18/2023

CATR: Combinatorial-Dependence Audio-Queried Transformer for Audio-Visual Video Segmentation

Audio-visual video segmentation (AVVS) aims to generate pixel-level maps...

0 Kexin Li, et al. ∙

research

∙ 07/30/2023

Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation

Video-based scene graph generation (VidSGG) is an approach that aims to ...

0 Wenqing Wang, et al. ∙

research

∙ 07/27/2023

Improved Neural Radiance Fields Using Pseudo-depth and Fusion

Since the advent of Neural Radiance Fields, novel view synthesis has rec...

0 Jingliang Li, et al. ∙

research

∙ 05/24/2023

Mitigating Biased Activation in Weakly-supervised Object Localization via Counterfactual Learning

In this paper, we focus on an under-explored issue of biased activation ...

0 Feifei Shao, et al. ∙

research

∙ 03/23/2023

Taking A Closer Look at Visual Relation: Unbiased Video Scene Graph Generation with Decoupled Label Learning

Current video-based scene graph generation (VidSGG) methods have been fo...

0 Wenqing Wang, et al. ∙

research

∙ 01/03/2023

Further Improving Weakly-supervised Object Localization via Causal Knowledge Distillation

Weakly-supervised object localization aims to indicate the category as w...

0 Feifei Shao, et al. ∙

research

∙ 08/13/2022

DS-MVSNet: Unsupervised Multi-view Stereo via Depth Synthesis

In recent years, supervised or unsupervised learning-based MVS methods a...

21 Jingliang Li, et al. ∙

research

∙ 08/04/2022

Multi-scale Sampling and Aggregation Network For High Dynamic Range Imaging

High dynamic range (HDR) imaging is a fundamental problem in image proce...

0 Jun Xiao, et al. ∙

research

∙ 08/03/2022

Integrating Object-aware and Interaction-aware Knowledge for Weakly Supervised Scene Graph Generation

Recently, increasing efforts have been focused on Weakly Supervised Scen...

11 Xingchen Li, et al. ∙

research

∙ 08/02/2022

Unified Normalization for Accelerating and Stabilizing Transformers

Solid results from Transformers have made them prevailing architectures ...

0 Qiming Yang, et al. ∙

research

∙ 07/27/2022

NICEST: Noisy Label Correction and Training for Robust Scene Graph Generation

Nearly all existing scene graph generation (SGG) models have overlooked ...

2 Lin Li, et al. ∙

research

∙ 07/22/2022

Rethinking the Reference-based Distinctive Image Captioning

Distinctive Image Captioning (DIC) – generating distinctive captions tha...

0 Yangjun Mao, et al. ∙

research

∙ 07/20/2022

Explicit Image Caption Editing

Given an image and a reference caption, the image caption editing task a...

0 Zhen Wang, et al. ∙

research

∙ 07/18/2022

Rethinking Data Augmentation for Robust Visual Question Answering

Data Augmentation (DA) – generating extra training samples beyond origin...

0 Long Chen, et al. ∙

research

∙ 07/06/2022

Learning Regularized Multi-Scale Feature Flow for High Dynamic Range Imaging

Reconstructing ghosting-free high dynamic range (HDR) images of dynamic ...

0 Qian Ye, et al. ∙

research

∙ 05/31/2022

A Knowledge-Enhanced Adversarial Model for Cross-lingual Structured Sentiment Analysis

Structured sentiment analysis, which aims to extract the complex semanti...

0 Qi Zhang, et al. ∙

research

∙ 04/25/2022

Rethinking Multi-Modal Alignment in Video Question Answering from Feature and Sample Perspectives

Reasoning about causal and temporal event relations in videos is a new d...

7 Shaoning Xiao, et al. ∙

research

∙ 04/16/2022

Bidirectional Self-Training with Multiple Anisotropic Prototypes for Domain Adaptive Semantic Segmentation

A thriving trend for domain adaptive segmentation endeavors to generate ...

5 Yulei Lu, et al. ∙

research

∙ 03/22/2022

DepthGAN: GAN-based Depth Generation of Indoor Scenes from Semantic Layouts

Limited by the computational efficiency and accuracy, generating complex...

0 Yidi Li, et al. ∙

research

∙ 02/25/2022

Active Learning for Point Cloud Semantic Segmentation via Spatial-Structural Diversity Reasoning

The expensive annotation cost is notoriously known as a main constraint ...

9 Feifei Shao, et al. ∙

research

∙ 12/29/2021

ACDNet: Adaptively Combined Dilated Convolution for Monocular Panorama Depth Estimation

Depth estimation is a crucial step for 3D reconstruction with panorama i...

10 Chuanqing Zhuang, et al. ∙

research

∙ 12/08/2021

Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs

Today's VidSGG models are all proposal-based methods, i.e., they first g...

0 Kaifeng Gao, et al. ∙

research

∙ 12/02/2021

Consensus Graph Representation Learning for Better Grounded Image Captioning

The contemporary visual captioning models frequently hallucinate objects...

0 Wenqiao Zhang, et al. ∙

research

∙ 12/02/2021

Relational Graph Learning for Grounded Video Description Generation

Grounded video description (GVD) encourages captioning models to attend ...

0 Wenqiao Zhang, et al. ∙

research

∙ 11/09/2021

Unified Group Fairness on Federated Learning

Federated learning (FL) has emerged as an important machine learning par...

0 Fengda Zhang, et al. ∙

research

∙ 10/03/2021

Counterfactual Samples Synthesizing and Training for Robust Visual Question Answering

Today's VQA models still tend to capture superficial linguistic correlat...

0 Long Chen, et al. ∙

research

∙ 09/22/2021

Natural Language Video Localization with Learnable Moment Proposals

Given an untrimmed video and a natural language query, Natural Language ...

0 Shaoning Xiao, et al. ∙

research

∙ 08/19/2021

Video Relation Detection via Tracklet based Visual Transformer

Video Visual Relation Detection (VidVRD), has received significant atten...

0 Kaifeng Gao, et al. ∙

research

∙ 08/19/2021

Progressive and Selective Fusion Network for High Dynamic Range Imaging

This paper considers the problem of generating an HDR image of a scene f...

0 Qian Ye, et al. ∙

research

∙ 06/01/2021

Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning

Centralized Training with Decentralized Execution (CTDE) has been a popu...

0 Jiahui Li, et al. ∙

research

∙ 05/26/2021

Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey

Weakly-Supervised Object Detection (WSOD) and Localization (WSOL), i.e.,...

0 Feifei Shao, et al. ∙

research

∙ 05/12/2021

VL-NMS: Breaking Proposal Bottlenecks in Two-Stage Visual-Language Matching

The prevailing framework for matching multimodal inputs is based on a tw...

0 Wenbo Ma, et al. ∙

research

∙ 04/21/2021

Improving Weakly-supervised Object Localization via Causal Intervention

The recent emerged weakly supervised object localization (WSOL) methods ...

18 Feifei Shao, et al. ∙

research

∙ 04/15/2021

Efficient Ring-topology Decentralized Federated Learning with Deep Generative Models for Industrial Artificial Intelligent

By leveraging deep learning based technologies, the data-driven based ap...

0 Zhao Wang, et al. ∙

research

∙ 03/22/2021

Human-like Controllable Image Captioning with Verb-specific Semantic Roles

Controllable Image Captioning (CIC) – generating image descriptions foll...

0 Long Chen, et al. ∙

research

∙ 03/15/2021

Boundary Proposal Network for Two-Stage Natural Language Video Localization

We aim to address the problem of Natural Language Video Localization (NL...

0 Shaoning Xiao, et al. ∙

research

∙ 12/18/2020

ROBY: Evaluating the Robustness of a Deep Model by its Decision Boundaries

With the successful application of deep learning models in many real-wor...

0 Jinyin Chen, et al. ∙

research

∙ 10/21/2020

GFL: A Decentralized Federated Learning Framework Based On Blockchain

Due to people's emerging concern about data privacy, federated learning(...

0 Yifan Hu, et al. ∙

research

∙ 10/18/2020

Federated Unsupervised Representation Learning

To leverage enormous unlabeled data on distributed edge devices, we form...

0 Fengda Zhang, et al. ∙

research

∙ 09/03/2020

Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding

The prevailing framework for solving referring expression grounding is b...

2 Long Chen, et al. ∙

research

∙ 08/11/2020

Topic Adaptation and Prototype Encoding for Few-Shot Visual Storytelling

Visual Storytelling (VIST) is a task to tell a narrative story about a c...

7 Jiacheng Li, et al. ∙

research

∙ 07/29/2020

Accurate 2D soft segmentation of medical image via SoftGAN network

Accurate 2D lung nodules segmentation from medical Computed Tomography (...

0 Changwei Wang, et al. ∙

research

∙ 07/09/2020

Deep Multi-task Learning for Facial Expression Recognition and Synthesis Based on Selective Feature Sharing

Multi-task learning is an effective learning strategy for deep-learning-...

0 Rui Zhao, et al. ∙

research

∙ 05/26/2020

Hierarchical Fashion Graph Network for Personalized Outfit Recommendation

Fashion outfit recommendation has attracted increasing attentions from o...

0 Xingchen Li, et al. ∙

research

∙ 03/14/2020

Counterfactual Samples Synthesizing for Robust Visual Question Answering

Despite Visual Question Answering (VQA) has realized impressive progress...

5 Long Chen, et al. ∙

research

∙ 03/03/2020

Evaluation Framework For Large-scale Federated Learning

Federated learning is proposed as a machine learning setting to enable d...

0 Lifeng Liu, et al. ∙

research

∙ 07/01/2019

Weak Supervision Enhanced Generative Network for Question Generation

Automatic question generation according to an answer within the given pa...

0 Yutong Wang, et al. ∙

research

∙ 04/22/2019

Galaxy Learning – A Position Paper

The recent rapid development of artificial intelligence (AI, mainly driv...

0 Chao Wu, et al. ∙

research

∙ 12/06/2018

Scene Dynamics: Counterfactual Critic Multi-Agent Training for Scene Graph Generation

Scene graphs -- objects as nodes and visual relationships as edges -- de...

6 Long Chen, et al. ∙

research

∙ 10/24/2018

Textually Guided Ranking Network for Attentional Image Retweet Modeling

Retweet prediction is a challenging problem in social media sites (SMS)....

0 Zhou Zhao, et al. ∙

Jun Xiao

Featured Co-authors

Sign in with Google

Consider DeepAI Pro