Xu Yang

research

∙ 09/15/2023

FedDCSR: Federated Cross-domain Sequential Recommendation via Disentangled Representation Learning

Cross-domain Sequential Recommendation (CSR) which leverages user sequen...

0 Hongyu Zhang, et al. ∙

research

∙ 07/06/2023

Temporal Difference Learning for High-Dimensional PIDEs with Jumps

In this paper, we propose a deep learning framework for solving high-dim...

1 Liwei Lu, et al. ∙

research

∙ 06/17/2023

Genes in Intelligent Agents

Training intelligent agents in Reinforcement Learning (RL) is much more ...

0 Fu Feng, et al. ∙

research

∙ 05/24/2023

Exploring Diverse In-Context Configurations for Image Captioning

After discovering that Language Models (LMs) can be good in-context few-...

0 Xu Yang, et al. ∙

research

∙ 05/03/2023

Learngene: Inheriting Condensed Knowledge from the Ancestry Model to Descendant Models

During the continuous evolution of one organism's ancestry, its genes ac...

0 Qiufeng Wang, et al. ∙

research

∙ 05/03/2023

Transforming Visual Scene Graphs to Image Captions

We propose to Transform Scene Graphs (TSG) into more descriptive caption...

0 Xu Yang, et al. ∙

research

∙ 04/04/2023

SC-ML: Self-supervised Counterfactual Metric Learning for Debiased Visual Question Answering

Visual question answering (VQA) is a critical multimodal task in which a...

0 Xinyao Shu, et al. ∙

research

∙ 03/13/2023

Spatial Attention and Syntax Rule Enhanced Tree Decoder for Offine Handwritten Mathematical Expression Recognition

Offline Handwritten Mathematical Expression Recognition (HMER) has been ...

0 Zihao Lin, et al. ∙

research

∙ 01/05/2023

Adaptively Clustering Neighbor Elements for Image Captioning

We design a novel global-local Transformer named Ada-ClustFormer (ACF) t...

0 Zihua Wang, et al. ∙

research

∙ 01/05/2023

Learning Trajectory-Word Alignments for Video-Language Tasks

Aligning objects with words plays a critical role in Image-Language BERT...

0 Xu Yang, et al. ∙

research

∙ 11/19/2022

Spikeformer: A Novel Architecture for Training High-Performance Low-Latency Spiking Neural Network

Spiking neural networks (SNNs) have made great progress on both performa...

0 Yudong Li, et al. ∙

research

∙ 10/12/2022

Projective Transformation Rectification for Camera-captured Chest X-ray Photograph Interpretation with Synthetic Data

Automatic interpretation on smartphone-captured chest X-ray (CXR) photog...

0 Chak Fong Chong, et al. ∙

research

∙ 10/04/2022

Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning

Humans tend to decompose a sentence into different parts like sth do sth...

0 Xu Yang, et al. ∙

research

∙ 08/20/2022

MemoNav: Selecting Informative Memories for Visual Navigation

Image-goal navigation is a challenging task, as it requires the agent to...

0 Hongxin Li, et al. ∙

research

∙ 06/29/2022

Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning

Compositional Zero-Shot Learning (CZSL) aims to recognize unseen composi...

0 Xiangyu Li, et al. ∙

research

∙ 06/27/2022

iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition

Online exams via video conference software like Zoom have been adopted i...

0 Xu Yang, et al. ∙

research

∙ 04/21/2022

Weakly Aligned Feature Fusion for Multimodal Object Detection

To achieve accurate and robust object detection in the real-world scenar...

7 Lu Zhang, et al. ∙

research

∙ 04/21/2022

Unseen Object Instance Segmentation with Fully Test-time RGB-D Embeddings Adaptation

Segmenting unseen objects is a crucial ability for the robot since it ma...

0 Lu Zhang, et al. ∙

research

∙ 12/17/2021

Towards End-to-End Image Compression and Analysis with Transformers

We propose an end-to-end image compression and analysis model with Trans...

17 Yuanchao Bai, et al. ∙

research

∙ 11/22/2021

Auto-Encoding Score Distribution Regression for Action Quality Assessment

Action quality assessment (AQA) from videos is a challenging vision task...

7 Boyu Zhang, et al. ∙

research

∙ 11/03/2021

Schwarz Waveform Relaxation Physics-Informed Neural Networks for Solving Advection-Diffusion-Reaction Equations

This paper develops a physics-informed neural network (PINN) based on th...

0 Emmanuel Lorin, et al. ∙

research

∙ 10/28/2021

Sliding Sequential CVAE with Time Variant Socially-aware Rethinking for Trajectory Prediction

Pedestrian trajectory prediction is a key technology in many application...

0 Hao Zhou, et al. ∙

research

∙ 10/13/2021

Seismic Tomography with Random Batch Gradient Reconstruction

Seismic tomography solves high-dimensional optimization problems to imag...

0 Yixiao Hu, et al. ∙

research

∙ 10/08/2021

How Can AI Recognize Pain and Express Empathy

Sensory and emotional experiences such as pain and empathy are relevant ...

0 Siqi Cao, et al. ∙

research

∙ 08/24/2021

Auto-Parsing Network for Image Captioning and Visual Question Answering

We propose an Auto-Parsing Network (APN) to discover and exploit the inp...

0 Xu Yang, et al. ∙

research

∙ 07/26/2021

Towards Unbiased Visual Emotion Recognition via Causal Intervention

Although much progress has been made in visual emotion recognition, rese...

0 Yuedong Chen, et al. ∙

research

∙ 07/12/2021

Deep unfitted Nitsche method for elliptic interface problems

In this paper, we propose a deep unfitted Nitsche method for computing e...

0 Hailong Guo, et al. ∙

research

∙ 03/09/2021

Doubly Contrastive Deep Clustering

Deep clustering successfully provides more effective features than conve...

21 Zhiyuan Dang, et al. ∙

research

∙ 03/05/2021

Causal Attention for Vision-Language Tasks

We present a novel attention mechanism: Causal Attention (CATT), to remo...

0 Xu Yang, et al. ∙

research

∙ 12/31/2020

Incremental Embedding Learning via Zero-Shot Translation

Modern deep learning methods have achieved great success in machine lear...

0 Kun Wei, et al. ∙

research

∙ 09/30/2020

Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning

Change Captioning is a task that aims to describe the difference between...

0 Xiangxi Shi, et al. ∙

research

∙ 07/25/2020

Unfitted Nitsche's method for computing band structures in phononic crystals with impurities

In this paper, we propose an unfitted Nitsche's method to compute the ba...

0 Hailong Guo, et al. ∙

research

∙ 06/03/2020

Learning to Scan: A Deep Reinforcement Learning Approach for Personalized Scanning in CT Imaging

Computed Tomography (CT) takes X-ray measurements on the subjects to rec...

0 Ziju Shen, et al. ∙

research

∙ 03/09/2020

Deconfounded Image Captioning: A Causal Retrospect

The dataset bias in vision-language tasks is becoming one of the main pr...

0 Xu Yang, et al. ∙

research

∙ 02/12/2020

Semi-classical limit for the varying-mass Schrödinger equation with random inhomogeneities

The varying-mass Schrödinger equation (VMSE) has been successfully appli...

0 Shi Chen, et al. ∙

research

∙ 08/19/2019

Unfitted Nitsche's method for computing wave modes in topological materials

In this paper, we propose an unfitted Nitsche's method for computing wav...

0 Hailong Guo, et al. ∙

research

∙ 05/24/2019

mu-Forcing: Training Variational Recurrent Autoencoders for Text Generation

It has been previously observed that training Variational Recurrent Auto...

0 Dayiheng Liu, et al. ∙

research

∙ 04/30/2019

Deep Spectral Clustering using Dual Autoencoder Network

The clustering methods have recently absorbed even-increasing attention ...

0 Xu Yang, et al. ∙

research

∙ 04/18/2019

Learning to Collocate Neural Modules for Image Captioning

We do not speak word by word from scratch; our brain quickly structures ...

0 Xu Yang, et al. ∙

research

∙ 03/26/2019

Unpaired Image Captioning via Scene Graph Alignments

Deep neural networks have achieved great success on the image captioning...

0 Jiuxiang Gu, et al. ∙

research

∙ 01/09/2019

The Cross-Modality Disparity Problem in Multispectral Pedestrian Detection

Aggregating extra features of novel modality brings great advantages for...

0 Lu Zhang, et al. ∙

research

∙ 12/06/2018

Auto-Encoding Graphical Inductive Bias for Descriptive Image Captioning

We propose Scene Graph Auto-Encoder (SGAE) that incorporates the languag...

16 Xu Yang, et al. ∙

research

∙ 12/06/2018

Auto-Encoding Scene Graphs for Image Captioning

We propose Scene Graph Auto-Encoder (SGAE) that incorporates the languag...

0 Xu Yang, et al. ∙

research

∙ 08/01/2018

Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship Features

Due to the fact that it is prohibitively expensive to completely annotat...

2 Xu Yang, et al. ∙

research

∙ 11/04/2014

A Weighted Common Subgraph Matching Algorithm

We propose a weighted common subgraph (WCS) matching algorithm to find t...

0 Xu Yang, et al. ∙

Xu Yang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro