Houqiang Li

research

∙ 08/23/2023

Sign Language Translation with Iterative Prototype

This paper presents IP-SLT, a simple yet effective framework for sign la...

0 Huijie Yao, et al. ∙

research

∙ 08/19/2023

UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding

In the era of Large Language Models (LLMs), tremendous strides have been...

0 Hao Feng, et al. ∙

research

∙ 08/17/2023

SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning

In fisheye images, rich distinct distortion patterns are regularly distr...

0 Hao Feng, et al. ∙

research

∙ 08/17/2023

Text-Only Training for Visual Storytelling

Visual storytelling aims to generate a narrative based on a sequence of ...

0 Yuechen Wang, et al. ∙

research

∙ 08/16/2023

DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory

Controllable video generation has gained significant attention in recent...

0 Shengming Yin, et al. ∙

research

∙ 08/14/2023

Masked Motion Predictors are Strong 3D Action Representation Learners

In 3D human action recognition, limited supervised data makes it challen...

0 Yunyao Mao, et al. ∙

research

∙ 08/11/2023

Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection

Recent progress in weakly supervised object detection is featured by a c...

0 Yufei Yin, et al. ∙

research

∙ 08/08/2023

Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video

Reconstructing interacting hands from monocular RGB data is a challengin...

0 Weichao Zhao, et al. ∙

research

∙ 07/17/2023

AltFreezing for More General Video Face Forgery Detection

Existing face forgery detection models try to discriminate fake images b...

0 Zhendong Wang, et al. ∙

research

∙ 06/19/2023

LVVC: A Learned Versatile Video Coding Framework for Efficient Human-Machine Vision

Almost all digital videos are coded into compact representations before ...

0 Xihua Sheng, et al. ∙

research

∙ 06/09/2023

Exploring Effective Mask Sampling Modeling for Neural Image Compression

Image compression aims to reduce the information redundancy in images. M...

0 Lin Liu, et al. ∙

research

∙ 06/03/2023

MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning

Recent approaches have utilized self-supervised auxiliary tasks as repre...

0 Haolin Song, et al. ∙

research

∙ 05/26/2023

Detect Any Shadow: Segment Anything for Video Shadow Detection

Segment anything model (SAM) has achieved great success in the field of ...

0 Yonghui Wang, et al. ∙

research

∙ 05/16/2023

Hybrid and Collaborative Passage Reranking

In passage retrieval system, the initial passage retrieval results may b...

0 Zongmeng Zhang, et al. ∙

research

∙ 05/08/2023

SignBERT+: Hand-model-aware Self-supervised Pre-training for Sign Language Understanding

Hand gesture serves as a crucial role during the expression of sign lang...

0 Hezhen Hu, et al. ∙

research

∙ 04/18/2023

Deep Unrestricted Document Image Rectification

In recent years, tremendous efforts have been made on document image rec...

0 Hao Feng, et al. ∙

research

∙ 04/12/2023

Learning Transferable Pedestrian Representation from Multimodal Information Supervision

Recent researches on unsupervised person re-identification (reID) have d...

0 Liping Bao, et al. ∙

research

∙ 03/24/2023

HandNeRF: Neural Radiance Fields for Animatable Interacting Hands

We propose a novel framework to reconstruct accurate appearance and geom...

0 Zhiyang Guo, et al. ∙

research

∙ 03/21/2023

Human Pose as Compositional Tokens

Human pose is typically represented by a coordinate vector of body joint...

0 Zigang Geng, et al. ∙

research

∙ 03/16/2023

DIRE for Diffusion-Generated Image Detection

Diffusion models have shown remarkable success in visual synthesis, but ...

0 Zhendong Wang, et al. ∙

research

∙ 03/16/2023

Focus on Your Target: A Dual Teacher-Student Framework for Domain-adaptive Semantic Segmentation

We study unsupervised domain adaptation (UDA) for semantic segmentation....

0 Xinyue Huo, et al. ∙

research

∙ 03/01/2023

ROCO: A Roundabout Traffic Conflict Dataset

Traffic conflicts have been studied by the transportation research commu...

0 Depu Meng, et al. ∙

research

∙ 02/10/2023

BEST: BERT Pre-Training for Sign Language Recognition with Coupling Tokenization

In this work, we are dedicated to leveraging the BERT pre-training succe...

0 Weichao Zhao, et al. ∙

research

∙ 01/25/2023

Discriminative Experience Replay for Efficient Multi-agent Reinforcement Learning

In cooperative multi-agent tasks, parameter sharing among agents is a co...

0 Xunhan Hu, et al. ∙

research

∙ 01/21/2023

Recurrent Contour-based Instance Segmentation with Progressive Learning

Contour-based instance segmentation has been actively studied, thanks to...

0 Hao Feng, et al. ∙

research

∙ 01/13/2023

OA-BEV: Bringing Object Awareness to Bird's-Eye-View Representation for Multi-Camera 3D Object Detection

The recent trend for multi-camera 3D object detection is through the uni...

0 Xiaomeng Chu, et al. ∙

research

∙ 11/28/2022

Hand-Object Interaction Image Generation

In this work, we are dedicated to a new task, i.e., hand-object interact...

0 Hezhen Hu, et al. ∙

research

∙ 11/28/2022

CLIP2GAN: Towards Bridging Text with the Latent Space of GANs

In this work, we are dedicated to text-guided image generation and propo...

0 Yixuan Wang, et al. ∙

research

∙ 11/22/2022

SinDiffusion: Learning a Diffusion Model from a Single Natural Image

We present SinDiffusion, leveraging denoising diffusion models to captur...

0 Weilun Wang, et al. ∙

research

∙ 11/16/2022

Stare at What You See: Masked Image Modeling without Reconstruction

Masked Autoencoders (MAE) have been prevailing paradigms for large-scale...

0 Hongwei Xue, et al. ∙

research

∙ 10/31/2022

DanZero: Mastering GuanDan Game with Reinforcement Learning

Card game AI has always been a hot topic in the research of artificial i...

0 Yudong Lu, et al. ∙

research

∙ 10/21/2022

Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding

Temporal language grounding (TLG) aims to localize a video segment in an...

0 Yuechen Wang, et al. ∙

research

∙ 10/15/2022

UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior

Document images captured by mobile devices are usually degraded by uncon...

0 Yonghui Wang, et al. ∙

research

∙ 10/15/2022

Geometric Representation Learning for Document Image Rectification

In document image rectification, there exist rich geometric constraints ...

0 Hao Feng, et al. ∙

research

∙ 09/14/2022

CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment

The pre-trained image-text models, like CLIP, have demonstrated the stro...

0 Hongwei Xue, et al. ∙

research

∙ 08/26/2022

CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation

In 3D action recognition, there exists rich complementary information be...

0 Yunyao Mao, et al. ∙

research

∙ 08/23/2022

Low-Light Video Enhancement with Synthetic Event Guidance

Low-light video enhancement (LLVE) is an important yet challenging task ...

1 Lin Liu, et al. ∙

research

∙ 07/14/2022

Unified 2D and 3D Pre-Training of Molecular Representations

Molecular representation learning has attracted much attention recently....

0 Jinhua Zhu, et al. ∙

research

∙ 07/14/2022

Neighbor Correspondence Matching for Flow-based Video Frame Synthesis

Video frame synthesis, which consists of interpolation and extrapolation...

0 Zhaoyang Jia, et al. ∙

research

∙ 06/30/2022

Semantic Image Synthesis via Diffusion Models

Denoising Diffusion Probabilistic Models (DDPMs) have achieved remarkabl...

6 Weilun Wang, et al. ∙

research

∙ 06/14/2022

TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer

In this work, we explore neat yet effective Transformer-based frameworks...

19 Jiajun Deng, et al. ∙

research

∙ 06/08/2022

Stabilizing Voltage in Power Distribution Networks via Multi-Agent Reinforcement Learning with Transformer

The increased integration of renewable energy poses a slew of technical ...

0 Minrui Wang, et al. ∙

research

∙ 05/23/2022

A Self-Paced Mixed Distillation Method for Non-Autoregressive Generation

Non-Autoregressive generation is a sequence generation paradigm, which r...

0 Weizhen Qi, et al. ∙

research

∙ 05/08/2022

Simultaneous Double Q-learning with Conservative Advantage Learning for Actor-Critic Methods

Actor-critic Reinforcement Learning (RL) algorithms have achieved impres...

1 Qing Li, et al. ∙

research

∙ 05/07/2022

Multi-Target Active Object Tracking with Monte Carlo Tree Search and Target Motion Modeling

In this work, we are dedicated to multi-target active object tracking (A...

14 Zheng Chen, et al. ∙

research

∙ 05/05/2022

LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning

Cooperative multi-agent reinforcement learning (MARL) has made prominent...

2 Mingyu Yang, et al. ∙

research

∙ 04/25/2022

Estimation of Reliable Proposal Quality for Temporal Action Detection

Temporal action detection (TAD) aims to locate and recognize the actions...

0 Junshan Hu, et al. ∙

research

∙ 04/06/2022

Domain-Agnostic Prior for Transfer Semantic Segmentation

Unsupervised domain adaptation (UDA) is an important topic in the comput...

0 Xinyue Huo, et al. ∙

research

∙ 04/06/2022

DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning

Recent years have witnessed the great breakthrough of deep reinforcement...

5 Youpeng Zhao, et al. ∙

research

∙ 03/30/2022

Large-Scale Pre-training for Person Re-identification with Noisy Labels

This paper aims to address the problem of pre-training for person re-ide...

4 Dengpan Fu, et al. ∙

Houqiang Li

Featured Co-authors

Sign in with Google

Consider DeepAI Pro