Lin Ma

research

∙ 07/21/2023

Tri-MipRF: Tri-Mip Representation for Efficient Anti-Aliasing Neural Radiance Fields

Despite the tremendous progress in neural radiance fields (NeRF), we sti...

0 Wenbo Hu, et al. ∙

research

∙ 06/13/2023

E2E-LOAD: End-to-End Long-form Online Action Detection

Recently, there has been a growing trend toward feature-based approaches...

0 Shuqiang Cao, et al. ∙

research

∙ 06/13/2023

PaVa: a novel Path-based Valley-seeking clustering algorithm

Clustering methods are being applied to a wider range of scenarios invol...

0 Lin Ma, et al. ∙

research

∙ 05/22/2023

Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object Tracking

Existing end-to-end Multi-Object Tracking (e2e-MOT) methods have not sur...

0 Feng Yan, et al. ∙

research

∙ 05/08/2023

A Multi-Modal Context Reasoning Approach for Conditional Inference on Joint Textual and Visual Clues

Conditional inference on joint textual and visual clues is a multi-modal...

0 Yunxin Li, et al. ∙

research

∙ 05/05/2023

LMEye: An Interactive Perception Network for Large Language Models

Training a Large Visual Language Model (LVLM) from scratch, like GPT-4, ...

0 Yunxin Li, et al. ∙

research

∙ 05/03/2023

A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text

Pretrained Vision-Language Models (VLMs) have achieved remarkable perfor...

0 Yunxin Li, et al. ∙

research

∙ 04/03/2023

Zero-Shot Semantic Segmentation with Decoupled One-Pass Network

Recently, the zero-shot semantic segmentation problem has attracted incr...

0 Cong Han, et al. ∙

research

∙ 03/31/2023

Adaptive Sparse Pairwise Loss for Object Re-Identification

Object re-identification (ReID) aims to find instances with the same ide...

0 Xiao Zhou, et al. ∙

research

∙ 03/13/2023

TriDet: Temporal Action Detection with Relative Boundary Modeling

In this paper, we present a one-stage framework TriDet for temporal acti...

0 Dingfeng Shi, et al. ∙

research

∙ 12/24/2022

DiP: Learning Discriminative Implicit Parts for Person Re-Identification

In person re-identification (ReID) tasks, many works explore the learnin...

0 Dengjie Li, et al. ∙

research

∙ 12/07/2022

Multiple Object Tracking Challenge Technical Report for Team MT_IoT

This is a brief technical report of our proposed method for Multiple-Obj...

0 Feng Yan, et al. ∙

research

∙ 10/22/2022

HAM: Hierarchical Attention Model with High Performance for 3D Visual Grounding

This paper tackles an emerging and challenging vision-language task, 3D ...

0 Jiaming Chen, et al. ∙

research

∙ 10/11/2022

Planning Assembly Sequence with Graph Transformer

Assembly sequence planning (ASP) is the essential process for modern man...

0 Lin Ma, et al. ∙

research

∙ 10/10/2022

Contrastive Video-Language Learning with Fine-grained Frame Sampling

Despite recent progress in video and language representation learning, t...

0 Zixu Wang, et al. ∙

research

∙ 10/08/2022

Contextual Modeling for 3D Dense Captioning on Point Clouds

3D dense captioning, as an emerging vision-language task, aims to identi...

0 Yufeng Zhong, et al. ∙

research

∙ 09/19/2022

Reweighting Clicks with Dwell Time in Recommendation

The click behavior is the most widely-used user positive feedback in rec...

0 Ruobing Xie, et al. ∙

research

∙ 09/16/2022

Weakly Supervised Semantic Segmentation via Progressive Patch Learning

Most of the existing semantic segmentation approaches with image-level c...

0 Jinlong Li, et al. ∙

research

∙ 09/16/2022

Expansion and Shrinkage of Localization for Weakly-Supervised Semantic Segmentation

Generating precise class-aware pseudo ground-truths, a.k.a, class activa...

0 Jinlong Li, et al. ∙

research

∙ 09/07/2022

MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection

Fusing LiDAR and camera information is essential for achieving accurate ...

0 Yang Jiao, et al. ∙

research

∙ 08/30/2022

A Circular Window-based Cascade Transformer for Online Action Detection

Online action detection aims at the accurate action prediction of the cu...

15 Shuqiang Cao, et al. ∙

research

∙ 07/23/2022

Chunk-aware Alignment and Lexical Constraint for Visual Entailment with Natural Language Explanations

Visual Entailment with natural language explanations aims to infer the r...

0 Qian Yang, et al. ∙

research

∙ 07/11/2022

MT-Net Submission to the Waymo 3D Detection Leaderboard

In this technical report, we introduce our submission to the Waymo 3D De...

0 Shaoxiang Chen, et al. ∙

research

∙ 07/03/2022

Cycle-Interactive Generative Adversarial Network for Robust Unsupervised Low-Light Enhancement

Getting rid of the fundamental limitations in fitting to the paired trai...

0 Zhangkai Ni, et al. ∙

research

∙ 05/13/2022

Leveraging Global Binary Masks for Structure Segmentation in Medical Images

Deep learning (DL) models for medical image segmentation are highly infl...

0 Mahdieh Kazemimoghadam, et al. ∙

research

∙ 03/10/2022

A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach

Temporal Sentence Grounding in Videos (TSGV), which aims to ground a nat...

0 Xiaohan Lan, et al. ∙

research

∙ 03/10/2022

MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes

3D dense captioning is a recently-proposed novel task, where point cloud...

0 Yang Jiao, et al. ∙

research

∙ 03/10/2022

Suspected Object Matters: Rethinking Model's Prediction for One-stage Visual Grounding

Recently, one-stage visual grounders attract high attention due to the c...

0 Yang Jiao, et al. ∙

research

∙ 12/02/2021

Controllable Video Captioning with an Exemplar Sentence

In this paper, we investigate a novel and challenging task, namely contr...

0 Yitian Yuan, et al. ∙

research

∙ 12/02/2021

Syntax Customized Video Captioning by Imitating Exemplar Sentences

Enhancing the diversity of sentences to describe video contents is an im...

0 Yitian Yuan, et al. ∙

research

∙ 10/09/2021

Two-stage Visual Cues Enhancement Network for Referring Image Segmentation

Referring Image Segmentation (RIS) aims at segmenting the target object ...

0 Yang Jiao, et al. ∙

research

∙ 10/01/2021

Video Temporal Relationship Mining for Data-Efficient Person Re-identification

This paper is a technical report to our submission to the ICCV 2021 VIPr...

10 Siyu Chen, et al. ∙

research

∙ 07/27/2021

Discriminative-Generative Representation Learning for One-Class Anomaly Detection

As a kind of generative self-supervised learning methods, generative adv...

0 Xuan Xia, et al. ∙

research

∙ 05/29/2021

ECMO: Peripheral Transplantation to Rehost Embedded Linux Kernels

Dynamic analysis based on the full-system emulator QEMU is widely used f...

0 Muhui Jiang, et al. ∙

research

∙ 05/29/2021

Revisiting Challenges for Selective Data Protection of Real Applications

Selective data protection is a promising technique to defend against the...

0 Lin Ma, et al. ∙

research

∙ 05/09/2021

Beyond Monocular Deraining: Parallel Stereo Deraining Network Via Semantic Prior

Rain is a common natural phenomenon. Taking images in the rain however o...

9 Kaihao Zhang, et al. ∙

research

∙ 03/24/2021

Relation-aware Instance Refinement for Weakly Supervised Visual Grounding

Visual grounding, which aims to build a correspondence between visual ob...

3 Yongfei Liu, et al. ∙

research

∙ 03/12/2021

Dual Attention-in-Attention Model for Joint Rain Streak and Raindrop Removal

Rain streaks and rain drops are two natural phenomena, which degrade ima...

0 Kaihao Zhang, et al. ∙

research

∙ 02/18/2021

SLAKE: A Semantically-Labeled Knowledge-Enhanced Dataset for Medical Visual Question Answering

Medical visual question answering (Med-VQA) has tremendous potential in ...

0 Bo Liu, et al. ∙

research

∙ 01/05/2021

Similarity Reasoning and Filtration for Image-Text Matching

Image-text matching plays a critical role in bridging the vision and lan...

5 Haiwen Diao, et al. ∙

research

∙ 12/30/2020

Unpaired Image Enhancement with Quality-Attention Generative Adversarial Network

In this work, we aim to learn an unpaired image enhancement model, which...

28 Zhangkai Ni, et al. ∙

research

∙ 12/30/2020

Towards Unsupervised Deep Image Enhancement with Generative Adversarial Network

Improving the aesthetic quality of images is challenging and eager for t...

4 Zhangkai Ni, et al. ∙

research

∙ 11/18/2020

Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis

We tackle human image synthesis, including human motion imitation, appea...

6 Wen Liu, et al. ∙

research

∙ 10/16/2020

Parsimonious Quantile Regression of Financial Asset Tail Dynamics via Sequential Learning

We propose a parsimonious quantile regression framework to learn the dyn...

0 Xing Yan, et al. ∙

research

∙ 09/02/2020

Intrinsic Relationship Reasoning for Small Object Detection

The small objects in images and videos are usually not independent indiv...

6 Kui Fu, et al. ∙

research

∙ 07/17/2020

Consensus-Aware Visual-Semantic Embedding for Image-Text Matching

Image-text matching plays a central role in bridging vision and language...

3 Haoran Wang, et al. ∙

research

∙ 04/04/2020

Deblurring by Realistic Blurring

Existing deep learning methods for image deblurring typically train mode...

67 Kaihao Zhang, et al. ∙

research

∙ 03/18/2020

STH: Spatio-Temporal Hybrid Convolution for Efficient Action Recognition

Effective and Efficient spatio-temporal modeling is essential for action...

0 Xu Li, et al. ∙

research

∙ 03/16/2020

Weakly-Supervised Multi-Level Attentional Reconstruction Network for Grounding Textual Queries in Videos

The task of temporally grounding textual queries in videos is to localiz...

8 Yijun Song, et al. ∙

research

∙ 03/01/2020

Cops-Ref: A new Dataset and Task on Compositional Referring Expression Comprehension

Referring expression comprehension (REF) aims at identifying a particula...

0 Zhenfang Chen, et al. ∙

Lin Ma

Featured Co-authors

Sign in with Google

Consider DeepAI Pro