Longyin Wen

research

∙ 03/22/2023

Text with Knowledge Graph Augmented Transformer for Video Captioning

Video captioning aims to describe the content of videos using natural la...

0 Xin Gu, et al. ∙

research

∙ 03/06/2023

DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training

Large-scale pre-trained multi-modal models (e.g., CLIP) demonstrate stro...

0 Wei Li, et al. ∙

research

∙ 07/07/2022

Dual-Stream Transformer for Generic Event Boundary Captioning

This paper describes our champion solution for the CVPR2022 Generic Even...

0 Xin Gu, et al. ∙

research

∙ 06/25/2022

SC-Transformer++: Structured Context Transformer for Generic Event Boundary Detection

This report presents the algorithm used in the submission of Generic Eve...

0 Dexiang Hong, et al. ∙

research

∙ 06/07/2022

Structured Context Transformer for Generic Event Boundary Detection

Generic Event Boundary Detection (GEBD) aims to detect moments where hum...

0 Congcong Li, et al. ∙

research

∙ 03/29/2022

End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection

Generic event boundary detection aims to localize the generic, taxonomy-...

0 Congcong Li, et al. ∙

research

∙ 08/16/2021

Towards Real-World Prohibited Item Detection: A Large-Scale X-ray Benchmark

Automatic security inspection using computer vision technology is a chal...

8 Boying Wang, et al. ∙

research

∙ 07/19/2021

VisDrone-CC2020: The Vision Meets Drone Crowd Counting Challenge Results

Crowd counting on the drone platform is an interesting topic in computer...

2 Dawei Du, et al. ∙

research

∙ 07/01/2021

Generic Event Boundary Detection Challenge at CVPR 2021 Technical Report: Cascaded Temporal Attention Network (CASTANET)

This report presents the approach used in the submission of Generic Even...

0 Dexiang Hong, et al. ∙

research

∙ 05/06/2021

Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark

To promote the developments of object detection, tracking and counting a...

18 Longyin Wen, et al. ∙

research

∙ 05/27/2020

Efficient Pig Counting in Crowds with Keypoints Tracking and Spatial-aware Temporal Response Filtering

Pig counting is a crucial task for large-scale pig farming, which is usu...

21 Guang Chen, et al. ∙

research

∙ 03/29/2020

Spatial Attention Pyramid Network for Unsupervised Domain Adaptation

Unsupervised domain adaptation is critical in various computer vision ta...

0 Congcong Li, et al. ∙

research

∙ 03/18/2020

Rethinking Object Detection in Retail Stores

The convention standard for object detection uses a bounding box to repr...

0 Yuanqiang Cai, et al. ∙

research

∙ 03/16/2020

Multi-Drone based Single Object Tracking with Agent Sharing Network

Drone equipped with cameras can dynamically track the target in the air ...

0 Pengfei Zhu, et al. ∙

research

∙ 01/16/2020

Vision Meets Drones: Past, Present and Future

Drones, or general UAVs, equipped with cameras have been fast deployed w...

15 Pengfei Zhu, et al. ∙

research

∙ 12/20/2019

Learning Semantic Neural Tree for Human Parsing

The majority of existing human parsing methods formulate the task as sem...

6 Ruyi Ji, et al. ∙

research

∙ 12/11/2019

SiamMan: Siamese Motion-aware Network for Visual Tracking

In this paper, we present a novel siamese motion-aware network (SiamMan)...

0 Wenzhang Zhou, et al. ∙

research

∙ 12/04/2019

Drone-based Joint Density Map Estimation, Localization and Tracking with Space-Time Multi-Scale Attention Network

This paper proposes a space-time multi-scale attention network (STANet) ...

0 Longyin Wen, et al. ∙

research

∙ 09/25/2019

Attention Convolutional Binary Neural Tree for Fine-Grained Visual Categorization

Fine-grained visual categorization (FGVC) is an important but challengin...

17 Ruyi Ji, et al. ∙

research

∙ 09/25/2019

Guided Attention Network for Object Detection and Counting on Drones

Object detection and counting are related but challenging problems, espe...

12 Yuanqiang Cai, et al. ∙

research

∙ 08/06/2019

Multi-view Deep Subspace Clustering Networks

Multi-view subspace clustering aims to discover the inherent structure b...

1 Pengfei Zhu, et al. ∙

research

∙ 07/29/2019

ChaLearn Looking at People: IsoGD and ConGD Large-scale RGB-D Gesture Recognition

The ChaLearn large-scale gesture recognition challenge has been run twic...

1 Jun Wan, et al. ∙

research

∙ 04/10/2019

Data Priming Network for Automatic Check-Out

Automatic Check-Out (ACO) receives increased interests in recent years. ...

8 Congcong Li, et al. ∙

research

∙ 04/04/2019

Spatiotemporal CNN for Video Object Segmentation

In this paper, we present a unified, end-to-end trainable spatiotemporal...

0 Kai Xu, et al. ∙

research

∙ 12/10/2018

Learning Non-Uniform Hypergraph for Multi-Object Tracking

The majority of Multi-Object Tracking (MOT) algorithms based on the trac...

0 Longyin Wen, et al. ∙

research

∙ 11/06/2018

Evolvement Constrained Adversarial Learning for Video Style Transfer

Video style transfer is a useful component for applications such as augm...

0 Wenbo Li, et al. ∙

research

∙ 10/19/2018

ScratchDet:Exploring to Train Single-Shot Object Detectors from Scratch

Current state-of-the-art object objectors are fine-tuned from the off-th...

0 Rui Zhu, et al. ∙

research

∙ 07/23/2018

Occlusion-aware R-CNN: Detecting Pedestrians in a Crowd

Pedestrian detection in crowded scenes is a challenging problem since th...

0 Shifeng Zhang, et al. ∙

research

∙ 04/20/2018

Vision Meets Drones: A Challenge

In this paper we present a large-scale visual object detection and track...

0 Pengfei Zhu, et al. ∙

research

∙ 11/18/2017

Single-Shot Refinement Neural Network for Object Detection

For object detection, the two-stage approach (e.g., Faster R-CNN) has be...

0 Shifeng Zhang, et al. ∙

research

∙ 06/13/2017

Contrast Enhancement Estimation for Digital Image Forensics

Inconsistency in contrast enhancement can be used to expose image forger...

0 Longyin Wen, et al. ∙

research

∙ 03/18/2016

Geometric Hypergraph Learning for Visual Tracking

Graph based representation is widely used in visual tracking field by fi...

0 Dawei Du, et al. ∙

research

∙ 11/13/2015

UA-DETRAC: A New Benchmark and Protocol for Multi-Object Detection and Tracking

In recent years, numerous effective multi-object tracking (MOT) methods ...

0 Longyin Wen, et al. ∙

Longyin Wen

Featured Co-authors

Sign in with Google

Consider DeepAI Pro