
-
Incorporating Vision Bias into Click Models for Image-oriented Search Engine
Most typical click models assume that the probability of a document to b...
read it
-
TransTrack: Multiple-Object Tracking with Transformer
Multiple-object tracking(MOT) is mostly dominated by complex and multi-s...
read it
-
OneNet: Towards End-to-End One-Stage Object Detection
End-to-end one-stage object detection trailed thus far. This paper disco...
read it
-
Slimmable Generative Adversarial Networks
Generative adversarial networks (GANs) have achieved remarkable progress...
read it
-
F2Net: Learning to Focus on the Foreground for Unsupervised Video Object Segmentation
Although deep learning based methods have achieved great progress in uns...
read it
-
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals
We present Sparse R-CNN, a purely sparse method for object detection in ...
read it
-
A robust statistical method for Genome-wide association analysis of human copy number variation
Conducting genome-wide association studies (GWAS) in copy number variati...
read it
-
Learning the Best Pooling Strategy for Visual Semantic Embedding
Visual Semantic Embedding (VSE) is a dominant approach for vision-langua...
read it
-
Towards Good Practices for Video Object Segmentation
Semi-supervised video object segmentation is an interesting yet challeng...
read it
-
Deformable Tube Network for Action Detection in Videos
We address the problem of spatio-temporal action detection in videos. Ex...
read it
-
A Context-and-Spatial Aware Network for Multi-Person Pose Estimation
Multi-person pose estimation is a fundamental yet challenging task in co...
read it
-
Multi-Person Pose Estimation with Enhanced Channel-wise and Spatial Information
Multi-person pose estimation is an important but challenging problem in ...
read it
-
Generative Dual Adversarial Network for Generalized Zero-shot Learning
This paper studies the problem of generalized zero-shot learning which r...
read it
-
Mask Propagation Network for Video Object Segmentation
In this work, we propose a mask propagation network to treat the video s...
read it
-
Knowing Where to Look? Analysis on Attention of Visual Question Answering System
Attention mechanisms have been widely used in Visual Question Answering ...
read it
-
Towards Good Practices for Multi-modal Fusion in Large-scale Video Classification
Leveraging both visual frames and audio has been experimentally proven e...
read it
-
An Introduction to Image Synthesis with Generative Adversarial Nets
There has been a drastic growth of research in Generative Adversarial Ne...
read it
-
Modularized Morphing of Neural Networks
In this work we study the problem of network morphism, an effective lear...
read it
-
Surveillance Video Parsing with Single Frame Supervision
Surveillance video parsing, which segments the video frames into several...
read it
-
Network Morphism
We present in this paper a systematic study on how to morph a well-train...
read it