In this paper, we consider feature screening for ultrahigh dimensional
c...
In applications such as gene regulatory network analysis based on single...
Most existing scene text detectors focus on detecting characters or word...
Gene regulatory network (GRN) refers to the complex network formed by
re...
Unsupervised domain adaptation (UDA) aims to enhance the generalization
...
Generally, humans are more skilled at perceiving differences between
hig...
We present a new neighbor sampling method on temporal graphs. In a tempo...
A more realistic object detection paradigm, Open-World Object Detection,...
Supervised learning is dominant in person search, but it requires elabor...
Few-shot learning aims to adapt knowledge learned from previous tasks to...
Video scene parsing is a long-standing challenging task in computer visi...
This paper studies the context aggregation problem in semantic image
seg...
Autonomous highlight detection is crucial for enhancing the efficiency o...
Retrieving occlusion relation among objects in a single image is challen...
The nonlocal-based blocks are designed for capturing long-range
spatial-...
Given input images, scene graph generation (SGG) aims to produce
compreh...
Contrastive learning applied to self-supervised representation learning ...
Semi-Supervised Learning (SSL) has shown its strong ability in utilizing...
Leveraging the advances of natural language processing, most recent scen...
The training loss function that enforces certain training sample distrib...
Data in the real world tends to exhibit a long-tailed label distribution...
Although single-image super-resolution (SISR) methods have achieved grea...
In this paper, we propose MINE to perform novel view synthesis and depth...
Fine-grained visual classification (FGVC) which aims at recognizing obje...
Fine-grained image recognition is very challenging due to the difficulty...
Most typical click models assume that the probability of a document to b...
Multiple-object tracking(MOT) is mostly dominated by complex and multi-s...
End-to-end one-stage object detection trailed thus far. This paper disco...
Generative adversarial networks (GANs) have achieved remarkable progress...
Although deep learning based methods have achieved great progress in
uns...
We present Sparse R-CNN, a purely sparse method for object detection in
...
Conducting genome-wide association studies (GWAS) in copy number variati...
Visual Semantic Embedding (VSE) is a dominant approach for vision-langua...
Semi-supervised video object segmentation is an interesting yet challeng...
We address the problem of spatio-temporal action detection in videos.
Ex...
Multi-person pose estimation is a fundamental yet challenging task in
co...
Multi-person pose estimation is an important but challenging problem in
...
This paper studies the problem of generalized zero-shot learning which
r...
In this work, we propose a mask propagation network to treat the video
s...
Attention mechanisms have been widely used in Visual Question Answering ...
Leveraging both visual frames and audio has been experimentally proven
e...
There has been a drastic growth of research in Generative Adversarial Ne...
In this work we study the problem of network morphism, an effective lear...
Surveillance video parsing, which segments the video frames into several...
We present in this paper a systematic study on how to morph a well-train...