
-
Decoupled and Memory-Reinforced Networks: Towards Effective Feature Learning for One-Step Person Search
The goal of person search is to localize and match query persons from sc...
read it
-
Learning Audio-Visual Correlations from Variational Cross-Modal Generation
People can easily imagine the potential sound while seeing an event. Thi...
read it
-
Modeling the Probabilistic Distribution of Unlabeled Data forOne-shot Medical Image Segmentation
Existing image segmentation networks mainly leverage large-scale labeled...
read it
-
AINet: Association Implantation for Superpixel Segmentation
Recently, some approaches are proposed to harness deep convolutional net...
read it
-
Supervision by Registration and Triangulation for Landmark Detection
We present Supervision by Registration and Triangulation (SRT), an unsup...
read it
-
Learning to Anticipate Egocentric Actions by Imagination
Anticipating actions before they are executed is crucial for a wide rang...
read it
-
SemGloVe: Semantic Co-occurrences for GloVe from BERT
GloVe learns word embeddings by leveraging statistical information from ...
read it
-
Understanding Image Retrieval Re-Ranking: A Graph Neural Network Perspective
The re-ranking approach leverages high-confidence retrieved samples to r...
read it
-
Doubly Robust Adaptive LASSO for Effect Modifier Discovery
Effect modification occurs when the effect of the treatment on an outcom...
read it
-
ActBERT: Learning Global-Local Video-Text Representations
In this paper, we introduce ActBERT for self-supervised learning of join...
read it
-
Large-scale multilingual audio visual dubbing
We describe a system for large-scale audiovisual translation and dubbing...
read it
-
Pixel-Level Cycle Association: A New Perspective for Domain Adaptive Semantic Segmentation
Domain adaptive semantic segmentation aims to train a model performing s...
read it
-
Generating Plausible Counterfactual Explanations for Deep Transformers in Financial Text Classification
Corporate mergers and acquisitions (M A) account for billions of dolla...
read it
-
LID 2020: The Learning from Imperfect Data Challenge Results
Learning from imperfect data becomes an issue in many industrial applica...
read it
-
Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration
This paper investigates the principles of embedding learning to tackle t...
read it
-
Simple and Effective Few-Shot Named Entity Recognition with Structured Nearest Neighbor Learning
We present a simple few-shot named entity recognition (NER) system based...
read it
-
DOTS: Decoupling Operation and Topology in Differentiable Architecture Search
Differentiable Architecture Search (DARTS) has attracted extensive atten...
read it
-
Tasks Integrated Networks: Joint Detection and Retrieval for Image Search
The traditional object retrieval task aims to learn a discriminative fea...
read it
-
Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization
Cross-view geo-localization is to spot images of the same geographic tar...
read it
-
Point Adversarial Self Mining: A Simple Method for Facial Expression Recognition in the Wild
In this paper, the Point Adversarial Self Mining (PASM) approach, a simp...
read it
-
DONet: Dual Objective Networks for Skin Lesion Segmentation
Skin lesion segmentation is a crucial step in the computer-aided diagnos...
read it
-
Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents
With the arising concerns for the AI systems provided with direct access...
read it
-
Inter-Image Communication for Weakly Supervised Localization
Weakly supervised localization aims at finding target object regions usi...
read it
-
Single Image Brightening via Multi-Scale Exposure Fusion with Hybrid Learning
A small ISO and a small exposure time are usually used to capture an ima...
read it
-
Sketch-Guided Scenery Image Outpainting
The outpainting results produced by existing approaches are often too ra...
read it
-
FinBERT: A Pretrained Language Model for Financial Communications
Contextual pretrained language models, such as BERT (Devlin et al., 2019...
read it
-
Rethinking Localization Map: Towards Accurate Object Perception with Self-Enhancement Maps
Recently, remarkable progress has been made in weakly supervised object ...
read it
-
Person Re-identification in the 3D Space
People live in a 3D world. However, existing works on person re-identifi...
read it
-
Feature Robust Optimal Transport for High-dimensional Data
Optimal transport is a machine learning technique with applications incl...
read it
-
Omni-supervised Facial Expression Recognition: A Simple Baseline
In this paper, we target on advancing the performance in facial expressi...
read it
-
VehicleNet: Learning Robust Visual Representation for Vehicle Re-identification
One fundamental challenge of vehicle re-identification (re-id) is to lea...
read it
-
Adversarial Style Mining for One-Shot Unsupervised Domain Adaptation
We aim at the problem named One-Shot Unsupervised Domain Adaptation. Unl...
read it
-
OpenMix: Reviving Known Knowledge for Discovering Novel Visual Categories in An Open World
In this paper, we tackle the problem of discovering new classes in unlab...
read it
-
One Model to Recognize Them All: Marginal Distillation from NER Models with Different Tag Sets
Named entity recognition (NER) is a fundamental component in the modern ...
read it
-
Memory Aggregation Networks for Efficient Interactive Video Object Segmentation
Interactive video object segmentation (iVOS) aims at efficiently harvest...
read it
-
Collaborative Video Object Segmentation by Foreground-Background Integration
In this paper, we investigate the principles of embedding learning betwe...
read it
-
Motion-Excited Sampler: Video Adversarial Attack with Sparked Prior
Deep neural networks are known to be susceptible to adversarial noise, w...
read it
-
SF-Net: Single-Frame Supervision for Temporal Action Localization
In this paper, we study an intermediate form of supervision, i.e., singl...
read it
-
Rectifying Pseudo Label Learning via Uncertainty Estimation for Domain Adaptive Semantic Segmentation
This paper focuses on the unsupervised domain adaptation of transferring...
read it
-
Angle-Based Cost-Sensitive Multicategory Classification
Many real-world classification problems come with costs which can vary f...
read it
-
Grounded and Controllable Image Completion by Incorporating Lexical Semantics
In this paper, we present an approach, namely Lexical Semantic Image Com...
read it
-
University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization
We consider the problem of cross-view geo-localization. The primary chal...
read it
-
Symbiotic Attention with Privileged Information for Egocentric Action Recognition
Egocentric video recognition is a natural testbed for diverse interactio...
read it
-
Lane Detection in Low-light Conditions Using an Efficient Data Enhancement : Light Conditions Style Transfer
Nowadays, deep learning techniques are widely used for lane detection, b...
read it
-
Progressive Local Filter Pruning for Image Retrieval Acceleration
This paper focuses on network pruning for image retrieval acceleration. ...
read it
-
NAS-Bench-102: Extending the Scope of Reproducible Neural Architecture Search
Neural architecture search (NAS) has achieved breakthrough success in a ...
read it
-
Very Long Natural Scenery Image Prediction by Outpainting
Comparing to image inpainting, image outpainting receives less attention...
read it
-
Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation
Utterance-level permutation invariant training (uPIT) has achieved promi...
read it
-
Unsupervised Scene Adaptation with Memory Regularization in vivo
We consider the unsupervised scene adaptation problem of learning from b...
read it
-
Instance-Invariant Adaptive Object Detection via Progressive Disentanglement
Most state-of-the-art methods of object detection suffer from poor gener...
read it