
-
Regularizing Meta-Learning via Gradient Dropout
With the growing attention on learning-to-learn new tasks using only a f...
read it
-
Learning to See Through Obstructions
We present a learning-based approach for removing unwanted obstructions,...
read it
-
Single-Image HDR Reconstruction by Learning to Reverse the Camera Pipeline
Recovering a high dynamic range (HDR) image from a single low dynamic ra...
read it
-
Deep Semantic Matching with Foreground Detection and Cycle-Consistency
Establishing dense semantic correspondences between object instances rem...
read it
-
TapLab: A Fast Framework for Semantic Video Segmentation Tapping into Compressed-Domain Knowledge
Real-time semantic video segmentation is a challenging task due to the s...
read it
-
Rethinking Class-Balanced Methods for Long-Tailed Visual Recognition from a Domain Adaptation Perspective
Object frequency in the real world often follows a power law, leading to...
read it
-
Collaborative Distillation for Ultra-Resolution Universal Style Transfer
Universal style transfer methods typically leverage rich representations...
read it
-
CycleISP: Real Image Restoration via Improved Data Synthesis
The availability of large-scale datasets has helped unleash the true pot...
read it
-
Learning Enriched Features for Real Image Restoration and Enhancement
With the goal of recovering high-quality image content from its degraded...
read it
-
Self-supervised Single-view 3D Reconstruction via Semantic Consistency
We learn a self-supervised, single-view 3D reconstruction model that pre...
read it
-
Gated Fusion Network for Degraded Image Super Resolution
Single image super resolution aims to enhance image quality with respect...
read it
-
Model-Agnostic Structured Sparsification with Learnable Channel Shuffle
Recent advances in convolutional neural networks (CNNs) usually come wit...
read it
-
Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning
Weakly-supervised semantic segmentation is a challenging task as no pixe...
read it
-
Cross-Domain Few-Shot Classification via Learned Feature-Wise Transformation
Few-shot classification aims to recognize novel categories with only few...
read it
-
Visual Question Answering on 360° Images
In this work, we introduce VQA 360, a novel task of visual question answ...
read it
-
CrDoCo: Pixel-level Domain Transfer with Cross-Domain Consistency
Unsupervised domain adaptation algorithms aim to transfer the knowledge ...
read it
-
RC-DARTS: Resource Constrained Differentiable Architecture Search
Recent advances show that Neural Architectural Search (NAS) method is ab...
read it
-
Controllable and Progressive Image Extrapolation
Image extrapolation aims at expanding the narrow field of view of a give...
read it
-
Neural Design Network: Graphic Layout Generation with Constraints
Graphic design is essential for visual communication with layouts being ...
read it
-
Adversarial Learning of Privacy-Preserving and Task-Oriented Representations
Data privacy has emerged as an important issue as data-driven deep learn...
read it
-
Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications
Visual events are usually accompanied by sounds in our daily lives. Howe...
read it
-
Dancing to Music
Dancing to music is an instinctive move by humans. Learning to model the...
read it
-
Quadratic video interpolation
Video interpolation is an important problem in computer vision, which he...
read it
-
Progressive Domain Adaptation for Object Detection
Recent deep learning methods for object detection rely on a large amount...
read it
-
Referring Expression Object Segmentation with Caption-Aware Consistency
Referring expressions are natural language descriptions that identify a ...
read it
-
Joint-task Self-supervised Learning for Temporal Correspondence
This paper proposes to learn reliable dense correspondence from videos i...
read it
-
Video Stitching for Linear Camera Arrays
Despite the long history of image and video stitching research, existing...
read it
-
Show, Match and Segment: Joint Learning of Semantic Matching and Object Co-segmentation
We present an approach for jointly matching and segmenting object instan...
read it
-
Detection and Tracking of Multiple Mice Using Part Proposal Networks
The study of mouse social behaviours has been increasingly undertaken in...
read it
-
Self-supervised Audio Spatialization with Correspondence Classifier
Spatial audio is an essential medium to audiences for 3D visual and audi...
read it
-
Weakly-supervised Caricature Face Parsing through Domain Adaptation
A caricature is an artistic form of a person's picture in which certain ...
read it
-
Few-Shot Viewpoint Estimation
Viewpoint estimation for known categories of objects has been improved s...
read it
-
SCOPS: Self-Supervised Co-Part Segmentation
Parts provide a good intermediate representation of objects that is robu...
read it
-
DRIT++: Diverse Image-to-Image Translation via Disentangled Representations
Image-to-image translation aims to learn the mapping between two visual ...
read it
-
Target-Aware Deep Tracking
Existing deep trackers mainly use convolutional neural networks pre-trai...
read it
-
Res2Net: A New Multi-scale Backbone Architecture
Representing features at multiple scales is of great importance for nume...
read it
-
Depth-Aware Video Frame Interpolation
Video frame interpolation aims to synthesize nonexistent frames in-betwe...
read it
-
Im2Pencil: Controllable Pencil Illustration from Photographs
We propose a high-quality photo-to-pencil translation method with fine-g...
read it
-
Inserting Videos into Videos
In this paper, we introduce a new problem of manipulating a given video ...
read it
-
Putting Humans in a Scene: Learning Affordance in 3D Indoor Environments
Affordance modeling plays an important role in visual understanding. In ...
read it
-
Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis
Most conditional generation tasks expect diverse outputs given a single ...
read it
-
Online Multi-Object Tracking with Dual Matching Attention Networks
In this paper, we propose an online Multi-Object Tracking (MOT) approach...
read it
-
Unseen Object Segmentation in Videos via Transferable Representations
In order to learn object segmentation models in videos, conventional met...
read it
-
PiCANet: Pixel-wise Contextual Attention Learning for Accurate Saliency Detection
In saliency detection, every pixel needs contextual information to make ...
read it
-
Context-Aware Synthesis and Placement ofObject Instances
Learning to insert an object instance into an image in a semantically co...
read it
-
Context-Aware Synthesis and Placement of Object Instances
Learning to insert an object instance into an image in a semantically co...
read it
-
Joint Face Hallucination and Deblurring via Structure Generation and Detail Enhancement
We address the problem of restoring a high-resolution face image from a ...
read it
-
A Fusion Approach for Multi-Frame Optical Flow Estimation
To date, top-performing optical flow estimation methods only take pairs ...
read it
-
MEMC-Net: Motion Estimation and Motion Compensation Driven Neural Network for Video Interpolation and Enhancement
Motion estimation (ME) and motion compensation (MC) have been widely use...
read it
-
Deep Attentive Tracking via Reciprocative Learning
Visual attention, derived from cognitive neuroscience, facilitates human...
read it