
-
Simplifying Reinforced Feature Selection via Restructured Choice Strategy of Single Agent
Feature selection aims to select a subset of features to optimize the pe...
read it
-
Text as Neural Operator: Image Manipulation by Text Instruction
In this paper, we study a new task that allows users to edit an input im...
read it
-
AdvAug: Robust Adversarial Augmentation for Neural Machine Translation
In this paper, we propose a new adversarial augmentation method for Neur...
read it
-
Controllable and Progressive Image Extrapolation
Image extrapolation aims at expanding the narrow field of view of a give...
read it
-
Neural Design Network: Graphic Layout Generation with Constraints
Graphic design is essential for visual communication with layouts being ...
read it
-
The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction
This paper studies the problem of predicting the distribution over multi...
read it
-
Synthetic vs Real: Deep Learning on Controlled Noise
Performing controlled experiments on noisy data is essential in thorough...
read it
-
Confident Learning: Estimating Uncertainty in Dataset Labels
Learning exists in the context of data, yet notions of confidence typica...
read it
-
Feature Partitioning for Efficient Multi-Task Architectures
Multi-task learning holds the promise of less data, parameters, and time...
read it
-
Robust Neural Machine Translation with Doubly Adversarial Inputs
Neural machine translation (NMT) often suffers from the vulnerability to...
read it
-
State-aware Re-identification Feature for Multi-target Multi-camera Tracking
Multi-target Multi-camera Tracking (MTMCT) aims to extract the trajector...
read it
-
Revisiting EmbodiedQA: A Simple Baseline and Beyond
In Embodied Question Answering (EmbodiedQA), an agent interacts with an ...
read it
-
Let's Transfer Transformations of Shared Semantic Representations
With a good image understanding capability, can we manipulate the images...
read it
-
Peeking into the Future: Predicting Future Person Activities and Locations in Videos
Deciphering human behaviors to predict their future paths/trajectories a...
read it
-
Contrastive Adaptation Network for Unsupervised Domain Adaptation
Unsupervised Domain Adaptation (UDA) makes predictions for the target do...
read it
-
Composing Text and Image for Image Retrieval - An Empirical Odyssey
In this paper, we study the task of image retrieval, where the input que...
read it
-
Focal Visual-Text Attention for Visual Question Answering
Recent insights on language and vision with neural networks have been su...
read it
-
Decoupled Novel Object Captioner
Image captioning is a challenging task where the machine automatically d...
read it
-
MentorNet: Regularizing Very Deep Neural Networks on Corrupted Labels
Recent studies have discovered that deep networks are capable of memoriz...
read it
-
Graph Distillation for Action Detection with Privileged Information
In this work, we propose a technique that tackles the video understandin...
read it
-
MemexQA: Visual Memex Question Answering
This paper proposes a new task, MemexQA: given a collection of photos or...
read it
-
Video Representation Learning and Latent Concept Mining for Large-scale Multi-label Video Classification
We report on CMU Informedia Lab's system used in Google's YouTube 8 Mill...
read it
-
Self-paced Learning for Weakly Supervised Evidence Discovery in Multimedia Event Search
Multimedia event detection has been receiving increasing attention in re...
read it
-
Strategies for Searching Video Content with Text Queries or Video Examples
The large number of user-generated videos uploaded on to the Internet ev...
read it