
-
Evaluating Explainable AI: Which Algorithmic Explanations Help Users Predict Model Behavior?
Algorithmic approaches to interpreting machine learning models have prol...
read it
-
Optimization-Guided Binary Diversification to Mislead Neural Networks for Malware Detection
Motivated by the transformative impact of deep neural networks (DNNs) on...
read it
-
Automatically Learning Data Augmentation Policies for Dialogue Tasks
Automatic data augmentation (AutoAugment) (Cubuk et al., 2019) searches ...
read it
-
Adversarial Augmentation Policy Search for Domain and Cross-Lingual Generalization in Reading Comprehension
Reading comprehension models often overfit to nuances of training datase...
read it
-
Modality-Balanced Models for Visual Dialogue
The Visual Dialog task requires a model to exploit both image and conver...
read it
-
ManyModalQA: Modality Disambiguation and QA over Diverse Inputs
We present a new multimodal question answering challenge, ManyModalQA, i...
read it
-
n-ML: Mitigating Adversarial Examples via Ensembles of Topologically Manipulated Classifiers
This paper proposes a new defense called n-ML against adversarial exampl...
read it
-
Metric Learning for Image Registration
Image registration is a key technique in medical image analysis to estim...
read it
-
ViewSynth: Learning Local Features from Depth using View Synthesis
We address the problem of jointly detecting keypoints and learning descr...
read it
-
Composition and decomposition of GANs
In this work, we propose a composition/decomposition framework for adver...
read it
-
Flow Models for Arbitrary Conditional Likelihoods
Understanding the dependencies among features of a dataset is at the cor...
read it
-
FBNetV3: Joint Architecture-Recipe Search using Neural Acquisition Function
Neural Architecture Search (NAS) yields state-of-the-art neural networks...
read it
-
Multi-Source Domain Adaptation for Text Classification via DistanceNet-Bandits
Domain adaptation performance of a learning algorithm on a target domain...
read it
-
Graph Filtration Learning
We propose an approach to learning with graph-structured data in the pro...
read it
-
Expressing Visual Relationships via Language
Describing images with text is a fundamental problem in vision-language ...
read it
-
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
We introduce a new multimodal retrieval task - TV show Retrieval (TVR), ...
read it
-
A Mask-RCNN Baseline for Probabilistic Object Detection
The Probabilistic Object Detection Challenge evaluates object detection ...
read it
-
Deep Multi-View Learning via Task-Optimal CCA
Canonical Correlation Analysis (CCA) is widely used for multimodal data ...
read it
-
Networks for Joint Affine and Non-parametric Image Registration
We introduce an end-to-end deep-learning framework for 3D medical image ...
read it
-
Real-Time Quality Assessment of Pediatric MRI via Semi-Supervised Deep Nonlocal Residual Neural Networks
In this paper, we introduce an image quality assessment (IQA) method for...
read it
-
Diagnosing the Environment Bias in Vision-and-Language Navigation
Vision-and-Language Navigation (VLN) requires an agent to follow natural...
read it
-
Pano Popups: Indoor 3D Reconstruction with a Plane-Aware Network
In this work we present a method to train a plane-aware convolutional ne...
read it
-
FVA: Modeling Perceived Friendliness of Virtual Agents Using Movement Characteristics
We present a new approach for improving the friendliness and warmth of a...
read it
-
What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Given a video with aligned dialogue, people can often infer what is more...
read it
-
Interactive Medical Image Segmentation via Point-Based Interaction and Sequential Patch Learning
Due to low tissue contrast, irregular object appearance, and unpredictab...
read it
-
The Domain Transform Solver
We present a framework for edge-aware optimization that is an order of m...
read it
-
Recurrent Neural Network for Learning DenseDepth and Ego-Motion from Video
Learning-based, single-view depth estimation often generalizes poorly to...
read it
-
Commonsense for Generative Multi-Hop Question Answering Tasks
Reading comprehension QA tasks have seen a recent surge in popularity, y...
read it
-
StereoDRNet: Dilated Residual Stereo Net
We propose a system that uses a convolution neural network (CNN) to esti...
read it
-
Realtime Simulation of Thin-Shell Deformable Materials using CNN-Based Mesh Embedding
We address the problem of accelerating thin-shell deformable object simu...
read it
-
Estimating heterogeneous treatment effects with right-censored data via causal survival forests
There is fast-growing literature on estimating heterogeneous treatment e...
read it
-
MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Generating multi-sentence descriptions for videos is one of the most cha...
read it
-
Deep Fiducial Inference
Since the mid-2000s, there has been a resurrection of interest in modern...
read it
-
SSD: Single Shot MultiBox Detector
We present a method for detecting objects in images using a single deep ...
read it
-
Deep Convolutional Ranking for Multilabel Image Annotation
Multilabel image annotation is one of the most important challenges in c...
read it
-
ParseNet: Looking Wider to See Better
We present a technique for adding global context to deep convolutional n...
read it
-
DSSD : Deconvolutional Single Shot Detector
The main contribution of this paper is an approach for introducing addit...
read it
-
Fast Single Shot Detection and Pose Estimation
For applications in navigation and robotics, estimating the 3D pose of o...
read it
-
Smooth Primal-Dual Coordinate Descent Algorithms for Nonsmooth Convex Optimization
We propose a new randomized coordinate descent method for a convex optim...
read it
-
The Mean and Median Criterion for Automatic Kernel Bandwidth Selection for Support Vector Data Description
Support vector data description (SVDD) is a popular technique for detect...
read it
-
Heat Kernel Smoothing in Irregular Image Domains
We present the discrete version of heat kernel smoothing on graph data s...
read it
-
Hierarchically-Attentive RNN for Album Summarization and Storytelling
We address the problem of end-to-end visual storytelling. Given a photo ...
read it
-
Shortcut-Stacked Sentence Encoders for Multi-Domain Inference
We present a simple sequential sentence encoder for multi-domain natural...
read it
-
Reinforced Video Captioning with Entailment Rewards
Sequence-to-sequence models have shown promising improvements on the tem...
read it
-
Video Highlight Prediction Using Audience Chat Reactions
Sports channel video portals offer an exciting domain for research on mu...
read it
-
Source-Target Inference Models for Spatial Instruction Understanding
Models that can execute natural language instructions for situated robot...
read it
-
Punny Captions: Witty Wordplay in Image Descriptions
Wit is a quintessential form of rich inter-human interaction, and is oft...
read it
-
Multi-Task Video Captioning with Video and Entailment Generation
Video captioning, the task of describing the content of a video, has see...
read it
-
A Joint Speaker-Listener-Reinforcer Model for Referring Expressions
Referring expressions are natural language constructions used to identif...
read it
-
Deep Learning with Topological Signatures
Inferring topological and geometrical information from data can offer an...
read it