
Identifying Critical Neurons in ANN Architectures using Mixed Integer Programming
We introduce a novel approach to optimize the architecture of deep neura...
SketchTransfer: A Challenging New Task for Exploring DetailInvariance and the Abstractions Learned by Deep Networks
Deep networks have achieved excellent results in perceptual tasks, yet t...
Boosting Image Recognition with Nondifferentiable Constraints
In this paper, we study the problem of image recognition with nondiffer...
Deep Verifier Networks: Verification of Deep Discriminative Models with Deep Generative Models
AI Safety is a major concern in many deep learning applications such as ...
Graph Policy Network for Transferable Active Learning on Graphs
Graph neural networks (GNNs) have been attracting increasing popularity ...
Selective Brain Damage: Measuring the Disparate Impact of Model Pruning
Neural network pruning techniques have demonstrated it is possible to re...
GraphAF: a Flowbased Autoregressive Model for Molecular Graph Generation
Molecular graph generation is a fundamental problem for drug discovery a...
FewShot SingleView 3D Object Reconstruction with Compositional Priors
The impressive performance of deep convolutional neural networks in sing...
Network Moments: Extensions and SparseSmooth Attacks
The impressive performance of deep neural networks (DNNs) has immensely ...
Talk the Walk: Navigating New York City through Grounded Dialogue
We introduce "Talk The Walk", the first largescale dialogue dataset gro...
Applying Knowledge Transfer for Water Body Segmentation in Peru
In this work, we present the application of convolutional neural network...
Supervised Visualization for Data Exploration
Dimensionality reduction is often used as an initial step in data explor...
The effect of task and training on intermediate representations in convolutional neural networks revealed with modified RV similarity analysis
Centered Kernel Alignment (CKA) was recently proposed as a similarity me...
Painless Stochastic Gradient: Interpolation, LineSearch, and Convergence Rates
Recent works have shown that stochastic gradient descent (SGD) achieves ...
Towards Understanding Generalization in GradientBased MetaLearning
In this work we study generalization of neural networks in gradientbase...
Improved Conditional VRNNs for Video Prediction
Predicting future frames for a video sequence is a challenging generativ...
Human Gait Symmetry Assessment using a Depth Camera and Mirrors
This paper proposes a reliable approach for human gait symmetry assessme...
Neural Multisensory Scene Inference
For embodied agents to infer representations of the underlying 3D physic...
RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space
We study the problem of learning representations of entities and relatio...
Perceptual Generative Autoencoders
Modern generative models are usually designed to match target distributi...
Detecting semantic anomalies
We critically appraise the recent interest in outofdistribution (OOD) ...
Scattering GCN: Overcoming Oversmoothness in Graph Convolutional Networks
Graph convolutional networks (GCNs) have shown promising results in proc...
GraphVite: A HighPerformance CPUGPU Hybrid System for Node Embedding
Learning continuous representations of nodes is attracting growing inter...
GradientBased Neural DAG Learning
We propose a novel scorebased approach to learning a directed acyclic g...
Straight to the Tree: Constituency Parsing with Neural Syntactic Distance
In this work, we propose a novel constituency parsing scheme. The model ...
UNet FixedPoint Quantization for Medical Image Segmentation
Model quantization is leveraged to reduce the memory consumption and the...
Nonnormal Recurrent Neural Network (nnRNN): learning long time dependencies while improving expressivity with transient dynamics
A recent strategy to circumvent the exploding and vanishing gradient pro...
Stochastic Neural Network with Kronecker Flow
Recent advances in variational inference enable the modelling of highly ...
Building EndToEnd Dialogue Systems Using Generative Hierarchical Neural Network Models
We investigate the task of building open domain, conversational dialogue...
Invariant Representations for Noisy Speech Recognition
Modern automatic speech recognition (ASR) systems need to be robust unde...
A Hierarchical Latent Variable EncoderDecoder Model for Generating Dialogues
Sequential data often possesses a hierarchical structure with complex de...
Revisiting Natural Gradient for Deep Networks
We evaluate natural gradient, an algorithm originally proposed in Amari ...
Knowledge Matters: Importance of Prior Information for Optimization
We explore the effect of introducing prior information into the intermed...
Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation
We introduce the multiresolution recurrent neural network, which extends...
On Training Deep Boltzmann Machines
The deep Boltzmann machine (DBM) has been an important development in th...
Theano: new features and speed improvements
Theano is a linear algebra compiler that optimizes a user's symbolically...
Iterative Alternating Neural Attention for Machine Reading
We propose a novel neural attention architecture to tackle machine compr...
A Hierarchical Recurrent EncoderDecoder For Generative ContextAware Query Suggestion
Users may strive to formulate an adequate textual query for their inform...
On the number of response regions of deep feed forward networks with piecewise linear activations
This paper explores the complexity of deep feedforward networks with lin...
Neural Networks with Few Multiplications
For most deep learning algorithms training is notoriously time consuming...
On the saddle point problem for nonconvex optimization
A central challenge to many fields of science and engineering involves m...
A CharacterLevel Decoder without Explicit Segmentation for Neural Machine Translation
The existing machine translation systems, whether phrasebased or neural...
Dynamic Layer Normalization for Adaptive Neural Acoustic Modeling in Speech Recognition
Layer normalization is a recently introduced technique for normalizing t...
Recurrent Neural Networks With Limited Numerical Precision
Recurrent Neural Networks (RNNs) produce stateofart performance on man...
Pylearn2: a machine learning research library
Pylearn2 is a machine learning research library. This does not just mean...
ContextDependent Word Representation for Neural Machine Translation
We first observe a potential weakness of continuous vector representatio...
How to Construct Deep Recurrent Neural Networks
In this paper, we explore different ways to extend a recurrent neural ne...
Memory Augmented Neural Networks with Wormhole Connections
Recent empirical results on longterm dependency tasks have shown that n...
Learning to Compute Word Embeddings On the Fly
Words in natural language follow a Zipfian distribution whereby some wor...
Learning to Understand Phrases by Embedding the Dictionary
Distributional models that learn rich semantic word representations are ...
