
-
Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation
Many real-world applications such as robotics provide hard constraints o...
read it
-
Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification
In the standard Markov decision process formalism, users specify tasks b...
read it
-
Self-supervised Representation Learning with Relative Predictive Coding
This paper introduces Relative Predictive Coding (RPC), a new contrastiv...
read it
-
Instabilities of Offline RL with Pre-Trained Neural Representation
In offline reinforcement learning (RL), we seek to utilize offline data ...
read it
-
On Proximal Policy Optimization's Heavy-tailed Gradients
Modern policy gradient algorithms, notably Proximal Policy Optimization ...
read it
-
Reasoning Over Virtual Knowledge Bases With Open Predicate Relations
We present the Open Predicate Query Language (OPQL); a method for constr...
read it
-
The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors
Although deep reinforcement learning has led to breakthroughs in many di...
read it
-
Understanding the Tradeoffs in Client-Side Privacy for Speech Recognition
Existing approaches to ensuring privacy of user speech data primarily fo...
read it
-
Cross-Modal Generalization: Learning in Low Resource Modalities via Meta-Alignment
The natural world is abundant with concepts expressed via visual, acoust...
read it
-
C-Learning: Learning to Achieve Goals via Recursive Classification
We study the problem of predicting and controlling the future state dist...
read it
-
Close Category Generalization
Out-of-distribution generalization is a core challenge in machine learni...
read it
-
Unsupervised Domain Adaptation for Visual Navigation
Advances in visual navigation methods have led to intelligent embodied n...
read it
-
Planning with Submodular Objective Functions
We study planning with submodular objective functions, where instead of ...
read it
-
Case Study: Deontological Ethics in NLP
Recent work in natural language processing (NLP) has focused on ethical ...
read it
-
Graph Adversarial Networks: Protecting Information against Adversarial Attacks
We study the problem of protecting information when learning with graph ...
read it
-
Revisiting LSTM Networks for Semi-Supervised Text Classification via Mixed Objective Function
In this paper, we study bidirectional LSTM network for the task of text ...
read it
-
Few-Shot Learning with Intra-Class Knowledge Transfer
We consider the few-shot classification task with an unbalanced dataset,...
read it
-
Towards Debiasing Sentence Representations
As natural language processing methods are increasingly deployed in real...
read it
-
Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers
We propose a simple, practical, and intuitive approach for domain adapta...
read it
-
On Reward-Free Reinforcement Learning with Linear Function Approximation
Reward-free reinforcement learning (RL) is a framework which is suitable...
read it
-
Demystifying Self-Supervised Learning: An Information-Theoretical Framework
Self-supervised representation learning adopts self-defined signals as s...
read it
-
Neural Methods for Point-wise Dependency Estimation
Since its inception, the neural estimation of mutual information (MI) ha...
read it
-
Feature Robust Optimal Transport for High-dimensional Data
Optimal transport is a machine learning technique with applications incl...
read it
-
Provably Efficient Reinforcement Learning with General Value Function Approximation
Value function approximation has demonstrated phenomenal empirical succe...
read it
-
Guaranteeing Reproducibility in Deep Learning Competitions
To encourage the development of methods with reproducible and robust tra...
read it
-
Exploring Controllable Text Generation Techniques
Neural controllable text generation is an important area gaining attenti...
read it
-
Topological Sort for Sentence Ordering
Sentence ordering is the task of arranging the sentences of a given text...
read it
-
Politeness Transfer: A Tag and Generate Approach
This paper introduces a new task of politeness transfer which involves c...
read it
-
Interpretable Multimodal Routing for Human Multimodal Language
The human language has heterogeneous sources of information, including t...
read it
-
Weakly-Supervised Reinforcement Learning for Controllable Behavior
Reinforcement learning (RL) is a powerful framework for learning to take...
read it
-
On Emergent Communication in Competitive Multi-Agent Teams
Several recent works have found the emergence of grounded compositional ...
read it
-
Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement
Multi-task reinforcement learning (RL) aims to simultaneously learn poli...
read it
-
Differentiable Reasoning over a Virtual Knowledge Base
We consider the task of answering complex multi-hop questions using a co...
read it
-
Learning Not to Learn in the Presence of Noisy Labels
Learning in the presence of label noise is a challenging yet important t...
read it
-
Capsules with Inverted Dot-Product Attention Routing
We introduce a new routing algorithm for capsule networks, in which a ch...
read it
-
Think Locally, Act Globally: Federated Learning with Local and Global Representations
Federated learning is an emerging research paradigm to train models on p...
read it
-
Geometric Capsule Autoencoders for 3D Point Clouds
We propose a method to learn object representations from 3D point clouds...
read it
-
Worst Cases Policy Gradients
Recent advances in deep reinforcement learning have demonstrated the cap...
read it
-
Multiple Futures Prediction
Temporal prediction is critical for making intelligent and robust decisi...
read it
-
Enhanced Convolutional Neural Tangent Kernels
Recent research shows that for training with ℓ_2 loss, convolutional neu...
read it
-
Learning Data Manipulation for Augmentation and Weighting
Manipulating data, such as weighting data examples or augmenting with ne...
read it
-
Complex Transformer: A Framework for Modeling Complex-Valued Sequence
While deep learning has received a surge of interest in a variety of fie...
read it
-
Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks
Recent research shows that the following two models are equivalent: (a) ...
read it
-
On Universal Approximation by Neural Networks with Uniform Guarantees on Approximation of Infinite Dimensional Maps
The study of universal approximation of arbitrary functions f: X→Y by ne...
read it
-
LSMI-Sinkhorn: Semi-supervised Squared-Loss Mutual Information Estimation with Optimal Transport
Estimating mutual information is an important machine learning and stati...
read it
-
Transformer Dissection: An Unified Understanding for Transformer's Attention via the Lens of Kernel
Transformer is a powerful architecture that achieves superior performanc...
read it
-
MineRL: A Large-Scale Dataset of Minecraft Demonstrations
The sample inefficiency of standard deep reinforcement learning methods ...
read it
-
Learning Neural Networks with Adaptive Regularization
Feed-forward neural networks can be understood as a combination of an in...
read it
-
Learning Representations from Imperfect Time Series Data via Tensor Rank Regularization
There has been an increased interest in multimodal language processing i...
read it
-
Deep Gamblers: Learning to Abstain with Portfolio Theory
We deal with the selective classification problem (supervised-learning p...
read it