
-
Knowledge-driven Self-supervision for Zero-shot Commonsense Question Answering
Recent developments in pre-trained neural language modeling have led to ...
read it
-
Imagining Grounded Conceptual Representations from Perceptual Information in Situated Guessing Games
In visual guessing games, a Guesser has to identify a target object in a...
read it
-
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Given a simple request (e.g., Put a washed apple in the kitchen fridge),...
read it
-
The Return of Lexical Dependencies: Neural Lexicalized PCFGs
In this paper we demonstrate that context free grammar (CFG) based metho...
read it
-
RMM: A Recursive Mental Model for Dialog Navigation
Fluent communication requires understanding your audience. In the new co...
read it
-
A Benchmark for Structured Procedural Knowledge Extraction from Cooking Videos
Procedural knowledge, which we define as concrete information about the ...
read it
-
Experience Grounds Language
Successful linguistic communication relies on a shared experience of the...
read it
-
Multi-View Learning for Vision-and-Language Navigation
Learning to navigate in a visual environment following natural language ...
read it
-
ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks
We present ALFRED (Action Learning From Realistic Environments and Direc...
read it
-
PIQA: Reasoning about Physical Commonsense in Natural Language
To apply eyeshadow without a brush, should I use a cotton swab or a toot...
read it
-
Robust Navigation with Language Pretraining and Stochastic Sampling
Core to the vision-and-language navigation (VLN) challenge is building r...
read it
-
Defending Against Neural Fake News
Recent progress in natural language generation has raised dual-use conce...
read it
-
HellaSwag: Can a Machine Really Finish Your Sentence?
Recent work by Zellers et al. (2018) introduced a new task of commonsens...
read it
-
Improving Robot Success Detection using Static Object Data
We use static object data to improve success detection for stacking obje...
read it
-
Prospection: Interpretable Plans From Language By Predicting the Future
High-level human instructions often correspond to behaviors with multipl...
read it
-
Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation
We present FAST NAVIGATOR, a general framework for action decoding, whic...
read it
-
Character-based Surprisal as a Model of Human Reading in the Presence of Errors
Intuitively, human readers cope easily with errors in text; typos, missp...
read it
-
From Recognition to Cognition: Visual Commonsense Reasoning
Visual understanding goes well beyond object recognition. With one glanc...
read it
-
Early Fusion for Goal Directed Robotic Vision
Increasingly, perceptual systems are being codified as strict pipelines ...
read it
-
Shifting the Baseline: Single Modality Performance on Visual Navigation & QA
Language-and-vision navigation and question answering (QA) are exciting ...
read it
-
SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference
Given a partial description like "she opened the hood of the car," human...
read it
-
Inducing Grammars with and for Neural Machine Translation
Machine translation systems require semantic knowledge and grammatical u...
read it
-
Balancing Shared Autonomy with Human-Robot Communication
Robotic agents that share autonomy with a human should leverage human do...
read it
-
CHALET: Cornell House Agent Learning Environment
We present CHALET, a 3D house simulator with support for navigation and ...
read it
-
Learning Interpretable Spatial Operations in a Rich 3D Blocks World
In this paper, we study the problem of mapping natural language instruct...
read it
-
Synthetic and Natural Noise Both Break Neural Machine Translation
Character-based neural machine translation (NMT) models alleviate out-of...
read it
-
Natural Language Inference from Multiple Premises
We define a novel textual entailment task that requires inference over m...
read it
-
Evaluating Induced CCG Parsers on Grounded Semantic Parsing
We compare the effectiveness of four different syntactic CCG parsers for...
read it