Karthik Narasimhan

research

∙ 09/05/2023

Cognitive Architectures for Language Agents

Recent efforts have incorporated large language models (LLMs) with exter...

0 Theodore Sumers, et al. ∙

research

∙ 07/18/2023

Scaling Laws for Imitation Learning in NetHack

Imitation Learning (IL) is one of the most widely used methods in machin...

0 Jens Tuyls, et al. ∙

research

∙ 07/17/2023

COLLIE: Systematic Construction of Constrained Text Generation Tasks

Text generation under constraints have seen increasing interests in natu...

0 Shunyu Yao, et al. ∙

research

∙ 07/01/2023

InstructEval: Systematic Evaluation of Instruction Selection Methods

In-context learning (ICL) performs tasks by prompting a large language m...

0 Anirudh Ajith, et al. ∙

research

∙ 06/26/2023

InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback

Humans write code in a fundamentally interactive manner and rely on cons...

0 John Yang, et al. ∙

research

∙ 05/24/2023

Referral Augmentation for Zero-Shot Information Retrieval

We propose Referral-Augmented Retrieval (RAR), a simple technique that c...

0 Michael Tang, et al. ∙

research

∙ 05/24/2023

CSTS: Conditional Semantic Textual Similarity

Semantic textual similarity (STS) has been a cornerstone task in NLP tha...

0 Ameet Deshpande, et al. ∙

research

∙ 05/24/2023

Anthropomorphization of AI: Opportunities and Risks

Anthropomorphization is the tendency to attribute human-like traits to n...

0 Ameet Deshpande, et al. ∙

research

∙ 05/24/2023

PruMUX: Augmenting Data Multiplexing with Model Compression

As language models increase in size by the day, methods for efficient in...

0 Yushan Su, et al. ∙

research

∙ 05/17/2023

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language models are increasingly being deployed for general problem solv...

0 Shunyu Yao, et al. ∙

research

∙ 04/11/2023

Toxicity in ChatGPT: Analyzing Persona-assigned Language Models

Large language models (LLMs) have shown incredible capabilities and tran...

10 Ameet Deshpande, et al. ∙

research

∙ 02/24/2023

MUX-PLMs: Pre-training Language Models with Data Multiplexing

Data multiplexing is a recently proposed method for improving a model's ...

0 Vishvak Murahari, et al. ∙

research

∙ 01/26/2023

SemSup-XC: Semantic Supervision for Zero and Few-shot Extreme Classification

Extreme classification (XC) involves predicting over large numbers of cl...

0 Pranjal Aggarwal, et al. ∙

research

∙ 12/20/2022

Controllable Text Generation with Language Constraints

We consider the task of text generation in language models with constrai...

0 Howard Chen, et al. ∙

research

∙ 11/29/2022

SPARTAN: Sparse Hierarchical Memory for Parameter-Efficient Transformers

Fine-tuning pre-trained language models (PLMs) achieves impressive perfo...

0 Ameet Deshpande, et al. ∙

research

∙ 11/15/2022

ALIGN-MLM: Word Embedding Alignment is Crucial for Multilingual Pre-training

Multilingual pre-trained models exhibit zero-shot cross-lingual transfer...

0 Henry Tang, et al. ∙

research

∙ 10/06/2022

ReAct: Synergizing Reasoning and Acting in Language Models

While large language models (LLMs) have demonstrated impressive capabili...

7 Shunyu Yao, et al. ∙

research

∙ 07/04/2022

WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Existing benchmarks for grounding language in interactive environments e...

0 Shunyu Yao, et al. ∙

research

∙ 06/27/2022

Leveraging Language for Accelerated Learning of Tool Manipulation

Robust and generalized tool manipulation requires an understanding of th...

0 Allen Z. Ren, et al. ∙

research

∙ 05/23/2022

Using Natural Language and Program Abstractions to Instill Human Inductive Biases in Machines

Strong inductive biases are a key component of human intelligence, allow...

1 Sreejan Kumar, et al. ∙

research

∙ 04/25/2022

Can Rationalization Improve Robustness?

A growing line of work has investigated the development of neural NLP mo...

2 Howard Chen, et al. ∙

research

∙ 03/15/2022

CARETS: A Consistency And Robustness Evaluative Test Suite for VQA

We introduce CARETS, a systematic test suite to measure consistency and ...

2 Carlos E. Jimenez, et al. ∙

research

∙ 02/26/2022

Semantic Supervision: Enabling Generalization over Output Spaces

In this paper, we propose Semantic Supervision (SemSup) - a unified para...

0 Austin W. Hanjie, et al. ∙

research

∙ 02/18/2022

DataMUX: Data Multiplexing for Neural Networks

In this paper, we introduce data multiplexing (DataMUX), a technique tha...

9 Vishvak Murahari, et al. ∙

research

∙ 01/10/2022

Multi-query Video Retrieval

Retrieving target videos based on text descriptions is a task of great p...

0 Zeyu Wang, et al. ∙

research

∙ 01/04/2022

Multi-Stage Episodic Control for Strategic Exploration in Text Games

Text adventure games present unique challenges to reinforcement learning...

3 Jens Tuyls, et al. ∙

research

∙ 10/27/2021

When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer

While recent work on multilingual language models has demonstrated their...

0 Ameet Deshpande, et al. ∙

research

∙ 10/20/2021

SILG: The Multi-environment Symbolic Interactive Language Grounding Benchmark

Existing work in language grounding typically study single environments....

0 Victor Zhong, et al. ∙

research

∙ 06/28/2021

Revelio: ML-Generated Debugging Queries for Distributed Systems

A major difficulty in debugging distributed systems lies in manually det...

0 Pradeep Dogga, et al. ∙

research

∙ 05/24/2021

Self-Attention Networks Can Process Bounded Hierarchical Languages

Despite their impressive performance in NLP, self-attention networks wer...

12 Shunyu Yao, et al. ∙

research

∙ 03/25/2021

Reading and Acting while Blindfolded: The Need for Semantics in Text Game Agents

Text-based games simulate worlds and interact with players using natural...

0 Shunyu Yao, et al. ∙

research

∙ 01/19/2021

Grounding Language to Entities and Dynamics for Generalization in Reinforcement Learning

In this paper, we consider the problem of leveraging textual description...

0 H. J. Austin Wang, et al. ∙

research

∙ 11/27/2020

Connecting Context-specific Adaptation in Humans to Meta-learning

Cognitive control, the ability of a system to adapt to the demands of a ...

0 Rachit Dubey, et al. ∙

research

∙ 10/20/2020

Generating Strategic Dialogue for Negotiation with Theory of Mind

We propose a framework to integrate the concept of Theory of Mind (ToM) ...

0 Runzhe Yang, et al. ∙

research

∙ 10/11/2020

Safe Reinforcement Learning with Natural Language Constraints

In this paper, we tackle the problem of learning control policies for ta...

27 Tsung-Yen Yang, et al. ∙

research

∙ 10/07/2020

Projection-Based Constrained Policy Optimization

We consider the problem of learning control policies that optimize a rew...

0 Tsung-Yen Yang, et al. ∙

research

∙ 10/06/2020

Keep CALM and Explore: Language Models for Action Generation in Text-based Games

Text-based games present a unique challenge for autonomous agents to ope...

0 Shunyu Yao, et al. ∙

research

∙ 10/06/2020

Guiding Attention for Self-Supervised Learning with Transformers

In this paper, we propose a simple and effective technique to allow for ...

0 Ameet Deshpande, et al. ∙

research

∙ 09/30/2020

Learning Rewards from Linguistic Feedback

We explore unconstrained natural language feedback as a learning signal ...

0 Theodore R. Sumers, et al. ∙

research

∙ 09/08/2020

Towards Unique and Informative Captioning of Images

Despite considerable progress, state of the art image captioning models ...

7 Zeyu Wang, et al. ∙

research

∙ 07/11/2020

Evolving Graphical Planner: Contextual Global Planning for Vision-and-Language Navigation

The ability to perform effective planning is crucial for building an ins...

11 Zhiwei Deng, et al. ∙

research

∙ 06/20/2020

Accelerating Safe Reinforcement Learning with Constraint-mismatched Policies

We consider the problem of reinforcement learning when provided with a b...

0 Tsung-Yen Yang, et al. ∙

research

∙ 05/01/2020

Universal Adversarial Attacks with Natural Triggers for Text Classification

Recent work has demonstrated the vulnerability of modern text classifier...

0 Liwei Song, et al. ∙

research

∙ 03/31/2020

Take the Scenic Route: Improving Generalization in Vision-and-Language Navigation

In the Vision-and-Language Navigation (VLN) task, an agent with egocentr...

0 Felix Yu, et al. ∙

research

∙ 08/21/2019

A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation

We introduce a new algorithm for multi-objective reinforcement learning ...

0 Runzhe Yang, et al. ∙

research

∙ 06/11/2019

Calibration, Entropy Rates, and Memory in Language Models

Building accurate language models that capture meaningful long-term depe...

1 Mark Braverman, et al. ∙

research

∙ 05/13/2019

Task-Agnostic Dynamics Priors for Deep Reinforcement Learning

While model-based deep reinforcement learning (RL) holds great promise f...

0 Yilun Du, et al. ∙

research

∙ 08/01/2017

Deep Transfer in Reinforcement Learning by Language Grounding

In this paper, we explore the utilization of natural language to drive t...

0 Karthik Narasimhan, et al. ∙

research

∙ 07/13/2017

Representation Learning for Grounded Spatial Reasoning

The interpretation of spatial references is highly contextual, requiring...

0 Michael Janner, et al. ∙

research

∙ 02/22/2017

Unsupervised Learning of Morphological Forests

This paper focuses on unsupervised modeling of morphological families, c...

0 Jiaming Luo, et al. ∙

Karthik Narasimhan

Featured Co-authors

Sign in with Google

Consider DeepAI Pro