b'Tushar Khot'

research

∙ 05/26/2023

Chain-of-Thought Hub: A Continuous Effort to Measure Large Language Models' Reasoning Performance

As large language models (LLMs) are continuously being developed, their ...

0 Yao Fu, et al. ∙

research

∙ 05/17/2023

Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback

We study whether multiple large language models (LLMs) can autonomously ...

0 Yao Fu, et al. ∙

research

∙ 01/30/2023

Specializing Smaller Language Models towards Multi-Step Reasoning

The surprising ability of Large Language Models (LLMs) to perform well o...

0 Yao Fu, et al. ∙

research

∙ 12/20/2022

Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions

Recent work has shown that large language models are capable of generati...

0 Harsh Trivedi, et al. ∙

research

∙ 10/18/2022

The Tail Wagging the Dog: Dataset Construction Biases of Social Bias Benchmarks

How reliably can we trust the scores obtained from social bias benchmark...

0 Nikil Roashan Selvam, et al. ∙

research

∙ 10/05/2022

Decomposed Prompting: A Modular Approach for Solving Complex Tasks

Few-shot prompting is a surprisingly powerful way to use Large Language ...

2 Tushar Khot, et al. ∙

research

∙ 10/03/2022

Complexity-Based Prompting for Multi-Step Reasoning

We study the task of prompting large-scale language models to perform mu...

4 Yao Fu, et al. ∙

research

∙ 05/25/2022

Teaching Broad Reasoning Skills via Decomposition-Guided Contexts

Question-answering datasets require a broad set of reasoning skills. We ...

0 Harsh Trivedi, et al. ∙

research

∙ 05/07/2022

Better Retrieval May Not Lead to Better Question Answering

Considerable progress has been made recently in open-domain question ans...

0 Zhengzhong Liang, et al. ∙

research

∙ 10/16/2021

Learning to Solve Complex Tasks by Talking to Agents

Humans often solve complex problems by interacting (in natural language)...

0 Tushar Khot, et al. ∙

research

∙ 08/02/2021

MuSiQue: Multi-hop Questions via Single-hop Question Composition

To build challenging multi-hop question answering datasets, we propose a...

0 Harsh Trivedi, et al. ∙

research

∙ 06/02/2021

Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions?

Is it possible to use natural language to intervene in a model's behavio...

10 Jieyu Zhao, et al. ∙

research

∙ 04/18/2021

GooAQ: Open Question Answering with Diverse Answer Types

While day-to-day questions come with a variety of answer types, the curr...

0 Daniel Khashabi, et al. ∙

research

∙ 02/05/2021

Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge

We present the ARC-DA dataset, a direct-answer ("open response", "freefo...

0 Sumithra Bhakthavatsalam, et al. ∙

research

∙ 01/06/2021

Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies

A key limitation in current datasets for multi-hop reasoning is that the...

0 Mor Geva, et al. ∙

research

∙ 11/13/2020

IIRC: A Dataset of Incomplete Information Reading Comprehension Questions

Humans often have to read multiple documents to address their informatio...

0 James Ferguson, et al. ∙

research

∙ 10/24/2020

ReadOnce Transformers: Reusable Representations of Text for Transformers

While large-scale language models are extremely effective when directly ...

0 Shih-Ting Lin, et al. ∙

research

∙ 10/24/2020

Temporal Reasoning on Implicit Events from Distant Supervision

Existing works on temporal reasoning among events described in text focu...

0 Ben Zhou, et al. ∙

research

∙ 10/06/2020

UNQOVERing Stereotyping Biases via Underspecified Questions

While language embeddings have been shown to have stereotyping biases, h...

0 Tao Li, et al. ∙

research

∙ 09/01/2020

Text Modular Networks: Learning to Decompose Tasks in the Language of Existing Models

A common approach to solve complex tasks is by breaking them down into s...

7 Tushar Khot, et al. ∙

research

∙ 05/02/2020

Measuring and Reducing Non-Multifact Reasoning in Multi-hop Question Answering

The measurement of true progress in multihop question-answering has been...

0 Harsh Trivedi, et al. ∙

research

∙ 05/02/2020

UnifiedQA: Crossing Format Boundaries With a Single QA System

Question answering (QA) tasks have been posed using a variety of formats...

4 Daniel Khashabi, et al. ∙

research

∙ 04/14/2020

A Simple Yet Strong Pipeline for HotpotQA

State-of-the-art models for multi-hop question answering typically augme...

0 Dirk Groeneveld, et al. ∙

research

∙ 04/09/2020

Natural Perturbation for Robust Question Answering

While recent models have achieved human-level scores on many NLP dataset...

1 Daniel Khashabi, et al. ∙

research

∙ 10/25/2019

QASC: A Dataset for Question Answering via Sentence Composition

Composing knowledge from multiple pieces of texts is a key challenge in ...

0 Tushar Khot, et al. ∙

research

∙ 09/19/2019

What's Missing: A Knowledge Gap Guided Approach for Multi-hop Question Answering

Multi-hop textual question answering requires combining information from...

0 Tushar Khot, et al. ∙

research

∙ 09/04/2019

From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project

AI has achieved remarkable mastery over games such as Chess, Go, and Pok...

8 Peter Clark, et al. ∙

research

∙ 06/09/2019

Question Answering as Global Reasoning over Semantic Abstractions

We propose a novel method for exploiting the semantic structure of text ...

3 Daniel Khashabi, et al. ∙

research

∙ 04/20/2019

Repurposing Entailment for Multi-Hop Question Answering Tasks

Question Answering (QA) naturally reduces to an entailment problem, name...

0 Harsh Trivedi, et al. ∙

research

∙ 01/08/2019

On the Capabilities and Limitations of Reasoning for Natural Language Understanding

Recent systems for natural language understanding are strong at overcomi...

12 Daniel Khashabi, et al. ∙

research

∙ 11/02/2018

Exploiting Explicit Paths for Multi-hop Reading Comprehension

We focus on the task of multi-hop reading comprehension where a system i...

0 Souvik Kundu, et al. ∙

research

∙ 09/08/2018

Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering

We present a new kind of question answering dataset, OpenBookQA, modeled...

0 Todor Mihaylov, et al. ∙

research

∙ 08/28/2018

Bridging Knowledge Gaps in Neural Entailment via Symbolic Models

Most textual entailment models focus on lexical gaps between the premise...

0 Dongyeop Kang, et al. ∙

research

∙ 08/06/2018

Structure Learning for Relational Logistic Regression: An Ensemble Approach

We consider the problem of learning Relational Logistic Regression (RLR)...

14 Nandini Ramanan, et al. ∙

research

∙ 05/12/2018

AdvEntuRe: Adversarial Training for Textual Entailment with Knowledge-Guided Examples

We consider the problem of learning textual entailment models with limit...

4 Dongyeop Kang, et al. ∙

research

∙ 03/14/2018

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

We present a new question set, text corpus, and baselines assembled to e...

0 Peter Clark, et al. ∙

research

∙ 04/19/2017

Answering Complex Questions Using Open Information Extraction

While there has been substantial progress in factoid question-answering ...

0 Tushar Khot, et al. ∙

research

∙ 04/20/2016

Question Answering via Integer Programming over Semi-Structured Knowledge

Answering science questions posed in natural language is an important AI...

0 Daniel Khashabi, et al. ∙

research

∙ 07/10/2015

Markov Logic Networks for Natural Language Question Answering

Our goal is to answer elementary-level science questions using knowledge...

0 Tushar Khot, et al. ∙

Tushar Khot

Featured Co-authors

Sign in with Google

Consider DeepAI Pro