Srinivasan Iyer

research

∙ 12/22/2022

OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization

Recent work has shown that fine-tuning large pre-trained language models...

0 Srinivasan Iyer, et al. ∙

research

∙ 11/25/2022

Complementary Explanations for Effective In-Context Learning

Large language models (LLMs) have exhibited remarkable capabilities in l...

0 Xi Ye, et al. ∙

research

∙ 12/20/2021

Efficient Large Scale Language Modeling with Mixtures of Experts

Mixture of Experts layers (MoEs) enable efficient scaling of language mo...

10 Mikel Artetxe, et al. ∙

research

∙ 11/26/2021

Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs

Do language models have beliefs about the world? Dennett (1995) famously...

5 Peter Hase, et al. ∙

research

∙ 05/14/2021

EASE: Extractive-Abstractive Summarization with Explanations

Current abstractive summarization systems outperform their extractive co...

4 Haoran Li, et al. ∙

research

∙ 12/31/2020

FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale Generation

Natural language (NL) explanations of model predictions are gaining popu...

0 Kushal Lakhotia, et al. ∙

research

∙ 12/30/2020

Human Evaluation of Spoken vs. Visual Explanations for Open-Domain QA

While research on explaining predictions of open-domain QA systems (ODQA...

0 Ana Valeria Gonzalez, et al. ∙

research

∙ 10/21/2020

RECONSIDER: Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering

State-of-the-art Machine Reading Comprehension (MRC) models for Open-dom...

0 Srinivasan Iyer, et al. ∙

research

∙ 10/06/2020

Efficient One-Pass End-to-End Entity Linking for Questions

We present ELQ, a fast end-to-end entity linking model for questions, wh...

0 Belinda Z. Li, et al. ∙

research

∙ 08/03/2020

DeLighT: Very Deep and Light-weight Transformer

We introduce a very deep and light-weight transformer, DeLighT, that del...

0 Sachin Mehta, et al. ∙

research

∙ 10/05/2019

JuICe: A Large Scale Distantly Supervised Dataset for Open Domain Context-based Code Generation

Interactive programming with interleaved code snippet cells and natural ...

0 Rajas Agashe, et al. ∙

research

∙ 04/19/2019

Learning Programmatic Idioms for Scalable Semantic Parsing

Programmers typically organize executable source code using high-level c...

0 Srinivasan Iyer, et al. ∙

research

∙ 08/29/2018

Mapping Language to Code in Programmatic Context

Source code is rarely written in isolation. It depends significantly on ...

0 Srinivasan Iyer, et al. ∙

research

∙ 04/18/2018

Learning to Map Context-Dependent Sentences to Executable Formal Queries

We propose a context-dependent model to map utterances within an interac...

0 Alane Suhr, et al. ∙

research

∙ 04/27/2017

Learning a Neural Semantic Parser from User Feedback

We present an approach to rapidly and easily build natural language inte...

0 Srinivasan Iyer, et al. ∙

research

∙ 04/26/2017

Neural AMR: Sequence-to-Sequence Models for Parsing and Generation

Sequence-to-sequence models have shown strong performance across a broad...

0 Ioannis Konstas, et al. ∙

Srinivasan Iyer

Featured Co-authors

Sign in with Google

Consider DeepAI Pro