Kush Bhatia

research

∙ 07/26/2023

Skill-it! A Data-Driven Skills Framework for Understanding and Training Language Models

The quality of training data impacts the performance of pre-trained larg...

0 Mayee F. Chen, et al. ∙

research

∙ 07/20/2023

Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification

Recent work has shown that language models' (LMs) prompt-based learning ...

0 Neel Guha, et al. ∙

research

∙ 06/13/2023

TART: A plug-and-play Transformer module for task-agnostic reasoning

Large language models (LLMs) exhibit in-context learning abilities which...

26 Kush Bhatia, et al. ∙

research

∙ 02/23/2023

Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws

Specifying reward functions for complex tasks like object manipulation o...

0 Kush Bhatia, et al. ∙

research

∙ 01/23/2023

Congested Bandits: Optimal Routing via Short-term Resets

For traffic routing platforms, the choice of which route to recommend to...

0 Pranjal Awasthi, et al. ∙

research

∙ 12/09/2022

On the Sensitivity of Reward Inference to Misspecified Human Models

Inferring reward functions from human behavior is at the center of value...

0 Joey Hong, et al. ∙

research

∙ 10/05/2022

Ask Me Anything: A simple strategy for prompting language models

Large language models (LLMs) transfer well to new tasks out-of-the-box s...

10 Simran Arora, et al. ∙

research

∙ 01/10/2022

The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models

Reward hacking – where RL agents exploit gaps in misspecified reward fun...

0 Alexander Pan, et al. ∙

research

∙ 05/05/2021

Preference learning along multiple criteria: A game-theoretic perspective

The literature on ranking from ordinal data is vast, and there are sever...

0 Kush Bhatia, et al. ∙

research

∙ 04/17/2021

Agnostic learning with unknown utilities

Traditional learning approaches for classification implicitly assume tha...

0 Kush Bhatia, et al. ∙

research

∙ 12/03/2020

Online learning with dynamics: A minimax perspective

We study the problem of online learning with dynamics, where a learner i...

0 Kush Bhatia, et al. ∙

research

∙ 07/27/2019

Bayesian Robustness: A Nonasymptotic Viewpoint

We study the problem of robustly estimating the posterior distribution f...

18 Kush Bhatia, et al. ∙

research

∙ 03/19/2019

Adaptive Hard Thresholding for Near-optimal Consistent Robust Regression

We study the problem of robust linear regression with response variable ...

0 Arun Sai Suggala, et al. ∙

research

∙ 01/08/2019

FastGRNN: A Fast, Accurate, Stable and Tiny Kilobyte Sized Gated Recurrent Neural Network

This paper develops the FastRNN and FastGRNN algorithms to address the t...

0 Aditya Kusupati, et al. ∙

research

∙ 12/20/2018

Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems

We study derivative-free methods for policy optimization over the class ...

0 Dhruv Malik, et al. ∙

research

∙ 11/20/2018

Gen-Oja: A Simple and Efficient Algorithm for Streaming Generalized Eigenvector Computation

In this paper, we study the problems of principal Generalized Eigenvecto...

0 Kush Bhatia, et al. ∙

research

∙ 10/18/2018

Establishing Appropriate Trust via Critical States

In order to effectively interact with or supervise a robot, humans need ...

0 Sandy H. Huang, et al. ∙

research

∙ 07/01/2016

Efficient and Consistent Robust Time Series Analysis

We study the problem of robust time series analysis under the standard a...

0 Kush Bhatia, et al. ∙

research

∙ 06/08/2015

Robust Regression via Hard Thresholding

We study the problem of Robust Least Squares Regression (RLSR) where sev...

0 Kush Bhatia, et al. ∙

Kush Bhatia

Featured Co-authors

Sign in with Google

Consider DeepAI Pro