
Learning Zerosum Stochastic Games with Posterior Sampling
In this paper, we propose Posterior Sampling Reinforcement Learning for ...
read it

Online Learning for Cooperative MultiPlayer MultiArmed Bandits
We introduce a framework for decentralized online learning for multiarm...
read it

Quantum secure nonmalleableextractors
We construct several explicit quantum secure nonmalleableextractors. A...
read it

Dynamic Metatheorems for Distance and Matching
Reachability, distance, and matching are some of the most fundamental gr...
read it

On relating oneway classical and quantum communication complexities
Let f: X × Y →{0,1,} be a partial function and μ be a distribution with ...
read it

Implicit FiniteHorizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path
We introduce a generic template for developing regret minimization algor...
read it

Online Learning for Stochastic Shortest Path Model via Posterior Sampling
We consider the problem of online reinforcement learning for the Stochas...
read it

A direct product theorem for quantum communication complexity with applications to deviceindependent QKD
We give a direct product theorem for the entanglementassisted interacti...
read it

Quantum Measurement Adversary
Multisourceextractors are functions that extract uniform randomness fr...
read it

Optimal communication and control strategies in a multiagent MDP problem
The problem of controlling multiagent systems under different models of...
read it

Oneshot quantum state redistribution and quantum Markov chains
We revisit the task of quantum state redistribution in the oneshot sett...
read it

Reachability and Matching in Single Crossing Minor Free Graphs
We construct in Logspace nonzero circulations for Hminor free graphs w...
read it

Online Learning for Unknown Partially Observable MDPs
Solving Partially Observable Markov Decision Processes (POMDPs) is hard....
read it

SpaceEfficient Algorithms for Reachability in Geometric Graphs
The problem of graph Reachability is to decide whether there is a path f...
read it

Designing Interpretable Approximations to Deep Reinforcement Learning with Soft Decision Trees
In an ever expanding set of research and application areas, deep neural ...
read it

A SampleEfficient Algorithm for Episodic FiniteHorizon MDP with Constraints
Constrained Markov Decision Processes (CMDPs) formalize sequential decis...
read it

A Direct Product Theorem for OneWay Quantum Communication
We prove a direct product theorem for the oneway entanglementassisted ...
read it

A nearoptimal directsum theorem for communication complexity
We show a near optimal directsum theorem for the twoparty randomized c...
read it

Learning Infinitehorizon Averagereward MDPs with Linear Function Approximation
We develop several new algorithms for learning Markov Decision Processes...
read it

A Modelfree Learning Algorithm for Infinitehorizon Averagereward MDPs with Nearoptimal Regret
Recently, modelfree reinforcement learning has attracted research atten...
read it

Randomized Policy Learning for Continuous State and Action MDPs
Deep reinforcement learning methods have achieved stateoftheart resul...
read it

Multiple Source Replacement Path Problem
One of the classical line of work in graph algorithms has been the Repla...
read it

Time Space Optimal Algorithm for Computing Separators in Bounded Genus Graphs
A graph separator is a subset of vertices of a graph whose removal divid...
read it

Modelfree Reinforcement Learning in Infinitehorizon Averagereward Markov Decision Processes
Modelfree reinforcement learning is known to be memory and computation ...
read it

A TwoStage Market Mechanism for Electricity with Renewable Generation
We consider a two stage market mechanism for trading electricity includi...
read it

Grid Graph Reachability
The reachability problem is to determine if there exists a path from one...
read it

Reachability in High Treewidth Graphs
Reachability is the problem of deciding whether there is a path from one...
read it

Visionbased Obstacle Removal System for Autonomous Ground Vehicles Using a Robotic Arm
Over the past few years, the use of cameraequipped robotic platforms fo...
read it

A Two Stage Mechanism For Selling Random Power
We present a two stage auction mechanism that renewable generators (or a...
read it

Oneshot Capacity bounds on the Simultaneous Transmission of Classical and Quantum Information
We study the communication capabilities of a quantum channel under the m...
read it

Partially smoothed information measures
Smooth entropies are a tool for quantifying resource tradeoffs in (quan...
read it

A Fixed Point Theorem for Iterative Random Contraction Operators over Banach Spaces
Consider a contraction operator T over a Banach space X with a fixed po...
read it

Quadratically Tight Relations for Randomized Query Complexity
Let f:{0,1}^n →{0,1} be a Boolean function. The certificate complexity C...
read it

On RegretOptimal Learning in Decentralized Multiplayer Multiarmed Bandits
We consider the problem of learning in singleplayer and multiplayer mul...
read it
Rahul Jain
is this you? claim profile