b'Lior Shani'

research

∙ 05/31/2023

Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback

Despite the seeming success of contemporary grounded text generation sys...

0 Paul Roit, et al. ∙

research

∙ 02/04/2023

Reinforcement Learning with History-Dependent Dynamic Contexts

We introduce Dynamic Contextual Markov Decision Processes (DCMDPs), a no...

0 Guy Tennenholtz, et al. ∙

research

∙ 05/30/2022

Reinforcement Learning with a Terminator

We present the problem of reinforcement learning with exogenous terminat...

0 Guy Tennenholtz, et al. ∙

research

∙ 02/13/2021

Online Apprenticeship Learning

In Apprenticeship Learning (AL), we are given a Markov Decision Process ...

0 Lior Shani, et al. ∙

research

∙ 05/20/2020

Mirror Descent Policy Optimization

We propose deep Reinforcement Learning (RL) algorithms inspired by mirro...

0 Manan Tomar, et al. ∙

research

∙ 02/19/2020

Optimistic Policy Optimization with Bandit Feedback

Policy optimization methods are one of the most widely used classes of R...

0 Yonathan Efroni, et al. ∙

research

∙ 09/06/2019

Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs

Trust region policy optimization (TRPO) is a popular and empirically suc...

0 Lior Shani, et al. ∙

research

∙ 12/17/2018

Multi Instance Learning For Unbalanced Data

In the context of Multi Instance Learning, we analyze the Single Instanc...

0 Mark Kozdoba, et al. ∙

research

∙ 12/13/2018

Revisiting Exploration-Conscious Reinforcement Learning

The objective of Reinforcement Learning is to learn an optimal policy by...

0 Lior Shani, et al. ∙

Lior Shani

Featured Co-authors

Sign in with Google

Consider DeepAI Pro