Rating-based Reinforcement Learning

07/30/2023
by   Devin White, et al.
0

This paper develops a novel rating-based reinforcement learning approach that uses human ratings to obtain human guidance in reinforcement learning. Different from the existing preference-based and ranking-based reinforcement learning paradigms, based on human relative preferences over sample pairs, the proposed rating-based reinforcement learning approach is based on human evaluation of individual trajectories without relative comparisons between sample pairs. The rating-based reinforcement learning approach builds on a new prediction model for human ratings and a novel multi-class loss function. We conduct several experimental studies based on synthetic ratings and real human ratings to evaluate the effectiveness and benefits of the new rating-based reinforcement learning approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/01/2019

Elo ratings applied to TopCoder SRM

We present an Elo-based rating method for ranked multi-player games. We ...
research
03/12/2022

Discrete, recurrent, and scalable patterns in human judgement underlie affective picture ratings

Operant keypress tasks, where each action has a consequence, have been a...
research
05/22/2023

Dean Oliver's Four Factors Revisited

This paper studies the relationship between basketball teams' four facto...
research
12/14/2021

An Empirical Study of Least Squares Ratings for USA Ultimate Frisbee

Ultimate frisbee is one of the fastest-growing sports in the world. In t...
research
06/11/2020

Surveys without Questions: A Reinforcement Learning Approach

The 'old world' instrument, survey, remains a tool of choice for firms t...
research
08/27/2022

SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy Treatment Strategies with Deep Reinforcement Learning

We propose a recommendation system that suggests treatment strategies to...
research
09/02/2019

All You Need is Ratings: A Clustering Approach to Synthetic Rating Datasets Generation

The public availability of collections containing user preferences is of...

Please sign up or login with your details

Forgot password? Click here to reset