Unifying Ensemble Methods for Q-learning via Social Choice Theory

02/27/2019
by   Rishav Chourasia, et al.
0

Ensemble methods have been widely applied in Reinforcement Learning (RL) in order to enhance stability, increase convergence speed, and improve exploration. These methods typically work by employing an aggregation mechanism over actions of different RL algorithms. We show that a variety of these methods can be unified by drawing parallels from committee voting rules in Social Choice Theory. We map the problem of designing an action aggregation mechanism in an ensemble method to a voting problem which, under different voting rules, yield popular ensemble-based RL algorithms like Majority Voting Q-learning or Bootstrapped Q-learning. Our unification framework, in turn, allows us to design new ensemble-RL algorithms with better performance. For instance, we map two diversity-centered committee voting rules, namely Single Non-Transferable Voting Rule and Chamberlin-Courant Rule, into new RL algorithms that demonstrate excellent exploratory behavior in our experiments.

READ FULL TEXT
research
12/20/2021

Axiomatic characterizations of consistent approval-based committee choice rules

We prove axiomatic characterizations of several important multiwinner ru...
research
05/09/2023

An Evaluation and Ranking of Different Voting Schemes for Improved Visual Place Recognition

Visual Place Recognition has recently seen a surge of endeavours utilizi...
research
07/11/2022

RRMSE Voting Regressor: A weighting function based improvement to ensemble regression

This paper describes the RRMSE (Relative Root Mean Square Error) based w...
research
03/13/2022

Mathematically Quantifying Gerrymandering and the Non-responsiveness of the 2021 Georgia Congressional Districting Plan

While gerrymandering has been widely suspected in Georgia for years, it ...
research
04/08/2019

Optimizing Majority Voting Based Systems Under a Resource Constraint for Multiclass Problems

Ensemble-based approaches are very effective in various fields in raisin...
research
09/28/2019

Machine Truth Serum

Wisdom of the crowd revealed a striking fact that the majority answer fr...
research
09/09/2017

Less Is More: A Comprehensive Framework for the Number of Components of Ensemble Classifiers

The number of component classifiers chosen for an ensemble has a great i...

Please sign up or login with your details

Forgot password? Click here to reset