Learning in Random Utility Models Via Online Decision Problems

12/21/2021
by   Emerson Melo, et al.
0

This paper studies the Random Utility Model (RUM) in environments where the decision maker is imperfectly informed about the payoffs associated to each of the alternatives he faces. By embedding the RUM into an online decision problem, we make four contributions. First, we propose a gradient-based learning algorithm and show that a large class of RUMs are Hannan consistent (<cit.>); that is, the average difference between the expected payoffs generated by a RUM and that of the best fixed policy in hindsight goes to zero as the number of periods increase. Second, we show that the class of Generalized Extreme Value (GEV) models can be implemented with our learning algorithm. Examples in the GEV class include the Nested Logit, Ordered, and Product Differentiation models among many others. Third, we show that our gradient-based algorithm is the dual, in a convex analysis sense, of the Follow the Regularized Leader (FTRL) algorithm, which is widely used in the Machine Learning literature. Finally, we discuss how our approach can incorporate recency bias and be used to implement prediction markets in general environments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/26/2010

A New Understanding of Prediction Markets Via No-Regret Learning

We explore the striking mathematical connections that exist between mark...
research
04/16/2018

On the Convergence of Competitive, Multi-Agent Gradient-Based Learning

As learning algorithms are increasingly deployed in markets and other co...
research
09/05/2023

Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes

In this paper, we consider an infinite horizon average reward Markov Dec...
research
07/17/2021

Tâtonnement Beyond Constant Elasticity of Substitution

In this paper, we bring consumer theory to bear in the analysis of Fishe...
research
01/16/2015

Stochastic Gradient Based Extreme Learning Machines For Online Learning of Advanced Combustion Engines

In this article, a stochastic gradient based online learning algorithm f...
research
06/11/2020

Optimally Deceiving a Learning Leader in Stackelberg Games

Recent results in the ML community have revealed that learning algorithm...
research
07/14/2018

Generalization in quasi-periodic environments

By and large the behavior of stochastic gradient is regarded as a challe...

Please sign up or login with your details

Forgot password? Click here to reset