Generalization Analysis for Game-Theoretic Machine Learning

10/09/2014
by   Haifang Li, et al.
0

For Internet applications like sponsored search, cautions need to be taken when using machine learning to optimize their mechanisms (e.g., auction) since self-interested agents in these applications may change their behaviors (and thus the data distribution) in response to the mechanisms. To tackle this problem, a framework called game-theoretic machine learning (GTML) was recently proposed, which first learns a Markov behavior model to characterize agents' behaviors, and then learns the optimal mechanism by simulating agents' behavior changes in response to the mechanism. While GTML has demonstrated practical success, its generalization analysis is challenging because the behavior data are non-i.i.d. and dependent on the mechanism. To address this challenge, first, we decompose the generalization error for GTML into the behavior learning error and the mechanism learning error; second, for the behavior learning error, we obtain novel non-asymptotic error bounds for both parametric and non-parametric behavior learning methods; third, for the mechanism learning error, we derive a uniform convergence bound based on a new concept called nested covering number of the mechanism space and the generalization analysis techniques developed for mixing sequences. To the best of our knowledge, this is the first work on the generalization analysis of GTML, and we believe it has general implications to the theoretical analysis of other complicated machine learning problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2014

A Game-theoretic Machine Learning Approach for Revenue Maximization in Sponsored Search

Sponsored search is an important monetization channel for search engines...
research
02/03/2021

Information-Theoretic Bounds on the Moments of the Generalization Error of Learning Algorithms

Generalization error bounds are critical to understanding the performanc...
research
02/09/2022

An Exploration of Multicalibration Uniform Convergence Bounds

Recent works have investigated the sample complexity necessary for fair ...
research
08/27/2019

Infochain: A Decentralized System for Truthful Information Elicitation

Incentive mechanisms play a pivotal role in collecting correct and relia...
research
07/08/2022

Generalization-Memorization Machines

Classifying the training data correctly without over-fitting is one of t...
research
05/13/2022

Modeling Human Behavior Part I – Learning and Belief Approaches

There is a clear desire to model and comprehend human behavior. Trends i...
research
05/15/2023

Robust Auction Design with Support Information

A seller wants to sell an item to n buyers. The buyer valuations are dra...

Please sign up or login with your details

Forgot password? Click here to reset