Proximal Bellman mappings for reinforcement learning and their application to robust adaptive filtering

09/14/2023
by   Yuki Akiyama, et al.
0

This paper aims at the algorithmic/theoretical core of reinforcement learning (RL) by introducing the novel class of proximal Bellman mappings. These mappings are defined in reproducing kernel Hilbert spaces (RKHSs), to benefit from the rich approximation properties and inner product of RKHSs, they are shown to belong to the powerful Hilbertian family of (firmly) nonexpansive mappings, regardless of the values of their discount factors, and possess ample degrees of design freedom to even reproduce attributes of the classical Bellman mappings and to pave the way for novel RL designs. An approximate policy-iteration scheme is built on the proposed class of mappings to solve the problem of selecting online, at every time instance, the "optimal" exponent p in a p-norm loss to combat outliers in linear adaptive filtering, without training data and any knowledge on the statistical properties of the outliers. Numerical tests on synthetic data showcase the superior performance of the proposed framework over several non-RL and kernel-based RL schemes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2022

online and lightweight kernel-based approximated policy iteration for dynamic p-norm linear adaptive filtering

This paper introduces a solution to the problem of selecting dynamically...
research
10/20/2022

Dynamic selection of p-norm in linear adaptive filtering via online kernel-based reinforcement learning

This study addresses the problem of selecting dynamically, at each time ...
research
01/01/2020

Fast Estimation of Information Theoretic Learning Descriptors using Explicit Inner Product Spaces

Kernel methods form a theoretically-grounded, powerful and versatile fra...
research
12/03/2021

Reinforcement Learning-Based Automatic Berthing System

Previous studies on automatic berthing systems based on artificial neura...
research
06/16/2022

Reinforcement Learning in Macroeconomic Policy Design: A New Frontier?

Agent-based computational macroeconomics is a field with a rich academic...
research
09/13/2019

Towards an Adaptive Robot for Sports and Rehabilitation Coaching

The work presented in this paper aims to explore how, and to what extent...
research
05/08/2018

Phoneme-to-viseme mappings: the good, the bad, and the ugly

Visemes are the visual equivalent of phonemes. Although not precisely de...

Please sign up or login with your details

Forgot password? Click here to reset