Learning with little mixing

06/16/2022
by   Ingvar Ziemann, et al.
0

We study square loss in a realizable time-series framework with martingale difference noise. Our main result is a fast rate excess risk bound which shows that whenever a trajectory hypercontractivity condition holds, the risk of the least-squares estimator on dependent data matches the iid rate order-wise after a burn-in time. In comparison, many existing results in learning from dependent data have rates where the effective sample size is deflated by a factor of the mixing-time of the underlying process, even after the burn-in time. Furthermore, our results allow the covariate process to exhibit long range correlations which are substantially weaker than geometric ergodicity. We call this phenomenon learning with little mixing, and present several examples for when it occurs: bounded function classes for which the L^2 and L^2+ϵ norms are equivalent, ergodic finite state Markov chains, various parametric models, and a broad family of infinite dimensional ℓ^2(ℕ) ellipsoids. By instantiating our main result to system identification of nonlinear dynamics with generalized linear model transitions, we obtain a nearly minimax optimal excess risk bound after only a polynomial burn-in time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/04/2011

Estimating β-mixing coefficients

The literature on statistical learning for time series assumes the asymp...
research
01/14/2019

The Bahadur representation for sample quantiles under dependent sequence

On the one hand, we investigate the Bahadur representation for sample qu...
research
02/22/2018

Learning Without Mixing: Towards A Sharp Analysis of Linear System Identification

We prove that the ordinary least-squares (OLS) estimator attains nearly ...
research
12/06/2021

Strong mixing properties of discrete-valued time series with exogenous covariates

We derive strong mixing conditions for many existing discrete-valued tim...
research
12/21/2018

Isotonic Regression in Multi-Dimensional Spaces and Graphs

In this paper we study minimax and adaptation rates in general isotonic ...
research
04/18/2022

Benign Overfitting in Time Series Linear Model with Over-Parameterization

The success of large-scale models in recent years has increased the impo...
research
04/15/2019

Subgeometrically ergodic autoregressions

In this paper we discuss how the notion of subgeometric ergodicity in Ma...

Please sign up or login with your details

Forgot password? Click here to reset