Hoeffding's lemma for Markov Chains and its applications to statistical learning

Qiang Sun

Hoeffding's lemma for Markov Chains and its applications to statistical learning

Abstract

We extend Hoeffding's lemma to general-state-space and not necessarily reversible Markov chains. Let \Xi\i 1 be a stationary Markov chain with invariant measure π and absolute spectral gap 1-λ, where λ is defined as the operator norm of the transition kernel acting on mean zero and square-integrable functions with respect to π. Then, for any bounded functions fi: x [ai,bi], the sum of fi(Xi) is sub-Gaussian with variance proxy 1+λ1-λ · Σi (bi-ai)24. This result differs from the classical Hoeffding's lemma by a multiplicative coefficient of (1+λ)/(1-λ), and simplifies to the latter when λ = 0. The counterpart of Hoeffding's inequality for Markov chains immediately follows. Our results assume none of countable state space, reversibility and time-homogeneity of Markov chains and cover time-dependent functions with various ranges. We illustrate the utility of these results by applying them to six problems in statistics and machine learning.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Or compile a full topic from this idea

Discussion (0)

Sign in to join the discussion.

Loading comments…