On additive averaging kernels for finite Markov chains

Michael C. H. Choi

On additive averaging kernels for finite Markov chains

Abstract

We study additive mixtures of Markov kernels of the form Aα = α P + (1-α)G, where α ∈ [0,1], P is a baseline sampler and G is a Gibbs kernel induced by a partition of the state space. We first motivate the study of Aα, which can be interpreted as the projection of a lifted Markov chain. We then consider the minimisation of distance to stationarity under two objectives: the squared Frobenius norm and the Kullback-Leibler (KL) divergence. For the Frobenius objective, we derive explicit trace formulas and identify a Cheeger-type functional that characterises optimal two-block partitions. This yields a structured combinatorial optimisation problem admitting a difference-of-submodular decomposition, enabling efficient approximation via majorisation-minimisation. We also obtain geometric decay rates governed by the absolute spectral gap of P. For the KL divergence, we establish convexity-based bounds showing that the divergence of Aα is controlled by those of both P and G, thereby reducing partition selection to the Gibbs component. Numerical experiments on the Curie-Weiss model demonstrate that suitable choice of both the partition and the parameter α can significantly accelerate convergence in total variation distance. We observe a consistent trade-off between local exploration and global averaging, with intermediate values of α achieving the best performance across regimes.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Or compile a full topic from this idea

Discussion (0)

Sign in to join the discussion.

Loading comments…