Selection principles for quasi-stationary distributions and reinforcement processes
Abstract
Let \(P\) be a sub-Markov matrix on a finite set \(S\), representing the transition probabilities of a Markov chain on \(S\) absorbed at a cemetery point \(∂ S\). We consider a reinforced process \((Xn,μn)\) defined as follows: \((Xn)\) behaves like a chain with kernel \(P\) until it dies, and when it dies at time \(n\), it is instantaneously ``resurrected'' at a point sampled according to its weighted past occupation measure \[ μn = 1Wn ( w0μ0+Σk=1n wkδXk ), Wn=Σk=0n wk, \] where the positive weights wk satisfy certain technical assumptions, a typical example being given by wk = kq, with q≥ -1. When \(P\) is irreducible, the behaviour of \((μn)\) is well understood AFP, bansaye2022non: it converges almost surely toward the unique quasi-stationary distribution (QSD) of \(P\). The purpose of this paper is to investigate the general situation where \(P\) is not irreducible. Under generic assumptions on \(P\), there are finitely many QSDs. We prove that the asymptotic selection depends on the summability of the inverse cumulative weights \(1/Wn\). If \[ Σn≥01Wn=∞, \] then \((μn)\) almost surely converges toward the QSD associated with the largest Perron value. If instead \[ Σn≥01Wn<∞, \] then each QSD is selected with positive probability. In particular, for polynomial weights \(w0=1\) and \(wk=kq\), \(k≥1\), this gives almost sure selection of the QSD with largest Perron value for \(-1≤ q≤0\), whereas each quasi-stationary distribution is selected with positive probability for \(q>0\).
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.