Consistent model selection in the spiked Wigner model via AIC-type criteria

Abstract

Consider the spiked Wigner model \[ X = Σi = 1k λi ui ui + σ G, \] where G is an N × N GOE random matrix, and the eigenvalues λi are all spiked, i.e. above the Baik-Ben Arous-P\'ech\'e (BBP) threshold σ. We consider AIC-type model selection criteria of the form \[ -2 \, (maximised log-likelihood) + γ \, (number of parameters) \] for estimating the number k of spikes. For γ > 2, the above criterion is strongly consistent provided λk > λγ, where λγ is a threshold strictly above the BBP threshold, whereas for γ < 2, it almost surely overestimates k. Although AIC (which corresponds to γ = 2) is not strongly consistent, we show that taking γ = 2 + δN, where δN 0 and δN N-2/3, results in a weakly consistent estimator of k. We further show that a soft minimiser of AIC, where one chooses the least complex model whose AIC score is close to the minimum AIC score, is strongly consistent. Based on a spiked (generalised) Wigner representation, we also develop similar model selection criteria for consistently estimating the number of communities in a balanced stochastic block model under some sparsity restrictions.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…