Accurate Community Detection in the Stochastic Block Model via Spectral Algorithms

Abstract

We consider the problem of community detection in the Stochastic Block Model with a finite number K of communities of sizes linearly growing with the network size n. This model consists in a random graph such that each pair of vertices is connected independently with probability p within communities and q across communities. One observes a realization of this random graph, and the objective is to reconstruct the communities from this observation. We show that under spectral algorithms, the number of misclassified vertices does not exceed s with high probability as n grows large, whenever pn=ω(1), s=o(n) and equation* ∈fn∞ n(α1 p+α2 q-(α1 + α2)pα1α1 + α2qα2α1 + α2) (ns) >1,(1) equation* where α1 and α2 denote the (fixed) proportions of vertices in the two smallest communities. In view of recent work by Abbe et al. and Mossel et al., this establishes that the proposed spectral algorithms are able to exactly recover communities whenever this is at all possible in the case of networks with two communities with equal sizes. We conjecture that condition (1) is actually necessary to obtain less than s misclassified vertices asymptotically, which would establish the optimality of spectral method in more general scenarios.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…