Many-to-many Correspondences between Partitions: Introducing a Cut-based Approach
Abstract
Let P and P' be finite partitions of the set V. Finding good correspondences between the parts of P and those of P' is helpful in classification, pattern recognition, and network analysis. Unlike common similarity measures for partitions that yield only a single value, we provide specifics on how P and P' correspond to each other. To this end, we first define natural collections of best correspondences under three constraints , , and . In case of , the best correspondences form a minimum cut basis of a certain bipartite graph, whereas the other two lead to minimum cut bases of P P'. We also introduce a constraint, , which tightens ; both are useful for finding consensus partitions. We then develop branch-and-bound algorithms for finding minimum Ps-Pt cuts of P and thus P -1 best correspondences under , , and , respectively. In a case study, we use the correspondences to gain insight into a community detection algorithm. The results suggest, among others, that only very minor losses in the quality of the correspondences occur if the branch-and-bound algorithm is restricted to its greedy core. Thus, even for graphs with more than half a million nodes and hundreds of communities, we can find hundreds of best or almost best correspondences in less than a minute.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.