Counting HyperGraphlets via Color Coding: a Quadratic Barrier and How to Break It
Abstract
We study the problem of counting k-hypergraphlets, an interesting but surprisingly ignored primitive, with the aim of understanding whether efficient algorithms exist. To this end, we consider color coding, a well-known technique for approximately counting k-graphlets in graphs. Our first result is that, on hypergraphs, color coding encounters a quadratic barrier: under the Orthogonal Vector Conjecture, no implementation can run in sub-quadratic time in the input size. We then introduce a simple property, (α,β)-niceness, that hypergraphs from real-world datasets appear to satisfy for small values of α and β. Intuitively, an (α,β)-nice hypergraph can be split into two sub-hypergraphs having respectively rank at most α and degree at most β. By applying different techniques to each sub-hypergraph and carefully combining the outputs, we show how to run color coding in time 2O(k) · (2β |V| + αk |E| + α2 β \|H\|), where H=(V,E) is the input hypergraph. Afterwards, we can sample colorful k-hypergraphlets uniformly in expected kO(k) · (β2 + |V|) time per sample. Experiments on real-world hypergraphs show that our algorithm significantly outperforms the naive quadratic algorithm, sometimes by more than an order of magnitude.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.