Hyperedge Estimation using Polylogarithmic Subset Queries
Abstract
In this work, we estimate the number of hyperedges in a hypergraph H(U( H), F( H)), where U( H) denotes the set of vertices and F( H)) denotes the set of hyperedges. We assume a query oracle access to the hypergraph H. Estimating the number of edges, triangles or small subgraphs in a graph is a well studied problem. Beame ηl~and Bhattacharya ηl~gave algorithms to estimate the number of edges and triangles in a graph using queries to the Bipartite Independent Set ( BIS) and the Tripartite Independent Set ( TIS) oracles, respectively. We generalize the earlier works by estimating the number of hyperedges using a query oracle, known as the Generalized d-partite independent set oracle ( GPIS), that takes d (non-empty) pairwise disjoint subsets of vertices A1,…,Ad ⊂eq U( H) as input, and answers whether there exists a hyperedge in H having (exactly) one vertex in each Ai, i ∈ \1,2,…,d\. We give a randomized algorithm for the hyperedge estimation problem using the GPIS query oracle to output m for m( H) satisfying (1-ε) · m( H) ≤ m ≤ (1+ε) · m( H). The number of queries made by our algorithm, assuming d to be a constant, is polylogarithmic in the number of vertices of the hypergraph.