Shortest Path Centrality and the APSP problem via VC-dimension and Rademacher Averages

Abstract

In this paper we are interested in a version of the All-pairs Shortest Paths problem (APSP) that fits neither in the exact nor in the approximate case. We define a measure of centrality of a shortest path, related to the ``importance'' of such shortest path in the graph, and propose an algorithm based on the idea of progressive sampling that, for any fixed constants 0 < ε, δ < 1, given an undirected graph G with non-negative edge weights, outputs with probability 1 - δ a data structure of size n · DiamV(G), where DiamV(G) is the vertex diameter of G, in expected time O( n (m + n n, n · DiamV(G))) containing the (exact) distance and the shortest path between every pair of vertices (u,v) that has centrality at least ε. The progressive sampling technique is sensitive to the probability distribution of the input (if we assume that G is chosen from a prescribed random distribution), but even in the case where we take no assumption about such distribution, we show an upper bound for the sample size using VC-dimension theory that is tighter than the bound given by standard Hoeffding and union bounds, since VC-dimension theory captures the combinatorial structure of the input graph.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…