PathDB: A system for evaluating regular path queries

Abstract

Regular Path Queries (RPQs) are a core mechanism for expressing recursion and reachability in graph databases. However, most systems evaluate RPQs with traversal-based algorithms that repeatedly explore overlapping subpaths and offer limited control over path semantics. We present PathDB, an algebraic query engine for RPQs based on a closed path algebra over multisets of paths with five operators: selection, join, union, recursive join, and projection. PathDB provides (i) a GQL-inspired declarative language that supports RPQs with multiple semantics (walk, trail, simple, and acyclic), (ii) an operator-at-a-time execution procedure analogous to relational query processing, and (iii) result formats that include full paths, not only endpoint pairs. The experimental evaluation, based on four LDBC Social Network Benchmark property graphs and a workload of 142 queries derived from 26 path patterns, showed that PathDB outperforms two automaton-guided traversal baselines (DFS and BFS), often by more than an order of magnitude.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…