Linear Run Time of Persistent Homology Computation with GPU Parallelization

Michael G. Rawson

Linear Run Time of Persistent Homology Computation with GPU Parallelization

Abstract

Persistent homology is a crucial invariant that is used in many areas to understand data. The O(N4) run time is a hindrance to its use on most large datasets. We give a parallelization method to utilize multi-core machines and clusters. We implement the computation of the 0th persistent homology with OpenMP parallelization and observe a 1.75 fold performance increase by using 2 threads on a dual core machine. We also benchmark the computation using larger numbers of threads and show that the thread computational overhead decreases performance. With GPU parallelization, we analytically and empirically decrease the run time scaling from O(N4) to O(N3) and even O(N2) where N is the number of data points, for a large enough GPU. Next, we analytically show run time scaling O(N) for an even larger GPU.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Or compile a full topic from this idea

Discussion (0)

Sign in to join the discussion.

Loading comments…