ALBATROSS: Cheap Filtration Based Geometry via Stochastic Sub-Sampling
Abstract
Topological data analysis (TDA) detects geometric structure in biological data. However, many TDA algorithms are memory intensive and impractical for massive datasets. Here, we introduce a statistical protocol that reduces TDA's memory requirements and gives access to scientists with modest computing resources. We validate this protocol against two empirical datasets, showing that it replicates previous findings with much lower memory requirements. Finally, we demonstrate the power of the protocol by mapping the topology of functional correlations for the human cortex at high spatial resolution, something that was previously infeasible without this novel approach.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.