A Hash Table Without Hash Functions, and How to Get the Most Out of Your Random Bits
Abstract
This paper considers the basic question of how strong of a probabilistic guarantee can a hash table, storing n (1 + (1)) n-bit key/value pairs, offer? Past work on this question has been bottlenecked by limitations of the known families of hash functions: The only hash tables to achieve failure probabilities less than 1 / 2 n require access to fully-random hash functions -- if the same hash tables are implemented using the known explicit families of hash functions, their failure probabilities become 1 / (n). To get around these obstacles, we show how to construct a randomized data structure that has the same guarantees as a hash table, but that avoids the direct use of hash functions. Building on this, we are able to construct a hash table using O(n) random bits that achieves failure probability 1 / nn1 - ε for an arbitrary positive constant ε. In fact, we show that this guarantee can even be achieved by a succinct dictionary, that is, by a dictionary that uses space within a 1 + o(1) factor of the information-theoretic optimum. Finally we also construct a succinct hash table whose probabilistic guarantees fall on a different extreme, offering a failure probability of 1 / (n) while using only O( n) random bits. This latter result matches (up to low-order terms) a guarantee previously achieved by Dietzfelbinger et al., but with increased space efficiency and with several surprising technical components.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.