Private Multi-File Retrieval From Distributed Databases

Abstract

Suppose there are N distributed databases each storing a full set of M independent files. A user wants to retrieve r out of the M files without revealing the identity of the r files. When r=1 it is the classic problem of private information retrieval (PIR). In this paper we study the problem of private multi-file retrieval (PMFR) which covers the case of general r. We first prove an upper bound on the capacity of PMFR schemes which indicates the minimum possible download size per unit of retrieved files. Then we design a general PMFR scheme which happens to attain the upper bound when r≥M2, thus achieving the optimal communication cost. As r goes down we show the trivial approach of executing r independent PIR instances achieves the near optimal communication cost. Comparing with the capacity-achieving PIR schemes, our PMFR scheme reduces the number of subpackages needed for each file from NM to N2, which implies a great reduction of implementation complexity.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…