Constant-Factor Approximation for the Uniform Decision Tree
Abstract
We resolve a long-standing open question, about the existence of a constant-factor approximation algorithm for the average-case Decision Tree problem with uniform probability distribution over the hypotheses. We answer the question in the affirmative by providing a simple polynomial-time algorithm with approximation ratio of 21-(e+1)/(2e)+ε <11.57. This improves upon the currently best-known, greedy algorithm which achieves O( n/ n)-approximation. The first key ingredient in our analysis is the usage of a decomposition technique known from problems related to Hierarchical Clustering [SODA '17, WALCOM '26], which allows us to decompose the optimal decision tree into a series of objects called separating subfamilies. The second crucial idea is to reduce the subproblem of finding a Separating Subfamily to an instance of the Maximum Coverage problem. To do so, we analyze the properties of cutting cliques into small pieces, which represent pairs of hypotheses to be separated. This allows us to obtain a good approximation for the Separating Subfamily problem, which then enables the design of the approximation algorithm for the original problem.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.