Tree density estimation
Abstract
We study the problem of estimating the density f( x) of a random vector X in Rd. For a spanning tree T defined on the vertex set \1,… ,d\, the tree density fT is a product of bivariate conditional densities. An optimal spanning tree minimizes the Kullback-Leibler divergence between f and fT. From i.i.d. data we identify an optimal tree T* and efficiently construct a tree density estimate fn such that, without any regularity conditions on the density f, one has n ∞ ∫ |fn( x)-fT*( x)|d x=0 a.s. For Lipschitz f with bounded support, E \ ∫ |fn( x)-fT*( x)|d x\=O(n-1/4), a dimension-free rate.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.