Robust density estimation over star-shaped density classes
Abstract
We establish a novel criterion for comparing the performance of two densities, g1 and g2, within the context of corrupted data. Utilizing this criterion, we propose an algorithm to construct a density estimator within a star-shaped density class, F, under conditions of data corruption. We proceed to derive the minimax upper and lower bounds for density estimation across this star-shaped density class, characterized by densities that are uniformly bounded above and below (in the sup norm), in the presence of adversarially corrupted data. Specifically, we assume that a fraction ε ≤ 13 of the N observations are arbitrarily corrupted. We obtain the minimax upper bound \ τJ2, ε \ d2. Under certain conditions, we obtain the minimax risk, up to proportionality constants, under the squared L2 loss as \ τ*2 d2, ε d2 \, where τ* := \ τ : Nτ2 ≤ MFloc(τ, c) \ for a sufficiently large constant c. Here, MFloc(τ, c) denotes the local entropy of the set F, and d is the L2 diameter of F.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.