Minimax fast rates for discriminant analysis with errors in variables
Abstract
The effect of measurement errors in discriminant analysis is investigated. Given observations Z=X+ε, where ε denotes a random noise, the goal is to predict the density of X among two possible candidates f and g. We suppose that we have at our disposal two learning samples. The aim is to approach the best possible decision rule G defined as a minimizer of the Bayes risk. In the free-noise case (ε=0), minimax fast rates of convergence are well-known under the margin assumption in discriminant analysis (see mammen) or in the more general classification framework (see tsybakov2004,AT). In this paper we intend to establish similar results in the noisy case, i.e. when dealing with errors in variables. We prove minimax lower bounds for this problem and explain how can these rates be attained, using in particular an Empirical Risk Minimizer (ERM) method based on deconvolution kernel estimators.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.