Robust estimation of a regression function in exponential families

Abstract

We observe n pairs of independent (but not necessarily i.i.d.) random variables X1=(W1,Y1),…,Xn=(Wn,Yn) and tackle the problem of estimating the conditional distributions Qi(wi) of Yi given Wi=wi for all i∈\1,…,n\. Even though these might not be true, we base our estimator on the assumptions that the data are i.i.d.\ and the conditional distributions of Yi given Wi=wi belong to a one parameter exponential family Q with parameter space given by an interval I. More precisely, we pretend that these conditional distributions take the form Qθ(wi)∈ Q for some θ that belongs to a VC-class of functions with values in I. For each i∈\1,…,n\, we estimate Qi(wi) by a distribution of the same form, i.e.\ Qθ(wi)∈ Q, where θ= θ(X1,…,Xn) is a well-chosen estimator with values in . We show that our estimation strategy is robust to model misspecification, contamination and the presence of outliers. Besides, we provide an algorithm for calculating θ when is a VC-class of functions of low or moderate dimension and we carry out a simulation study to compare the performance of θ to that of the MLE and median-based estimators.

0

Discussion (0)

Sign in to join the discussion.

Loading comments…