Scale matrix estimation under data-based loss in high and low dimensions
Abstract
We consider the problem of estimating the scale matrix of the additif model Yp× n = M + E, under a theoretical decision point of view. Here, p is the number of variables, n is the number of observations, M is a matrix of unknown parameters with rank q<p and E is a random noise, whose distribution is elliptically symmetric with covariance matrix proportional to In \,. We deal with a canonical form of this model where Y is decomposed in two matrices, namely, Zq× p which summarizes the information contained in M , and Um× p, where m=n-q, which summarizes the sufficient information to estimate . As the natural estimators of the form a=a\, S (where S=UT\,U and a is a positive constant) perform poorly when p >m (S non-invertible), we propose estimators of the form a, G = a( S+ S \, S+\,G(Z,S)) where S+ is the Moore-Penrose inverse of S (which coincides with S-1 when S is invertible). We provide conditions on the correction matrix SS+G(Z,S) such that a, G improves over a under the data-based loss L S( , ) = tr ( S+\,( \, - 1 - I p ) 2) . We adopt a unified approach of the two cases where S is invertible (p ≤ m) and S is non-invertible (p>m).