Consistent Model Selection of Discrete Bayesian Networks from Incomplete Data

Abstract

A maximum likelihood based model selection of discrete Bayesian networks is considered. The model selection is performed through scoring function S, which, for a given network G and n-sample Dn, is defined to be the maximum log-likelihood l minus a penalization term λn h proportional to network complexity h(G), S(G|Dn) = l(G|Dn) - λn h(G). The data is allowed to have missing values at random that has prompted, to improve the efficiency of estimation, a replacement of the standard log-likelihood with the sum of sample average node log-likelihoods. The latter avoids the exclusion of most partially missing data records and allows the comparison of models fitted to different samples. Provided that a discrete Bayesian network is identifiable for a given missing data distribution, we show that if the sequence λn converges to zero at a slower rate than n-1/2 then the estimation is consistent. Moreover, we establish that BIC model selection (λn=0.5(n)/n) applied to the node-average log-likelihood is in general not consistent. This is in contrast to the complete data case where BIC is known to be consistent. The conclusions are confirmed by numerical examples.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…