Empirical Risk Minimization with f-Divergence Regularization

Abstract

In this paper, the solution to the empirical risk minimization problem with f-divergence regularization (ERM-fDR) is presented and conditions under which the solution also serves as the solution to the minimization of the expected empirical risk subject to an f-divergence constraint are established. The proposed approach extends applicability to a broader class of f-divergences than previously reported and yields theoretical results that recover previously known results. Additionally, the difference between the expected empirical risk of the ERM-fDR solution and that of its reference measure is characterized, providing insights into previously studied cases of f-divergences. A central contribution is the introduction of the normalization function, a mathematical object that is critical in both the dual formulation and practical computation of the ERM-fDR solution. This work presents an implicit characterization of the normalization function as a nonlinear ordinary differential equation (ODE), establishes its key properties, and subsequently leverages them to construct a numerical algorithm for approximating the normalization factor under mild assumptions. Further analysis demonstrates structural equivalences between ERM-fDR problems with different f-divergences via transformations of the empirical risk. Finally, the proposed algorithm is used to compute the training and test risks of ERM-fDR solutions under different f-divergence regularizers. This numerical example highlights the practical implications of choosing different functions f in ERM-fDR problems.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…