The Sample Complexity of Multi-Distribution Learning for VC Classes

Abstract

Multi-distribution learning is a natural generalization of PAC learning to settings with multiple data distributions. There remains a significant gap between the known upper and lower bounds for PAC-learnable classes. In particular, though we understand the sample complexity of learning a VC dimension d class on k distributions to be O(ε-2 (k)(d + k) + \ε-1 dk, ε-4 (k) d\), the best lower bound is (ε-2(d + k (k))). We discuss recent progress on this problem and some hurdles that are fundamental to the use of game dynamics in statistical learning.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…