Group Distributionally Robust Optimization with Flexible Sample Queries
Abstract
Group distributionally robust optimization (GDRO) aims to develop models that perform well across m distributions simultaneously. Existing GDRO algorithms can only process a fixed number of samples per iteration, either 1 or m, and therefore can not support scenarios where the sample size varies dynamically. To address this limitation, we investigate GDRO with flexible sample queries and cast it as a two-player game: one player solves an online convex optimization problem, while the other tackles a prediction with limited advice (PLA) problem. Within such a game, we propose a novel PLA algorithm, constructing appropriate loss estimators for cases where the sample size is either 1 or not, and updating the decision using follow-the-regularized-leader. Then, we establish the first high-probability regret bound for non-oblivious PLA. Building upon the above approach, we develop a GDRO algorithm that allows an arbitrary and varying sample size per round, achieving a high-probability optimization error bound of O(1tΣj=1t mrj m), where rt denotes the sample size at round t. This result demonstrates that the optimization error decreases as the number of samples increases and implies a consistent sample complexity of O(m (m)/ε2) for any fixed sample size r∈[m], aligning with existing bounds for cases of r=1 or m. We validate our approach on synthetic binary and real-world multi-class datasets.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.