Stochastic Compositional Optimization via Hybrid Momentum Frank--Wolfe

Abstract

Stochastic compositional optimization minimizes objectives of the form x ∈ X F(f(x), x), where f is accessible only through noisy stochastic queries. Existing methods for this problem assume that the outer function F is continuously differentiable, which excludes many practically important applications such as robust max-of-losses, Conditional Value-at-Risk, and norm regularizers. We propose the Hybrid Momentum Stochastic Frank--Wolfe algorithm, which drops the smoothness assumption on F. By combining a momentum-based Jacobian tracker with a Taylor-corrected function tracker, the algorithm feeds an entire stochastic linearization -- rather than a single gradient -- into a generalized linear minimization oracle. We establish an O(K-1/4) convergence rate in the generalized Frank--Wolfe gap for non-convex objectives with LF-Lipschitz outer functions, matching the optimal complexity for projection-free single-sample stochastic methods under expected smoothness. The analysis extends to heavy-tailed noise oracles with bounded r-th moments for r ∈ (1, 2] and recovers the deterministic rates of Vladarean et al (2023) as the noise vanishes.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…