Faster Rates for training Max-Margin Markov Networks

Abstract

Structured output prediction is an important machine learning problem both in theory and practice, and the max-margin Markov network () is an effective approach. All state-of-the-art algorithms for optimizing \ objectives take at least O(1/ε) number of iterations to find an ε accurate solution. Recent results in structured optimization suggest that faster rates are possible by exploiting the structure of the objective function. Towards this end Nesterov05 proposed an excessive gap reduction technique based on Euclidean projections which converges in O(1/ε) iterations on strongly convex functions. Unfortunately when applied to s, this approach does not admit graphical model factorization which, as in many existing algorithms, is crucial for keeping the cost per iteration tractable. In this paper, we present a new excessive gap reduction technique based on Bregman projections which admits graphical model factorization naturally, and converges in O(1/ε) iterations. Compared with existing algorithms, the convergence rate of our method has better dependence on ε and other parameters of the problem, and can be easily kernelized.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…