Contractivity of neural ODEs: an eigenvalue optimization problem
Abstract
We propose a novel methodology to solve a key eigenvalue optimization problem which arises in the contractivity analysis of neural ODEs. When looking at contractivity properties of a one layer weight-tied neural ODE u(t)=σ(Au(t)+b) (with u,b ∈ Rn, A is a given n × n matrix, σ : R R denotes an activation function and for a vector z ∈ Rn, σ(z) ∈ Rn has to be interpreted entry-wise), we are led to study the logarithmic norm of a set of products of type D A, where D is a diagonal matrix such that diag(D) ∈ σ'( Rn). Specifically, given a real number c (usually c=0), the problem consists in finding the largest positive interval I⊂eq [0,∞) such that the logarithmic norm μ(DA) c for all diagonal matrices D with Dii∈ I. We propose a two-level nested methodology: an inner level where, for a given I, we compute an optimizer D(I) by a gradient system approach, and an outer level where we tune I so that the value c is reached by μ(D(I)A). We extend the proposed two-level approach to the general multilayer, and possibly time-dependent, case u(t) = σ( Ak(t) … σ ( A1(t) u(t) + b1(t) ) … + bk(t) ) and we propose several numerical examples to illustrate its behaviour, including its stabilizing performance on a one-layer neural ODE applied to the classification of the MNIST handwritten digits dataset.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.