A novel Krylov subspace method for approximating Fr\'echet derivatives of large-scale matrix functions

Abstract

We present a novel Krylov subspace method for approximating Lf(A, E) b, the matrix-vector product of the Fr\'echet derivative Lf(A, E) of a large-scale matrix function f(A) in direction E, a task that arises naturally in the sensitivity analysis of quantities involving matrix functions, such as centrality measures for networks. It also arises in the context of gradient-based methods for optimization problems that feature matrix functions, e.g., when fitting an evolution equation to an observed solution trajectory. In principle, the well-known identity \[ f( bmatrix A & E \\ 0 & A bmatrix ) bmatrix 0 \\ b bmatrix = bmatrix Lf(A, E) b \\ f(A) b bmatrix, \] allows one to directly apply any standard Krylov subspace method, such as the Arnoldi algorithm, to address this task. However, this comes with the major disadvantage that the involved block triangular matrix has unfavorable spectral properties, which impede the convergence analysis and, to a certain extent, also the observed convergence. To avoid these difficulties, we propose a novel modification of the Arnoldi algorithm that aims at better preserving the block triangular structure. In turn, this allows one to bound the convergence of the modified method by the best polynomial approximation of the derivative f on the numerical range of A. Several numerical experiments illustrate our findings.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…