Learning Transfers: Kan Extensions for Neural Invariants
Abstract
Transfer learning presumes that a representation learned on source tasks carries structure that remains usable on related target tasks. Standard evaluations probe this through target accuracy or distributional discrepancy, yet leave unspecified which structural invariant is meant to transfer. We supply that invariant categorically. A source task category A, a target task category B, and a task-change functor J: A B determine, for every invariant-valued source representation F: A V, the universal transferred invariant LanJ F. Given a target invariant G: B V, we define the transfer discrepancy CompJ(F,G)=b∈Ob( B) d V((LanJ F)(b),G(b)), evaluating transfer not by an objectwise comparison of source and target, but by comparing the target invariant against the one forced by the prescribed task transformation. We prove finite cokernel formulas for (LanJ F)(b) in chain complexes and persistence modules, indexed by the comma category J b. For persistence-valued finite-type one-parameter invariants, the discrepancy is computed exactly by bottleneck distances between barcodes. Controlled experiments on neural latent point clouds then test whether the score recovers the correct task functor and flags representation collapses that preserve classification accuracy while destroying transfer-relevant topology.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.