Group-Algebraic Tensors: Provably-optimal Equivariant Learning and Physical Symmetry Discovery
Abstract
We introduce the G tensor algebra, in which any finite group G defines the multiplication rule, making equivariance an intrinsic algebraic property rather than an architectural constraint. The framework rests on three machine-verified theoretical pillars: (i)~an Eckart-Young optimality guarantee for the G-SVD: the first such result for symmetry-preserving tensor approximation, exact and polynomial-time; (ii)~a Kronecker factorization that composes multiple symmetries by replacing FG with FG1 FG2 with no architectural redesign; and (iii)~a 600-line Lean~4 formalization of the G algebra. The framework provides capabilities that equivariant neural networks (ENNs) structurally cannot: a closed-form per-irreducible-representation decomposition of every prediction, and data-driven discovery of the symmetry group that best fits a dataset. As a non-trivial empirical demonstration, decomposing QM9 molecular geometry over the chiral octahedral subgroup of SO(3) recovers the Wigner--Eckart selection rules of angular momentum from data alone, with no quantum mechanical input: scalar properties are A1-dominated, dipole components are T1-dominated, the isotropic polarizability is uniquely insensitive to l\!=\!1 as the rank-2-trace decomposition l\!=\!0 l\!=\!2 requires, and the T1/A1 predictive-power ratio separates vector observables from scalar observables by a factor of five. On full QM9 (130,831 molecules), G-SVD with ridge regression provides closed form predictions at 50-90× fewer parameters than parameter-matched MLPs. Algebraic equivariance thus complements architectural equivariance not as a faster-better-cheaper alternative but as a different mathematical affordance: provably-optimal symmetry-preserving compression, per-irrep interpretability, and data-driven physical discovery.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.