A Unified Subject Map for 130 Years of Physics
Abstract
More than a century of physics is recorded in the American Physical Society (APS) archive, but the corpus cannot be analyzed as a single, time-resolved object because its subject metadata are fragmented across eras with no shared vocabulary. We close this gap by using a frontier large language model to retrospectively assign the modern Physics Subject Headings (PhySH) to the historical archive, yielding a unified subject map for every APS paper from 1893 to 2025. The resulting map not only reproduces century-scale disciplinary arcs but also resolves the fine-grained lifecycles of individual ideas, materials, techniques, and discoveries across a vocabulary of over 3,000 PhySH Concepts. The map turns a fragmented archive into a quantitative substrate for systematic search and for data-driven studies of how physics evolves.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.