Topic Modeling in New Physics Detection
Abstract
In this work, we apply topic modeling to detect new physics in proton-proton collisions at the LHC in an unsupervised way. We investigate three new physics scenarios where fully leptonic tt bb+-_ is the main source of background without relying on jet substructure variables. We demonstrate that the algorithm remains effective even in this low-particle multiplicity framework, complementing jet tagging studies, where it is typically employed. Moreover, we demonstrate that the performance of topic modeling is competitive or even better than well-known outlier detectors, such as isolation forest and variational autoencoders, with moderate and high background pollution in almost all new physics scenarios considered.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.