I Dropped a Neural Net
Abstract
A recent Dwarkesh Patel podcast with John Collison and Elon Musk featured an interesting puzzle from Jane Street: they trained a neural net, shuffled all 96 layers, and asked to put them back in order. Given unlabelled layers of a Residual Network and its training dataset, we recover the exact ordering of the layers. The problem decomposes into pairing each block's input and output projections (48! possibilities) and ordering the reassembled blocks (48! possibilities), for a combined search space of (48!)2 ≈ 10122, which is more than the atoms in the observable universe. We show that stability conditions during training like dynamic isometry leave the product Wout Win for correctly paired layers with a negative diagonal structure, allowing us to use diagonal dominance ratio as a signal for pairing. For ordering, we seed-initialize with a rough proxy such as delta-norm or \|Wout\|F then hill-climb to zero mean squared error.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.