Acquiring Human-Like Data-Efficient Mechanics Prediction from Deep Reinforcement Learning
Abstract
Humans can infer mechanical outcomes by learning from a few observations. This capacity for mechanics intuition is acquired in a data-efficient manner. Here, we propose a reinforcement learning framework to mimic this process, in which an agent encodes continuous physical observation parameters into its state and is trained via episodic switching across closely related observations. With merely two or three similar observations, the agent acquires robust mechanics intuition that generalizes over wide parameter ranges beyond the training data. Our method is demonstrated on the brachistochrone, a large-deformation elastic plate, and the quantum harmonic oscillator. We explain this generalization through a unified theoretical view: it is associated with cross-parameter Bellman consistency encouraged by episodic switching across neighboring task parameters, promoting approximate stationarity of the Bellman residual with respect to physical variations. This is consistent with a smooth policy that tracks a low-dimensional solution manifold underlying the continuum of tasks. Our work identifies episodic switching as a practically effective and theoretically motivated route to artificial mechanics intuition and suggests a computational analogy to data-efficient generalization in biological learners.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.