Utilizing Priors in Sampling-based Cost Minimization
Abstract
We consider an autonomous vehicle (AV) agent performing a long-term cost-minimization problem in the elapsed time T over sequences of states s1:T and actions a1:T for some fixed, known (though potentially learned) cost function C(st,at), approximate system dynamics P, and distribution over initial states d0. The goal is to minimize the expected cost-to-go of the driving trajectory τ = s1, a1, ..., sT, aT from the initial state.
0
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.