Utilizing Priors in Sampling-based Cost Minimization

Abstract

We consider an autonomous vehicle (AV) agent performing a long-term cost-minimization problem in the elapsed time T over sequences of states s1:T and actions a1:T for some fixed, known (though potentially learned) cost function C(st,at), approximate system dynamics P, and distribution over initial states d0. The goal is to minimize the expected cost-to-go of the driving trajectory τ = s1, a1, ..., sT, aT from the initial state.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…