Non-Bernoulli Perturbation Distributions for Small Samples in Simultaneous Perturbation Stochastic Approximation
Abstract
Simultaneous perturbation stochastic approximation (SPSA) has proven to be efficient for recursive optimization. SPSA uses a centered difference approximation to the gradient based on two function evaluations regardless of the dimension of the problem. Typically, the Bernoulli +-1 distribution is used for perturbation vectors and theory has been established to prove the asymptotic optimality of this distribution. However, optimality of the Bernoulli distribution may not hold for small-sample stochastic approximation (SA) runs. In this paper, we investigate the performance of the segmented uniform as a perturbation distribution for small-sample SPSA. In particular, we conduct a theoretical analysis for one iteration of SA, which is a reasonable starting point and can be used as a basis for generalization to other small-sample SPSA settings with more than one iteration. In this work, we show that the Bernoulli distribution may not be the best choice for perturbation vectors under certain choices of parameters in small-sample SPSA
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.