VASR: Variance-Aware Systematic Resampling for Reward-Guided Diffusion
Abstract
Sequential Monte Carlo (SMC) samplers for reward-guided diffusion models often suffer from rapid lineage collapse: a few high-reward particles dominate the population within a handful of resampling steps, destroying diversity and degrading sample quality. We propose a variance-decomposition framework for reward-guided diffusion SMC that separates continuation variance Vtcont from residual variance Vtres, revealing that high offspring-count variance under the commonly used multinomial resampling drives this collapse. This motivates VASR (Variance-Aware Systematic Resampling), which addresses both variance terms via variance-optimal mass allocation mt wt ert (minimizing Vtcont) and systematic resampling (controlling Vtres). For latent diffusion models where intermediate rewards are noisy due to stochastic continuations, we propose VASR-Max, a deliberately biased high-selection variant for variance-sensitive reward optimization. Both methods are training-free, fully parallelizable, and add only linear overhead. On MNIST and CIFAR-10, VASR achieves as high as 26\% better FID than prior SMC methods while remaining 66 times faster than MCTS-based value methods at matched compute. On text-to-image generation, VASR-Max consistently outperforms the strongest SMC baseline across compute budgets and matches MCTS-based methods within 2.5--3% reward at high budgets while being approximately times faster.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.