U-Turn Diffusion

Abstract

We investigate diffusion models generating synthetic samples from the probability distribution represented by the Ground Truth (GT) samples. We focus on how GT sample information is encoded in the Score Function (SF), computed (not simulated) from the Wiener-Ito (WI) linear forward process in the artifical time t∈ [0 ∞], and then used as a nonlinear drift in the simulated WI reverse process with t∈ [∞ 0]. We propose U-Turn diffusion, an augmentation of a pre-trained diffusion model, which shortens the forward and reverse processes to t∈ [0 Tu] and t∈ [Tu 0]. The U-Turn reverse process is initialized at Tu with a sample from the probability distribution of the forward process (initialized at t=0 with a GT sample) ensuring a detailed balance relation between the shorten forward and reverse processes. Our experiments on the class-conditioned SF of the ImageNet dataset and the multi-class, single SF of the CIFAR-10 dataset reveal a critical Memorization Time Tm , beyond which generated samples diverge from the GT sample used to initialize the U-Turn scheme, and a Speciation Time Ts , where for Tu > Ts > Tm , samples begin representing different classes. We further examine the role of SF non-linearity through a Gaussian Test, comparing empirical and Gaussian-approximated U-Turn auto-correlation functions, and showing that the SF becomes effectively affine for t > Ts , and approximately affine for t∈ [Tm,Ts].

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…