Dual-Select FMA Butterfly for FFT: Eliminating Twiddle Factor Singularities with Bounded Precomputed Ratios

Abstract

The fused multiply-add (FMA) instruction enables the radix-2 FFT butterfly to be computed in 6~FMA operations -- the proven minimum. The classical factorization by Linzer and Feig~linzer1993 precomputes the ratio θ = θ/θ, which is singular when the twiddle factor is W0 = 1 (i.e., θ = 0). Standard practice clamps θ to a small epsilon, degrading numerical precision. We observe that an alternative factorization using θ as the outer multiplier (precomputing θ) avoids this particular singularity but introduces a new one at WN/4. We then propose a dual-select strategy that chooses, per twiddle factor, whichever factorization yields |ratio| ≤ 1. This eliminates all singularities, requires no epsilon clamping, and bounds the precomputed ratio to unity for all twiddle factors. For N = 1024, the worst-case ratio drops from 163 (Linzer-Feig) to exactly~1.0 (dual-select), yielding a 235× tighter error bound in FP16 arithmetic over 10~FFT passes. The strategy adds zero computational overhead -- only the precomputed twiddle table changes.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…