Revisiting local regression: shape regularity, uniform rates, and the limits of random splits

Abstract

Considering pointwise and sup-norm estimation, we analyze the non-asymptotic behavior of local averaging estimators for Lipschitz regression functions. Building on a general deviation bound for estimators based on a VC family of localizing sets, we introduce the notion of shape-regular local maps, where averaging is performed over sets with an almost isotropic geometry. Our main message is a characterization: shape regularity is both necessary and sufficient to attain optimal rates, up to logarithmic factors. Necessity is established non-asymptotically through an explicit anisotropic example, sharpening a phenomenon previously understood only heuristically in asymptotic theory. We then draw two consequences. First, the simple k-nearest neighbor rule is shape-regular by construction and attains the optimal rate, even on unbounded supports. Second, and perhaps surprisingly, the popular random-split condition for trees -- known to ensure consistency and vanishing cell diameters -- does not guarantee optimal rates: for blind tree constructions, the cell aspect ratio diverges exponentially with depth, so that shape regularity fails with positive probability. This identifies the absence of a geometric correction mechanism, rather than a slowly shrinking diameter, as the obstruction to optimality. Motivated by this gap, we propose a tree construction that enforces shape regularity through a simple constraint on admissible splits, and prove a uniform deviation inequality showing that it restores the optimal rate for Lipschitz functions.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…