Rerandomization for covariate balance mitigates p-hacking in regression adjustment
Abstract
Rerandomization enforces covariate balance across treatment groups in the design stage of experiments. Despite its intuitive appeal, its theoretical justification remains unsatisfying because its benefits of improving efficiency for estimating the average treatment effect diminish if we use regression adjustment in the analysis stage. To strengthen the theory of rerandomization, we show that it mitigates false discoveries resulting from p-hacking, the practice of strategically selecting covariates to get more significant p-values. Moreover, we show that rerandomization with a sufficiently stringent threshold can resolve p-hacking. As a byproduct, our theory offers guidance for choosing the threshold in rerandomization in practice.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.