K-Anonymous A/B Testing

Abstract

A core principle of Privacy by Design (PbD) is minimizing the data that is stored or shared about each individual respondent. PbD principles are mandated by the GDPR (see Article 5c and Article 25), as well as informing aspects of California Privacy Rights Act (CPRA). This paper describes a simple and effective approach that can be used in many a/b testing and similar contexts to help meet these PbD goals. Specifically, the method presented describes an approach to run OLS regression on k-anonymized data. To help illustrate the general utility of this approach, descriptions of two important use cases are offered: 1) calculating partial f-tests as a simple way to both check for a/b test interactions and to test for heterogeneity of treatment effects; and 2) regression adjustment using an approach similar to the popular CUPED method, as a variance reduction method for a/b tests. Using this method has advantages for privacy and compliance, as well as often reducing data storage and processing costs, by storing, sharing, or analyzing only aggregate level rather than individual level data.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…