Validating Behavioral Proxies for Disease Risk Monitoring via Large-Scale E-commerce Data
Abstract
Digital traces of daily activities, such as e-commerce (EC) purchase histories, provide scalable signals for public health surveillance, yet their epidemiological validity remains unclear. This study validates a behavioral proxy for disease onset, defined as transitions from regular to therapeutic diets, by comparing large-scale EC data (N=55,645) against independent insurance-derived clinical records. Using feline lower urinary tract disease (FLUTD) as a case study, the proxy showed strong agreement with clinical data for ingredient-level risk patterns (r=0.74) and seasonal dynamics (r=0.82). Furthermore, analysis using EC data alone reproduced the established protective association of wet food consumption. These results demonstrate that validated behavioral signals from EC data can serve as cost-effective complements to traditional surveillance, with potential applicability to monitoring lifestyle-related diseases in human populations.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.