Data-Driven Monitoring and Deterrence in a Changing Environment

Abstract

We study a dynamic model in which a principal monitors agents based on historical data of infractions. This data informs when and at what intensity to monitor; the monitoring decision, in turn, selects the collected data, shaping the principal's future learning. We analyze this feedback loop using a bandit model in which the underlying monitoring environment evolves according to a hidden Markov process. Because data collection is endogenous, how the principal uses this information is critical: surprisingly, a myopic approach renders historical data completely valueless. By endogenizing the agent's incentives, we demonstrate that the principal's purely informational motive to explore serves as an endogenous commitment device. This inherent drive to gather data compels persistent vigilance, strictly lowering the equilibrium infraction rate and restoring the power of deterrence.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…