When Can Human-AI Teams Outperform Individuals? Tight Bounds with Impossibility Guarantees

Abstract

Human-AI teams fail to outperform their best member in 70% of studies, yet no theory specifies when complementarity is achievable. We derive tight bounds for the broad class of confidence-based aggregation rules by integrating signal detection theory with information-theoretic analysis, yielding four results: (1) a complementarity theorem (teams outperform individuals iff error correlation HM < *, with * ≈ a in the symmetric near-chance regime); (2) minimax bounds showing gains scale as ( d) with metacognitive sensitivity difference; (3) an impossibility result proving no confidence-based aggregation rule achieves complementarity when HM ≥ *; and (4) multi-class generalization *K ≈ */K-1. Predictions match observed team accuracy (R = 0.94 on ImageNet-16H, R = 0.91 on CIFAR-10H) and the multi-class threshold scaling holds on human data (R = 0.93, K = 16), with robustness under non-Gaussian distributions. The framework explains why complementarity is rare and provides actionable design formulas; results apply to aggregation, not to interactive deliberation that generates novel answers.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…