Augmented Quantization: Mixture Models for Risk-Oriented Sensitivity Analysis
Abstract
A central question in risk analysis is to identify the factors that drive the system toward a specific hazardous outcome, such as the exceedance of a given threshold. When relying on numerical simulators, we propose to study the distribution of the inputs, transformed into uniform variables via their cumulative distributions, conditionally on the occurrence of the hazardous event. To represent this multivariate conditional distribution for sensitivity analysis, we introduce an original quantization approach based on estimating a mixture of Dirac and local uniform distributions. For each marginal of this mixture, a Dirac component indicates a strong influence of the corresponding variable, whereas a uniform component with wide support reflects weak influence. A notable advantage of this method is its ability to identify the regions of the input space that most strongly influence the occurrence of the risk event, while also capturing the joint effects of multiple variables. However, learning mixture models typically relies on likelihood-based methods, which are not well suited to mixtures involving singular or Dirac components. To address this, we propose an Augmented Quantization method, a reformulation of the classical quantization problem based on the p-Wasserstein distance, which can be computed in very general distribution spaces. The performance of Augmented Quantization in estimating such mixture models is first demonstrated on analytical toy problems, and then applied to sensitivity analysis of both an analytical function and a practical flooding case study on a section of the Loire River.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.