Robustness against data loss with Algebraic Statistics
Abstract
The paper describes an algorithm that, given an initial design Fn of size n and a linear model with p parameters, provides a sequence Fn ⊃ … ⊃ Fn-k ⊃ … ⊃ Fp of nested robust designs. The sequence is obtained by the removal, one by one, of the runs of Fn till a p-run saturated design Fp is obtained. The potential impact of the algorithm on real applications is high. The initial fraction Fn can be of any type and the output sequence can be used to organize the experimental activity. The experiments can start with the runs corresponding to Fp and continue adding one run after the other (from Fn-k to Fn-k+1) till the initial design Fn is obtained. In this way, if for some unexpected reasons the experimental activity must be stopped before the end when only n-k runs are completed, the corresponding Fn-k has a high value of robustness for k ∈ \1, …, n-p\. The algorithm uses the circuit basis, a special representation of the kernel of a matrix with integer entries. The effectiveness of the algorithm is demonstrated through the use of simulations.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.