Critical issues with the Pearson's chi-square test

Abstract

Pearson's chi-square tests are among the most commonly applied statistical tools across a wide range of scientific disciplines, including medicine, engineering, biology, sociology, marketing and business. However, its usage in some areas is not correct. For example, the chi-square test for homogeneity of proportions (that is, comparing proportions across groups in a contingency table) is frequently used to verify if the rows of a given nonnegative m × n (contingency) matrix A are proportional. The null-hypothesis H0: ``m rows are proportional'' (for the whole population) is rejected with confidence level 1 - α if and only if 2stat > 2crit, where the first term is given by Pearson's formula, while the second one depends only on m, n, and α, but not on the entries of A. It is immediate to notice that the Pearson's formula is not invariant. More precisely, whenever we multiply all entries of A by a constant c, the value 2stat(A) is multiplied by c, too, 2stat(cA) = c 2stat (A). Thus, if all rows of A are exactly proportional then 2stat(cA) = c 2stat(A) = 0 for any c and any α. Otherwise, 2stat (cA) becomes arbitrary large or small, as positive c is increasing or decreasing. Hence, at any fixed significance level α, the null hypothesis H0 will be rejected with confidence 1 - α, when c is sufficiently large and not rejected when c is sufficiently small, Yet, obviously, the rows of cA should be proportional or not for all c simultaneously. Thus, any reasonable formula for the test statistic must be invariant, that is, take the same value for matrices cA for all real positive c. KEY WORDS: Pearson chi-square test, difference between two proportions, goodness of fit, contingency tables.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…