Derivation of the Chi-Square Test Statistic for Goodness-of-Fit and Independence

Q: Why is the denominator $ E_i $ and not the variance $ np_i(1-p_i) $?

While the variance of a single Binomial component is $ np_i(1-p_i) $, the derivation uses the properties of the Multivariate Normal distribution. The $ E_i $ denominator emerges naturally when simplifying the quadratic form of the inverse covariance matrix of the Multinomial distribution.

Q: What happens if $ E_i < 5 $?

The Chi-Square distribution is an asymptotic result (Limit Theorem). When expected counts are small, the discrete nature of the data is not sufficiently 'smoothed' into a Gaussian shape, making the $ p $-values derived from the continuous Chi-Square curve unreliable.

Q: How does the 'Independence' test relate to the 'Goodness-of-Fit' derivation?

Independence is a specific case of Goodness-of-Fit where the hypothesized probabilities $ p_{ij} $ are products of marginals $ p_{i.} \times p_{.j} $. The derivation remains the same, but the degrees of freedom are adjusted for estimated parameters.

Analytical Intuition.

Imagine a high-dimensional stage where the Multinomial distribution describes the movement of data points across various categories. Under the Null Hypothesis, each category has a 'target' weight defined by

E_i

. As the sample size

n

scales toward infinity, the discrete jumps of the Poisson-like counts smooth out into a continuous Multivariate Normal distribution. However, because all counts must sum to

n

, the variables are not independent; they are locked in a geometric dance upon a hyper-plane of dimension

k-1

. The Chi-Square statistic is essentially the squared Mahalanobis distance from the observed data to the expected center, normalized by the variance. By squaring these standardized deviations, we transform a complex Gaussian vector into a singular scalar value representing the 'total tension' or discrepancy in the system. It is the mathematical lens that focuses the chaotic vibrations of random sampling into a clear signal of whether our model holds or fractures under the weight of evidence.

Institutional Warning.

Students often conflate the Degrees of Freedom for Goodness-of-Fit with those for Independence. In Independence tests, we estimate marginal probabilities from the data, which imposes additional linear constraints, reducing the dimensions from

k-1

(r-1)(c-1)

via the subtraction of estimated parameters.

Academic Inquiries.

Why is the denominator $E_i$ and not the variance $np_i(1-p_i)$ ?

While the variance of a single Binomial component is $np_i(1-p_i)$ , the derivation uses the properties of the Multivariate Normal distribution. The $E_i$ denominator emerges naturally when simplifying the quadratic form of the inverse covariance matrix of the Multinomial distribution.

What happens if $E_i < 5$ ?

The Chi-Square distribution is an asymptotic result (Limit Theorem). When expected counts are small, the discrete nature of the data is not sufficiently 'smoothed' into a Gaussian shape, making the $p$ -values derived from the continuous Chi-Square curve unreliable.

How does the 'Independence' test relate to the 'Goodness-of-Fit' derivation?

Independence is a specific case of Goodness-of-Fit where the hypothesized probabilities $p_{ij}$ are products of marginals $p_{i.} \times p_{.j}$ . The derivation remains the same, but the degrees of freedom are adjusted for estimated parameters.

NICEFA Visual Mathematics. (2026). Derivation of the Chi-Square Test Statistic for Goodness-of-Fit and Independence: Visual Proof & Intuition. Retrieved from https://nicefa.org/library/applied-statistics/derivation-of-the-chi-square-test-statistic-for-goodness-of-fit-and-independence

Visualizing...

The Formal Theorem

Analytical Intuition.

Institutional Warning.

Academic Inquiries.

Why is the denominator $E_i$ and not the variance $np_i(1-p_i)$ ?

What happens if $E_i < 5$ ?

How does the 'Independence' test relate to the 'Goodness-of-Fit' derivation?

Standardized References.

Proof of Chebyshev's Inequality

Derivation of the Mean and Variance of the Binomial Distribution

Derivation of the Mean and Variance of the Poisson Distribution

The Conceptual Proof of the Central Limit Theorem (CLT)

Institutional Citation

Dominate the Logic.

Visualizing...

The Formal Theorem

Analytical Intuition.

Institutional Warning.

Academic Inquiries.

Why is the denominator Ei E_i Ei​ and not the variance npi(1−pi) np_i(1-p_i) npi​(1−pi​)?

What happens if Ei<5 E_i < 5 Ei​<5?

How does the 'Independence' test relate to the 'Goodness-of-Fit' derivation?

Standardized References.

Related Proofs Cluster.

Proof of Chebyshev's Inequality

Derivation of the Mean and Variance of the Binomial Distribution

Derivation of the Mean and Variance of the Poisson Distribution

The Conceptual Proof of the Central Limit Theorem (CLT)

Institutional Citation

Dominate the Logic.

Why is the denominator $E_i$ and not the variance $np_i(1-p_i)$ ?

What happens if $E_i < 5$ ?