Q: Does the Gauss-Markov Theorem require the error terms $ \epsilon $ to be normally distributed?

No, the Gauss-Markov Theorem does not require the errors $ \epsilon $ to be normally distributed. It only relies on the first and second moments of the errors (mean and variance/covariance). Normality is often assumed for hypothesis testing and confidence intervals, or for OLS to be the Maximum Likelihood Estimator (MLE), but it is not necessary for OLS to be BLUE.

Q: What does it mean for an estimator to be 'linear' in the context of OLS?

An estimator $ \tilde{\beta} $ is 'linear' if it can be expressed as a linear function of the observed dependent variable $ y $. For OLS, $ \hat{\beta}_{OLS} = (X^T X)^{-1} X^T y $. If we let $ C = (X^T X)^{-1} X^T $, then $ \hat{\beta}_{OLS} = Cy $, which clearly shows its linearity in $ y $. This linearity simplifies analysis and makes the estimator analytically tractable.

Q: Why is the positive semi-definite condition for $ \text{Var}(\tilde{\beta}) - \text{Var}(\hat{\beta}_{OLS}) $ important?

The condition that $ \text{Var}(\tilde{\beta}) - \text{Var}(\hat{\beta}_{OLS}) $ is a positive semi-definite matrix means that for any non-zero vector $ a $, $ a^T (\text{Var}(\tilde{\beta}) - \text{Var}(\hat{\beta}_{OLS})) a \ge 0 $. This implies that $ \text{Var}(a^T \tilde{\beta}) \ge \text{Var}(a^T \hat{\beta}_{OLS}) $ for any linear combination of the elements of $ \tilde{\beta} $. In simpler terms, it means that the variance of any single coefficient estimate, or any linear combination of coefficients, from $ \hat{\beta}_{OLS} $ will be less than or equal to that from any other linear unbiased estimator $ \tilde{\beta} $.

Question 1

Does the Gauss-Markov Theorem require the error terms $ \epsilon $ to be normally distributed?

Accepted Answer

No, the Gauss-Markov Theorem does not require the errors $ \epsilon $ to be normally distributed. It only relies on the first and second moments of the errors (mean and variance/covariance). Normality is often assumed for hypothesis testing and confidence intervals, or for OLS to be the Maximum Likelihood Estimator (MLE), but it is not necessary for OLS to be BLUE.

Question 2

If homoscedasticity or no autocorrelation is violated, is OLS still unbiased?

Accepted Answer

Yes, OLS remains unbiased even in the presence of heteroscedasticity or autocorrelation, provided the strict exogeneity assumption $ E[\epsilon | X] = 0 $ holds. However, its variance-covariance matrix will be incorrectly estimated by the standard formula, and OLS will no longer be the 'Best' (most efficient) linear unbiased estimator. Other methods like Weighted Least Squares (WLS) or Generalized Least Squares (GLS) would be more efficient.

Question 3

What does it mean for an estimator to be 'linear' in the context of OLS?

Accepted Answer

An estimator $ \tilde{\beta} $ is 'linear' if it can be expressed as a linear function of the observed dependent variable $ y $. For OLS, $ \hat{\beta}_{OLS} = (X^T X)^{-1} X^T y $. If we let $ C = (X^T X)^{-1} X^T $, then $ \hat{\beta}_{OLS} = Cy $, which clearly shows its linearity in $ y $. This linearity simplifies analysis and makes the estimator analytically tractable.

Question 4

Why is the positive semi-definite condition for $ \text{Var}(\tilde{\beta}) - \text{Var}(\hat{\beta}_{OLS}) $ important?

Accepted Answer

The condition that $ \text{Var}(\tilde{\beta}) - \text{Var}(\hat{\beta}_{OLS}) $ is a positive semi-definite matrix means that for any non-zero vector $ a $, $ a^T (\text{Var}(\tilde{\beta}) - \text{Var}(\hat{\beta}_{OLS})) a \ge 0 $. This implies that $ \text{Var}(a^T \tilde{\beta}) \ge \text{Var}(a^T \hat{\beta}_{OLS}) $ for any linear combination of the elements of $ \tilde{\beta} $. In simpler terms, it means that the variance of any single coefficient estimate, or any linear combination of coefficients, from $ \hat{\beta}_{OLS} $ will be less than or equal to that from any other linear unbiased estimator $ \tilde{\beta} $.

The Gauss-Markov Theorem: Proof that OLS is the Best Linear Unbiased Estimator (BLUE)

Visualizing...

The Formal Theorem

Analytical Intuition.

Institutional Warning.

Academic Inquiries.

Does the Gauss-Markov Theorem require the error terms $\epsilon$ to be normally distributed?

If homoscedasticity or no autocorrelation is violated, is OLS still unbiased?

What does it mean for an estimator to be 'linear' in the context of OLS?

Why is the positive semi-definite condition for $\text{Var}(\tilde{\beta}) - \text{Var}(\hat{\beta}_{OLS})$ important?

Standardized References.

The Matrix Formulation of the General Linear Model: Y = Xβ + ϵ and its Fundamental Assumptions

Derivation of the Ordinary Least Squares (OLS) Estimator: β̂ = (X'X)⁻¹X'Y

Proof of Unbiasedness of the OLS Estimator: E(β̂) = β

Derivation of the Variance-Covariance Matrix of the OLS Estimator: Var(β̂) = σ²(X'X)⁻¹

Institutional Citation

Dominate the Logic.

Visualizing...

The Formal Theorem

Analytical Intuition.

Institutional Warning.

Academic Inquiries.

Does the Gauss-Markov Theorem require the error terms ϵ \epsilon ϵ to be normally distributed?

If homoscedasticity or no autocorrelation is violated, is OLS still unbiased?

What does it mean for an estimator to be 'linear' in the context of OLS?

Why is the positive semi-definite condition for Var(β~)−Var(β^OLS) \text{Var}(\tilde{\beta}) - \text{Var}(\hat{\beta}_{OLS}) Var(β~​)−Var(β^​OLS​) important?

Standardized References.

Related Proofs Cluster.

The Matrix Formulation of the General Linear Model: Y = Xβ + ϵ and its Fundamental Assumptions

Derivation of the Ordinary Least Squares (OLS) Estimator: β̂ = (X'X)⁻¹X'Y

Proof of Unbiasedness of the OLS Estimator: E(β̂) = β

Derivation of the Variance-Covariance Matrix of the OLS Estimator: Var(β̂) = σ²(X'X)⁻¹

Institutional Citation

Dominate the Logic.

Does the Gauss-Markov Theorem require the error terms $\epsilon$ to be normally distributed?

Why is the positive semi-definite condition for $\text{Var}(\tilde{\beta}) - \text{Var}(\hat{\beta}_{OLS})$ important?