Q: Does the unbiasedness property hold if the regressors $ \mathbf{X} $ are stochastic (random variables) rather than fixed?

Yes, provided the strict exogeneity assumption $ E(\boldsymbol{\epsilon} | \mathbf{X}) = \mathbf{0} $ holds. This conditional expectation accounts for $ \mathbf{X} $ being stochastic by ensuring that, for any realization of $ \mathbf{X} $, the errors average to zero, thus allowing the law of iterated expectations to yield $ E(\hat{\boldsymbol{\beta}}) = \boldsymbol{\beta} $.

Q: What happens to unbiasedness if there is perfect multicollinearity among the regressors?

Perfect multicollinearity means the design matrix $ \mathbf{X} $ does not have full column rank, rendering $ \mathbf{X}^T \mathbf{X} $ singular. Consequently, its inverse $ (\mathbf{X}^T \mathbf{X})^{-1} $ does not exist, and the OLS estimator $ \hat{\boldsymbol{\beta}} $ cannot be uniquely computed, thus making the concept of its unbiasedness moot.

Question 1

Does the unbiasedness property hold if the regressors $ \mathbf{X} $ are stochastic (random variables) rather than fixed?

Accepted Answer

Yes, provided the strict exogeneity assumption $ E(\boldsymbol{\epsilon} | \mathbf{X}) = \mathbf{0} $ holds. This conditional expectation accounts for $ \mathbf{X} $ being stochastic by ensuring that, for any realization of $ \mathbf{X} $, the errors average to zero, thus allowing the law of iterated expectations to yield $ E(\hat{\boldsymbol{\beta}}) = \boldsymbol{\beta} $.

Question 2

Is an unbiased estimator always preferred over a biased one?

Accepted Answer

Not necessarily. While unbiasedness is desirable, it doesn't consider estimator variance. A slightly biased estimator with much lower variance might be preferred, especially in terms of Mean Squared Error (MSE). This is known as the bias-variance trade-off, crucial in advanced estimation theory.

Question 3

What happens to unbiasedness if there is perfect multicollinearity among the regressors?

Accepted Answer

Perfect multicollinearity means the design matrix $ \mathbf{X} $ does not have full column rank, rendering $ \mathbf{X}^T \mathbf{X} $ singular. Consequently, its inverse $ (\mathbf{X}^T \mathbf{X})^{-1} $ does not exist, and the OLS estimator $ \hat{\boldsymbol{\beta}} $ cannot be uniquely computed, thus making the concept of its unbiasedness moot.

Question 4

How does omitted variable bias specifically break the unbiasedness of OLS?

Accepted Answer

Omitted variable bias occurs when a relevant variable, correlated with both an included regressor and the dependent variable, is left out of the model. Its effect is absorbed into the error term $ \boldsymbol{\epsilon} $, making $ \boldsymbol{\epsilon} $ correlated with the included $ \mathbf{X} $. This violates $ E(\boldsymbol{\epsilon} | \mathbf{X}) = \mathbf{0} $, causing $ E((\mathbf{X}^T \mathbf{X})^{-1} \mathbf{X}^T \boldsymbol{\epsilon}) $ to be non-zero, leading to a biased $ \hat{\boldsymbol{\beta}} $.

Proof of Unbiasedness of the OLS Estimator: E(β̂) = β

Visualizing...

The Formal Theorem

Analytical Intuition.

Institutional Warning.

Academic Inquiries.

Does the unbiasedness property hold if the regressors $\mathbf{X}$ are stochastic (random variables) rather than fixed?

Is an unbiased estimator always preferred over a biased one?

What happens to unbiasedness if there is perfect multicollinearity among the regressors?

How does omitted variable bias specifically break the unbiasedness of OLS?

Standardized References.

The Matrix Formulation of the General Linear Model: Y = Xβ + ϵ and its Fundamental Assumptions

Derivation of the Ordinary Least Squares (OLS) Estimator: β̂ = (X'X)⁻¹X'Y

Derivation of the Variance-Covariance Matrix of the OLS Estimator: Var(β̂) = σ²(X'X)⁻¹

The Gauss-Markov Theorem: Proof that OLS is the Best Linear Unbiased Estimator (BLUE)

Institutional Citation

Dominate the Logic.

Visualizing...

The Formal Theorem

Analytical Intuition.

Institutional Warning.

Academic Inquiries.

Does the unbiasedness property hold if the regressors X \mathbf{X} X are stochastic (random variables) rather than fixed?

Is an unbiased estimator always preferred over a biased one?

What happens to unbiasedness if there is perfect multicollinearity among the regressors?

How does omitted variable bias specifically break the unbiasedness of OLS?

Standardized References.

Related Proofs Cluster.

The Matrix Formulation of the General Linear Model: Y = Xβ + ϵ and its Fundamental Assumptions

Derivation of the Ordinary Least Squares (OLS) Estimator: β̂ = (X'X)⁻¹X'Y

Derivation of the Variance-Covariance Matrix of the OLS Estimator: Var(β̂) = σ²(X'X)⁻¹

The Gauss-Markov Theorem: Proof that OLS is the Best Linear Unbiased Estimator (BLUE)

Institutional Citation

Dominate the Logic.

Does the unbiasedness property hold if the regressors $\mathbf{X}$ are stochastic (random variables) rather than fixed?