Model Selection Criteria: Derivation and Comparison of AIC and BIC
Master the rigorous derivation and institutional intuition behind AIC and BIC. Learn how to navigate the bias-variance trade-off in General Linear Models.
Visualizing...
Our institutional research engineers are currently mapping the formal proof for Model Selection Criteria: Derivation and Comparison of AIC and BIC.
Apply for Institutional Early Access →The Formal Theorem
Analytical Intuition.
Institutional Warning.
Students often confuse the 'truth' assumption: AIC does not assume the true model is in the set, prioritizing predictive performance. BIC assumes a true model exists, prioritizing consistent identification of that model in the limit as .
Academic Inquiries.
Why does BIC penalize complexity more harshly than AIC?
Because the penalty term grows with sample size , whereas AIC's remains constant. Once , or , BIC's penalty exceeds AIC's.
Can I compare models with different transformations of the dependent variable using AIC?
No. AIC values depend on the Jacobian of the transformation. You must use the same likelihood function definition for all models being compared.
What happens if I use AIC or BIC when my model is misspecified?
AIC remains a valid tool for comparing predictive performance under misspecification. BIC's theoretical justification as an approximation of the posterior probability of the 'true' model becomes ambiguous if the true model is absent.
Standardized References.
- Definitive Institutional SourceBurnham, K. P., & Anderson, D. R., Model Selection and Multimodel Inference.
Related Proofs Cluster.
The Matrix Formulation of the General Linear Model: Y = Xβ + ϵ and its Fundamental Assumptions
Master the matrix formulation of the General Linear Model, \( Y = X\beta + \epsilon \), and its fundamental assumptions. Rigorous yet intuitive content for BSc Math/Stats students.
Derivation of the Ordinary Least Squares (OLS) Estimator: β̂ = (X'X)⁻¹X'Y
Master the OLS estimator derivation: \( \hat{\beta} = (X'X)^{-1}X'Y \). Explore the geometric orthogonality, matrix calculus, and Gauss-Markov foundations.
Proof of Unbiasedness of the OLS Estimator: E(β̂) = β
Master the rigorous proof of OLS estimator unbiasedness, \( E(\hat{\boldsymbol{\beta}}) = \boldsymbol{\beta} \). Understand critical assumptions, geometric intuition, and common pitfalls for robust linear modeling.
Derivation of the Variance-Covariance Matrix of the OLS Estimator: Var(β̂) = σ²(X'X)⁻¹
A rigorous derivation of the Variance-Covariance matrix for the OLS estimator, exploring the geometric impact of data configuration on statistical precision.
Institutional Citation
Reference this proof in your academic research or publications.
NICEFA Visual Mathematics. (2026). Model Selection Criteria: Derivation and Comparison of AIC and BIC: Visual Proof & Intuition. Retrieved from https://www.nicefa.org/library/general-linear-models-/model-selection-criteria--derivation-and-comparison-of-aic-and-bic
Dominate the Logic.
"Abstract theory is just a movement we haven't seen yet."