Funding: This work was supported by Leidos, National Cancer Institute, Nebraska Program of Excellence in Computational Science and National Science Foundation (2007418).

Read the full text

About

PDF

Tools

Share a link

Email
Wechat
Bluesky

ABSTRACT

We give a decomposition of the predictive variance based on the law of total variance by making the response variable dependent on a finite dimensional discrete random variable representing our modeling assumptions. Then, we test which terms in this decomposition are small enough to ignore. This allows us to identify which of the discrete random variables, that is, aspects of modeling, are most important to prediction variance. The terms in the decomposition admit interpretations based on conditional means and variances and are analogous to the terms in a Cochran's theorem decomposition of squared error often used in analysis of variance. Thus, the modeling features are treated as factors in completely randomized design.

Conflicts of Interest

The authors declare no conflicts of interest.

Open Research

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

1P. Smyth and D. Wolpert, “Linearly Combining Density Estiamtors via Stacking,” Machine Learning 36 (1999): 59–83.
10.1023/A:1007511322260
Google Scholar
2D. Draper, “Assessment and Propagation of Model Uncertainty,” Journal of the Royal Statistical Society: Series B: Methodological 57, no. 1 (1995): 45–97.
10.1111/j.2517-6161.1995.tb02015.x
Web of Science® Google Scholar
3D. Dustin and B. Clarke, “Testing for the Important Components of Posterior Predictive Variance,” arXiv:2209.00636, 2022.
Google Scholar
4E. George, “ Dilution Priors: Compensating for Model Space Redundancy,” in Borrowing Strength: Theory Powering Applications—A Festschrift for Lawrence D. Brown. IMS Collections, vol. 6, ed. J. O. Berger, T. T. Cai, and I. M. Johnstone (Institute of Mathematical Statistics, 2010), 158–165.
10.1214/10-IMSCOLL611
Google Scholar
5W. Wang, S. Mukherjee, S. Richardson, and S. Hill, “High Dimensional Regression in Practice: An Empirical Study of Finite-Sample Prediction, Variable Selection, and Ranking,” Statistics and Computing 30 (2020): 697–719.
10.1007/s11222-019-09914-9
PubMed Web of Science® Google Scholar
6D. Dustin, J. Clarke, and B. Clarke, “Predictive Stability Criteria for Penalty Selection in Linear Models,” Computational Statistics 39 (2024): 1241–1280.
10.1007/s00180-023-01342-8
Web of Science® Google Scholar
7X. Zhang and C.-A. Liu, “Model Averaging Prediction by k-Fold Cross-Validation,” Journal of Econometrics 235, no. 1 (2023): 280–301.
10.1016/j.jeconom.2022.04.007
Web of Science® Google Scholar
8N. J. Higham, “Computing a Nearest Symmetric Positive Semidefinite Matrix,” Linear Algebra and Its Applications 103 (1988): 103–118.
10.1016/0024-3795(88)90223-6
Web of Science® Google Scholar
9S. Zhao, D. Witten, and A. Shojaie, “In Defense of the Indefensible: A Very Naive Approach to High-Dimensional Inference,” Statistical Science 36 (2021): 562–577.
10.1214/20-STS815
PubMed Web of Science® Google Scholar
10H. Scheffé, The Analysis of Variance (John Wiley and Sons, 1959).
Web of Science® Google Scholar
11P. Gustafson and B. Clarke, “Decomposing Posterior Variance,” Journal of Statistical Planning and Inference 119, no. 2 (2004): 311–327.
10.1016/S0378-3758(02)00491-3
Web of Science® Google Scholar
12N. Meinshausen, M. Mathiuus, and P. Bühlmann, “Asymptotic Optimality of the Westfall-Young Permutation Procedure for Multiple Testing Under Dependence,” Annals of Statistics 39 (2011): 3369–3391.
10.1214/11-AOS946
Web of Science® Google Scholar
13K. Hamidieh, “A Data-Driven Statistical Model for Predicting the Critical Temperature of a Superconductor,” Computational Materials Science 154 (2018): 346–354.
10.1016/j.commatsci.2018.07.052
CAS Web of Science® Google Scholar
14K. Kira and L. A. Rendell, “ A Practical Approach to Feature Selection,” in Machine Learning Proceedings 1992, ed. D. Sleeman and P. Edwards (Morgan Kaufmann, 1992), 249–256.
10.1016/B978-1-55860-247-2.50037-1
Google Scholar
15I. Kononenko, “Estimating Attributes: Analysis and Extensions of RELIEF,” in European Conference on Machine Learning (Springer, 1994), 171–182.
Google Scholar
16M. Robnik-Šikonja and I. Kononenko, “An Adaptation of Relief for Attribute Estimation in Regression,” Machine Learning: Proceedings of the Fourteenth International Conference (ICML'97) (Vol. 5, Citeseer, 1997), 296–304.
Google Scholar
17J. Chen, C. Richard, H. Lantéri, C. Theys, and P. Honeine, “A Gradient Based Method for Fully Constrained Least-Squares Unmixing of Hyperspectral Images,” in 2011 IEEE Statistical Signal Processing Workshop (SSP) (IEEE, 2011), 301–304.
Google Scholar
18H. Toutenberg and Shalabh, Statistical Analysis of Designed Experiments (Springer, 2009).
Google Scholar
19G. Box, “Some Theorems on Quadratic Forms Applied in the Study of Analysis of Variance Problems. I. Effect of Inequality of Variance in the One Way Classification,” Annals of Mathematical Statistics 25 (1954): 290–302.
10.1214/aoms/1177728786
Google Scholar

Volume18, Issue4

August 2025

e70029

Testing for the Important Components of Predictive Variance

ABSTRACT

Conflicts of Interest

Open Research

Data Availability Statement

References

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Testing for the Important Components of Predictive Variance

ABSTRACT

Conflicts of Interest

Open Research

Data Availability Statement

References

References

Related

Information