Volume 31, Issue 10 pp. 741-751
Full Paper

A Kernel-Based Method for Assessing Uncertainty on Individual QSAR Predictions

Cleo Tebby

Corresponding Author

Cleo Tebby

Unité Modèles pour l'Ecotoxicologie et la Toxicologie (METO), INERIS, Parc Technologique Alata, BP2, 60550 Verneuil-en-Halatte, France

Unité Modèles pour l'Ecotoxicologie et la Toxicologie (METO), INERIS, Parc Technologique Alata, BP2, 60550 Verneuil-en-Halatte, FranceSearch for more papers by this author
Enrico Mombelli

Enrico Mombelli

Unité Modèles pour l'Ecotoxicologie et la Toxicologie (METO), INERIS, Parc Technologique Alata, BP2, 60550 Verneuil-en-Halatte, France

Search for more papers by this author
First published: 25 September 2012
Citations: 5

Graphical Abstract

Abstract

The assessment of uncertainty attached to individual predictions is now a priority for sound decision-making in risk assessment. QSAR predictive uncertainty is affected by a variety of factors related to the quality of the training set data, the adopted statistical models, and the distance between the query chemical and the training set. We developed a method to quantify uncertainty associated with individual linear QSAR predictions that integrates both model and experimental error uncertainty and that defines an applicability domain based on the density of training set data. Our method is based on chemical spaces defined by latent variables identified by Partial Least Squares (PLS) regressions. The method provides a kernel regression estimate of the activity of interest as well as a measure of predictive uncertainty based on a mathematical estimation of the domain of applicability and on local propagation of uncertainty associated with training set data.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.