We report a novel method called ADAN (Applicability Domain ANalysis) for assessing the reliability of drug property predictions obtained by in silico methods. The assessment provided by ADAN is based on the comparison of the query compound with the training set, using six diverse similarity criteria. For every criterion, the query compound is considered out of range when the similarity value obtained is larger than the 95th percentile of the values obtained for the training set. The final outcome is a number in the range 0 to 6 that expresses the number of unmeet similarity criteria and allows classifying the query compound within seven reliability categories. Such categories can be further exploited to assign simpler reliability classes using a traffic light schema, to assign approximate confidence intervals or to mark the predictions as unreliable. The whole methodology has been validated simulating realistic conditions, where query compounds are structurally diverse from those in the training set. The validation exercise involved the construction of more than one thousand models. These models were built using a combination of training set, molecular descriptors and modeling methods representative of the real predictive tasks performed in the eTOX project (a project aiming to predict in vivo toxicological endpoints in drug development). Validation results confirm the robustness of the proposed assessment methodology, which compares favorably with other classical methods based solely on the structural similarity of the compounds. ADAN characteristics make the method well suited for estimate the quality of drug predictions obtained in extremely unfavorable conditions, like the prediction of drug toxicity endpoints.
# install the package with # install.packages( "/PATH/TO/adan_X.X.tar.gz" , repos = NULL , type = "source" ) > library(adan) > vignette("adan")