The inconclusive category in forensics reporting is the appropriate response in many cases, but it poses challenges in estimating an “error rate”. We discuss the use of a class of information-theoretic measures related to cross entropy as an alternative set of metrics that allows for performance evaluation of results presented using multi-category reporting scales. This paper shows how this class of performance metrics, and in particular the log likelihood ratio cost, which is already in use with likelihood ratio forensic reporting methods and in machine learning communities, can be readily adapted for use with the widely used multiple category conclusions scales. Bayesian credible intervals on these metrics can be estimated using numerical methods. The application of these metrics to published test results is shown. It is demonstrated, using these test results, that reducing the number of categories used in a proficiency test from five or six to three increases the cross entropy, indicating that the higher number of categories was justified, as it they increased the level of agreement with ground truth.

CONFLICT OF INTEREST STATEMENT

The authors have no conflicts of interest to declare.

Supporting Information

Filename

Description

jfo15686-sup-0001-Figure_S1.pngPNG image, 399.8 KB

Figure S1.

jfo15686-sup-0002-Supplemental Information.docxWord 2007 document , 81.2 KB

Table S1.

Table S2.

Table S3.

Table S4.

Table S5.

Table S6.

Table S7.

Table S8.

Table S9.

Table S10.

Table S11.

Table S12.

Table S13.

Table S14.

Table S15.

Table S16.

Table S17.

Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.

REFERENCES

Citing Literature

Volume70, Issue2

March 2025

Pages 589-606

Cross entropy and log likelihood ratio cost as performance measures for multi-conclusion categorical outcomes scales

Abstract

CONFLICT OF INTEREST STATEMENT

Supporting Information

REFERENCES

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Cross entropy and log likelihood ratio cost as performance measures for multi-conclusion categorical outcomes scales

Abstract

CONFLICT OF INTEREST STATEMENT

Supporting Information

REFERENCES

Citing Literature

References

Related

Information