On Agreement Tables with Constant Kappa Values
Abstract
Kappa coefficients are standard tools for summarizing the information in cross-classifications of two categorical variables with identical categories, here called agreement tables. When two categories are combined the kappa value usually either increases or decreases. There is a class of agreement tables for which the value of Cohen’s kappa remains constant when two categories are combined. It is shown that for this class of tables all special cases of symmetric kappa coincide and that the value of symmetric kappa is not affected by any partitioning of the categories.
1. Introduction
In behavioral and biomedical science researchers are often interested in measuring the intensity of a behavior or a disease. Examples are psychologists that assess how anxious a speech-anxious subject appears while giving a talk, pathologists that rate the severity of lesions from scans, or competing diagnostic devices that classify the extent of a disease in patients into categories. These phenomena are typically classified using a categorical rating system, for example, with categories (A) slight, (B) moderate, and (C) extreme. Because ratings usually entail a certain degree of subjective judgment, researchers frequently want to assess the reliability of the categorical rating system that is used. One way to do this is to assign two observers to rate independently the same set of subjects. The reliability of the rating system can then be assessed by analyzing the agreement between the observers. High agreement between the ratings can be seen as a good indication of consensus in the diagnosis and interchangeability of the ratings of the observers.
Various statistical methodologies have been developed for analyzing agreement of a categorical rating system [1, 2]. For instance, loglinear models can be used for studying the patterns of agreement and sources of disagreement [3, 4]. However, in practice researchers often want to express the agreement between the raters in a single number. In this context, standard tools for summarizing agreement between observers are coefficients Cohen’s kappa in the case of nominal categories [5–7] and weighted kappa in the case of ordinal categories [8–11]. With ordinal categories one may expect more disagreement or confusion on adjacent categories than on categories that are further apart. Weighted kappa allows the user to specify weights to describe the closeness between categories [12]. Both Cohen’s kappa and weighted kappa are corrected for agreement due to chance. The coefficients were originally proposed in the context of agreement studies, but nowadays they are used for summarizing all kinds of cross-classifications of two variables with the same categories [11, 12].
The number of categories used in various rating systems usually varies from the minimum number of two to five in many practical applications. It is sometimes desirable to combine some of the categories [7]. For example, when two categories are easily confused, combining the categories usually improves the reliability of the rating system [13]. By collapsing categories the number of categories of the rating system is reduced. If there is a lot of disagreement between two categories, we expect the kappa value to increase if we combine the categories. This is usually the case. However, Schouten [13] showed that there is a class of agreement tables for which the value of Cohen’s kappa remains constant when categories are merged. This is not what one expects from an agreement coefficient like Cohen’s kappa. The question, then, arises: do other (weighted) kappa coefficients exhibit the same property for these tables? If the answer is negative, it would make sense to replace Cohen’s kappa by a weighted kappa with more favorable properties with regard to these agreement tables.
In this paper we present several properties of kappa coefficients with symmetric weighting schemes with respect to this particular class of agreement tables. The paper is organized as follows. In the next section we introduce notation, define weighted kappa, and discuss some of its special cases, including Cohen’s kappa. The results are presented in Section 3. Section 4 contains a conclusion.
2. Kappa Coefficients
In this section we introduce notation and define the kappa coefficients. For notational convenience weighted kappa is here defined in terms of dissimilarity scaling [8]. If the weights are dissimilarities, pairs of categories that are further apart are assigned higher weights.
First observer | Second observer | |||||||
---|---|---|---|---|---|---|---|---|
A | B | C | Total | A | B | C | Total | |
A | 22 | 2 | 0 | 24 | 16 | 4 | 0 | 20 |
B | 4 | 10 | 0 | 14 | 0 | 2 | 1 | 3 |
C | 4 | 2 | 6 | 12 | 4 | 0 | 2 | 6 |
Total | 30 | 14 | 6 | 50 | 20 | 6 | 3 | 29 |
Identity | Quadratic | |||||||
---|---|---|---|---|---|---|---|---|
A | B | C | D | A | B | C | D | |
A | 0 | 1 | 1 | 1 | 0 | 1 | 4 | 9 |
B | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 4 |
C | 1 | 1 | 0 | 1 | 4 | 1 | 0 | 1 |
D | 1 | 1 | 1 | 0 | 9 | 4 | 1 | 0 |
3. A Class of Agreement Tables
For Cohen’s kappa in (6) Schouten [13] showed that if (9) holds, then the kappa value cannot be increased or decreased by combing categories. In this section we present various additional results for other special cases of symmetric kappa in (4). Theorem 1 shows that all special cases of symmetric kappa coincide if (9) holds.
Theorem 1. If (9) holds, then κs = 1 − d.
Proof. If (9) holds, we have the particular case
A converse version of Theorem 1 also holds. Lemma 2 is used in the proof of Theorem 3.
Lemma 2. Let a, b ≥ 0 and c, d > 0 be real numbers. One has
Proof. Since c and d are positive numbers, we have a/c = b/d or ad = bc. Adding ac to both sides we obtain a(c + d) = c(a + b) or a/c = (a + b)/(c + d).
Theorem 3. If all special cases of symmetric kappa are equal, then (9) holds.
Proof. Let r, r′ ∈ {1,2, …, c} with r ≠ r′ be arbitrary categories. Let denote the value of the special case of symmetric kappa with and all other off-diagonal weights equal to 1. Since all special cases of symmetric kappa are equal, we have in particular for some real number d ≥ 0. Using (6), the identity is equivalent to
Note that in the proof of Theorem 3 certain special cases of coefficient (4) are used. Condition (9) will not necessarily hold if two arbitrary special cases of symmetric kappa are equal. We have the following consequences of Theorems 1 and 3.
Corollary 4. It holds that κs = 1⇔πij = 0 for i ≠ j and 1 ≤ i, j ≤ c.
Corollary 5. It holds that
Theorem 6 shows that if (9) holds, then the value of coefficient (4) remains constant when we combine two categories.
Theorem 6. Let κs denote the value of symmetric kappa of an agreement table with c ≥ 3 categories and the value of the table that is obtained by combining categories r′ and r′′. If condition (9) holds, then one has .
Proof. Since (9) holds, it follows from Theorem 1 that κs = 1 − d for some d ≥ 0. Let r denote the category that is obtained by merging r′ and r′′. Let i with 1 ≤ i ≤ c and i ≠ r′, r′′ be an arbitrary category. We have the four relations
Theorem 6 shows that if the value of Cohen’s kappa in (6) remains constant when categories are combined, then the value of symmetric kappa in (4) also remains constant when categories are combined. By repeatedly applying Theorem 6 we obtain the following consequence.
Corollary 7. Let κs denote the value of symmetric kappa of an agreement table with c ≥ 3 categories and the value of the collapsed table corresponding to any partitioning of the categories. If (9) holds, then one has .
4. Conclusion
Kappa coefficients are standard tools for summarizing agreement between two observers on a categorical rating scale. The coefficients are nowadays used for summarizing the information in all types of cross-classifications of two variables with the same categories. In the case of nominal categories Cohen’s kappa is a standard tool. In this paper we considered a class of agreement tables for which the value of Cohen’s kappa remains constant when two categories are combined. It was shown that for this class of agreement tables all special cases of symmetric kappa, that is, all kappa coefficients with a symmetric weighting scheme, coincide (Theorem 1). Furthermore, for this class of agreement tables the value of symmetric kappa remains constant when categories are merged (Theorem 6 and Corollary 7).
Conflict of Interests
The author declares that there is no conflict of interests regarding the publication of this paper.
Acknowledgments
The author thanks an anonymous reviewer for several helpful comments and valuable suggestions on a previous version of the paper. The comments have improved the presentation of the paper. This research is part of Veni project 451-11-026 funded by the Netherlands Organisation for Scientific Research.