A New Correlation Coefficient for T-Spherical Fuzzy Sets and Its Application in Multicriteria Decision-Making and Pattern Recognition
Abstract
The goal of this paper is to design a new correlation coefficient for T-spherical fuzzy sets (TSFSs), which can accurately measure the nature of correlation (i.e., positive and negative) as well as the degree of relationship between TSFS. In order to formulate our proposed idea, we had taken inspiration from the statistical concept of the correlation coefficient. While doing so, we firstly introduce the variance and covariance of two TSFS and then constructed our scheme using these two newly defined notions. The numerical value of our proposed correlation coefficient lies within the interval [−1, +1], as it should be from a statistical point of view, whereas the existing methods cannot measure the negative correlation between TSFS, as their numerical value falls within the interval [0, 1], which is not reasonable both statistically and intuitively. This aspect has also been thoroughly demonstrated using some numerical examples. The comparison results witnessed the dominance and upper hand of our proposed method over the existing definitions, with reliable and better results. In order to demonstrate the feasibility, usefulness, and practical application, we applied our proposed scheme to solve technical and scientific problems of multicriteria decision-making and pattern recognition. The numerical results show that our proposed scheme is practically suitable, technically applicable, and intuitively reasonable.
1. Introduction
The notion of fuzzy sets (FS) was initially proposed by Zadeh [1] to deal with imprecise and vague situations where the membership MA(x) is assigned to each element [2, 3], and it is a generalization of the crisp set [4]. In crisp set theory, we consider only deterministic and precise situations that could not handle imprecise and vague situations. Alternatively, fuzzy sets provide better solutions for real-life problems. Various researchers have developed certain novel techniques related to fuzzy sets and demonstrated their application in other fields such as [5–14] that are some practical examples. In a fuzzy set, however, only the degree of membership MA(x) is of relevance to research, and nonmembership NA(x) is not an option. In light of this, Atanassov [15] developed the notion of intuitionistic fuzzy sets (IFS), which takes into account both membership and nonmembership functions under a particular condition: 0 ≤ MA(x) + NA(x) ≤ 1, where 0 ≤ MA(x) ≤ 1 and 0 ≤ NA(x) ≤ 1, and the term τA(x) = 1 − MA(x) − NA(x) is called degree of hesitancy. IFS has a wide range of application in many fields, e.g., decision-making [16], logic programming [17], pattern recognition [18–20] medical diagnosis [21], information retrieval [22] and cluster analysis [23], and communication [24–26]. Although the theory of IFS has been successfully applied in different fields, there are some real-life situations where human opinions involve more than two independent statements, like yes, no, abstain, and refusal. Therefore, to deal with these types of situations, Cuong and Kreinovich [27] extended Zadeh FS and Atanassov’s IFS into picture fuzzy sets (PFS) with the following condition (MA(x) + IA(x) + NA(x) ≤ 1), where MA(x)(∈[0, 1]) is the degree of positive membership, IA(x)(∈[0, 1]) which represents the degree of neutral membership, and NA(x)(∈[0, 1]) is a degree of negative membership. The proposed idea of Cuong and Kreinovich [27] has great importance and application to deal with human opinion efficiently as we have discussed earlier but in some particular scenarios, the sum of membership degrees exceeds 1, i.e., (MA(x) + IA(x) + NA(x) > 1). In these situations, IFS is insufficient to provide a satisfactory result. To address this problem, Mahmood et al. [28] developed the spherical fuzzy set (SFS) by adding new constraints: and which is a generalization of FS, IFS, and PFS, which satisfies the sum of membership degrees is less than or equal to 1. The concept of TSFS is a recent development in fuzzy set theory and successfully applied to medical diagnosis and multiattribute decision-making. In addition, the extensions of IFS method are also found in literature abundantly such as fuzzy multisets [29], type-2 fuzzy sets [30], hesitant fuzzy sets [31], and Pythagorean fuzzy sets, [32].
earson proposed the concept of correlation in 1895, and it has since become one of the most widely used indices in statistics. Correlation is a statistical term that describes how two variables are related in a linear pattern. Murthy et al. [33] introduced the concept of fuzzy set correlation for the first time in 1985. The correlation coefficient for IFS was proposed by Gerstenkorn and Manko [34] who introduced the correlation coefficient for IFS. Later on, Hong and Hwang [35] went on to extend intuitionistic fuzzy sets’ association in probability space. Mitchell [36] illustrated new formula for intuitionistic fuzzy sets and interpreted two intuitionistic fuzzy sets as ensembles of ordinary fuzzy sets. Several novel expansions, such as [23, 37–41], have also been proposed in the literature. Ullah et al. [42] recently discovered correlation coefficients for T-spherical fuzzy sets, which they used in clustering and multiattribute decision-making. Guleria and Bajaj [43] also presented a similar idea of the correlation coefficient for TSFSs and illustrated its application in pattern recognition and medical diagnosis. These newly developed ideas are applicable in many areas but their proposed method only measures [0, 1] ranging correlations and is unable to deal with negative correlation. Keeping this drawback in mind, we propose a new correlation coefficient for TSFSs that can easily address the highlighted weaknesses in the existing methods. The numerical value of our proposed correlation coefficient lies within the interval [−1, +1] as it should be from a statistical point of view, whereas the existing methods cannot measure the negative correlation between TSFSs, as their numerical value falls within the interval [0, 1], which is not reasonable both statistically and intuitively.
The organization of this manuscript has been furnished as follows. Section 2 entails basic concepts of IFS, PFS, SFS, and TSFS. It also reviews some existing ideas of the CC of TSFS. Section 3 details our proposed scheme (CC of TSFS), and it also validates and verifies our method with the help of some theorems and propositions. In Section 4, we demonstrate numerical comparisons regarding the performance of our method and the other existing methods to show the strengths and superiority of the proposed idea. Section 5 illustrates the feasibility and usefulness of our idea in the thematic areas of MCDM and PR with the help of some real-world problems. In Section 6, we finalize this manuscript with concluding remarks.
2. Preliminaries
This section has been dedicated to recalling some essential theories, functions, and characteristics of TSFSs as well as the coefficient of correlation to facilitate the understanding of our research study. In the entire paper, we use M(x), N(x), I(x), and Λ(x) to represent the degree of membership, nonmembership, abstinence, and refusal in a unit interval [0, 1], respectively.
2.1. Generalizations of Fuzzy Sets
Definition 1. (see [15].)Let F be an intuitionistic fuzzy set (IFS) on a universe of discourse X = {x1, x2, x3, ⋯, xm}, and then it can be algebraically defined as F = {〈x, MF(x), NF(x)〉/x ∈ X}, with a constraint of 0 ≤ MF(x) + NF(x) ≤ 1, whereas H = 1 − {MF(x) + NF(x)} is known as the degree of hesitancy for an element x ∈ X to be belonging toF.
Definition 2. (see [44].)A mathematical expression for a PFS P defined over a universe of discourse X = {x1, x2, x3, ⋯, xm} can be presented as J = {〈x, MJ(x), IJ(x), NJ(x)〉/x ∈ X}, with a constraint of 0 ≤ MJ(x) + IJ(x) + NJ(x) ≤ 1, where the degree of refusal for an element x ∈ X to be in J is represented as Λ(x) = 1 − {MJ(x) + IJ(x) + NJ(x)}.
Definition 3. (see [45].)The standard negation set of the PFS J can be expressed as N(J) = {〈x, NJ(x), ΛJ(x), MJ(x)〉/x ∈ X}.
Definition 4. (see [28].)Let S be an SFS on a universe of discourse X = {x1, x2, x3, ⋯, xm}, and then it can be mathematically defined as S = {〈x, MS(x), IS(x), NS(x)〉/x ∈ X}, with a constraint of , while the degree of refusal for an element x ∈ X to be belonging to S can be expressed as .
Definition 5. (see [28].)For a TSFS T on a universe of discourse X = {x1, x2, x3, ⋯, xm}, its mathematical representation can be expressed as T = {〈x, MT(x), IT(x), NT(x)〉/x ∈ X}, with a constraint of . The degree of refusal for an element x ∈ X to be in TSFS T is defined as .
2.2. Existing Methods (CC of TSFSs)
Definition 6. (see [42].)Let S1 and S2 be two TSFSs in a universe of discourse X = {x1, x2, x3, ⋯, xm}, and the coefficient of correlation between TSFSs S1 and S2 is
while the correlation coefficient for weighted TSFSs is
Both of the above definitions (1) and (2) satisfy the following features:
- (a)
R(S1, S2) = R(S2, S1)
- (b)
0 ≤ R(S1, S2) ≤ 1
- (c)
R(S1, S2) = 1, iff S1 = S2
Definition 7. (see [42].)Another form of correlation coefficient between two TSFSs S1 and S2 in X = {x1, x2, x3, ⋯, xm} is
whereas its weighted form can be defined as
Equations (3) and (4) also satisfy the following properties:
- (a)
R∗(S1, S2) = R∗(S2, S1)
- (b)
0 ≤ R∗(S1, S2) ≤ 1
- (c)
R∗(S1, S2) = 1, iff S1 = S2
Definition 8. (see [43].)Coefficient of correlation between TSFSs S1 and S2 in X = {x1, x2, x3, ⋯, xm} is defined as
while the correlation coefficient for weighted TSFSs is
3. Proposed New Correlation Coefficient of TSFSs
We propose a new correlation coefficient(s) of TSFSs, including both the weighted and unweighted environments, to overcome the drawbacks of the existing definitions [37, 38]. It has been formulated using a statistical theory of correlation coefficient between two phenomena, that is, the mathematical ratio between the covariance of two TSFSs and the geometric mean of their individual variances. Initially, we introduce the variances and covariance of two TSFSs and use those new definitions to construct our proposed correlation coefficient.
Motivated from [46], we designed the structure of our proposed new correlation coefficient of TSFSs, demonstrated as follows.
Definition 10. For two TSFSs and , variances in terms of new variables can be defined as
while their covariance is mathematically expressed as
Proposition 11. For Cov(T1, T2) to be the covariance of two TSFSs T1 and T2, it should satisfy the following conditions:
Proof. (i) and (ii) are straight forward
(iii) Using Cauchy-Schwarz inequality, we have
Definition 12. Let T1 and T2 be two TSFSs on a universe of discourse X = {x1, x2, x3, ⋯, xm}. The correlation coefficient between T1 and T2 is defined as
Theorem 13. We have two TSFSs T1 and T2, defined over the universe of discourse X = {x1, x2, x3, ⋯, xm}, and then the correlation coefficient between them must fulfill the following conditions:
- (a)
ρ(T1, T2) = ρ(T2, T1)
- (b)
−1 ≤ ρ(T1, T2) ≤ +1
- (c)
ρ(T1, T2) = 1, if T1 = cT2 for any constant c > 0, where T1 = cT2 means, , , , and
- (d)
ρ(T1, T2) = −1, if T1 = cT2, for any constant c < 0
Proof.
By using these two inequalities,
- (c)
Since for any constant c > 0, , , , and , hence
Therefore, .
Cov(T1, T2) = cVar(T2),
Hence, for any constant c > 0,
- (d)
Since A = kB, so for any constant c < 0, , , , and , therefore,
Hence,
Cov(T1, T2) = −cVar(T2), and
So, for c < 0,
3.1. Special Cases
- (a)
When n = 2, equation (15) reduces to the correlation coefficient of SFSs
- (b)
When n = 1, equation (15) reduces to the correlation coefficient of PFSs
- (c)
When n = q and IA(xi) = IB(xi) = 0, equation (15) reduces to the correlation coefficient of Q-ROFSs
- (d)
When n = 2 and , equation (15) reduces to the correlation coefficient of PyFSs
- (e)
When n = 1 and , equation (15) reduces to the correlation coefficient of IFSs
We also propose a weighted correlation coefficient between two TSFSs in the following, to deal with the practical problems and situations, with different objects having unequal importance, in which we consider their weights keeping in view their relative importance.
Definition 14. Suppose we have two TSFSs T1 and T2 defined over a universe of discourse X = {x1, x2, x3, ⋯, xm}, having a weight vector w = {w1, w2, w3, ⋯, wm} for its elements xi ∈ X, where wi ≥ 0 and , then their weighted correlation coefficient can be defined as
If all of the elements xi ∈ X have equal weights (i.e.w = {(1/m), (1/m), (1/m), ⋯, (1/m)}), then equation (26) can be reduced to equation (15).
From the above definitions, we derive the following proposition and theorem.
Proposition 15. If we have two SFSs and in a universe of discourse ρw(A, B) = 1, then their weighted covariance satisfies the following conditions:
- (i)
Covw(A, B) = Covw(B, A)
- (ii)
Covw(A, A) = Varw(A)
- (iii)
Proof. It can be proved like Proposition 11.
Theorem 16. For two TSFSs T1 and T2 defined over X = {x1, x2, x3, ⋯, xm}, then the weighted correlation coefficient between them must satisfy the following properties:
- (a)
ρw(T1, T2) = ρw(T2, T1)
- (b)
−1 ≤ ρw(T1, T2) ≤ +1
- (c)
ρw(T1, T2) = 1, if T1 = cT2 for any constant c > 0
- (d)
ρw(T1, T2) = −1, if T1 = cT2, for any constant c < 0
Definition 17. Let T be a TSFS defined over a universe of discourse X = {x1, x2, x3, ⋯, xm}, and then standard negation set for it be characterized as MN(T)(xi) = NT(xi), IN(T)(xi) = ΛT(xi) and NN(T)(xi) = MT(xi) for all and can be mathematically expressed as N(T) = {〈x, NT(x), ΛT(x), MT(x)〉/x ∈ X}, where and where is the degree of refusal for an element x ∈ X to be in SFSs T.
4. Comparison with Existing Methods
This section has been dedicated to establishing a full comparative study of the results acquired from our suggested scheme and other current approaches [42, 43] using numerical examples to demonstrate the advantage and novelty of our concept. Using these examples and numerical results, we also intuitively showed the advantages of our proposed method to address the shortcomings of the existing methods. From a statistical as well as intuitive point of view, if two phenomena or datasets are moving in the same direction, then they have a positive relationship, while if their movement is in the opposite direction, then their relationship would be inverse. The correlation coefficient is an important instrument in statistical theory for determining the degree and nature of a link between two processes or variables.
So, our definitions of correlation coefficient should accurately measure both the degree and the nature of relationship between two TSFSs.
We present some numerical examples in the following to show those characteristics of our proposed scheme and its comparison with existing methods.
Example 18. Let T1 and T2 be two TSFSs in a universe of discourse X = {x1, x2, x3}, characterized as
By using our new definition, the standard negation set of a TSFS T1 is SN(T1) = {〈x1, 0.7, 0., 0.9〉, 〈x2, 0.8, 0., 0.7〉, 〈x3, 0.9, 0., 0.5〉}.
The following Table 1 represents the comparative results of our proposed method and the existing definitions [37, 38] of the correlation coefficient when applied to the above TSFSs.
It is intuitively justifiable that the relationship between a TSFS and its negation set should have an inverse relation, with a negative correlation coefficient. It can be observed from the above Table 1 that none of the definitions [42, 43] can accurately measure the negative relationship between TSFS T1 and its standard negation set SN(T)1, rather reflecting a positive relation (0.3798, 0.3505) which cannot be possible, while our proposed correlation coefficient shows that there is a negatively strong correlation (-0.9865) between TSFS T1 and its standard negation set SN(T)1. Similarly, it can be seen that the corresponding membership grades of TSFSs T1 and T2 are moving in an opposite direction over the various point of the universe of discourse X = {x1, x2, x3}. So, they should have an inverse relationship between them, with a negative correlation coefficient. It can be clearly seen from the above Table 1 that the value of our proposed correlation coefficient (-0.9859) portrays that there exists a strongly negative correlation between TSFSs T1 and T2, as it should be statistically as well as intuitively. On the other and the existing methods show a positive correlation (0.4304., 0.4051) between them and overlook their inverse relationship. Hence, it shows the superiority and upper hand of our proposed method over the existing definitions of correlation coefficient for TSFSs.
Example 19. Let w = {0.22, 0.33, 0.45} be the weight vector for the elements of TSFSs in Example 18. After applying our proposed weighted correlation coefficient and the existing methods for weighted TSFSs [42, 43], we present the comparative performance results in the following Table 2.
It can also be clearly seen from Table 2 that our proposed method accurately measures the negative relationship between TSFSs T1 and its standard negation set SN(T1) having a weighted correlation coefficient -0.9862, rationally endorsing the strongly negative correlation between them. Contrarily, the existing definitions [42, 43] show a positive correlation (0.3153, 0.2850) between TSFSs T1 and its standard negation set SN(T1), which cannot be logically possible. Moreover, there should be a negative correlation between TSFSs T1 and T1 (as explained in example 18). Table 2 shows that our method demonstrates a negative correlation (-0.9859) between T1 and T1 while the existing methods cannot measure it accurately, as they show a positive correlation coefficient (0.3774, 0.3473), which cannot be intuitively possible, and these shortcomings reduce the reliability and accuracy of the existing definitions. Furthermore, the competency of our proposed method to address these drawbacks shows its effectiveness and advantages.
5. Application in Multicriteria Decision-Making and Pattern Recognition
This section has been designated to illustrate the application of our proposed idea of the correlation coefficient for TSFSs in the practical domain of MCDM and pattern recognition to show its suitability and usefulness. Pattern recognition and machine learning are versatile practices that have widely discussed in different disciplines such as [19, 20, 24–26, 47, 48]: these are some examples.
5.1. Application in MCDM
This subsection has been planned to show the application of our proposed scheme in the field of multicriteria decision-making by using a real MCDM problem, taken from [28]. To do so, we present here the implementation procedure of our proposed scheme for dealing with an MCDM problem. Suppose there is a set of “k” alternatives defined as TSFSs A = {A1, A2, ⋯, Ak} and a set of “m” evaluation features F = {F1, F2, ⋯, Fm}, having weight vector , 0 ≤ w ≤ 1, and , we define the ideal alternative A∗ as a TSFS by taking maximum membership value, minimum nonmembership, and minimum indeterminacy value for each evaluation criteria across all of the given alternatives, like an ideal positive solution in TOPSIS. . By applying our proposed correlation coefficient, we calculate the relationship between ideal choice A∗ and the given alternatives. Furthermore, by using the principle of the maximum correlation coefficient, we select an alternative as the best option, which has the highest correlation coefficient with an ideal alternative A∗.
Example 20. In order to strengthen the sales and purchase department, a firm had advertised the position of manager and selected 3 candidates after conducting an initial screening of the prospective candidates, who had applied. For the selection of the best candidate, the top management of the firm had considered the following three features to evaluate the aforementioned initially short-listed candidates. (i) F1: communication skills, (ii) F2: sense of responsibility, and (iii) F3: creativity. The candidates, evaluation criteria, and their weights have been presented in the form of sets as a set of alternatives, set of evaluation criteria, and weights vector, respectively, as follows.
A = {A1, A2, A3},F = {F1, F2, F3}, and w = {0.20, 0.35, 0.45}T. After a comprehensive evaluation of the candidates, their decision information has been presented in the following Table 3 as a T-spherical fuzzy decision matrix (TSFDM).
Alternatives | F1 | F2 | F3 |
---|---|---|---|
A1 | <0.8, 0.5, 0.4> | <0.7, 0.4, 0.4> | <0.3, 0.5, 0.4> |
A2 | <0.9, 0.2, 0.4> | <0.6, 0.3, 0.2> | <0.4, 0.1, 0.7> |
A3 | <0.5, 0.5, 0.5> | <0.8, 0.2, 0.3> | <0.6, 0.4, 0.3> |
It can be clearly seen that the decision information in the above Table 3 is TSFI n = 3. So, we apply our proposed scheme to find a correlation coefficient between the ideal candidate A∗ = {〈0.9, 0.2, 0.4〉, 〈0.8, 0.2, 0.2〉, 〈0.6, 0.1, 0.3〉} and the initially short-listed three candidates. The results have been furnished in the following Table 4.
ρw(A∗, Ai) | A1 | A2 | A3 |
---|---|---|---|
ρw(A∗, Ai) | 0.9703 | 0.8307 | -0.2530 |
It can be clearly seen that the candidate A1 has the highest degree of relationship with the ideal choice A∗ as compared to that of the other alternatives. Hence, using the principle of maximum correlation, we can say that the candidate A1 is the most suitable choice for the position of manager. This is in complete agreement with the decision, from where this example has been taken.
5.2. Application in Pattern Recognition
In this section, we show how we used our proposed theory to solve a pattern recognition problem and recognize an unknown pattern while taking into account the properties of several well-known patterns. We develop a relationship between the unknown sample pattern P∗ and the set of known patterns P = {P1, P2, ⋯, Pk} using our proposed correlation coefficient on the basis of a set of evaluation criteria. Taking motivation from the idea of the recognition principle [49], we developed an analogy as a degree of belongingness of the unknown pattern to any one of the known patterns to recognize its actual class as , where ρ(P∗, Pj) is the degree of relationship between P∗ and Pj that can be calculated using our proposed correlation coefficient. More will be the value of j∗, and more will be the closeness P∗ to the jth known pattern. A synthetic/fictitious numerical example has been given in the following to demonstrate the application of our method in pattern recognition problems.
Example 21. Let T1, T2, and T3 be three patterns which are known, and T be an unknown sample pattern, defined as follows in the form of TSFSs. These TSFSs have been defined over a space of points X = {x1, x2, x3}, and weights have been expressed in the form of a weight vector as w = {0.31, 0.36, 0.33}T. The characteristic values of the abovementioned patterns have been furnished in the following Table 5.
Patterns | x1 | x2 | x3 |
---|---|---|---|
T1 | <0.42, 0.42, 0.42> | <0.42, 0.42, 0.62> | <0.63, 0.63, 0.72> |
T2 | <0.42, 0.42, 0.62> | <0.52, 0.63, 0.82> | <0.72, 0.72, 0.82> |
T3 | <0.42, 0.52, 0.63> | <0.52, 0.63, 0.72> | <0.63, 0.72, 0.82> |
T | <0.42, 0.42, 0.42> | <0.42, 0.42, 0.52> | <0.63, 0.63, 0.72> |
By applying our proposed scheme n = 5, we measure the relationship between the unknown sample pattern T and the other three known patterns (T1, T2, T3) to identify its belongingness. The obtained results have been presented in the following Table 4.
Table 6 clearly shows that the unknown sample pattern T has the highest correlation coefficient with the known pattern T1 as compared to the other known patterns T2 and T3. Hence, using the principle of maximum belongingness , we can conclude that the unknown sample pattern T belongs to the known pattern T1.
ρw(T, Tj) | T1 | T2 | T3 |
---|---|---|---|
ρw(T, Tj) | 0.9566 | 0.5185 | 0.9548 |
6. Conclusions
The TSFS is a useful tool for displaying the degree of positive, neutral, and negative membership, as well as the information’s dependability owing to rejection membership. Major correlation coefficient approaches based on the unit interval [0, 1] have been defined in the literature. In accordance with the conventional correlation coefficient in statistics, our newly proposed technique correlation coefficient for T-spherical fuzzy sets is the best choice for dealing with positive, negative, and no correlation [-1, 1]. The mathematical derivations for the upper and lower bound of correlation have been given. To show the novelty, we compare our method with existing methods, and the results demonstrate the good aspect of the proposed idea. Furthermore, to demonstrate the feasibility, usefulness, and practical application, we applied our proposed scheme to solve technical and scientific problems of multicriteria decision-making and pattern recognition.
Conflicts of Interest
The authors declare that they have no conflicts of interest to disclose.
Authors’ Contributions
Conceptualization was contributed by WA, MA, SH, SSU, and IH. Methodology was contributed by WA, MA, AB, and RA. Validation was contributed by WA, MA, SH, SSU, AB, RA, and IH. Investigation and funding were contributed by RA and AB. Writing the original draft preparation was contributed by WA, MA, SH, SSU, AB, RA, and IH. Writing-review and editing was contributed by FU, MA, IH, SSU, and SH. Visualization was contributed by WA, AB, RA, SSU, and FU. Project administration was contributed by WA, SH, IH, SSU, FU, AB, and RA.
Acknowledgments
The authors are grateful to the Taif University Researchers Supporting Project number (TURSP-2020/36), Taif University, Taif, Saudi Arabia.
Open Research
Data Availability
The data used in this research can be obtained from the corresponding authors.