Volume 37, Issue 10 pp. 7200-7234
RESEARCH ARTICLE

Feature selection based on multiview entropy measures in multiperspective rough set

Jiucheng Xu

Jiucheng Xu

Engineering Lab of Intelligence Business & Internet of Things, Henan Province, Xinxiang, China

College of Computer and Information Engineering, Henan Normal University, Xinxiang, China

Search for more papers by this author
Kanglin Qu

Corresponding Author

Kanglin Qu

Engineering Lab of Intelligence Business & Internet of Things, Henan Province, Xinxiang, China

College of Computer and Information Engineering, Henan Normal University, Xinxiang, China

Correspondence Kanglin Qu, College of Computer and Information Engineering, Henan Normal University, Xinxiang 453007, China.

Email: [email protected]

Search for more papers by this author
Xiangru Meng

Xiangru Meng

Engineering Lab of Intelligence Business & Internet of Things, Henan Province, Xinxiang, China

College of Computer and Information Engineering, Henan Normal University, Xinxiang, China

Search for more papers by this author
Yuanhao Sun

Yuanhao Sun

Engineering Lab of Intelligence Business & Internet of Things, Henan Province, Xinxiang, China

College of Computer and Information Engineering, Henan Normal University, Xinxiang, China

Search for more papers by this author
Qincheng Hou

Qincheng Hou

Engineering Lab of Intelligence Business & Internet of Things, Henan Province, Xinxiang, China

College of Computer and Information Engineering, Henan Normal University, Xinxiang, China

Search for more papers by this author
First published: 01 April 2022
Citations: 16

Abstract

The performance of the neighborhood rough set model in feature selection is limited by nonobjective parameter selection method, the uncertainty measures considered only from a single view, and high time cost caused by processing high-dimensional data. To solve the above problems, this study first defines the interclass boundary to granulate the samples in different classes, and three types of neighborhood concepts—negative perspective, neutral perspective, and positive perspective—are put forward based on different cognitive perspectives. Then, the concept of the multiperspective rough set model is developed. The most prominent feature of this model is the discovery of differences between classes from the given data, without any parameters. Second, by integrating the information theory and algebraic views under the multiperspective rough set model, multiview entropy measures are proposed to effectively measure the uncertainty in data. Moreover, a nonmonotonic feature selection algorithm based on the mutual information in the multiview entropy measures under the neutral perspective as the evaluation function of feature importance is designed to resolve the disadvantages of the algorithms based on the monotone evaluation function. Finally, Information Gain is introduced to preliminarily decrease the dimension of high-dimensional data sets to promote classification accuracy and reduce time consumption. The experimental results confirm that the proposed algorithm is efficient in eliminating noise and increasing classification accuracy.

CONFLICTS OF INTEREST

The authors declare no conflicts of interest.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.