Volume 37, Issue 10 pp. 7736-7766
RESEARCH ARTICLE

Learning robust graph for clustering

Zheng Liu

Zheng Liu

College of Control Science and Engineering, Research Center for Analytical Instrumentation, Institute of Cyber-Systems and Control, State Key Laboratory of Industrial Control Technology, Zhejiang University, Hangzhou, China

Search for more papers by this author
Wei Jin

Corresponding Author

Wei Jin

College of Control Science and Engineering, Research Center for Analytical Instrumentation, Institute of Cyber-Systems and Control, State Key Laboratory of Industrial Control Technology, Zhejiang University, Hangzhou, China

College of Control Science and Engineering, Huzhou Institute of Zhejiang University, Huzhou, China

Correspondence Wei Jin, Research Center for Analytical Instrumentation, Institute of Cyber-Systems and Control, State Key Laboratory of Industrial Control Technology, Zhejiang University, 310027 Hangzhou, China.

Email: [email protected]

Search for more papers by this author
Ying Mu

Ying Mu

College of Control Science and Engineering, Research Center for Analytical Instrumentation, Institute of Cyber-Systems and Control, State Key Laboratory of Industrial Control Technology, Zhejiang University, Hangzhou, China

Search for more papers by this author
First published: 16 May 2022
Citations: 5

Abstract

Graph is a popular technique to explore the structure of data. Many related algorithms directly construct graphs based on the original data. Actually, the samples collected in real life usually contain noise. Besides, some unimportant features probably exist in high-dimensional data. Therefore, this way cannot assure a high-quality graph and furthermore brings some adverse influence to the following tasks. In this paper, we incorporate robust graph learning and dimensionality reduction into a unified framework which also seamlessly integrates the clustering task. On the basis of the framework, Euclidean distance-based robust graph (EDBRG) and self-expressiveness-based robust graph (SEBRG) are presented. Both EDBRG and SEBRG contain clustering information from which the clustering results can be obtained directly. By projecting the original data into a discriminative subspace where the negative effect of redundant features and noise is removed, EDBRG and SEBRG are informative, robust, and sparse. During the whole mapping process, the main energy of data is preserved. Finally, some data sets are adopted to test the performances of EDBRG and SEBRG. Extensive experiments illustrate that the proposed methods have many advantages for the task of clustering, comparing with the state-of-the-art algorithms.

DATA AVAILABILITY STATEMENT

The data that support the findings of this study are available from the corresponding author upon reasonable request.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.