Early View
Paper

Enhanced Diagnosis of Parkinson's Disease Using XGBoost†

Thi-Nhu-Quynh Nguyen

Thi-Nhu-Quynh Nguyen

Non-member

Department of Physics and Computer Science, Faculty of Physics and Engineering Physics, University of Science,  Ho Chi Minh City, Vietnam

Vietnam National University, Ho Chi Minh City, Vietnam

Search for more papers by this author
Hoang-Thuy-Tien Vo

Hoang-Thuy-Tien Vo

Non-member

Department of Physics and Computer Science, Faculty of Physics and Engineering Physics, University of Science,  Ho Chi Minh City, Vietnam

Vietnam National University, Ho Chi Minh City, Vietnam

Search for more papers by this author
Tuan Van Huynh

Corresponding Author

Tuan Van Huynh

Non-member

Department of Physics and Computer Science, Faculty of Physics and Engineering Physics, University of Science,  Ho Chi Minh City, Vietnam

Vietnam National University, Ho Chi Minh City, Vietnam

Correspondence to: Tuan Van Huynh. E-mail: [email protected]

Search for more papers by this author
First published: 19 July 2025

†This research was presented at the ICEBA 2024 International Conference (The 5th International Conference on Engineering, Physics, MEMS-Biosensors, and Applications), co-organized by VNUHCM-University of Science, Tohoku University, and Mien Tay Construction University.

Abstract

Parkinson's disease (PD) affects over 10 million individuals globally, making it one of the most common neurological disorders. Despite its prevalence, no definitive cure or therapy exists to halt its progression. PD symptoms majorly influence patients' everyday lives, so prompt identification is essential. Because the disease starts in the brain, we used electroencephalography (EEG) data in our research. We used the publicly accessible dataset ‘EEG: Simon Conflict in Parkinson's’, which consists of EEG recordings from 28 people with PD (ON–OFF medication) and 28 healthy people (CTL). Due to the sensitivity of EEG signals, noise components were removed using Independent Component Analysis (ICA) combined with the IClabel model. Cleaned signals were reconstructed and analyzed into five primary frequency bands: delta, theta, beta, alpha, and gamma with statistical features. Boosting methods from the ensemble algorithm family were applied to evaluate classification performance. The classification results are presented for two labels (healthy individuals and PD) and three (healthy individuals, PD in ON medication, and PD in OFF medication). The XGBoost model achieved the best classification performance, achieving high accuracy, sensitivity, and specificity within a reasonable computation time. The XGBoost model combined with ICA-ICLabel achieved 99.71% accuracy in PD-CTL classification and 91.35% accuracy in three-class classification (CTL-ON-OFF). © 2025 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.