Volume 32, Issue 18 e5601
SPECIAL ISSUE PAPER

Minority oversampling based on the attraction-repulsion Weber problem

Ugo Fiore

Corresponding Author

Ugo Fiore

Department of Management and Quantitative Studies, Parthenope University of Naples, Naples, Italy

Ugo Fiore, Department of Management and Quantitative Studies, Parthenope University of Naples, 80132 Naples, Italy.

Email: [email protected]

Search for more papers by this author
First published: 17 November 2019
Citations: 4

Present Address:

Ugo Fiore, Via Gen. Parisi, 13, 80133 Naples, Italy

Abbreviations: SMOTE, Synthetic Minority Oversampling TEchnique; UCI, University of California at Irvine

Summary

Learning on imbalanced datasets, where one class is underrepresented, is problematic and important at the same time. On the one hand, a limited number of positive examples restricts the generalization ability of classifiers. On the other hand, often, the class of interest is such exactly because it is rare. The Synthetic Minority Oversampling TEchnique (SMOTE) is a preprocessing method that creates new synthetic examples by interpolating between neighboring instances. In this work, an enhancement to SMOTE is proposed, which characterizes synthetic instances as solutions of attraction-repulsion problems among the neighboring data points. Experimental evaluation shows an improvement in the positive predictive power of classification.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.