Volume 32, Issue 2 pp. 228-242
Research Article

Simple outlier labeling based on quantile regression, with application to the steelmaking process

Ruggero Bellio

Corresponding Author

Ruggero Bellio

Department of Economics and Statistics, University of Udine, Italy

Correspondence to: Ruggero Bellio, Department of Economics and Statistics, University of Udine, Via Tomadini 30/A, I-33100 Udine, Italy.

E-mail: [email protected]

Search for more papers by this author
Mauro Coletto

Mauro Coletto

IMT Institute for Advanced Studies, Lucca, Italy

CNR - ISTI, Pisa, Italy

Search for more papers by this author
First published: 03 November 2015
Citations: 1

Abstract

This paper introduces some methods for outlier identification in the regression setting, motivated by the analysis of steelmaking process data. The proposed methodology extends to the regression setting the boxplot rule, commonly used for outlier screening with univariate data. The focus here is on bivariate settings with a single covariate, but extensions are possible. The proposal is based on quantile regression, including an additional transformation parameter for selecting the best scale for linearity of the conditional quantiles. The resulting method is used to perform effective labeling of potential outliers, with a quite low computational complexity, allowing for simple implementation within statistical software as well as commonly used spreadsheets. Some simulation experiments have been carried out to study the swamping and masking properties of the proposal. The methodology is also illustrated by some real life examples, taking as the response variable the energy consumed in the melting process. Copyright © 2015 John Wiley & Sons, Ltd.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.