Volume 2020, Issue 1 8861942
Research Article
Open Access

Generative Adversarial Network-based Missing Data Handling and Remaining Useful Life Estimation for Smart Train Control and Monitoring Systems

Hyunsoo Lee

Corresponding Author

Hyunsoo Lee

School of Industrial Engineering, Kumoh National Institute of Technology, Gumi 39177, Republic of Korea kumoh.ac.kr

Search for more papers by this author
Seok-Youn Han

Seok-Youn Han

Urban Transit Research Group, Advanced Railroad Vehicle Division, Korea Railroad Research Institute, Uiwang, Republic of Korea krri.re.kr

Search for more papers by this author
Kee-Jun Park

Kee-Jun Park

Urban Transit Research Group, Advanced Railroad Vehicle Division, Korea Railroad Research Institute, Uiwang, Republic of Korea krri.re.kr

Search for more papers by this author
First published: 27 November 2020
Citations: 2
Academic Editor: Ladislav Routil

Abstract

As railway is considered one of the most significant transports, sudden malfunction of train components or delayed maintenance may considerably disrupt societal activities. To prevent this issue, various railway maintenance frameworks, from “periodic time-based and distance-based traditional maintenance frameworks” to “monitoring/conditional-based maintenance systems,” have been proposed and developed. However, these maintenance frameworks depend on the current status and situations of trains and cars. To overcome these issues, several predictive frameworks have been proposed. This study proposes a new and effective remaining useful life (RUL) estimation framework using big data from a train control and monitoring system (TCMS). TCMS data is classified into two types: operation data and alarm data. Alarm or RUL information is extracted from the alarm data. Subsequently, a deep learning model achieves the mapping relationship between operation data and the extracted RUL. However, a number of TCMS data have missing values due to malfunction of embedded sensors and/or low life of monitoring modules. This issue is addressed in the proposed generative adversarial network (GAN) framework. Both deep neural network (DNN) models for a generator and a predictor estimate missing values and predict train fault, simultaneously. To prove the effectiveness of the proposed GAN-based predictive maintenance framework, TCMS data-based case studies and comparisons with other methods were carried out.

1. Introduction

Railway infrastructure has been one of the essential infrastructures not only at a national level, but also across continents. In terms of ground cargo and freight transport, the railway system is the most important infrastructure. A number of research studies have focused on detections of aberrant situations in trains. For instance, unexpected failures in train components may catastrophically harm the passengers’ safety. Moreover, a maintenance delay may result in subsequent heavy delays in overall train schedule. Thus, maintenance frameworks for railway infrastructure have received significant attention. A number of existing research studies have proposed various railway maintenance frameworks and relevant applications for more reliable railroad operations.

Early railway maintenance frameworks are based on periodic time-based maintenance [1], which is still an effective technique for checking railway components. Monthly or quarterly inspection belongs to this type of maintenance. Recently, maintenance framework has evolved to “preventive maintenance” in contemporary railroad systems. Preventive maintenance is classified into “time-based maintenance” and “distance-based maintenance” in general. Most railway operators utilize both maintenance frameworks, simultaneously.

Moreover, the maintenance framework is evolving with the development of technologies of the fourth industrial revolutions. Of these technologies, Internet of Things (IoT) technology is the most relevant for enhancing railway maintenance. Fraga-Lamas et al. [2] summarized the utilization of IoT technologies in train maintenance. IoT-based embedded systems enable the detection of abnormal status of railway components in real time, where the signals are subsequently transferred to a secured database system. In general, most of the train systems have their own management systems, such as train control and monitoring system (TCMS) for storing various train data and for managing trains. Based on TCMS-based research studies [3, 4], TCMS is a system with control, communication, and management functions for all train platforms and applications. As the system collects operation and management-based data for trains and their connected cars, a huge amount of data can be collected and analysed. Several research studies [5, 6] used TCMS data for estimating energy consumption of trains or for controlling train doors safely. However, relatively fewer studies on predictive maintenance were carried out.

This study focuses on developing a new and effective predictive maintenance framework using TCMS data. In this study, remaining useful lives (RUL) of various train modules are predicted using a proposed deep learning method. In order to measure RUL of train modules, this study predicts time periods to the relevant trains’ faults and malfunctions. In this paper, this time to failure (TTF) for a certain train fault is defined a RUL of a certain fault. However, prerequisite conditions are necessary to handle data issues in TCMS. As TCMS data could include missing values, their handling mechanism must be embedded in a relevant RUL estimation framework.

This study applies a generative adversarial network (GAN) to handle missing values in TCMS. The following section provides relevant background knowledge and literature review. Section 3 examines TCMS data and relevant data issues. Sections 4 and 5 present a GAN-based predictive maintenance framework and its verifications using various numerical analyses, respectively.

2. Background and Literature Review

This study utilized TCMS data to estimate train component status and predict their RULs. The proposed framework is classified as a predictive maintenance framework in train systems. As discussed in the previous section, the maintenance paradigm in train transportation has converged with the technologies of the fourth industrial revolution. The time and distance-based maintenance frameworks have been combined with monitoring-based methods. Several sensing systems have been developed and installed for more detailed examinations of trains’ components. Sharma et al. [7] detected breakage of railway tracks using vibration sensors. Sireesha et al. [8] used a radio frequency-based method to detect rails’ broken status. These sensing systems have integrated with Internet of Things (IoT-) based frameworks. Lee [9, 10] developed various Industrial IoT (IIoT) systems to monitor abnormal manufacturing signals and estimate production performance indices in multiple supply chains. The detected signals are transmitted to a cloud server, where industrial big data analytics analyses the collected data and takes preventive measures for better production controls. These technologies and frameworks have been applied to various train systems and their relevant monitoring-based/condition-based maintenance. Hitachi [11] proposed Lumada IoT Platform© as a monitoring-based maintenance system for its railway system.

While various monitoring-based methods for detecting abnormal status of railway components have been introduced, deep learning methods and relevant data analytics have been integrated into predictive maintenance. Corman et al. [12] applied a data-driven method to estimate the remaining life of a light rail braking system in a train. McKinsey [13] suggested similar approaches to enhance rail operations using digital maintenance technologies. Atamuradov et al. [17] and Liden [18] summarized comprehensive overviews on railway infrastructure maintenance. Table 1 provides various time-based, monitoring-based, and data-driven maintenance frameworks and their applications.

Table 1. Existing time, monitoring, and data-driven maintenance applications in rail systems.
Existing research studies Target railway components Methods and characteristics Maintenance type
Time/distance Monitoring Data-driven
Faiz and Singh [14] Railway track
  1. (i) Detection of track geometry
  2. (ii) Usage of rail profile-based regression model
O
  
Sharma et al. [7] Railway (i) Vibration sensor-based estimation of railway breakage O
  
Shaikh et al. [15] Solid axle wheel sets
  1. (i) Installation of additional sensors (vibration sensors for capturing lateral and yaw dynamics)
  2. (ii) Vibration model-based simulation
O
  
Letot et al. [16] Railway track point machine (i) Degradation assessment and data-based RUL estimation O
  
Corman et al. [12] Train breaking system
  1. (i) Work, maintenance, and failure data-based reliability estimation
  2. (ii) Usage of Weibull distribution
O O O

As shown in Table 1, data driven analytics has been introduced for better rail maintenance. Among a number of data sets in a train framework, the TCMS data is the most comprehensive data, as it includes operation, parameter settings, and other information on train components. Figure 1 shows the various TCMS subcomponents that are installed in Korean trains and cars.

Details are in the caption following the image
Train control and monitoring system (TCMS) in Korean trains and cars. (a) Monitoring system in train control car. (b) Monitoring and control system in train driving modules. (c) Data acquisition system in each car.

In general, TCMS is an essential system for controlling electrical multiple units (EMU) in each train and car in a train system. Thus, control parameters and operation data are stored in TCMS. While TCMS is mainly used to control trains and cars, the usage of TCMS data for various purposes has been suggested. Table 2 shows various applications that use TCMS data. As shown in Table 2, most of the applications that use TCMS data have focused on monitoring-based maintenance.

Table 2. Applications that use TCMS data.
Research studies Applications TCMS data
Ito et al. [6]
  1. (i) Safe door operation
  2. (ii) Automatic power changer-based driver advisory system
(i) EMU functions in TCMS
  
Neil [19] (iii) Railway safety monitoring-based maintenance (ii) Transaction data in TCMS
  
Kim et al. [13] (iv) Analysis of train energy consumption considering driving patterns
  1. (iii) Driving time
  2. (iv) Train’s driving speed
  3. (v) Railway track data
  
Xu et al. [20] (v) Queuing theory-based maintenance cycle scheduling for an urban rail transit system
  1. (vi) Running distance, velocity, and mileage data
  2. Maintenance schedule
  
Shift2Rail project [21]
  1. (vi) Monitoring of cargo condition
  2. (vii) A conceptual and experimental project
  1. (vii) TCMS data
  2. (viii) Additional sensing systems (e.g., ultrasonic sensor and other wireless sensors)

While TCMS data have been used comparatively less with more advanced maintenance analytics, several industrial projects including Shift2Rail [21] have suggested predictive maintenance frameworks using TCMS data. However, these projects provide only conceptual frameworks or experimental-level demonstrations. In particular, big data analytics and more advanced data mining methods are seldom applied in TCMS-based predictive maintenance frameworks. To address this issue, this study proposes a new and effective predictive maintenance using deep learning methods and real-time TCMS data handling modules.

3. Missing Value Issues in TCMS Data for Predictive Maintenance Framework

The proposed predictive maintenance framework uses TCMS data for predicting RULs in a certain breakage. As shown in (1), the RUL (RULj,  jJ; J is a set of integers) of a certain breakage (j) is estimated using the TCMS data (X) and used as a main reference for setting up train and cars maintenance schedules. The function f(⋅) is modelled using a deep learning-based network architecture and is explained in the following section.
(1)

Table 3 summaries the TCMS data used in this study and the general specifications of the proposed RUL prediction framework.

Table 3. Specification of the proposed predictive maintenance framework.
Content Classification Issues
Data specification
  1. (i) Data source: TCMS data (2018.6∼2019.05)
  2. (ii) Data from the seventh line in subway system, Republic of Korea
Big data
  
TCMS data specification
  1. (i) Number of attributes: 2643 per one record
  2. Existence of a number of missing values in one record
  3. (ii) Data format: encrypted data
  1. (i) Data decryption is needed
  2. (ii) Missing value handling is needed
  
Fault/alarm data
  1. (i) Number of attributes: 56
  2. (ii) Data format: encrypted text data
(i) Data decryption is needed
  
Predictive maintenance framework
  1. (i) Data input: TCMS data
  2. (ii) Output: the estimated RUL
  3. (iii) Mechanism: GAN-based deep neural network

As shown in Table 3, the input data of the proposed predictive maintenance framework is the TCMS data. Figure 2(a) shows a part of the TCMS data, which is an encrypted data. In general, TCMS data is classified into two types: operation data (oper) and alarm data (arm) as shown in Figure 2(a). The TCMS data is stored with each car no., the date, and time. While the “oper” data describes information such as train identification and operation and other relevant train parameters, the recorded “arm” data includes various warning signals and relevant alarm codes. These alarm level information and other warning data are written using the predefined criteria, such as status levels of train components and other relevant sensor measuring ranges. Thus, “oper” data is used as input data, while the output of the proposed RUL estimation framework is driven by the “arm” data. If “arm” data can be predicted using a series of “oper” data, a real-time predictive maintenance can be applied. Hence, the proposed predictive maintenance framework uses “oper” data as input vector. The RUL variable is extracted from the “arm” data and fault/maintenance history. The fault/maintenance data clarifies the relationship among “arm” data and a certain train defect. RUL data is extracted from the “arm” data and its relationship to a certain defect is obtained using the mapping between both data.

Details are in the caption following the image
Encrypted TCMS data sample and the developed conversion tool. (a) A TCMS data sample and its encrypted texts. (b) The decryption and conversion software program developed in this study.
Details are in the caption following the image
Encrypted TCMS data sample and the developed conversion tool. (a) A TCMS data sample and its encrypted texts. (b) The decryption and conversion software program developed in this study.

However, TCMS data cannot be directly used owing to their encrypted formats and missing value issue. As shown in Figure 2(a), both types of data are encrypted for various reasons, such as data protection, data size reduction, and sensor driver encryption. This indicates that the data need to be decrypted prior to any further data processing.

To decrypt the data, hex data-based decoding is performed as an essential prerequisite procedure using the encryption rule for TCMS. Then, the hex-formatted data are converted into number-formatted data for subsequent deep learning processing. Figure 2(b) shows a program developed in this study, which converts hex-formatted data to number-formatted interpretable data. The “oper” data has 2,643 attributes, which include train identification, operation time, station time, velocity, sensor data, and other information. The “arm” data has 56 attributes such as system failure, warning, and alarm information.

The data conversion process is performed as a pre-processing step. However, the main issue is the frequent occurrence of missing values in the TCMS data. Missing values in the TCMS data exist due to various reasons (e.g., sudden breakdown of sensors, malfunction of devices, and/or sudden changes of electric current). Missing input and output data issues have to be resolved prior to training a deep learning-based predictive maintenance model. This is one of the most common issues in manufacturing [2224], transportation, and other data handling processes. Table 4 summaries various methods for handling missing values.

Table 4. Various methods for handling missing values.
Methods for handling missing values Detailed methods Related research studies
Removals of data sets with missing values
  1. (i) Ignorance of records with missing values
  2. (ii) Data without missing values are used only for an input vector
A number of research studies including [25]
  
Estimation of missing values (I)
  1. (iii) Estimation of missing values using mean, MCMC, and nearest neighbours
  2. (iv) Estimation considering only the attribute that has missing values
Moldovan et al. [26]
  
Estimation of missing values (II), multiple imputation
  1. (v) Data estimation considering overall attributes’ dependency
  2. (vi) Missing values estimation using regression and other statistical methods
  1. Hruschka et al. [27]
  2. Yuan [28]
  
Generation of a new data set
  1. (vii) Generative adversarial network- (GAN-) based data generation
  2. (viii) Replacement of the data having missing values with newly generated data
  1. Kim and Lee [23, 24]
  2. Douzas and Bacao [29]

As shown in Table 4, most of the early relevant research studies tried to remove records with missing values. While these methods provided complete data for input, it could result in the lack of a training data. However, this limitation can be overcome by estimating missing values. The simplest estimation method is to consider data in the same attribute and then extract a probability density function using the data associated with the same attribute. For instance, the Gaussian mixture model (GMM) can be applied for capturing the characteristics of the data [30]. Then, a random number generated using the reasoned GMM model is used as an estimator for a missing value. However, in this method, relationships among other attributes are ignored. To address this issue, another estimation method, missing value estimation method, which considers the overall dependencies among the data attributes can be used. In general, multivariate statistical approaches, regression model, or multivariate nearest neighbour methods [31] can be applied for describing data dependencies. Then, the missing values can be estimated and substituted using Markov Chain Monte Carlo (MCMC)-based random number generation methods.

While these methods have worked only for estimating missing values, the recent methods tend to generate not only missing data, but also overall data. The generative adversarial network (GAN) is the representative method among them. In industrial big data, one of the issues is the lack of certain fault data. The lack of certain types of data may lower training performances of applied learning mechanisms. While the initial purpose of adversarial network (G) in GAN is to increase the classification ability of a classification network (D), a well-trained adversarial network can generate data which fits to an objective. Table 5 summarizes the learning algorithm of GAN.

Table 5. General learning algorithm of GAN.
Input/parameters

(i) Training data: X

(ii) Learning epoch: k1/· Training epoch: k2

(iii) Step length: η

(iv) Mini-batch size: m

Output
  1. (v) Optimal parameters for G:
  2. (vi) Optimal parameters for D:
  1. Learning
  2. algorithm
  1. for 1:k1
  2.  Initialize θG, θD
  3.  for 1:k2
  4.   mini-batch partitioning from X, X = {x1,   … ,  xm}
  5.   calculate gradient for D and update  θD
  6.   
  7.   Generate random vector, Z = {z1, …,  zm}
  8.   Calculate gradient for G and update θG
  9.   
  10.  end
  11. end
The gradients for G and D are driven using
(2)

As shown in (2), fD(X) and fG(Z) denote and EZ[log(1 − D( G(Z)) )], respectively. Several research studies applied GAN for generating fault data in automotive [22], semiconductor [23], and steel production processes [24]. This study used GAN to handle the missing value issues in the TCMS data as well as to predict RULs in train components.

4. Generative Adversarial Network-Based Predictive Maintenance Framework

The proposed RUL estimation framework predicts the RUL of a certain defect or a malfunction. To focus on major malfunctions during train operations, 49 defects are extracted from the “arm” data in TCMS based on defect frequency and severity. Figure 3 shows the defect frequency. The records are gathered by Korean Railroad Research Institute.

Details are in the caption following the image
Defect frequency (each fault code is designated by Korean Railroad Research Institute).

Each defect’s RUL is calculated using the TCMS “arm” data and relevant fault/maintenance data. The “arm” data includes the identification number, occurrence date, and other relevant information for each defect. Figure 4 shows an occurrence history of a specific defect (defect code no. 442– fault of electronic control unit (ECU)). As shown in Figure 4, the X and the Y axes indicate the occurrence date and defect code, respectively.

Details are in the caption following the image
RUL extraction for a specific defect (code no. 442, fault of ECU) from the TCMS “arm” data.
From the information, Faulti,j(t) is extracted. Faulti,j(t) indicates the jth occurrence time of the ith defect in the TCMS data. Then, inter-defect time, RULi,j(t) , is calculated using
(3)
As shown in (3), Faulti,j(t) denotes the jth sensing time of the ith defect in TCMS, and RULi,j(t)  indicates the inter-defect time in day between the jth occurrence and (j−1)th occurrence of the ith defect. The obtained RUL is used as output data for prediction. Subsequently, the RUL is predicted using TCMS’s operation data (X(t)) and a deep learning framework as shown in (4).
(4)
(5)

As denoted in (4) and (5), xi(t) is the value of the ith attribute at time t in the TCMS “oper” data, wi is the weight value of xi(t), bi is the ith bias, and fi is the ith activation function.

While a general predictive maintenance estimates RUL using (4), the equation cannot be directly applied in the TCMS-based data mining owing to the missing value issue discussed in the previous section. To overcome this issue, (4) is converted by introducing a GAN in the RUL estimation. Figure 5 shows the overall RUL prediction framework using GAN.

Details are in the caption following the image
GAN-based missing value handling and RUL prediction framework.
As shown in Figure 5, the proposed RUL prediction framework consists of two phases: learning stage and prediction stage. The main objective of the first stage is to obtain a discriminator (D) using a deep learning-based architecture. The proposed framework applies a deep neural network for the discriminator. The discriminator uses a complete TCMS data (X(t)), where X(t) is generated using X(t) and a generator (G) in the proposed GAN. X(t) is complete data, while X(t) is a data set with missing values. As discussed in the previous section, the TCMS data (X(t)) of a certain train’s fault may have missing values owing to various reasons. As shown in (6), these missing values are estimated initially using multivariate GMM, p(θ|xi) where θ is the extracted RUL data, xi is the ith attribute’s data over the entire time considering xi(t), and |oper| is the data size of xi.
(6)
where .

The missing value is generated using Gibbs sampling method [30, 31]. The initial completed data (X(t)) is inputted to a generator G(⋅). The output of G is the regenerated data (X(t) ).  The generator has another deep neural network architecture similar to discriminator D(⋅) as shown in (2). Then, D generates X(t) that satisfies (1) better than the previous estimated X(t). The updated X(t) is then inputted to D(X(t)).

As presented in Figure 5, the GAN process shows these learning processes. Both network models have the deep learning parameters θ(D) and θ(G) as weight vectors. The updating procedures θ(D) are achieved using the gradients indicated in (7)–(9).
(7)
As shown in (7), V denotes fD(X′′(t)) + fG(X(t)).
(8)
vi denotes (wi · yi−1) + bi, and yi is the output of the ith deep learning layer, ϕ(vi).
(9)

θ(G) is updated using the same procedures. Finally, the learned D(⋅) is obtained after the overall learning iterations.

In the second phase (prediction stage), the RUL of a certain defect is estimated using real-time “oper” data. While real-time data (X(t)) may have missing values, the RUL is estimated successfully with D(X(t)). The proposed GAN-based RUL prediction framework considers data with various missing values that exist frequently in TCMS data. While TCMS may generate data with missing values due to various issues, the proposed framework is considered an effective predictive maintenance framework for handling missing values. The following section proves the effectiveness of the proposed framework using case studies and TCMS data analyses.

5. Verification and Analysis of GAN-Based RUL Prediction Framework

To prove the effectiveness of the proposed TCMS-based predictive maintenance framework, this section provides prediction performances of several train faults and compares prediction accuracies with other methods. As explained in the previous section, each fault type’s RUL was estimated using its GAN-based framework.

The prediction accuracy and analyses were performed using the data of the Korean Railroad Research Institute (KRRI), which is a Korean government-funded railroad institute. The TCMS data from June, 2018, to May, 2019, in Seoul Metropolitan Subway were used as training and testing data. Sixteen defects were selected to predict their RULs. Table 6 provides the fault types and their information. The fault code ID and other relevant information were recorded in the TCMS data of the Seoul Metropolitan Subway.

Table 6. Fault types and their information for RUL predictions.
No. Fault code ID Fault information and relevant location
1 31 LIU1 communication error in TC1
2 32 LIU2 communication error in TC1
3 34 LIU2 communication error in TC0
4 38 LIU1 hardware malfunction in TC0
5 39 LIU2 hardware malfunction in TC
6 231 SIV inverter malfunction
7 434 Break malfunction
8 442 ECU malfunction
9 635 TC MFB card/ATC vital malfunction
10 636 Tachometer error
11 640 Main ATC hardware malfunction
12 641 Secondary ATC hardware malfunction
13 647 FSB/ATC error
14 669 ATO-ATC communication error
15 670 ATO-ATC 1 communication error
16 684 ATC DBAU hardware error

To prove the verification using the provided GAN-based missing value estimation, the proposed method was compared with the other three existing methods: (1) ARIMA-based RUL estimation, (2) estimation with pruning of missing values, and (3) RUL prediction using mean-value estimation. Table 7 summaries the architecture, characteristics, and parameters of the proposed method and the three existing methods.

Table 7. Four RUL prediction methods.
RUL prediction method GAN-based RUL estimation (the proposed method) ARIMA-based RUL estimation RUL estimation using “missing value pruning” RUL estimation with “mean-value estimation” of missing values
Missing value handling mechanism O (GAN-based data generation) X (removal of records with missing values) X (removal of records with missing values) O (mean-value estimation)
  
RUL estimation method Classification using GAN ARIMA-based RUL estimation Deep neural network Deep neural network
  
Detailed parameters Refer to Table 8 ARIMA (6, 2, 5)
  1. (i) Learning epoch: 1000
  2. (ii) Learning rate: 10−3
  3. (iii) Number of layers:10
  4. (iv) Used activation functions
  5. =(leaky RU for final layer, Sigmoid for layers #1−#9)

The GAN architecture and other relevant parameters of the proposed method are provided in Table 8. As mentioned in the previous section, the GAN architecture varies for every defect. The parameters are provided in Table 8.

Table 8. Detailed architecture and relevant parameters of the GAN-based RUL estimation (case for fault code ID 31).
Classification Detailed architectures
General learning parameters
  1. (i) Learning epoch: 1000
  2. (ii) Learning rate: 10−3
  
Discriminator (D(⋅))
  1. (i) Number of Layers:10
  2. (ii) Number of nodes in each layer
  3. =(1, 50, 100, 200, 500, 1500, 2500, 3000, 3500, 2653)
  4. (iii) Used activation functions
  5. =(leaky RU for final layer, Sigmoid for layers #1−#9)
  
Generator (G(⋅))
  1. (i) Number of layers: 7
  2. (ii) Number of nodes in each layer
  3. =(2653, 2700, 2800, 3000, 3200, 3500, 2653)
  4. (iii) Used activation functions: Sigmoid function for each layer

The parameters shown in Tables 7 and 8 were determined by applying numerical tests on the provided TCMS data. Figure 6 shows the training and test accuracies of the proposed RUL prediction for fault code ID 31 (LIU1 communication error in TC1).

Details are in the caption following the image
The training and the test accuracies of the proposed RUL prediction (for fault code ID 31). (a) The training accuracy. (b) The test accuracy.
Details are in the caption following the image
The training and the test accuracies of the proposed RUL prediction (for fault code ID 31). (a) The training accuracy. (b) The test accuracy.
As shown in Figure 6, the proposed method performed 99.9% and 83.5% for the training and test accuracies, respectively. The accuracy was calculated using (10) and (11). The root mean squared error (RMSE(RUL)i) for a certain type (i) of RUL (RULi) was used as a test metric, where is the jth predicted value using the proposed framework and RULi,j is the jth original RUL value, and n is the size of the test data.
(10)
(11)

The test data was sampled from the original TCMS data. As the fault occurrence was very low, the amount of test data was limited. The data-based numerical tests were carried out by comparison with the other methods. Figure 7 shows the test accuracies using the four methods: the proposed method and the other three benchmarking methods shown in Table 7.

Details are in the caption following the image
Test accuracies using the four benchmarking methods.

As shown in Figure 7, the proposed method had the highest accuracy compared with the other existing methods. Table 9 provides the test accuracy for each method.

Table 9. Test accuracy for each RUL estimation method (case for fault code ID 31).
RUL prediction method GAN-based RUL estimation (the proposed method) ARIMA-based RUL estimation (%) RUL estimation using “missing value pruning” (%) RUL estimation using “mean-value estimation” of missing values (%)
Test accuracy 83.5% 56.7 69.7 76.3

The numerical analysis indicates that the missing value issue was critical for the fault prediction using the TCMS data. In addition, the estimation of the missing values strongly influenced the RUL predictions. Using the proposed framework, the RUL prediction system for the 16 train faults was developed as shown in Figure 8. The software program was implemented using MFC© and MATLAB© on Windows 10©. Figure 9 shows the test accuracies of the various TCMS faults.

Details are in the caption following the image
The implementation of GAN-based missing value handling and RUL prediction framework.
Details are in the caption following the image
RUL prediction results using the proposed method. (a) Fault code ID 34 (LIU2 communication error in TC0). (b) Fault code ID 231 (SIV inverter malfunction). (c) Fault code ID 434 (break malfunction). (d) Fault code ID 635 (TC MFB malfunction). (e) Fault code ID 640 (main ATC hardware malfunction LIU2). (f) Fault code ID 669 (ATO-ATC communication error).
Details are in the caption following the image
RUL prediction results using the proposed method. (a) Fault code ID 34 (LIU2 communication error in TC0). (b) Fault code ID 231 (SIV inverter malfunction). (c) Fault code ID 434 (break malfunction). (d) Fault code ID 635 (TC MFB malfunction). (e) Fault code ID 640 (main ATC hardware malfunction LIU2). (f) Fault code ID 669 (ATO-ATC communication error).
Details are in the caption following the image
RUL prediction results using the proposed method. (a) Fault code ID 34 (LIU2 communication error in TC0). (b) Fault code ID 231 (SIV inverter malfunction). (c) Fault code ID 434 (break malfunction). (d) Fault code ID 635 (TC MFB malfunction). (e) Fault code ID 640 (main ATC hardware malfunction LIU2). (f) Fault code ID 669 (ATO-ATC communication error).
Details are in the caption following the image
RUL prediction results using the proposed method. (a) Fault code ID 34 (LIU2 communication error in TC0). (b) Fault code ID 231 (SIV inverter malfunction). (c) Fault code ID 434 (break malfunction). (d) Fault code ID 635 (TC MFB malfunction). (e) Fault code ID 640 (main ATC hardware malfunction LIU2). (f) Fault code ID 669 (ATO-ATC communication error).
Details are in the caption following the image
RUL prediction results using the proposed method. (a) Fault code ID 34 (LIU2 communication error in TC0). (b) Fault code ID 231 (SIV inverter malfunction). (c) Fault code ID 434 (break malfunction). (d) Fault code ID 635 (TC MFB malfunction). (e) Fault code ID 640 (main ATC hardware malfunction LIU2). (f) Fault code ID 669 (ATO-ATC communication error).
Details are in the caption following the image
RUL prediction results using the proposed method. (a) Fault code ID 34 (LIU2 communication error in TC0). (b) Fault code ID 231 (SIV inverter malfunction). (c) Fault code ID 434 (break malfunction). (d) Fault code ID 635 (TC MFB malfunction). (e) Fault code ID 640 (main ATC hardware malfunction LIU2). (f) Fault code ID 669 (ATO-ATC communication error).

As shown in Figure 9, the proposed framework and its implemented software resulted in RUL predictions of over 82.07% for all TCMS fault types. Table 10 shows the test accuracy of the proposed framework for each defect in the TCMS data.

Table 10. Test accuracy for each defect using the proposed framework.
Fault code id. Test accuracy Fault code ID Test accuracy Fault code ID Test accuracy Fault code ID Test accuracy (%)
31 83.50 32 88.08 34 88.94 38 82.47
39 86.98 231 99.91 434 96.75 442 95.68
635 85.24 636 85.60 640 92.21 641 89.15
647 93.14 669 97.56 670 82.07 684 80.46

6. Conclusions and Further Study

The transportation maintenance has gained substantial attention owing to its significance in societies. This study focused on the RUL prediction-based railway maintenance framework. While initial railway maintenance concentrated on periodic maintenance framework such as predefined time-based maintenance or distance-based scheduling, railway maintenance has evolved to condition-based maintenance owing to the advancement in monitoring devices and information technologies. This framework detects abnormal status using state-of-the-art sensors in a train system. The monitored signals are transferred to a server, TCMS.

This study proposes a new and effective RUL prediction framework using TCMS data. In general, TCMS data are classified into operation data and alarm data. To predict the remaining life of a certain fault or malfunction, this paper selected 16 faults based on their significance and severity. A deep learning-based mechanism was developed for each fault. Firstly, RUL of the target train fault was extracted using the TCMS alarm data. Then, the data was used as the predicted output of the proposed deep neural network. However, the system has a critical issue, which is common in most sensor-based systems: the existence of missing values. Existence of missing values in TCMS data could be due to various reasons such as sensor malfunction and low life of monitoring modules. Among several estimation methods for replacing missing values, this study used a GAN model to estimate missing values and predict RULs, simultaneously. The developed GAN framework can generate new data that cover missing values using the prediction objectives. Initially, the missing values are estimated using GMM and the estimated data are refined with the proposed GAN framework. In addition, the discriminator in the GAN model has better predictive performances in generating more accurate data. The effectiveness of the proposed maintenance framework was investigated by comparing it with other existing methods. The proposed framework is a new and effective train predictive maintenance framework that addresses missing value issue and predicts fault detection in real time.

For further studies, various optimization and metaheuristics methods can be applied to the proposed framework. As TCMS data is classified as big data, its learning could take longer time. Moreover, the framework requires comparatively higher computational burden. While the proposed framework uses two deep neural network models for a generator and a discriminator in its GAN module, it is expected that application of several optimization methods could reduce learning time and computation burden. In addition, prediction of each train fault requires a different GAN-based framework. While it has an advantage focusing on the defined fault, real-time prediction may require significant computational burden. To resolve this issue, a new architecture for multiple-fault prediction will be considered in future studies.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This research was supported by The Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Republic of Korea (grant number: NRF-2018R1D1A3B07047113), and by a research grant from R&D Program of the Korea Railroad Research Institute (KRRI), Republic of Korea.

    Data Availability

    The used data is supported by Korea Railroad Research Institute.

      The full text of this article hosted at iucr.org is unavailable due to technical difficulties.