Tomek Link and SMOTE Approaches for Machine Fault Classification with an Imbalanced Dataset - PubMed Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Apr 23;22(9):3246.
doi: 10.3390/s22093246.

Tomek Link and SMOTE Approaches for Machine Fault Classification with an Imbalanced Dataset

Affiliations

Tomek Link and SMOTE Approaches for Machine Fault Classification with an Imbalanced Dataset

Elsie Fezeka Swana et al. Sensors (Basel). .

Abstract

Data-driven methods have prominently featured in the progressive research and development of modern condition monitoring systems for electrical machines. These methods have the advantage of simplicity when it comes to the implementation of effective fault detection and diagnostic systems. Despite their many advantages, the practical implementation of data-driven approaches still faces challenges such as data imbalance. The lack of sufficient and reliable labeled fault data from machines in the field often poses a challenge in developing accurate supervised learning-based condition monitoring systems. This research investigates the use of a Naïve Bayes classifier, support vector machine, and k-nearest neighbors together with synthetic minority oversampling technique, Tomek link, and the combination of these two resampling techniques for fault classification with simulation and experimental imbalanced data. A comparative analysis of these techniques is conducted for different imbalanced data cases to determine the suitability thereof for condition monitoring on a wound-rotor induction generator. The precision, recall, and f1-score matrices are applied for performance evaluation. The results indicate that the technique combining the synthetic minority oversampling technique with the Tomek link provides the best performance across all tested classifiers. The k-nearest neighbors, together with this combination resampling technique yielded the most accurate classification results. This research is of interest to researchers and practitioners working in the area of condition monitoring in electrical machines, and the findings and presented approach of the comparative analysis will assist with the selection of the most suitable technique for handling imbalanced fault data. This is especially important in the practice of condition monitoring on electrical rotating machines, where fault data are very limited.

Keywords: Bayesian classification; Tomek link; imbalanced data; k-nearest neighbor; support vector machine; synthetic minority over-sampling sampling; wound-rotor induction generator.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
(a) Geometry of wound-rotor induction generator model with flux lines distribution; (b) Flux lines distribution with stator winding inter-turn fault indicated (by arrow); (c) External circuit of the investigated wound-rotor induction generator.
Figure 1
Figure 1
(a) Geometry of wound-rotor induction generator model with flux lines distribution; (b) Flux lines distribution with stator winding inter-turn fault indicated (by arrow); (c) External circuit of the investigated wound-rotor induction generator.
Figure 2
Figure 2
(a) Experimental layout; (b) Stator winding inter-turn fault; (c) Rotor winding inter-turn fault.
Figure 3
Figure 3
(a) Three-phase steady−state stator phase voltage shown for a portion of acquisition time used; (b) stator voltage phase U under healthy conditions.
Figure 4
Figure 4
The method for WRIG imbalanced data.
Figure 5
Figure 5
Simulation performance of NBC classification with SMOTE.
Figure 6
Figure 6
Simulation performance of NBC classification with T-link.
Figure 7
Figure 7
Simulation performance of NBC classification with SMOTE/T-link.
Figure 8
Figure 8
Simulation performance of SVM classification with SMOTE.
Figure 9
Figure 9
Simulation performance of SVM classification with T-link.
Figure 10
Figure 10
Simulation performance of SVM classification with SOMTE/T-link.
Figure 11
Figure 11
Simulation performance of k-NN classification with SMOTE.
Figure 12
Figure 12
Simulation performance of k-NN classification with T-link.
Figure 13
Figure 13
Simulation performance of k-NN classification with SMOTE/T-link.
Figure 14
Figure 14
Experimental performance of NBC classification with SMOTE.
Figure 15
Figure 15
Experimental performance of NBC classification with T-link.
Figure 16
Figure 16
Experimental performance of NBC classification with SMOTE/T-link.
Figure 17
Figure 17
Experimental performance of SVM classification with SMOTE.
Figure 18
Figure 18
Experimental performance of SVM classification with T-link.
Figure 19
Figure 19
Experimental performance of SVM classification with SMOTE/T-link.
Figure 20
Figure 20
Experimental performance of k-NN classification with SMOTE.
Figure 21
Figure 21
Experimental performance of k-NN classification with T-link.
Figure 22
Figure 22
Experimental performance of k-NN classification with SMOTE/T-link.

Similar articles

Cited by

References

    1. Khan M.U., Imtiaz M.A., Aziz S., Kareem Z., Waseem A., Akram M.A. System design for early fault diagnosis of machines using vibration features; Proceedings of the IEEE 5th International Conference on Power Generation Systems and Renewable Energy Technologies; Istanbul, Turkey. 26–27 August 2019.
    1. Spyropoulos D.V., Mitronikas E.D. A review on the faults of electric machines used in electric ships. Adv. Power Electron. 2013;2013:216870. doi: 10.1155/2013/216870. - DOI
    1. Salomon C.P., Ferreira C., Sant’Ana W., Lambert-Torres G., Borges da Silva L.E., Bonaldi E.L., Oliveira L., Torres B. A study of fault diagnosis based on electrical signature analysis for synchronous generators predictive maintenance in bulk electric systems. Energies. 2019;12:1506. doi: 10.3390/en12081506. - DOI
    1. Rehman A.U., Chen Y., Wang L., Zhao Y., Yonghong Y., Yonghong C., Tanaka T. Experimental research and analysis on rotor winding inter-turn circuit fault in DFIG; Proceedings of the IEEE International Conference on Condition Monitoring and Diagnosis; Xi’an, China. 25–28 September 2016.
    1. Sun W., Zhao R., Yan R., Shao S., Chen X. Convolutional discriminative feature learning for induction motor fault diagnosis. IEEE Trans. Ind. Inform. 2017;13:1350–1359. doi: 10.1109/TII.2017.2672988. - DOI