1. Introduction
Fault diagnosis is an important part of rotating machinery equipment health management and is of great significance to the healthy and safe operation of manufacturing equipment [
1]. As a core component of rotating machinery, the engine’s complex structure brings a high cost to the disassembly and assembly of the engine. Rolling bearings, as their internal core components, have a harsh working environment [
2]. Once a fault occurs, it will bring huge economic losses or even casualties, so frequent testing is required [
3,
4]. However, frequent disassembly and assembly inspections will result in higher equipment maintenance costs. With the continuous development of information technologies, it has become a research hotspot in the field of manufacturing to deploy computer technologies to carry out non-disassembly fault diagnosis on rolling bearings of rotating machinery equipment to reduce equipment maintenance costs [
5].
Fault diagnosis can be realized by analyzing the various sensing signals during engine bearing operation. The types of signals include vibration signals, acoustic emission signals and current signals, etc. [
6,
7,
8]. The most-commonly used signal acquisition method is the acquisition of vibration signals from bearing rotation through the installation of acceleration sensors. A summary of the literature showed that intelligent fault diagnosis solutions mainly include model-based methods, empirical-based methods, traditional machine-learning-based methods, and deep-learning-based methods [
9]. Among them, the model-based approach requires an accurate physical model, a large amount of a priori knowledge, and an accurate understanding of the engine system structure and parameters, while the empirical approach requires a priori knowledge for judgment and reasoning, neither of which is suitable for non-domain experts [
10]. The main methods used in the domain of computing are traditional machine learning methods and deep-learning-based methods. Although traditional machine learning methods have been widely used and researched for many years [
11,
12,
13], they have some drawbacks. First, the vibration signal extraction method based on traditional machine learning relies heavily on manual work, requiring professional experience and knowledge. The process is complex, and the workloads are heavy, which makes it extremely difficult to meet the actual needs of bearing fault diagnosis [
14]. In addition, traditional machine learning methods are shallow learning with a limited ability to extract nonlinear features. As deep learning techniques are becoming more mature, their powerful feature extraction and representation capabilities offer the possibility of separating bearing state signals in complex situations.
Deep learning models commonly used in fault diagnosis include convolutional neural networks [
15], recurrent neural networks [
16], etc. In particular, convolutional neural network models are extensively used in fault diagnosis due to their powerful feature extraction and scalability, and the various techniques adopted by CNNs can more effectively extract the characteristics of bearing faults. Among them, one common technique is to broaden the local perceptual field of the convolutional layer, which allows the network to better capture local features and improve the accuracy of feature extraction. In addition, CNNs utilize weight-sharing techniques, where the same weight parameters are distributed to different locations, which allows the network to simplify the parameters and train faster and learn the features of bearing faults more effectively. The integration of these techniques allows the CNN to extract bearing fault features more quickly and accurately, leading to efficiency promotion for fault diagnosis. For example, Reference [
17] proposed the MA1DCNN network to improve fault diagnosis accuracy by adaptively correcting the features of each layer in the network. Reference [
14] proposed a novel GNR-based end-to-end CNN model to enhance network fault diagnosis. Reference [
18] proposed fault diagnosis by using wide convolutional kernels. Reference [
19] used normalized diagnostic feature maps and convolutional neural networks for bearing fault diagnosis. Reference [
20] proposed a lightweight CNN framework that requires fewer parameters to achieve high accuracy.
Although the above methods have achieved satisfactory diagnostic accuracy, the data tested in these methods are mostly “clean” data collected in laboratories. However, at the actual site, bearings operate in harsh environments with high temperatures, high pressures, and high loads, which makes it extremely prone to fault. Therefore, regular maintenance is necessary to ensure their optimal performance. However, the signals collected by the sensors contain a large number of vibration signals from other mechanical components, and a large amount of ambient noises accumulate during the transmission process. According to the central limit theorem, we assumed that the vibration sources of each element are independent, each vibration source will generate a certain random signal, and the superposition of independent signals on each other will lead to an increase in the variance of the signal, thus tending to a Gaussian noise signal. The signal is drowned out when the Gaussian signal is strong enough, resulting in an insufficient signal-to-noise ratio [
21]. The above features make it more challenging to extract bearing vibration signals and reduce the accuracy of bearing fault diagnosis.
To improve the noise immunity of convolutional neural networks, different CNN-model-based denoising methods have also been proposed by researchers. A method based on wavelet transform (WT) and an improved residual neural network was proposed by [
9], but its use of SVD to improve the pooling layer made it less efficient. Reference [
22] proposed combining PCA theory with deep learning technology and applying it to a high-speed rail system for early fault diagnosis.Reference [
23] proposed a deep learning method for bearing fault diagnosis by superimposing residual null convolution, but it only considered a single load situation and a high signal-to-noise ratio, which cannot meet the requirements of practical complex environments. Reference [
24] proposed a model with high fault diagnosis accuracy based on a working mechanism of soft thresholding and global context, but its sharing of a threshold value for all channels led to ignoring the possibility of different amounts of noise features in different channels.Reference [
25] proposed a deep transient feature learning method that forms a training dataset by simulating the underlying signals of different pulse wavelet bases and learns to anticipate repetitive transient pulse features during the training process, which in turn constructs a mapping of noisy TFDs to clean TFDs; however, the method is only used to remove noise, and there is a lack of fault diagnosis. Reference [
26] presented a hierarchical-branch-based bearing fault diagnosis method using convolutional neural networks (CNNs) and conducted experiments in the SNR range of 2 dB to 12 dB, achieving good results. Reference [
27] proposed a bearing fault diagnosis method based on multi-granularity information fusion, which enhanced robustness against noise. However, the experiments regarding the noise environment were relatively simple. Reference [
28] introduced a bearing fault diagnosis method under noisy backgrounds using adaptive thresholding, achieving high diagnostic accuracy; however, the model complexity was high, and the designed experiments did not include comparisons under low signal-to-noise ratios. Reference [
29] proposed a novel error cost function and a structure adaptive algorithm based on stacked denoising auto-encoders. However, it only verified noise accuracy at higher signal-to-noise ratios.
Therefore, it remains a major challenge to realize a bearing fault diagnosis method under noisy conditions. Due to the presence of noise, fault diagnosis can suffer from signal distortion, decreased signal-to-noise ratios, and an increased risk of misdiagnosis. Therefore, effective noise suppression and feature enhancement strategies are necessary for accurate fault diagnosis. However, existing noise suppression and feature enhancement methods face limitations in addressing the impact of noise on fault diagnosis. These limitations include limited noise suppression effectiveness and potential feature loss. To address the above problems, this paper proposes a bearing fault diagnosis method based on an improved dilated convolutional neural network to achieve feature extraction and fault diagnosis of bearing vibration signals in noisy environments. The proposed method is consistent with the research trend in the field of bearing fault diagnosis, and has both theoretical significance and application value. In this paper, we designed a bearing fault diagnosis method consisting of a short-time Fourier transform and MAB-DrNet. The original signal was converted into a time–frequency map by the short-time Fourier transform, and then, the proposed MAB-DrNet was used for feature extraction and bearing fault diagnosis. The main contributions of this paper are as follows:
An innovative bearing fault diagnosis model, MAB-DrNet, was designed for accurately determining the health status of bearings in noisy environments.
A residual network based on dilated convolution was designed, which increased the perceptual field of the convolution kernel and improved the accuracy of bearing fault diagnosis by using dilated convolution instead of conventional convolution.
We propose the MAB module, which selectively focused on informative features, allowing the model to allocate its resources more effectively and enhance its ability to distinguish relevant and irrelevant features. This improved the overall performance of the model. Additionally, we introduced the global residual block to address the issues of vanishing and exploding gradients, thereby improving the network’s generalization capability and robustness.
The experimental validation was carried out on a bearing vibration dataset using the Case Western Reserve University dataset. The proposed method was validated and compared by adding different intensities of noise to the dataset, and the effectiveness of the proposed method is demonstrated.
The remainder of this paper is organized as follows.
Section 2 reviews the short-time Fourier transform, dilated convolution, and residual network and analyzes their advantages.
Section 3 introduces the proposed fault diagnosis method.
Section 4 validates the performance of the proposed model by using the CWRU dataset for comparative analysis with several models and illustrates the necessity of the proposed modules through ablation experiments.
Section 5 concludes the paper and discusses future work.
5. Conclusions
To solve the problem of noise leading to the non-smooth and nonlinear issues of the collected signal wave affecting the accuracy of bearing fault diagnosis, a rolling bearing fault diagnosis method based on an improved dilated convolutional neural network, namely MAB-DrNet, was proposed. The method integrated a short-time Fourier transform and improved dilated convolution. First, we proposed an STFT-based data preprocessing method by which the time-domain signal was converted into a time–frequency signal in order to achieve analysis from both the time domain and frequency domain perspectives and to initially solve the non-smooth and nonlinear problems of the signal. We then designed a novel bearing fault diagnosis model, which was based on the dilated-convolution-based residual model, and the MAB module and the global residual block were designed to increase the feature extraction ability of the model and improve its accuracy. To validate the noise immunity of the model for testing, the CWRU dataset was mixed with Gaussian white noise for the experiments, and the effectiveness of the proposed method was proven by comparative analysis with several advanced models Attention EfficientNet, RSG, and FDGRU and the classical network models Resnet18 and EfficientNet v2, respectively. Finally, the effectiveness of the designed MAB module and the GRB module was verified by ablation experiments.
From a theoretical perspective, the study advances the understanding of fault diagnosis in the presence of noise. By investigating the impact of noise on fault signals and proposing noise suppression and feature enhancement strategies, the study provides valuable insights into the challenges posed by noise in fault diagnosis. The study expands the knowledge base regarding the effects of noise on signal distortion, signal-to-noise ratio degradation, and increased risk of misdiagnosis. Additionally, the study contributes to the theoretical framework by proposing novel methods for noise suppression and feature enhancement, which can serve as a foundation for further studies in this field.
In terms of practical applications, the study findings offer practical solutions for improving fault diagnosis in real-world scenarios. By developing effective noise suppression techniques and feature enhancement strategies, the study provides practitioners with tools to enhance the accuracy and reliability of fault diagnosis systems. These findings have the potential to benefit various industries and domains that rely on fault diagnosis, such as manufacturing, transportation, and machinery maintenance. By addressing the challenges posed by noise, the research enables more precise and reliable identification of faults, leading to improved operational efficiency, reduced downtime, and cost savings.
Nevertheless, it is important to note that the methodology employed in this study did not address the potential influence of class imbalance on fault diagnosis. This limitation arises from the fact that the collected data predominantly consisted of healthy samples, with a smaller proportion of fault samples. As a result, the class imbalance issue was not explicitly considered in the current work. However, it is recognized that class imbalance can significantly impact the performance of fault diagnosis models, and addressing this concern will be a focus of future research endeavors. Subsequent investigations will strive to incorporate appropriate techniques or algorithms to mitigate the effects of class imbalance, thereby enhancing the robustness and generalizability of the fault diagnosis approach.