Data-driven vibration-based bearing fault diagnosis using non-steady-state training data

Pichler, Kurt; Ooijevaar, Ted; Hesch, Clemens; Kastl, Christian; Hammer, Florian

doi:https://doi.org/10.5194/jsss-9-143-2020

Articles | Volume 9, issue 1

https://doi.org/10.5194/jsss-9-143-2020

Special issue:

Sensors and Measurement Systems 2019

https://doi.org/10.5194/jsss-9-143-2020

Articles | Volume 9, issue 1

Regular research article

12 May 2020

Regular research article |

| 12 May 2020

Data-driven vibration-based bearing fault diagnosis using non-steady-state training data

Kurt Pichler, Ted Ooijevaar, Clemens Hesch, Christian Kastl, and Florian Hammer

Abstract

This paper presents the extension of an empirical study in which a universally applicable fault diagnosis method is used to analyse vibration data of bearings measured with accelerometers. The motivation for extending the previously published results was to provide a profound analysis of the proposed approach with regard to a more feasible training scenario for real applications. For a detailed assessment of the method, data were acquired on two different test beds: a gearbox test bed equipped with various bearings at different health states and an accelerated lifetime (ALT) test bed to degrade a bearing and introduce an operational fault. Features were extracted from the raw data of two different accelerometers and used to monitor the actual health state of the bearings. For that purpose, feature selection and classifier training are performed in a supervised-learning approach. The accuracy is estimated using an independent test dataset. The results of the gearbox test bed data show that the training of the method can be performed with non-steady-state data and that the same feature set can be used for different revolution speeds if a small decrease in accuracy is acceptable. The results of the ALT test bed show that the same features that were identified in the gearbox test start to change significantly when the bearing starts to degrade. Thus, it is possible to observe the identified features for applying predictive maintenance.

Download & links

Article (PDF, 7733 KB)

How to cite

How to cite.

Dates

Received: 20 Sep 2019 – Revised: 31 Mar 2020 – Accepted: 09 Apr 2020 – Published: 12 May 2020

1 Introduction

Manufacturing companies continuously try to increase their productivity, by avoiding machine downtime among other things. The former involves considerable costs because of the resulting loss of turnover. Monitoring the condition of, for instance, bearings and gears plays a vital role in the maintenance programme of rotating machines. Early fault detection could allow for moving from a time-based preventive-maintenance programme to a condition-based predictive-maintenance strategy and reducing unexpected machine downtime and cost.

Vibration-based condition monitoring is an established approach that has been employed by industries for many years in their maintenance programmes (Randall, 2011). However, up to this day, machine operators often still base their maintenance decisions on data from the periodical and manual inspection of single machines, which does not always result in correct conclusions. The common practice is that vibration measurements are periodically recorded using portable vibration sensors, and measurement signals are analysed by an expert to interpret the machine's health condition. This approach can, however, lead to serious misinterpretation, where rapidly growing impairments could be missed.

A continuous condition-monitoring approach enables early detection of machine faults. In this way, the machine condition is continuously tracked, and total failures can be anticipated in advance, hence allowing appropriate maintenance actions. Despite their advantages, continuous monitoring programmes are still not well adopted by industry. Firstly, this is because it often involves a high investment cost. Although recent advancements in sensor, acquisition and processing hardware have demonstrated cost-effective solutions (Albarbar et al., 2008; Ompusunggu et al., 2018), the economic benefit of the investment is still not clear and hard to quantify. Secondly, this is because many of those systems still require an expert to interpret the analysis results. Finally, this is also because it is not straightforward to select the most appropriate condition-monitoring method for a specific application.

A wide range of vibration-based bearing fault detection methods have been proposed in the literature (Henriquez et al., 2014; Sait and Sharaf-Eldeen, 2011; Wang et al., 2017; Zarei et al., 2014). Approaches that utilize time domain features (e.g. crest factor and kurtosis; Barbini et al., 2017), frequency and cepstral-domain features (e.g. envelope analysis and cepstral coefficients; Borghesani et al., 2013) usually assume stationary machine conditions. Other methods such as cyclostationary analysis (i.e. second-order technique in the frequency domain; Dalpiaz et al., 2013; Hu et al., 2019) and time–frequency domain analysis (e.g. Wigner–Ville distribution, Hilbert–Huang transform and wavelet-transform-based features; Bajric et al., 2016) are more appropriate for non-stationary processes. Some of those methods are purely data driven, whereas others use the physical relation between the bearing geometry, the rotational shaft speed and the bearing-specific fault frequencies associated to the impulse behaviour introduced by bearing faults.

In this paper, we present a purely data-driven method that extracts a large number of features from vibration data of accelerometer measurements and selects and classifies these features in a supervised-learning approach. Training and test data were acquired at two different bearing test beds: a gearbox setup that can be equipped with bearings of different degradation statuses and a simple rotating shaft with a bearing under certain radial loads for accelerated degradation. However, the method does not address one specific application with certain requirements of the application. We applied the same basic idea to very different applications like fault diagnosis in a hydraulic accumulator loading circuit or oscillation detection. In both cases, we obtained satisfying accuracy values. In any case, the limitation of the method is that it requires information-rich training data of the underlying system to select meaningful features and train suitable classifiers. In this context, information-rich means that the training data have to contain the different possible (fault) states of the application as well as different operation modes. There are of course general requirements that hold for fault detection in most applications, such as the desire to obtain a high detection accuracy and the ability to detect faults under all relevant operational conditions. Both of them are tackled implicitly for the specific application of this paper by the main goals that are stated below.

The basics of the proposed method were already compared to two state-of-the-art methods for bearing monitoring in Ooijevaar et al. (2019). It proved that it can compete with the other methods, although those incorporate specific knowledge about the monitored bearing, while the proposed method is purely data driven. Of course, there are also some drawbacks of the proposed method compared to the other state-of-the-art methods, for instance the requirement of sufficient training data for different states and operation modes.

In contrast to Ooijevaar et al. (2019), the goal of this paper is to analyse if a more feasible training scenario is possible. Therefore, it is investigated if it is sufficient to acquire training data with linearly increasing speed instead of many different steady-state speed levels to save time for training data acquisition. It is also investigated if the same feature set can be used for different revolution speeds of the bearing to make it applicable to different speeds without adapting the feature set.

The paper is structured as follows: in Sect. 2, the problem is briefly stated, and the experimental setup is introduced. The classification method is described in detail in Sect. 3, and Sect. 4 provides test results. Finally, Sect. 5 gives the conclusions of the work.

2 Problem statement and experimental setup

In this paper, a previously proposed method (Ooijevaar et al., 2019) for bearing fault detection based on vibration measurements is further analysed with regard to a more feasible training scenario for real-world applications. For that purpose, it is firstly investigated if training data can be acquired using measurements with linearly increasing speed instead of acquiring data at many different steady-state speed levels. That saves a significant amount of time for training data acquisition. Secondly, it is investigated if the same feature set can be used for different revolution speeds of the bearing. That makes the approach more universally applicable, since there is no need to adapt the feature set to the revolution speed.

Since the paper deals with applying the proposed method to bearing fault detection, it is of essential importance to know the underlying physical system. Hence, this section provides a detailed explanation of the experiments and the measured data. Two types of experiments have been performed: (i) an accelerated lifetime test (ALT) of a ball bearing on a single-shaft drive train setup and (ii) a test on a more complex gearbox setup including bearings with various faults. In both test setups, the vibrations of the bearing are measured by accelerometers. These vibration data are used to detect the faults in a machine-learning approach. The tests are described in the next two subsections.

2.1 Accelerated lifetime test

The accelerated lifetime test allows for creating an operational fault in a bearing. This test differs from other studies on the fact that those are often limited to artificially induced faults. Moreover, the fault evolution and accumulation can be monitored during the accelerated lifetime. The experimental setup used to perform the accelerated lifetime test is shown in Fig. 1. The setup comprises of a single shaft with a test bearing. The shaft is supported with the help of a support bearing on each side. A hydraulic cylinder is used to apply a radial load to the test bearing up to a maximum of 10 kN. The test bearing is oil lubricated by an internal oil bath. Two air fans are installed to cool the setup and avoid overheating of the bearing. The setup is driven by a motor at a fixed rotation speed of 1500 rpm.

https://www.j-sens-sens-syst.net/9/143/2020/jsss-9-143-2020-f01

Figure 1The drive train setup used to reduce the lifetime of a bearing to less than 1 d, allowing for the generation of vibration data during the accumulation of an operational bearing fault.

Download

The test procedure is schematically illustrated in Fig. 2. Vibration measurements were performed under a nominal radial load of 1.5 kN (i.e. 10 % of the dynamic load rating). The radial load was temporarily increased to 9.0 kN (i.e. 65 % of the dynamic load rating) to accelerate the degradation of the bearing. In the beginning, the interval of increased load was 20 min, but this had been reduced as soon as the first indication of an incipient fault was noticed in the measured vibration responses. In total, 30 vibration measurements were performed at the nominal 1.5 kN load condition, and 29 vibration measurements were performed at the high 9.0 kN radial load. The accelerated lifetime test was stopped when a vibration peak level of ±50 g was reached.

https://www.j-sens-sens-syst.net/9/143/2020/jsss-9-143-2020-f02

Figure 2The load was temporarily increased from 1.5 to 9 kN to accelerate the lifetime of the bearing.

Download

The applied radial load, the radial vibrations in the loading direction and the temperature of the bearing housing were measured during the test. The machine vibrations were measured using a piezo-film ACH-01-03 accelerometer and sampled at 12.8 kHz by an embedded acquisition platform. In each measurement, 20 s of data were acquired. The acquisition platform consists of a BeagleBone Black single-board computer with a Linux operating system, supplemented with a customized six-channel interface. This embedded platform is used as a compact, open, scalable and cost-effective data acquisition system.

The accelerated lifetime test was performed on a FAG 6205 ball bearing. Before the start of the test, a small indentation (see Fig. 3) with a diameter of 230 µm was created in the inner race using a Rockwell C hardness tester. This indentation is used as a local stress riser and represents a local plastic deformation caused by, for instance, a contamination particle. Subsequently, the accelerated lifetime test was performed for several hours. Although bearings can fail in many different ways, the indentation triggers the bearing to fail in a more repeatable way. The test was stopped when severe rolling contact surface fatigue occurred at the inner race (Halme and Andersson, 2009). The start and the end condition of the inner race of the test bearing are shown in Fig. 3.

https://www.j-sens-sens-syst.net/9/143/2020/jsss-9-143-2020-f03

Figure 3The indentation at the bearing inner race was used as the start condition, and the surface fatigue fault at the inner race was introduced by the accelerated lifetime test.

Download

Only a single dataset has been used in this paper. However, the accelerated lifetime test has been performed several times as part of other research by the authors. They have all resulted in similar surface fatigue faults at the inner race of the test bearing.

2.2 Gearbox test

The second test performed in this study was an industrially representative gearbox setup. Figure 4 shows a photograph and a schematic top view of the gearbox setup. The test setup consists of (i) an induction electric motor, (ii) a gearbox and (iii) a magnetic brake. The motor is controlled by a variable-frequency drive (VFD) with either a stationary mode or a transient mode (run-up or run-down mode). The motor speed can be controlled from 0 to 3000 rpm. The gearbox input shaft is connected to the motor through a flexible coupling, while the gearbox output shaft is directly coupled to the brake. The torque applied to the brake can be adjusted by the controller from 0 to 50 Nm.

https://www.j-sens-sens-syst.net/9/143/2020/jsss-9-143-2020-f04

Figure 4Gearbox setup comprising a motor, three-shaft gearbox and brake to introduce a load.

Download

As illustrated in Fig. 4, the gearbox comprises of three parallel shafts connected through contacting spur gear pairs. Note that the number of gear teeth is indicated in the figure. Hence the total reduction factor from the input to the output shaft is equal to $(100 / 29) \times (90 / 36) = 8.62$ . The input shaft is supported by MB ER-10K deep-groove ball bearings, while the other shafts are supported by MB ER-16K deep-groove ball bearings. For simulating a healthy or faulty state on the gearbox, the right-side bearing housing that supports the second shaft is equipped either with a healthy or a damaged FAG 6205-C-TVH ball bearing.

Two healthy bearings and three faulty bearings with different inner race faults were tested. An indentation fault with a diameter of 490 µm was created using an Rockwell C hardness tester. Two other bearings with operational faults were created using the accelerated lifetime test setup as described in Sect. 2.1. The healthy bearings are referred as “Healthy1” and “Healthy2”, while the faulty bearings are referred as “Indent”, “Faulty1” and “Faulty2”, in order of increasing severity; these are illustrated in Fig. 5.

https://www.j-sens-sens-syst.net/9/143/2020/jsss-9-143-2020-f05

Figure 5Five bearing states tested on the gearbox setup comprising two healthy bearings and three faulty bearings with different severities.

Download

For each healthy or faulty state, two operating conditions were imposed on the gearbox setup, namely two different motor speeds of 1500 and 3000 rpm. The brake torque was kept constant at 50 Nm. Because of the transmission ratio, the rotational speed of the second shaft is 29∕100 lower than that of the motor speed, while the torque applied on the second shaft is 36∕90 lower than that of the brake torque. Hence, for the imposed operating conditions, the rotational speeds of the second shaft were 435 and 870 rpm, while the torque applied to the second shaft was 20 Nm. A high-end PCB (picocoulomb; manufacturer PCB Piezotronics) accelerometer and a low-cost MEMS (microelectromechanical-system) accelerometer were mounted on the gearbox housing as shown in Fig. 4. The vibration signals were sampled at 50 kHz using a Dewesoft data acquisition system. For each operating condition, 10 operations of 20 s each were repeated. Furthermore, for each of the five tested bearing states, three ramp-up measurements were conducted. In these ramp-up measurements, the motor speed was increased linearly from 0 to 3000 rpm within 40 s. The ramp-up measurements are used to investigate the first and main aim of this paper, i.e. reducing measurement effort for training data acquisition. It is obvious that conducting 40 s of ramp-up measurements saves a significant amount of time compared to conducting steady-state measurements for several seconds at many revolution levels. All data are then processed using scripts written in MATLAB.

3 Classification method

The fault diagnosis approach presented here is a purely data-driven one; i.e. it incorporates no physical knowledge about the monitored system. This makes it on one hand much more flexible and applicable to many other kinds of systems, machines or components. On the other hand, incorporating extra knowledge usually improves the diagnostic ability of a condition-monitoring system and reduces the necessary amount of training data.

The proposed method applies a supervised-learning approach to annotated measurement data. In the presented application, accelerometers measure vibrations that are caused by the bearing to be monitored. For this purpose, the accelerometer is mounted at the housing of the bearing (see for instance Fig. 1). Since the fault state in the gearbox setup is known, annotated data for classifier training are available.

The training procedure of the proposed method consists of three steps:

feature extraction from annotated data
feature selection
classifier training.

The evaluation procedure for new data consists of two steps:

extraction of the features selected in the training procedure
classifier evaluation.

All steps are described in more detail in the following subsections. Moreover, the combination of training and test procedures is summed up in Fig. 6. From the annotated training data, features are extracted, and the most significant features for the classification task are selected. With these features, a classifier is trained. From the test data, the features that were selected before are extracted. The trained classifier is applied to the features to obtain the estimated class information for the test data. If the ground truth of the test data is available, it can be used together with the estimated class information to evaluate the classifier (confusion matrix, accuracy, etc.; de Ridder et al., 2017).

https://www.j-sens-sens-syst.net/9/143/2020/jsss-9-143-2020-f06

Figure 6Block diagram of the proposed method.

Download

3.1 Feature extraction

In the first step, a large number of features is extracted in a sliding-window approach from the raw accelerometer signals. Feature extraction for vibration analysis has been discussed in numerous publications; extensive reviews can be found for instance in Wang et al. (2017) and Singh and Vishwakarma (2015). The extraction of typical statistical features in the time domain is described in Sharma and Parey (2016), Lei et al. (2007), Shen et al. (2013), Decker and Lewicki (2003), Alattas and Basaleem (2007), Boldt et al. (2013), Jalil et al. (2013), Suma and Gurumurthy (2010), and Kollialil et al. (2013). Features in the time–frequency and frequency domains are proposed and investigated in Sharma and Parey (2016), Lei et al. (2007), Alattas and Basaleem (2007), and Boldt et al. (2013). Typical symptom parameters in the frequency domain for rotating machinery are extracted in Wang and Chen (2007). Adopting the spectral kurtosis for vibration monitoring is examined in Rao (2015) and Antoni and Randall (2006). In McClintic et al. (2000) and Assaad et al. (2014), features of residual and difference signals are extracted by using for instance autoregressive models. Features in the wavelet domain are introduced in Heidari Bafroui and Ohadi (2014), Jafarizadeh et al. (2008), Bajric et al. (2016), and Kollialil et al. (2013). Satyam et al. (1994) and Konstantin-Hansen and Herlufsen (2010) examine vibration analysis in the cepstral domain. The application of synchronous time averaging is demonstrated for instance in McFadden and Toozhy (2000). We implemented a broad selection of the proposed features to analyse the measured data. Overall, 83 features were extracted. Amongst the finally selected features of a time series $x = x_{1}, x_{2}, \dots, x_{n}$ were for instance the root mean square (RMS; Sharma and Parey, 2016) as

\begin{matrix} (1) & x_{rms} = \sqrt{\frac{1}{n} \cdot \sum_{i = 0}^{n} x_{i}^{2}} \end{matrix}

or the interquartile range (Kollialil et al., 2013) as

\begin{matrix} (2) & x_{iqa} = {\hat{x}}_{0.75} - {\hat{x}}_{0.25}, \end{matrix}

where

\begin{matrix} (3) & {\hat{x}}_{p} = \{\begin{cases} \frac{1}{2} (y_{n \cdot p} + y_{(n \cdot p) + 1}) & if n \cdot p \in N \\ y_{⌊n \cdot p + 1⌋} & if n \cdot p \notin N \end{cases} \end{matrix}

and $y_{1} \leq y_{2} \leq \dots \leq y_{n}$ are the sorted values of x.

Also the symptom parameters (Wang and Chen, 2007) of

\begin{matrix} (4) & p_{2} = \sqrt{\frac{\sum_{i = 1}^{N} f_{i}^{4} \cdot S (f_{i})}{\sum_{i = 1}^{N} f_{i}^{2} \cdot S (f_{i})}} \end{matrix}

and

\begin{matrix} (5) & p_{4} = \frac{σ}{\overline{f}}, \end{matrix}

where f_i, $i = 1, \dots, N$ are the frequency bins, S(f_i) is the power spectrum,

\begin{matrix} (6) & σ = \sqrt{\frac{\sum_{i = 1}^{N} {(f_{i} - \overline{f})}^{2} \cdot S (f_{i})}{N - 1}} \end{matrix}

and

\begin{matrix} (7) & \overline{f} = \frac{\sum_{i = 1}^{N} f_{i} \cdot S (f_{i})}{\sum_{i = 1}^{N} S (f_{i})}, \end{matrix}

are amongst the top features. However, for confidentiality reasons we are not allowed to name the exact features that were chosen in each particular test. Therefore, we use abstract feature numbers in the following sections.

3.2 Feature selection

In the next step of the supervised-learning approach, the dimensionality of the feature space is reduced to avoid the curse of dimensionality (Bellman, 2003). Therefore, the significant features are identified by feature selection procedures as described in Guyon and Elisseeff (2003). In particular, a standard forward-selection-filter algorithm selecting one feature per step was applied. As the selection criterion in each step of forward selection, we use the robust Dy–Brodley distance measure (Dy and Brodley, 2004). Assuming a dataset with C∈ℕ classes in a k-dimensional feature space, the feature values for each class can be represented as a matrix of $X_{c} \in R^{n_{c} \times k}$ , $c \in \{1, \dots, C\}$ , with n_c∈ℕ denoting the number of samples for class c. Then μ_c∈ℝ^k and $Σ_{c} \in R^{k \times k}$ , $c \in \{1, \dots, C\}$ , denote the mean values and covariance matrices of each class c, and μ∈ℝ^k denotes the mean value over all classes. Defining the within-scatter $S_{W} \in R^{k \times k}$ as

\begin{matrix} (8) & S_{W} = \sum_{c = 1}^{C} \frac{n_{c}}{n} \cdot Σ_{c} \end{matrix}

and the between scatter $S_{B} \in R^{k \times k}$ as

\begin{matrix} (9) & S_{B} = \sum_{c = 1}^{C} \frac{n_{c}}{n} \cdot {(μ_{c} - μ)}^{T} \cdot (μ_{c} - μ), \end{matrix}

where

\begin{matrix} (10) & n = \sum_{c = 1}^{C} n_{c}, \end{matrix}

the Dy–Brodley distance measure is finally defined as

\begin{matrix} (11) & J = tr (S_{W}^{- 1} \cdot S_{B}) . \end{matrix}

Feature selection is stopped when the relative gain of the selection criterion falls below 1 %. We also performed tests with the Mahalanobis distance (McLachlan, 1999) as the selection criterion; however, both distance measures resulted in the same feature sets.

3.3 Classifier training

After feature extraction and selection, a classifier is trained in the feature space. For that purpose, we use linear and quadratic discriminant analysis (Hastie et al., 2009; de Ridder et al., 2017).

In this classification approach, the class conditional distributions of a new data sample x∈ℝ^k are modelled as $P (x | c = c_{i})$ for each class $c_{i} \in \{1, \dots, C\}$ . By using Bayes' rule of

\begin{matrix} (12) & \begin{aligned} P (c = c_{i} | x) & = \frac{P (x | c = c_{i}) \cdot P (c = c_{i})}{P (x)} \\ = \frac{P (x | c = c_{i}) \cdot P (c = c_{i})}{\sum_{c_{j} = 1}^{C} P (x | c = c_{j}) \cdot P (c = c_{j})} \end{aligned} \end{matrix}

and selecting the class c_i with highest conditional probability, a class prediction can be made. In the application example of this paper, the classes are “Healthy”, “Indent” and “Fault”. In discriminant analysis, P(x|c) is modelled as a multivariate normal distribution with density

\begin{matrix} (13) & \begin{aligned} P & (x | c = c_{i}) = \\ \frac{1}{{(2 π)}^{\frac{k}{2}} \cdot {|Σ_{c_{i}}|}^{\frac{1}{2}}} \cdot e^{- \frac{1}{2} {(x - μ_{c_{i}})}^{T} \cdot Σ_{c_{i}}^{- 1} \cdot (x - μ_{c_{i}})}, \end{aligned} \end{matrix}

where the prior probabilities P(c=c_i), the class mean values $μ_{c_{i}}$ and the class covariance matrices $Σ_{c_{i}}$ are estimated from training data. Equation (13) represents the quadratic case of discriminant analysis. In the linear case, the normal distributions for each class are assumed to have the same covariance matrix, i.e. $Σ_{c_{i}} = Σ$ for $c_{i} \in \{1, \dots, C\}$ .

The validity of using normally distributed classifiers was checked by normal probability plots of the feature vectors (see for instance the feature interquartile range for two states in Fig. 7).

https://www.j-sens-sens-syst.net/9/143/2020/jsss-9-143-2020-f07

Figure 7Normal probability plots of the feature interquartile range, 3000 rpm data, PCB sensor, and states Healthy2 and Faulty2.

Download

The supervised-learning approach implies that the method depends on having a sufficient amount of annotated training data for all states (failure modes) to be monitored. There are also classifiers for one-class classification (also referred to as novelty detection) available (Tax, 2001). However, those techniques detect only a deviation from a nominal state and are thus prone to overdetection due to changing operation modes. Furthermore, the feature selection process depends on having an annotated dataset as well. For the ALT test no training data from different states were available; hence we used a novelty detection technique. For that purpose, the features selected in the gearbox test are observed by cumulative-sum (CUSUM) control charts (Hawkins and Olwell, 1998).

Given annotated training data, the whole process of feature extraction, feature selection and classification can be fully automated. The more useful the information in the training data is, the better the resulting feature subset and classifier are. In this context, information means different states, rotation speed, repeated measurements with different samples of the same bearing type and so on.

3.4 Evaluation of new data

The process of evaluating a new data sample is straightforward: the selected features are extracted in a sliding-window approach from the raw accelerometer signals, and the classifier is applied to those features. Since many classifiers are able to deliver class membership probabilities, it is generally also possible to determine instances lying between two distinct states. However, we restrict here the evaluation to crisp class decisions by detecting the maximum class probability for each observation.

If a set of new samples (i.e. the test dataset) contains the true class information (the ground truth), the estimated class can be compared to the ground truth to evaluate the quality of the classifier. For that purpose, many measures like accuracy, balanced accuracy, confusion matrices and receiver-operating-characteristic (ROC) curves are proposed in the literature (de Ridder et al., 2017).

4 Results

The results obtained by the proposed method are presented in this section. The accelerated lifetime test results are addressed first. This is followed by the results of the gearbox test.

4.1 Accelerated lifetime test

In the ALT test, the features were extracted from the raw accelerometer signals in an overlapping sliding-window approach (window length of 0.2 s and overlap of 0.1 s). That yields 199 observations per 20 s data batch. However, for final evaluation, only the mean value of those 199 observations of a data batch is observed. Unlike the gearbox setup, we had no data of different health states available for feature selection and classifier training in the accelerated lifetime test. Therefore we could not strictly follow the evaluation scheme proposed in Sect. 3. We were rather restricted to detect significant changes in the feature values. For that purpose, CUSUM control charts were applied to the behaviour of a feature over time. Due to the missing feature selection step, we evaluated the top-ranked features of the gearbox test. All of those features increased significantly towards the end of the test run for the 9.0 kN as well as for the 1.5 kN load conditions. For instance, feature 20 is depicted in Fig. 8. Due to the increasing feature value, the upper threshold of the CUSUM control chart is exceeded after approximately 7.3 h in the 9.0 kN case and 7.9 h in the 1.5 kN case, indicating a failure of the bearing. This result shows that the features identified in the gearbox test can be used to perform predictive maintenance of bearings by observing them using CUSUM control charts. For control purposes, Fig. 9 shows an arbitrarily chosen feature. The feature shows no significant trend, and the CUSUM control charts do not exceed the thresholds. This is a first indication that a wrongly chosen feature will no produce overdetections.

https://www.j-sens-sens-syst.net/9/143/2020/jsss-9-143-2020-f08

Figure 8Feature values and CUSUM control charts for feature 20 in the ALT test for load 9.0 kN (a) and 1.5 kN (b).

Download

https://www.j-sens-sens-syst.net/9/143/2020/jsss-9-143-2020-f09

Figure 9Feature values and CUSUM control charts for feature 2 in the ALT test for load 9.0 kN (a) and 1.5 kN (b).

Download

4.2 Gearbox test

Just like in the ALT, the features for the gearbox test were extracted from the raw accelerometer signals in an overlapping sliding-window approach with a window length of 0.2 s and an overlap of 0.1 s, delivering again 199 observations for each 20 s data batch. After the extraction of all features, the feature selection algorithm was applied to the 3000 and 1500 rpm motor speed data independently. For feature selection, we did not use all available states. We used only the datasets of the states Healthy1, Healthy2 and Faulty2, but we did not use Indent and Faulty1. This procedure tests whether the method selects features that are suitable for the other two states as well or not. After feature selection, a quadratic-discriminant-analysis classifier, as described in Sect. 3.3, is trained in the space of the selected and annotated features. Only observations from the states Healthy2, Indent and Faulty2 are used for classifier training to test the generalizability of the proposed method. Moreover, for feature selection and classifier training, just two arbitrarily chosen data batches from each of the three states are used. That means that $199 \cdot 2 \cdot 3 = 1194$ observations were available for training. After training, validation was performed with another arbitrarily chosen 20 s data batch from all five bearing states, hence, $199 \cdot 1 \cdot 5 = 995$ observations. However, as the target class of the classifier we did not use those five states, but we only used the simplified states Healthy (containing Healthy1 and Healthy2), Indent and Faulty (containing Faulty1 and Faulty2). Since the states Healthy1 and Healthy2 and Faulty1 and Faulty2 are very similar, there is no reason to discriminate between them; in fact they are even supposed to produce similar feature values. As evaluation criteria, we use classification accuracy and the confusion matrix.

First, we evaluate the data of the high-end PCB accelerometer. For the 3000 rpm motor speed data, the algorithm selected three top-ranked features, and for the 1500 rpm data it selected four top-ranked features. However, for a first visual impression we show only the top two features for all recorded states and rotation speeds in a scatterplot in Fig. 10 (3000 rpm) and Fig. 11 (1500 rpm).

https://www.j-sens-sens-syst.net/9/143/2020/jsss-9-143-2020-f10

Figure 10Scatterplot of top two features for the 3000 rpm PCB dataset.

Download

https://www.j-sens-sens-syst.net/9/143/2020/jsss-9-143-2020-f11

Figure 11Scatterplot of top two features for the 1500 rpm PCB dataset.

Download

The scatterplots already indicate a few possible conclusions:

Different top features were selected for the different rotation speeds.
The 3000 rpm dataset revealed better separability.
Faulty1 and Faulty2 produced similar feature values.
In the 3000 rpm dataset, the Indent class lies somewhere in between the healthy and the faulty states.

According to the feature selection step, the 3000 rpm data were validated with the top three features, and the 1500 rpm data were validated with the top four features. The validation yields 99.30 % accuracy for the 3000 rpm data (confusion matrix in Table 1) and 82.41 % accuracy (confusion matrix in Table 2) for the 1500 rpm data. That result confirms the first conclusion above: the separability of the states Healthy, Indent and Faulty in the 3000 rpm case is satisfying, while it is worse in the 1500 rpm case. Especially the state Healthy1 is misclassified in the 1500 rpm case. Since Healthy1 was not used for classifier training, it is obviously more likely to be misclassified.

Table 1Confusion matrix for 3000 rpm PCB data and the top three features.