!!! icbm intelligent condition based maintenance thesis
TRANSCRIPT
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
1/109
UNIVERSITY OF CINCINNATI
Date:___________________
I, _________________________________________________________,
hereby submit this work as part of the requirements for the degree of:
in:
It is entitled:
This work and its defense approved by:
Chair: _______________________________
_______________________________
_______________________________
_______________________________
_______________________________
10/28/05
Ranganath Kothamasu
Doctor of Philosophy
Industrial Engineering
Intelligent Condition Based Maintenance - A Soft Computing
Approach to System Diagnosis and Prognosis
Samuel. H. Huang
Ali Minai
Ernest HallSam Anand
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
2/109
Intelligent Condition Based Maintenance: A Soft Computing Approach
to System Diagnosis and Prognosis
A Dissertation submitted to the
Dept of Mechanical, Industrial and Nuclear Engineering
University of Cincinnati
In Partial fulfillment of the
Requirements for the degree of
Doctor of Philosophy
2005
By
Ranganath Kothamasu
Committee Members:
Dr. Samuel Huang
Dr. Sam Anand
Dr. Ernest Hall
Dr. Ali Minai
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
3/109
Abstract
Maintenance is the set of activities performed on a system to sustain it in operable
condition while Condition Based Maintenance (CBM) refers to the practice of triggering
these activities as necessitated by the condition of the target system. CBM thus entails the
process of diagnosis (of the target system) and timely identification of incipient or
existing failures popularly known as Failure Detection and Identification (FDI). FDI has
been given due research focus; however there is a dearth of autonomous yet interactive
decision making tools that would perform diagnosis andprognosis under the precepts of
CBM in a guided environment.
The development of such an architecture along with the tools necessary for
decision making in the realm of condition based maintenance constitute the focus of this
research. The architecture and the tools developed in this research encompass the model
based approach to FDI. These tools are built on Neuro-Fuzzy (NF) paradigms as they
offer many advantages in the form of accuracy, adaptability and lucidity compared to
other parametric and non-parametric approaches. Along with the development of a NF
algorithm, suitable evaluation criteria are also explored and developed to gauge the
applicability and efficiency of the developed models. Intelligent Condition Based
Maintenance (ICBM) thus refers to the creation of adaptive and robust FDI models based
on a model based architecture and their subsequent validation using suitable evaluation
criteria. The efficiency and robustness of these ICBM tools are demonstrated by applying
them in several scenarios Simulated as well as real world.
i
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
4/109
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
5/109
Acknowledgements
I am highly indebted to Dr. Samuel Huang for his guidance throughout my
dissertation. He has encouraged and helped me in numerous ways to accomplish this
research effort. I am also grateful to Dr. Sam Anand, Dr. Ernest Hall and Dr. Ali Minai
for being a part of my dissertation committee. Dr. Sam Anand has always been a source
of inspiration and I am grateful to him for giving me this opportunity to do doctoral
research. It was through interactions with Dr. Bruce Shultes that I was able refine my
knowledge in statistical techniques which has been of great help to my research. Dr.
Ernest Hall is an excellent mentor and it was through his critique and recommendations
that I was able to define the scope and content of my dissertation. It was through
discussions with Dr. Ali Minai that I have gained insights into the workings of intelligent
systems which are the primary focus of my dissertation. I would like to specifically
acknowledge my family (parents, my brother and sister in law) who were my primary
drivers and source of inspiration and fortitude from the inception to conclusion of this
research effort. Ms. Vinodha Sadasivam stood by me through thick and thin, and has in
many ways guided this effort to its conclusion. I would also like to thank my colleagues
Mr. Kanthi Muthiah, Mr. Nuo Xu and Mr. Saurabh Dwivedi for their constant support. A
special thanks to Dr. Jun Shi, whose efforts into making my dissertation foolproof have
been a tremendous help.
iii
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
6/109
Table of Contents
ABSTRACT.................................................................................................................................................. I
ACKNOWLEDGEMENTS ........................................................................................................................III
TABLE OF CONTENTS............................................................................................................................IV
LIST OF FIGURES.....................................................................................................................................VI
LIST OF TABLES.................................................................................................................................... VII
INTRODUCTION........................................................................................................................................ 1
LITERATURE REVIEW............................................................................................................................ 5
2.1.SYSTEM MAINTENANCE PARADIGMS................................................................................................... 52.2.SYSTEM MAINTENANCE -TOOLS AND TECHNIQUES .......................................................................... 10
2.2.1. Reliability Based Maintenance ................................................................................................. 112.2.2. Model Based Approach to FDI ................................................................................................. 122.2.3. Signal Based FDI...................................................................................................................... 142.2.4. Statistical FDI / Maintenance................................................................................................... 16
INTELLIGENT CONDITION BASED MAINTENANCE (ICBM) -CONCEPTUAL
DEVELOPMENT....................................................................................................................................... 17
3.1.ICBMARCHITECTURE DEVELOPMENT ........................................................................................... 173.2.ICBMMODELING PARADIGM......................................................................................................... 20
INTELLIGENT CONDITION BASED MAINTENANCE - MODEL DEVELOPMENT AND
VALIDATION............................................................................................................................................ 23
4.1.ADAPTIVE MAMDANI FUZZY MODEL (AMFM)................................................................................. 244.1.1. Architecture Initialization......................................................................................................... 254.1.2. Rule Tuning............................................................................................................................... 28
4.2.EVALUATION CRITERIA ..................................................................................................................... 314.2.1. Validation of Model Precision .................................................................................................. 314.2.1.1. Function Approximation .................................................................................................................... 334.2.1.2. Classification...................................................................................................................................... 36
4.2.2. Validation of Model Legibility .................................................................................................. 40
INTELLIGENT CONDITION BASED MAINTENANCE - CASE STUDIES .................................... 46
5.1.SPINDLE BEARING FAILURE DIAGNOSIS ............................................................................................ 465.1.1. Model Development & Benchmarking ...................................................................................... 47
iv
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
7/109
5.1.2. Summary ................................................................................................................................... 535.2.HARD TURNING ................................................................................................................................. 54
5.2.1. Flank Wear and Force Prediction........................................................................................ 555.2.2. Simulated Failures................................................................................................................ 57
5.2.2.1. Bearing Wear ..................................................................................................................................... 585.2.2.2. Fixture Misalignment ......................................................................................................................... 59
5.2.3. Model Development .................................................................................................................. 615.2.4. Model Evaluation...................................................................................................................... 62
5.2.4.1. Tool replacement model..................................................................................................................... 635.2.4.2. Failure Detection and Diagnosis ........................................................................................................ 65
5.2.5. Summary ................................................................................................................................... 675.3.ENGINE DIAGNOSIS............................................................................................................................ 67
5.3.1. Feature Extraction.................................................................................................................... 685.3.2. Data Assimilation...................................................................................................................... 725.3.3. Model Development .................................................................................................................. 72
CONCLUSIONS AND FUTURE RESEARCH....................................................................................... 77
6.1.CONCLUSIONS.................................................................................................................................... 776.2.FUTURE RESEARCH............................................................................................................................ 78
REFERENCES ........................................................................................................................................... 80
APPENDIX I............................................................................................................................................... 88
A1.1.FEATURE EXTRACTION ................................................................................................................... 88
APPENDIX II ............................................................................................................................................. 99
A2.1.MODEL EXTENSION ........................................................................................................................ 99
v
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
8/109
List of Figures
FIGURE1.1.BATHTUB CURVE DEPICTING FAILURE RATE OF EQUIPMENT (STAMATIS,1995)............................ 3
FIGURE2.1.TAXONOMY OF MAINTENANCE PHILOSOPHIES (KOTHAMASU ET AL,2004) .................................. 6
FIGURE2.2.FLOW OF MODEL BASED APPROACHES (SIMANI ET AL,2003)...................................................... 13
FIGURE3.1.ICBMARCHITECTURE ............................................................................................................... 18
FIGURE4.1.POLYGONAL APPROXIMATION TO COG...................................................................................... 26
FIGURE4.2.ADAPTIVE MAMDANI FUZZY MODEL (AMFM).......................................................................... 28
FIGURE4.3.BOX AND WHISKER PLOT OF MSE VALUES ................................................................................. 36
FIGURE4.4.BOX AND WHISKER PLOT OF MSE VALUES ................................................................................. 38
FIGURE4.7.FINAL MEMBERSHIP FUNCTIONS (A)INPUT1(B)INPUT2 ............................................................. 44
FIGURE5.1.RESPONSE SURFACES (A)REGRESSION (B)NEURALNETWORK(C)ICBM.................................. 49
FIGURE5.2.DISTRIBUTION OF RESPONSES (A)REGRESSION (B)NEURALNETWORK(C)ICBM..................... 50
FIGURE5.3.RESPONSE SURFACES USING CLASSIFICATION SCHEME (A)TRADITIONAL (B)ICBM.................. 51
FIGURE5.4.DECISION BOUNDARIES (A)TRADITIONAL MODEL (B)ICBM MODEL ......................................... 52
FIGURE5.5.BOX PLOT OF PATTERN DISTANCES FROM LINEAR AND ICBM MODELS ...................................... 53
FIGURE5.6.EFFECT OF BEARING WEAR ON CUTTING FORCE (FZ) ................................................................... 58
FIGURE5.7.EFFECT OF MISALIGNMENT IN Y-DIRECTION ............................................................................... 60
FIGURE5.8.FORCE VS TIME (A)FORCE AT DIFFERENT SPEEDS (B)FORCE AT DIFFERENT FEEDS ................... 62
FIGUREA1.1.SPIKES IN THE DATA BELONGING TO STATE X.......................................................................... 95
FIGUREA1.2.SIGNALS WITH VARYING ACTIVE FREQUENCY COMPONENTS .................................................... 97
vi
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
9/109
List of Tables
TABLE2.1.MAINTENANCE TOOLS AND TECHNIQUES ..................................................................................... 10
TABLE4.1.DEFINITION OF THE TRADITIONAL CRITERIA USED IN MODEL EVALUATION .................................. 32
TABLE4.2.EVALUATION CRITERIA FOR THE FUNCTION APPROXIMATION PROBLEM ...................................... 34
TABLE4.3.CONFIDENCE INTERVALS FROM PAIR-WISE HOCHBERG AND TAMHANE TEST .............................. 34
TABLE4.4.MSE VALUES OF THE MODELS AT DIFFERENT NOISE LEVELS........................................................ 35
TABLE4.5.CLASSES IN THE ECOLI DATASET ................................................................................................. 37
TABLE4.6.AIC VALUES OF NETWORKS DEVELOPED FOR CLASSIFYING THE ECOLI DATASET........................ 37
TABLE4.7.CONFIDENCE INTERVALS FOR DIFFERENCE IN ERROR................................................................... 38
TABLE4.8.ERROR PROPORTIONS WHEN SIMULATED IN NOISY ENVIRONMENT ............................................... 39TABLE4.9.KL DISTANCE MATRIX FORX1 ..................................................................................................... 43
TABLE4.10.KL DISTANCE MATRIX FORX2................................................................................................... 44
TABLE5.1.NIST DATA ON SPINDLE BEARINGS .............................................................................................. 47
TABLE5.2.RANGE OF FEATURES IN THE TRAINING DATASET ......................................................................... 48
TABLE5.3.RANGE OF RESPONSE OF VARIOUS MODELS.................................................................................. 49
TABLE5.4.PATTERN DISTANCES FROM EACH MODEL ................................................................................... 52
TABLE5.5.PRECISION OF MODELS FROM VARIOUS TECHNIQUES ................................................................... 57
TABLE5.6.SAMPLE FALSE ALARM FROM REGRESSION MODEL ...................................................................... 63
TABLE5.7.ACTUAL AMFM AND REGRESSION OUTPUTS ............................................................................... 64
TABLE5.8.AMFM VS. REGRESSION IN PRESENCE OF NOISE (TOOL REPLACEMENT) ...................................... 65
TABLE5.9.AMFM VS. REGRESSION, EXTRAPOLATING IN PRESENCE OF NOISE (TOOL REPLACEMENT) .......... 65
TABLE 5.10.AMFM VS. REGRESSION IN PRESENCE OF NOISE (FAILURE MODE DETERMINATION) ................. 66
TABLE 5.11.AMFM VS. REGRESSION, EXTRAPOLATING IN PRESENCE OF NOISE ........................................... 67
TABLE5.12.FEATURES IN TRAINING DATA SET.............................................................................................. 70
TABLE5.13.FEATURES IN TESTING DATA SET................................................................................................ 71
TABLE5.14.RESULTS FROM MODEL USING 4 RULES ...................................................................................... 74
TABLE5.15.RESULTS FROM THE ENHANCED MODEL ..................................................................................... 76
vii
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
10/109
Chapter I
Introduction
The oldest and most common maintenance and repair strategy is fix it when it
breaks. The appeal of this approach is that no analysis or planning is required. The
problems however are the reduction in availability and high unscheduled downtime
because of unanticipated breakdowns that affect the overall performance of system.
Availability in this perspective has a serious impact on organizational agility especially
those that implement efficiency improvement strategies such as Just In Time (JIT) and
Material Resource Planning(MRP) (Stamatis, 1995).
Quality is increasingly seen as a motivation for improved maintenance
management as the link between quality, process/equipment control and productivity
improvement becomes increasingly apparent (Ben-Daya & Duffua, 1995). Another
compelling but less addressed justification of maintenance is safety and environmental
preservation which assumes a highly significant role with increase in stringency of safety
and environmental laws. Since operational hazards and accidents lead to enormous legal
expenses, inattention to these issues is no longer affordable (Rao, 1996).
Although the above motivational factors have direct economic impacts, efficient
maintenance on its own has economic objectives (Saranga & Knezevic, 2000). Though
the return on investment is highly dependent on the specific industry and the equipment
involved, a survey (Rao, 1996) states that an investment in monitoring of between
$10,000 and $20,000 dollars results in savings of 500,000 dollars a year. Across many
industries, 15-40% of manufacturing costs are typically attributable to maintenance
1
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
11/109
activities. In the current competitive marketplace, maintenance management plays an
increasingly important role in combating competition by reducing equipment downtime
and associated costs and unscheduled disruptions (Abdulnour et al, 1995).
These insights instigated the development of various paradigms like Total
Productive maintenance (Nakajima, 1998) which aims at maximizing equipment
efficiency and Terotechnology (Husband, 1978) which offers a much broader perspective
including the supply (to the system), engineering and market modules of a system. These
paradigms prescribe predictive maintenance over reactive or a simple time-based
maintenance.
Condition Based Maintenance (CBM) has evolved from these above practices and
it aims at continuous monitoring/assessment of the target system and development of a
maintenance strategy based on the assessed condition. CBM offers many advantages over
a traditional time based strategy that typically is modeled around the popular bath-tub
curve depicted in figure 1.1. Time based maintenance tends to be too conservative
resulting in very high maintenance costs. The bath-tub curve fails to acknowledge the
complex interactions between the different components of a system and is especially not
suited to discrete manufacturing systems with frequent changes in work content and
schedule. CBM on the other hand is highly generic and can be used to generate efficient
maintenance strategies.
CBM being a proactive process requires the development of a predictive model
that can trigger the alarm for maintenance. In many instances this model could be loosely
based on analytical criteria developed on the signals collected from the system. In a much
more sophisticated form it would necessitate the development of prognostic and
2
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
12/109
diagnostic models that can predict the future state of a system besides diagnosing the
current state. Such models can be developed using the process data, its history and
several other factors like future schedule via the modeling techniques belonging to the
parametric or non-parametric literature. These models have to be precise and robust
besides possessing some form of autonomous modeling capabilities.
Figure1.1. Bathtub curve depicting failure rate of equipment (Stamatis, 1995)
In this research the focus is on developing soft computing techniques with the
above qualities. The objective also included the development of a humane modeling
system that can assume the role of a decision making aid in the CBM arena. The
motivation was that such systems can be subjected to continuous improvement (or plain
modification) by interacting with the users of such systems. Neuro-Fuzzy models were
found to posses these qualities and hence the research aim is to develop robust yet lucid
proactive models under the neuro-fuzzy constructs. The following constitute the specific
objectives and deliverables of this research.
3
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
13/109
1. Development of a generic architecture or algorithmic approach to Condition
Based Maintenance systems in a manufacturing setup.
2. Development of an accurate, robust and adaptive algorithm that can create
transparent models.
3. Development of suitable evaluation and validation criteria for these models.
4. Development of an easy to use software system that incorporates the above
algorithms and can function as an aid to the decision making process in
maintenance scenarios.
5. Demonstration of the utility of this software system (and algorithms) by applying
it to maintenance scenarios.
4
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
14/109
Chapter II
Literature Review
System maintenance in a multitude of forms has received due focus ever since
mass manufacturing has been adopted. Several principles and several tools ranging from
expensive hardware to smart software systems have been created in order to establish,
automate and execute the different tasks involved in this domain. This section reviews the
various maintenance paradigms and then delves into the different tools and techniques
used to generate maintenance solutions.
2.1. System Maintenance Paradigms
System maintenance over the years has significantly evolved in terms of
governing philosophy, implementation, technology, analytical techniques and objectives.
This evolution has an interesting chronological perspective as elaborated by Kinclaid
(Kinclaid, 1987). A brief taxonomy of the various philosophies is given in figure 2.1.
Maintenance philosophies can be broadly classified as reactive and proactive.
Reactive orUnplannedmaintenance is a legacy practice where maintenance is done only
after the manifestation of the defect, breakdown or stoppage. It is appropriate in facilities
where the installed machinery is minimal and the plant is not totally dependent on the
reliability of any individual machine (Jones, 1995). It might also be appropriate when the
failure rate is minimal and failure does not result in serious cost setbacks or safety
consequence. Breakdown or Corrective maintenance and Emergency maintenance
belongs to this category.
5
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
15/109
a) Corrective maintenance is defined as the activity carried out after a failure has
occurred and is intended to restore an item to a state in which it can perform its
required function (Knezevic, 1987; Saranga & Knezevic, 2000; Goplan & Kumar,
1995).
b) Emergency maintenance is defined as the maintenance activity that is necessary to
accomplish immediately to avoid serious consequences. Constraints are applied
on the frequency of maintenance with the object of cost-wise optimization. These
constraints are defined in terms of the immediacy of the required action and the
possible repercussions of non-maintenance.
Figure2.1. Taxonomy of maintenance philosophies (Kothamasu et al, 2004)
Proactive or Planned maintenance on the other hand executes the necessary tasks
prior to any breakdown and it can be further classified as preventive and predictive
maintenance based on the form of maintenance schedule. In many situations, better
utilization of resources is seen compared to reactive strategies (Mobley, 1990).
6
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
16/109
Preventive maintenance is the strategy organized to perform maintenance at
predetermined intervals to reduce the probability of failure or performance degradation. It
can be classified into constant interval, age based or imperfect maintenance.
a) Constant interval maintenance: As the name suggests it is done at fixed intervals
(in addition to any maintenance prompted by failure). Intervals are selected to
balance high risk of failure with long intervals and high preventive maintenance
costs with short intervals (Jardine, 1987).
b) Age based maintenance: In this strategy, preventive maintenance at fixed intervals
is carried out only after the system has reached a specific age, say t. If the
system fails prior to t, maintenance action is taken and the next maintenance is
scheduled to tunits later. By deferring initiation, this strategy reduces the number
of maintenance intervals compared to constant interval maintenance.
c) Imperfect maintenance: In the above two schemes, the system is assumed to be
restored to its original condition after a preventive maintenance. However it may
be the case that the condition of the system is in between good (original) and bad
(failure). This is the premise of imperfect maintenance strategies which take into
consideration the uncertainty of the current state of the equipment while
scheduling future activities.
The predetermined interval is estimated from the failure rate distribution that is
constructed from historical data extracted from the system or provided by the supplier of
individual components in the system. The estimation of distribution and the interval
determination are extensively researched in (Rao, 1992).
7
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
17/109
Predictive and preventive maintenance differ in the scheduling of maintenance. In
the latter it is performed on a fixed schedule whereas in the former it is adaptively
determined.Predictive maintenance can be classified into Condition Based Maintenance
andReliability Centered Maintenance.
a) Condition Based Maintenance (CBM): This is a decision making strategy where
the decision to perform maintenance is reached by observing the condition of
the system and/or its components. The condition of a system is quantified by
parameters that are continuously monitored and are system or application specific.
For instance, in the case of rotary systems a vibration characteristic or index is an
appropriate choice. The advantage of this approach is immediately apparent as the
decision is made on depictive and corroborative data that actually reflects the state
of the system. It is highly presumptive to assume that the state of a system would
always follow the same operational curve, which is the underlying assumption in
preventive maintenance. In an industrial or production environment, the system is
exposed to random disturbances, which cause deviations in the operational
characteristics. Hence it is highly justified to monitor the condition of system and
base the maintenance decision on the state of the system.
Some of the advantages of CBM are prior warning of impending failure and
increased precision in failure prediction. It is also aids in diagnostic procedures as
it is relatively easy to associate the failure to specific components through the
monitored parameters. It also can be linked to adaptive control thus facilitating
process optimization. The disadvantage, of course, is the necessity to install and
8
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
18/109
use monitoring equipment and to develop some level of modeling or decision-
making strategy.
b) Reliability Centered Maintenance (RCM): This approach utilizes reliability
estimates of the system to formulate a cost-effective schedule for maintenance
RCM was originally developed in the aircraft industry. For aircraft and other
safety-related applications, cost-effectiveness is balanced with safety and
availability with the goal of minimizing costs and downtime but eliminating the
chance of a failure (Moss, 1985). RCM is a union of two tasks, one of which is to
analyze and categorize failure modes based on the effects of the failure on the
system and the other is to assess the impact of maintenance schedules on
reliability. The failure analysis starts with the identification of all the failure
modes and proceeds with categorization of these failure modes based on the
consequences of each failure. The results of this study comprise a Failure Modes
and Effects Analysis (FMEA).
Usually the consequences of failure are Operational, Environmental/Safety or
Economic (Rao, 1996). Once the effects have been identified, the decision logic
algorithms prioritize the effects. These algorithms tend to be industry specific as
the constraints and requirements of each industry vary considerably.
Though RCM-based maintenance intervals were determined similarly to planned
or scheduled maintenance, condition monitoring techniques are increasingly being used
to determine the optimum interval (Kumar & Granholm, 1990; Sandtorv 1991). Hence
though originally a preventive maintenance technique, RCM is graduating into predictive
9
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
19/109
maintenance. A good introduction to RCM is given in (Moubray, 1997; Wireman, 1998;
Monderres, 1993; Jones, 1995).
2.2. System Maintenance - Tools and Techniques
Maintaining the health of a system is a complex task that requires in-depth
analysis of the target system, principles involved, their applicability and the
implementation strategies. Table 2.1 below lists methods, analysis and modeling tools,
and techniques (for data/condition extraction).
Table2.1. Maintenance tools and techniques
Methods ToolsMeasurementTechniques
Reliability basedmaintenance
Parameter estimation techniques Numerical analysis techniques Markov chains
Model Based FDI
State space parameter estimation Artificial neural networks Knowledge based systems Fuzzy inference systems Neuro-Fuzzy systems
Signal Based FDI
Fourier analysis Wavelet analysis Wigner-Ville analysis Diagnostic parameter analysis
Statistical FDI /Maintenance
Bayesian estimation / reasoningtechniques
Markov chains Hidden Markov models Proportional Hazards models
Vibration analysis Thermography Acoustic emission
Wear/debrismonitoring
Lubricant analysis Process
measurements
However, it has to be noted that most applications are a combination of the listed
methods and techniques (tools) and the list is far from exhaustive. For instance because
of their generalized applicability, parameter estimation techniques such as regression,
maximum likelihoodand expectation maximization can be used in all the listed categories.
10
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
20/109
There is also a close association between reliability based maintenance and statistical
maintenance techniques. A high level explanation of these methods is given in this
section.
2.2.1. Reliability Based Maintenance
A popular approach to the maintenance of complex systems is through estimating
the reliability of the system. Traditionally, reliability is estimated from the time-to-failure
distributions of the system. The most striking drawback of such an approach is that
multiple failure mechanisms often interact with each other in perhaps unknown ways and
this affects the degradation rate of the system, causing it to deviate considerably from the
predicted failure distribution. An alternative approach much similar to condition based
maintenance has been proposed by Knezevic (1987) known as the Relevant Condition
Parameter (RCP) based approach. This approach is based on identifying RCPs (defined
for a process) that quantify or reflect a particular failure mechanism. Using these RCPs
the reliability of a system is defined as the probability that RCP lies within prescribed
limits as given in (2.1).
)1.2()RCP)t(RCPRCP(P)t(R limkin
inRCP is the initial state of the system and is the limiting value where the
system inevitably fails. When the failure mechanisms are dependent, it is possible to
model the system using Markov chains as shown in (Saranga & Knezevic, 2000). Once
the Markov chain is formulated representing the different states of the system, the
probability of the system being in the upstate A(t) can be calculated as a sequence of
integrals of the form given in (2.2) (Gopalan & Kumar 1995).
limRCP
11
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
21/109
)2.2()u(g)u(A)xt(w)t(A
t
0
These integrals are further solved by using quadrature techniques such as the trapezoidal
approximation.
2.2.2. Model Based Approach to FDI
The model based approach to FDIis based on analytical redundancy or functional
redundancy, meaning dissimilar signals are compared and evaluated to identify the
existing faults in the system or its components. This comparison is between the measured
signal and the estimated values generated by the mathematical model of the system.
Figure 2.2 gives a general structure of model-based approaches.
Residual generation is the most important element of a model-based approach and
the techniques involved in model based diagnosis differ in the generation and definition
of a residual. For instance in some cases it is the discrepancy of output (from the system)
estimation and in some cases it is the deviation of the systems parameters from their
expected values. It is imperative that the generated residual be dependent only on the
faults in the system and not on its operating state. Several techniques that have been
proposed in the literature for this residual generation are a modification or improvement
of the following three principles.
Observer-Based approaches (Beard, 1971; Ding & Frank, 1990; Patton & Chen,
1997; Wilsky 1976).
Parameter estimation technique (Kiramura, 1980; Isermann, 1993).
Parity space approach (Chow & Wilsky, 1984; Deckert et al 1977).
12
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
22/109
Figure2.2. Flow of model based approaches (Simani et al, 2003)
Observer-based approaches rely on estimating the outputs from either
Luenberger observers or Kalman filters (Simani et al, 2003). The approach is centered on
the idea that the state estimation error is zero in a fault free environment and it is not so
otherwise. Dedicated Observer, Fault Detection Filters and Output Observers are the
three important subcategories that fall under this approach.
Parameter estimation techniques analyze the failure mechanisms based on their
influence on the system parameters (of the model). Hence this approach is centered on
generating online estimates of the parameters and analyzing the changes in the estimates.
In the Equation Error methods which analyze the parameters directly, least square
estimation is quite often used; in the Output Errormethods which compute the error in
the output numerical optimization techniques are often used.
Parity Space Relations check for parity of the measurements from the process,
generating a residual by comparing the model and the process behavior. This approach
13
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
23/109
has been shown to be in close correlation with the observer-based techniques (Patton &
Chen, 1994).
As stated before the model-based FDI approaches are based on identifying
(constructing) models that mimic the system. These models have to be extremely robust
to real world nuances such as noise, etc. and to be effective the model based FDI should
learn to differentiate between these uncertainties and the changes due to failures. Another
challenge is to identify not just the existing faults but the incipient faults which may not
(yet) significantly affect the system.
2.2.3. Signal Based FDI
Signal-based FDI approaches focus on detecting the changes or variations in a
signal and subsequently diagnosing (identifying) the change. Change detection in a
system has been extensively explored in the literature and there are quite a few effective
techniques that have integrated various ideas from parametric modeling principles (in
statistics) with signal-based principles such as spectral analysis. A good summary is
given in (Basseville, 1988). Some of the techniques are formulated around model-based
approaches, i.e., generation of residuals (deviation from nominal signals) and diagnosis
of the residuals. Some of the detection algorithms are modeled in the form of hypothesis
testing involving a change (or jump) in the mean (known or unknown) such as the
Generalized Likelihood ratio test and the Page-Hinkley stopping rule. Some real time
algorithms are based on computing distance measures between local and global models
(differentiated based on their time windows) and some popular measures are the
Euclidean distance between AR (Auto Regressive) coefficients, Cepstral distance,
Chernoff distance etc.
14
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
24/109
In recent years non-stationary signals are modeled using wavelets instead of
Fourier transforms because wavelets are scale and time variant. Two of the important
uses of wavelets to FDI are data compression and feature extraction (Staszewski, 1998).
Data Compression as the name suggests refers to encoding the data (like a vibration
signal) in a compressed form and feature selection on the other hand is identifying
features within these encoded signals that would help identify the faults in the monitored
systems. Once wavelet transform is applied to the signal output, the coefficients are
analyzed for any variation from the normal signal. The identification of coefficients that
would substantiate a failure is a painstaking procedure though recently some techniques
such as genetic algorithms are being employed. These wavelets are predominantly used
for FDI in gears, as vibration analysis is quite effective for these domains (McFadden,
1994; Staszewski & Tomlinson, 1994).
Time-frequency analysis using Wigner-Ville Distribution (WVD) has proven to
be another effective tool for vibration analysis. It has proven to be quite effective in
situations where neither the time domain nor frequency domains can produce significant
patterns (Staszewski et al, 1997). The contour plots generated by WVD are visually
inspected for the failure features that indicate its progression and existence. Often these
plots are analyzed with the help of classification algorithms ranging from parametric
(statistical) to soft computing (neural networks, fuzzy inference systems).
Detection signaltechniques are also used for FDI, where a detection signal is used
as an input to the system for a specific period of time and the diagnosis is based on the
behavior of the system during this period. Some interesting theories in the design and
implementation of detection signals are given in (Nikoukhah et al, 2000; Zhang 1989;
15
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
25/109
Kerestecioglu 1993; Kerestecioglu & Zarrop, 1994; Uosaki et al, 1984; Nikoukhah,
1998).
2.2.4. Statistical FDI / Maintenance
A vast number of applications also use Bayesian statistics and Bayesian parameter
estimation for FDI (Berec, 1998; Won & Modarres, 1998; Wu et al, 2001; Leung &
Ramanougli, 2000; Ray et al, 2001). Another important aspect is to identify the detection
(inspection) intervals, optimization of cost and replacement decision-making. Markov
chains seem to be increasingly used for optimizing maintenance strategies and some
algorithms are given in (Wang & Shueng, 2003; Hassan et al, 2002; Hassan et al, 2000;
Zhang & Zhao, 1999). Another interesting application is given by Bunks et al (2000)
using hidden Markov models.
Proportional Hazards Modeling (PHM) has also been used for reliability
estimation and estimation of effects on failure rate ever since they were used by Feigl and
Zelen (1965). Some interesting theories and applications related using PHM are reported
in (Jardine 1987; Kobaccy et al, 1997; Pena & Hollander, 1995).
16
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
26/109
Chapter III
Intelligent Condition Based Maintenance (ICBM) -
Conceptual Development
It is evident from the current state of the art in system maintenance applications that
Condition Base Maintenance (CBM) is highly generalized and efficient paradigm to
generate maintenance solutions. It is possible to create an architecture that can be used to
generate maintenance solutions using this paradigm. It also possible to achieve a
seamless integration of model based FDI algorithms with this architecture in order to
create decision tools that aid users in their maintenance applications. This section deals
with the creation of this architecture and also integration with model based approach.
Intelligent Condition Based Maintenance (ICBM) as such is the application of model
based FDI using this generic architecture.
3.1. ICBM Architecture Development
The ICBM architecture specifies a methodical approach to model building for
maintenance applications. It specifies the various elements of model building from data
acquisition to model deployment. This architecture is depicted in figure 3.1.
The following are the various modules incorporated into the architecture.
1. Data Acquisition
2. Data Conditioning and Feature Extraction
3. Model Generation
4. Model Deployment
17
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
27/109
Figure3.1. ICBM Architecture
Data Acquisition as the name suggests is the process of acquiring data from the
target system and its environment. This data includes information extracted from the
process, sensors monitoring the process and the environment. These are the typical forms
of data that go into the model building process.
Data Conditioningis an essential element as raw data is seldom useful in its own
form especially if it comes from a sensor used in vibration or acoustic emission
application. Data conditioning comprises of all the pre-processing typical to model
generation process in any domain.Feature extraction on the other hand can be thought of
as the extraction of useful information from the conditioned data which reflects the
condition of the target system. It is not essential for the features to directly correlate to
the condition and they could even be a quantification of documented or known effects of
18
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
28/109
the failures of interest. Several features can be extracted from the domains of signal
processing, time series analysis and diagnostic analysis. Some of these features are
documented in Appendix I.
Model Generation refers to the actual process of building a predictive model from
extracted data. This model would take the extracted features as inputs and give out
relevant outputs based on whether it is a prognostic or diagnostic application. A
diagnostic model would give information on the existence of any failure along with its
type. A prognostic model would give information on the expected state of system as
reflected by the conditionparameters (features) or some other specified output typical of
the application. For instance it is typical to use indicators such as RMS (Root Mean
Square) and Kurtosis in bearing wear applications. So in such an application, the
diagnostic model would detect bearing failures while prognostic model could estimate the
future RMS or Kurtosis values. The inputs to the model generation procedure can be
categorized as below.
Health Condition Indicator (as depicted by features)
Process Condition Indicator (as depicted by the process data)
Sampling/Prediction Indicator (as specified in the feature extraction process)
Model Robustness Indicator, although not an input in the conventional sense, it can
be used to trigger the necessity to retrain / improve / extend an existing model which
is very essential in real time application
As is depicted in the architecture, the domain expertplays a significant role in the
process of feature extraction and some parts of the model building process. The experts
knowledge can efficiently used to identify the right set of features as well as generate the
19
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
29/109
necessary knowledge to create the models. This approach is consistent with the objective
to create a decision aid with certain amount of autonomy.
Model Deploymentrefers to the process of integrating the created model into the
monitoring system as well as establishing the proper channels of communication with the
various business functions like maintenance, production planning & control, quality
control etc. The two outputs of model generation process namely the modelitself and the
process knowledge are to be systematically integrated with these business functions.
Although all the modules pose interesting challenges this research effort is
concentrated on the model generation process and the various algorithms directly related
to the model generation process like its validation and evaluation.
3.2. ICBM Modeling Paradigm
As mentioned in the previous section, this research concentrates on the model
generation process and in this section we identify the modeling paradigm. As can be seen
from the architecture, the ICBM models belong to two classes of learning problems
function approximation and classification. Diagnosis is essentially a classification
problem while prognosis is akin to function approximation.
These two classes of learning problems have been extensively studied for long
time in both the parametric and non-parametric arena and there are a multitude of
paradigms that address the issues involved. We start by listing the desirable
characteristics of an ICBM modeling algorithm.
Adaptive: The algorithm should be adaptable as it functions in a highly dynamic and
non-linear environment.
20
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
30/109
Flexible: The algorithm should be as generic as the architecture and should be a
universal approximator. It should also be flexible enough to incorporate various
forms of knowledge data, heuristic and analytical as provided by the domain
expert.
Lucid: The algorithm should be able to create models that are highly transparent.
This is essential as the model has to act as a decision aid and should also generate
useful and intelligible knowledge about the process. This is also essential as the
domain expert is tightly integrated with the model building process.
Robust: The algorithm should be able to create models that are robust to handle the
demands of a real time algorithm such as noise handling capabilities and
dimensionality.
As mentioned previously several algorithms exist in the literature of the two
classes of learning problems. However it is necessary to identify the algorithm (or its
class) that possesses the desirable characteristics outlined above.
Parametric estimation methods are in general highly robust and theoretically can
estimate any system to the required accuracy. However they require assumptions on the
distribution of some modeling elements for instance the data or the error. Several
methods such as transformation techniques do exist but they are not amenable to
automation. As stated in the statistical learning theory, it is also not advise-able to
approach to a solution via solving a harder problem such as density estimation
(Cherkassky & Mulier, 1998).
Non-Parametric estimation methods such as neural networks do not require any
such assumptions and are capable of approximating any domain of problems. They are
21
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
31/109
also equipped with autonomous learning algorithms that can automatically retrain or
regenerate maintenance solution if necessitated. However, they tend to be quite opaque
(black box) and it is not often possible to generate any qualitative knowledge about the
approximated system. This is major hindrance to establishment of any form of knowledge
transfer between the domain expert and the approximating system.
Soft computing algorithms also include Fuzzy Inference Systems (FIS) that can
be efficiently used as a bridge between the domain expert and the ICBM architecture.
These systems work on knowledge bases that are in easily comprehensible IFTHEN
format. However, this particular class of algorithms do not possess any form of
automated learning, hence require considerable amount of manual tuning in the
generation of the solution.
Neuro-Fuzzy algorithms are an assimilation of these two forms of approximating
algorithms and are able to annul the disadvantages of the respective parts. These
algorithms are particularly adaptive, lucid and highly flexible. As they are essentially
fuzzy inference systems embedded into a neural network they are also robust. It is also
easy for a domain expert to interact with these algorithms. Since the knowledge is both in
a functional form (network) and generalized form (rule base), it is possible to integrate
with the other business functions mentioned above.
Based on the above discussion it is evident that neuro-fuzzy systems possess the
necessary qualities listed in the beginning of this section. In the following sections we
develop a neuro-fuzzy algorithm that meets the requirements.
22
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
32/109
Chapter IV
Intelligent Condition Based Maintenance - Model
Development and Validation
As observed in the previous section, neuro-fuzzy systems are ideal candidates to
fulfill ICBM objectives. Neuro-fuzzy systems have also been used for control system and
FDI application successfully (Simani et al, 2003). However it has to be noted that these
applications fulfill or aim only at function approximation and this actually beats the
purpose of using neuro-fuzzy tools.
Adaptive Neuro Fuzzy Inference System (ANFIS) (Jang, 1993) and Hybrid Fuzzy
Inference System (HyFIS) (Kim & Kasabov, 1999) are the two most popular neuro-fuzzy
connectionist systems that simulate a Sugeno and aMamdani type FIS respectively. Both
the algorithms have been validated on various datasets and were shown to possess good
accuracy. However, they are not without their drawbacks in the ICBM context as
elucidated below.
Consider a domain described by a function ( )21,xxfy =
A mamdani type FIS in this domain would consists of rules of the form given below:
IF x1 is low AND x2 is medium THEN y is high
Where low, medium and high are linguistic terms with functional forms like gaussian,
sigmoid etc also known as membership functions.
A sugeno type FIS in this domain would consist of rules of the form
IF x1 is low AND x2 is med THEN ( )211 ,xxfy =
23
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
33/109
where low and medium are linguistic terms with functional context. The difference
between the two FIS is the form of consequents. In mamdani type FIS the output
membership function can be defined independent of the premise parameters while in
sugeno type FIS each output membership function is a function of the inputs.
ANFIS mimics a sugeno type FIS and from the above it can be seen that it is
efficient for function approximation problems and is not particular useful in classification
applications. Hence it is not appropriate for diagnosis applications and the knowledge
(rules) it extracts would be abstract for a domain expert as they are not entirely in a
linguistic format.
HyFIS on the other hand simulates a mamdani type FIS which is universally
applicable and hence can be used for prognosis as well as diagnosis applications.
However, it uses a defuzzification (process of generating crisp outputs from fuzzy
outputs) strategy that restricts the output membership functions to assume a gaussian
functional form (with centre and variance parameters).Although this does not hamper its
ability to generate maintenance solutions, it is not possible for a domain expert to interact
with the model in all situations (for instance, when output membership functions are non
gaussian)
The aforementioned reasons have provided the motivation to formulate an easily
comprehensible neuro-fuzzy system. The next section elaborates such a neuro-fuzzy
system including its architecture and learning process.
4.1. Adaptive Mamdani Fuzzy Model (AMFM)
Adaptive Mamdani Fuzzy Model (AMFM) is a neuro-fuzzy algorithm that
simulates the mamdani type fuzzy inference system. The development of a diagnosis or
24
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
34/109
prognosis model using AMFM consists of two main tasks Architecture Initialization
andRule Tuning.
4.1.1. Architecture Initialization
Typically a neuro-fuzzy system consists of five layers of neurons with each layer
(subsequent to the input layer) performing the following four operations of a fuzzy
inference system.
Fuzzification
Implication
Aggregation
Defuzzification
Most of the neuro-fuzzy systems differ in the final layer which corresponds to the
defuzzification operation. Centre of Gravity (COG) is the most popular defuzzification
operation and since its computation is analytically intractable, the connectionist systems
approximate the COG with an easily computable function. In AMFM, the COG is
approximated with the COG of the maximum polygonal area contained within the fuzzy
output area as shown in figure 4.1.
25
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
35/109
Figure4.1. Polygonal approximation to COG
A typical AMFM structure would be as depicted in figure 4.2. The first layer
represents the input parameters. LetNbe the number of input parameters, then the first
layer will haveNnodes. LetMn denote the number of linguistic terms of input parameter
n, n = 1, 2 N. Then the total number of nodes in the second layer,I, will be . A
node n in the first layer is connected to onlyM
=
N
n
nM1
n nodes in the second layer that represents
its corresponding linguistic terms. It merely passes the input value xn to the connected
second layer nodes. A node i in the second layer has a Gaussian activation
function
2
)2(
)2()2(
2
1
)2(
= iii cx
i ey
. The center c and spread can be initialized using the
mean and standard deviation of the input parameter values within the cluster or interval
that represents the particular linguistic term.
)2(
i
)2(
i
The number of third layer nodesJ, equals to the number of rules. Each node, with
its connections from the preceding nodes, represents a rule. Note that different nodes in
26
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
36/109
this layer might represent the same concept (linguistic term of an output parameter). For
example, in figure 4.2, say the input parameterx1 has three linguistic terms, small,
medium, and large, which corresponds to the first three nodes in the second layer. The
input parameterxn has two linguistic terms, small and large, which corresponds to the last
two nodes in the second layer. The first two nodes in the third parameter both represent
the concept outputy1 is small. Then the first and second nodes in the third layer and
their connections from the second layer nodes represent the rules IFx1 is small andxn is
small THENy1 is small and IF x1 is medium andxn is large THEN y1 is small. The
activation function of third layer nodes is the minimum operation. For instance in figure
4.2, .},min{ )2( 1)2(
1
)3(
1 = Iyyy
The fourth layer nodes represent the output linguistic terms. Unlike the third
layer, each node represents a distinct concept. Therefore, the number of nodes,K, equals
to the total number of output parameter linguistic terms. Each node is connected to the
preceding layer nodes that represent the same concept. Its activation function is the
maximum operation. For example, in figure 4.2, the first node in the fourth layer
represents the concept outputy1 is small. It is connected to the first two nodes in the
third layer, which represent the same concept. We have . Each
fourth layer node also maintains a Gaussian membership function with center c and
spread (k = 1, 2 K), which are initialized in the same way as those of input
linguistic terms. Note that this Gaussian function is not an activation function. It is
transmitted to a fifth layer node for defuzzification.
},max{ )3(2)3(
1)4(
1 yyy =
)4(
k
)4(
k
The fifth layer represents the output parameters. Let L be the number of output
parameters, then the fifth layer will haveL nodes. Let Ol denote the number of linguistic
27
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
37/109
terms of output parameter l, l = 1, 2 L. A node l in the fifth layer will have Ol
incoming connections from forth layer nodes that correspond to its linguistic terms. For
example, in figure 4.2, the output parametery1 has two linguistic terms, namely, small
and large, which are represented by the first and the second nodes in the forth layer,
respectively. Therefore, these two forth layer nodes are connected to the first node is the
fifth layer, which represents the output parametery1. A node in the fifth layer performs
defuzzification using a modified centroid of area method. Specifically, instead of
considering the entire area under the Gaussian curves when calculating the gravity center,
we consider only the rectangular part of the area.
Figure4.2. Adaptive Mamdani Fuzzy Model (AMFM)
4.1.2. Rule Tuning
The tunable parameters in AMFM are the centers and spreads of the Gaussian
membership functions. These include those for the input parameters, namely, and
(i = 1, 2, , I), in the second layer; and those for the output parameters, namely,
and (k= 1, 2, , K), in the forth layer. The tuning process is based on error
)2(
ic
)2(
i
)4(
kc)4(
k
28
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
38/109
backpropagation and gradient descent search. For a particular input vector [x1x2
]NxT, let the desired output vector be [d1d2 ]Ld
T, and the AMFM output vector be [y1
y2 ]LyT. Then, the error can be calculated as in (4.1)
'k
y
y
)4(
'k
)4(
'k
E)4(
k
=
ln2
(
k
c
(
k
( ) )1.4(yd2
1E
L
1l
2
ll=
The error signal at the fifth layer can be calculated as in (4.2).
( ) )2.4(ydy
Ell
l
)5(
l
Using the chain rule, we can calculate the error signal for and , when the kth
node in the fourth layer is connected to the lth node in the fifth layer, as in (4.3) and (4.4).
)4(
kc)4(
k
)3.4(yln2y
yln2y-
c
y
y
E
c
E
)l(
)4(
'k
)4(
'k
)4(
'k
)4(
k
)4(
k
)4(
k)5(
l)4(
k
l
l
)4(
k
=
=
)4.4(yln2
yln2cyln2y
-y
y
E
)l('k
)4(
'k
)4(
'k
)l('k
)4(
'k
)4(
'k
)4(
'k
)4(
'k
)l('k
)4(
'k
)4(
'k
)5(
l)4(
k
l
l
In which )4()4()4( ln2 kkk ycy = ,)4()4(
kk yy = and )(l denotes the set of
fourth layer nodes that are connected to lth node in the fifth layer.
Now, c and can be updated as in (4.5) and (4.6), where, is a positive constant
(the learning rate).
)4 )4(
k
)5.4(yc
Ec )4(k)4(
k
)4(
k
)4(
k
)6.4(yE )4(
k)4(
k
)4)4(
k
29
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
39/109
To adjust c and , the error signals need to be propagated backward to the second
layer. The error signal at the forth layer is calculated as in (4.7).
)2(
i
)2(
i
)7.4(
y
)1
(cy)1
(cy
y
y
y
y
y
E
y
E
2
)l(k
'k
)4(
'k
)4(
'k
k
k
)4(
k
)l(k
'k
)4(
'k
)4(
'k
)4(
'k
k
k
)4(
k
)4(
k
)l(k
'k
)4(
'k
)4(
'k
)4(k
)5(
l)5(
l)4(k
)5(
l
)5(l
)4(k
)4(
k
'
''
=
=
=
in which )4(ln2 kk y= .
The error signal at the third layer is calculated as in (4.8)
)8.4()k(jotherwise,0
yyif, )4(k)3(
j
)4(
k)3(
j =
in which denotes the set of third layer nodes that are connected to kth node in the
forth layer.
)(k
The error signal at the second layer is calculated as in (4.9).
)9.4()j(iotherwise,0
yyif, )3(j)2(
i
)3(
j)2(
i =
in which denotes the set of second layer nodes that are connected tojth node in the
third layer.
)( j
Now we can calculate the error signal forc and as in (4.10) and (4.11).)2(i)2(
i
( )( ) )10.4(
cxy
c
y
y
E
c
E2)2(
i
)2(
i
)2(
i)2(
i
)2(
i)2(
i
)2(
i
)2(
i
)2(
i
=
( )( ) )11.4(
cxy
y
y
EE3)2(
i
2)2(
i
)2(
i)2(
i
)2(
i)2(
i
)2(
i
)2(
i
)2(
i
=
30
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
40/109
Hence, and can be updated as in (4.12) and (4.13))2(ic)2(
i
)12.4(xc
Ecc )2(i)2(
i
)2(
i
)2(
i
)13.4(xE )2(
i)2(
i
)2(
i
)2(
i
4.2. Evaluation Criteria
The previous sections detailed the AMFM algorithm that can be used to associate
the inputs with the outputs. However, just like any other approximation algorithm, it is
quite essential to validate the developed model. The validation is done at two levels
precision of the model and the legibility of the model. In the following sections these
aspects of model validation are explored.
4.2.1. Validation of Model Precision
As mentioned previously, model evaluation is done with the goal of selecting the
best available model for the given dataset. Traditionally criteria like SSE, MSE, MAP,
R2, R2(adj), PRESS are used to validate a model. The usual approach is to split the
available data into learningand validation sets [58]. The algorithms are supplied with the
learning sets to create the model and are later validated with the above-mentioned criteria
on the validation dataset. The definition of the above parameters is given in table 4.1
(Kothamasu et al, 2004).
The quality of the model is in inverse proportion to the magnitude of the first
three criteria and also to the deviation of the last two criteria from 1. However it has to
be mentioned that these criteria are not appropriate for model selection in all situations.
31
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
41/109
For instance, R2
should not be used for comparison of modeling algorithms that do no
satisfy the criteria that and0ei = 0ey ii = where ei are the corresponding residuals.
This is especially true in the case of neuro-fuzzy modeling where situations with R2
greater than 1 are often encountered.
Table4.1. Definition of the traditional criteria used in model evaluation
Criteria Definition
SSE (Sum of Squared Error) =
n
1i
2ii )yy(
MSE (Mean Squared Error) =
n
1i
2ii )yy(1n
1
MAP (Mean Absolute Percenterror)
=
n
1i
iii |y/)yy(|n
100
R2
=
=
n
i
i
n
i
i
yy
yy
1
2
1
2
)(
)(
R2
(adj)
=
1
12
)(
nSST
knSSE
Radj
parametersof#k
Patternsof#n
OutputsPredictedy
OutputsActualy
i
i
Apart from the above mentioned restrictions these criteria do not explicitly take
into account the underlying dimensionality of the model (except R2(adj)) and the
complexity of data into account. Multiple Comparison Procedures (MCP) is another
category of model evaluation techniques that are often used to compare a set of possible
models to the given data. Such tests include McNemars test, a test for difference of error
proportions, resampled paired t test, k-fold cross validated paired t test and 5x2cv paired t
32
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
42/109
test (Diettrich, 1997). The basic concept of these tests is to check for significant
difference in error (or its proportions) from the various models developed. Since the usual
practice is to check for this difference among the error vectors from the same dataset,
care must be taken to compensate for correlation. Secondly, the multiplicity effect that
arises out of simultaneous pair wise comparisons between the models should also be
taken into consideration (because of increased chances of Type I error).
4.2.1.1. Function Approximation
The Hochberg and Tamhane based on the studentized maximum modulus
distribution is appropriate for the function approximation problems. Dunn (1961)
proposed a test based on studentized t distribution that can reveal any significant
differences between error proportions (well suited for classification problems). An
excellent summary of both these tests is given in (Feelers & Verkooijen, 1996).
A third evaluation strategy is to construct a form ofpenalization criteria that
enhances the empirical risk with a term that disfavors complex models (Domingos,
1999). There are several penalization forms and AIC (Akaike Information Criterion) as
defined in (4.14) is one of them (Ishikawa, 1996).
)14.4(k2)log(nlAIC 2 Where, k is the number of independent estimated parameter, l is the number of output
units and is the maximum likelihood estimate of the mean square error.2
In order to check the performance of these criteria an approach similar to the one
proposed by Lawrence et al (1997) is used. A randomly initialized teacher network is
used to extract the training and testing data and networks of varying complexities called
student networks are then trained on the learning dataset and validated with the testing
33
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
43/109
data. In this case study a neural network consisting of 3 hidden neurons is used as the
teacher network with 2 inputs and one output. The data was split into learning and
validation sets comprising of 140 and 60 patterns respectively.
Networks of varying size are trained with the learning data for 500 epochs.
Various evaluation criteria along with the Hochberg and Tamhane confidence intervals
(at 95% level) are given in Tables 4.2 and 4.3 respectively.
Table4.2. Evaluation criteria for the function approximation problem
Hidden Neurons MSE R2
R2(adj) AIC
2 0.006471215 0.417 0.3256 -285.4319
3 0.000054294 0.7061 0.631 -564.27485 0.000040264 0.8028 0.7017 -566.2111
10 0.000220444 0.8472 0.5254 -424.2004
15 0.000046811 0.8769 8.2639 -477.1716
20 0.004963404 0.8229 1.4976 -157.3482
From table 4.2 we cannot conclusively select a model because of varying
indications from the different criteria, although AIC points to the model with 5 hidden
neurons which is the closest to the original model (3 hidden neurons). From table 4.3 it is
evident that the Hochberg & Tamhane test concludes that all models are equally good.
Table4.3. Confidence intervals from pair-wise Hochberg and Tamhane test
Model 1 2 3 4 5 6
1 - [-2.77 2.79] [-2.77 2.79] [-2.77 2.78] [-2.77 2.79] [-2.78 2.78]
2 - - [-2.78 2.78] [-2.78 2.77] [-2.78 2.78] [-2.78 2.77]
3 - - - [-2.7802.77] [-2.78 2.78] [-2.78 2.77]
4 - - - - [-2.78 2.78] [-2.78 2.778]
5 - - - - - [-2.78 2.778]
Though AIC was close to the original model (3 neurons), it cannot be concluded
that it did in fact select a model that best fits the data and it is also not valid to assume
that a 3 or close to 3 hidden neuron network is a good fit for the finite data. (This
34
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
44/109
validates the theory that for finite data that the best fit is not necessarily a model identical
to the true parametric form (Cherkassky V & Mulier F, 1998) and in this case it is in fact
of a higher complexity than the true parametric form.)
To confirm that the 5 hidden neuron model is in fact superior, a generalization test
was performed where noisy inputs were presented to the networks. The (additive) noisy
inputs were generated as )dBW,i(wgn)i(I)i(I += where I is the original input value and
wgn is white gaussian noise with power specified by dBW. It can be seen from table
4.4 and figure 4.3 that the network with 5 neurons outperforms the rest.
Table4.4. MSE values of the models at different noise levels
Hidden Neurons
Noise (dBW) 2 3 5 10 15 20
1 0.0112 0.07 0.0456 0.049 0.0611 0.0367
5 0.0365 0.1191 0.0673 0.095 0.1177 0.0429
10 0.0871 0.2113 0.099 0.2325 0.2257 0.1023
15 0.1864 0.2735 0.1332 0.3805 0.3768 0.1684
20 0.2669 0.4267 0.2189 0.4318 0.8173 0.197
25 0.2106 0.3823 0.1502 0.4577 0.7028 0.1944
30 0.3304 0.4181 0.1871 0.4428 0.8066 0.2133
35 0.3803 0.4053 0.1985 0.4856 0.8117 0.2375
40 0.4897 0.4747 0.2371 0.6171 0.8084 0.3408
45 0.613 0.3692 0.184 0.5839 0.8462 0.2364
50 0.549 0.4485 0.2114 0.5048 0.9228 0.2672
AvgMSE 0.2873 0.3272 0.1575 0.3892 0.5906 0.1852
35
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
45/109
Figure4.3. Box and whisker plot of MSE values
4.2.1.2. Classification
A real world problem in the form ofEColi dataset1
was chosen for analysis of
AIC and the other criteria in the classification arena. Table 4.5 details the composition of
this dataset (8 inputs and 8 classes). The data was normalized to facilitate computation of
AIC and this was done using the technique specified by Mirkin (1996) and shown in 18.
)15.4(
P1
Pvv
v
2
v
v
normalized =
The AIC values and the confidence intervals on the error proportions are given in
Tables 4.5 and 4.6 respectively. From table 4.6, it is evident that the AIC values indicate
an inferior classification capability (positive AIC values) and that the network with 2
hidden neurons is the best of the lot. From Table 4.7, it can be seen that none of the
classifiers have identical classification capabilities (no closed interval containing 0) and
that their performance is in the order 3, 6, 5, 4, 2, 1.
1 Available by anonymous ftp from ftp://ftp.ics.uci.edu/pub/machine-learning-databases/
36
ftp://ftp.ics.uci.edu/pub/machine-learning-databases/ftp://ftp.ics.uci.edu/pub/machine-learning-databases/ -
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
46/109
Table4.5. Classes in the Ecoli dataset
Class ID Class Name Number of patterns
CP Cytoplasm 143
IM Inner membrane without signal sequence 77
PP Periplasm 52
IMU Inner membrane, un-cleavable signal sequence 35
OM Outer membrane 20
OML Outer membrane lipoprotein 5
IML Inner membrane lipoprotein 2
IMS Inner membrane, cleavable signal sequence 2
Table4.6. AIC values of networks developed for classifying the EColi dataset
Hidden Neurons AIC
2 146.5111
3 170.166
5 199.6422
10 299.781
15 406.0171
20 472.2449
37
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
47/109
Table4.7. Confidence Intervals for difference in error
Model 2 3 4 5 6
1 [0.003 0.016] [0.19 0.20] [0.12 0.13] [0.16 0.17] [0.17 0.18]
2 [0.18 0.19] [0.11 0.12] [0.15 0.16] [0.16 0.17]
3 [-0.07 -0.06] [-0.035 -0.02] [-0.02 -0.01]
4 [0.03 0.04] [0.04 0.05]
5 [0.003 0.01]
The test of generalization accomplished by inducing additive white gaussian noise
yielded the error proportions shown in Table 4.8. It is evident from the average error
values and box plot in Figure 4.4 that network 3 is in fact the best and the performance of
the networks is in the order 3, 4, 6, 5, 2, 1 which is close to what is concluded from the
above test. This is also in total contradiction to that indicated by AIC.
Figure4.4. Box and whisker plot of MSE values
38
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
48/109
Table4.8. Error proportions when simulated in noisy environment
Hidden Neurons
Noise (dBW) 2 3 5 10 15 20
1 0.6634 0.6436 0.5545 0.6238 0.5941 0.5446
5 0.6634 0.703 0.5248 0.6634 0.703 0.7129
10 0.8218 0.7921 0.6634 0.6634 0.7921 0.7723
15 0.8515 0.8218 0.6238 0.6832 0.6931 0.802
20 0.8218 0.802 0.703 0.703 0.7921 0.7822
25 0.8614 0.8416 0.6931 0.7624 0.7822 0.8119
30 0.8515 0.8515 0.6931 0.6535 0.802 0.7228
35 0.8416 0.7525 0.7129 0.6634 0.802 0.7426
40 0.8515 0.7723 0.6634 0.6832 0.7525 0.7624
Multiple comparison procedures such as the Hochberg & Tamhane test for
function approximation and studentized t test for classification can point to significant
differences in the approximation capabilities of the models. However these tests do not
take into account the complexity of the models. The Akaike Information Criterion,
designed to take into account the complexity as well as the precision of the model, was
seen to perform extremely well in the function approximation arena while it falters in the
classification domain. Studentized t test yields a better evaluation strategy when
compared to AIC for the classification problems.
39
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
49/109
4.2.2. Validation of Model Legibility
A neuro-fuzzy architecture is a highly transparent model (or representation)
because the rules used for modeling provide qualitative insights of the domain. However
a neuro-fuzzy system often does not result in a good model (from the view point of
legibility) because of the unconstrained gradient search algorithm. The rules that result
from this training often are neither identical nor similar in their ability to mirror the
domain as the rules prior to the training. Although the training phase results in a gain in
the precision, it often is at the expense of the legibility of the rules. Two types of model
deteriorations are explained below (Kothamasu et al, 2004).
Linguistic deterioration: The initial rules are created based on membership
functions that can be described using linguistic variables like small, medium
or large or other appropriate characterization. However it is not possible to
achieve such a characterization after these rules have been tuned because of the
deterioration of the linguistic structure (within each dimension) created prior to
the training or during the discretization phase of rule extraction.
Structural deterioration: The situation is compounded with the fact that the post
training rules often do not effectively describe the system. It is not uncommon
that the rules are often undistinguishable and make sense only from the
approximation point of view and will not be able to explain the created model
thus causing a deterioration of the transparency of the system.
This has grave repercussions in some situations where the model needs to change
over time because of the dynamic nature of the domain. Since the models are not
transparent enough it is not possible to direct this necessary change. However, it is
40
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
50/109
possible to continue to update the model using the backpropagation, but this makes the
NF models equivalent to a neural network and it defeats the original intention and
objective to utilize them as decision aids.
As demonstrated in the previous section, AIC based on Kullback-Leibler (KL)
mean information which measures the distance between two distributions can be
effectively used for model validation. A similar approach can be used to validate or
evaluate the structure of rules based on the KL distance between the membership
functions in each dimension. KL distance is computed as given in (4.16).
)16.4()x()x(log)x(
xd
j
d
id
i
d
j,i Where dji, represents the KL distance between membership functions and in
dimension d. A distance matrix hence can be formulated for each dimension dwhich
represents the qualitative distance of each membership function from the rest as given in
(4.17).
di
dj
d
)17.4(
..........
.........
d
N,N
d
1,N
d
N,i
d
i,i
d
1,i
d
N,1
d
1,1
d
ddd
d
d
=
Where, is the number of MFs in dimension danddN ( ) ( )2,2
,,d
ijdji
dji +=
. This
matrix is scaled to facilitate merging of the significantly similar membership functions
based on a threshold . The matrix is scaled as given in (4.18), where is
the largest element in .
dthreshold
d
dmax
.
41
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
51/109
)18.4(d
max
d
j,id
j,i
d
j,i
d
=
The primary advantage of using the KL distance is that it is not restricted by the
parametric form of the membership functions. The similarity between any two
membership functions is inversely related to the corresponding value in the distance
matrix. The advantage of KL distance matrix can be seen in the following case study
which involves the approximation of a function popularly known as Rosenbrocks banana
function as defined in (4.19).
)19.4()x1()xx(*100y2
2
22
21 The initial and final membership functions as identified by ANFIS are depicted in
figures 4.5 and 4.6. As can be seen from figures 4.5 and 4.6 there is as clear deterioration
of the rule and linguistic structure within each input dimension. However from an
approximation point of view the network is very precise as indicated by the MSE value
which is 0.00065141 after 1000 iterations.
The normalized KL distance matrices for the input dimension (X1) are computed
using the above formulae and are given in tables 4.9 and 4.10 where MF stands for
membership function. As can be seen from tables 4.9 & 4.10 the distance measures of
MF1 (from rest) are quite similar indicating a very wide span and hence higher overlap
with all the membership functions. This is indeed the case as can be seen from figure4.6.
It can also be seen that there is a gradual degradation of the structure because of
membership functions with very wide spans and closely spaced centers. This is indicated
in tables 4.9 & 4.10 where some of the distance measures are low in magnitude.
42
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
52/109
(a) (b)
Figure4.5. Initial membership functions (a) input 1 (b) input 2
(a) (b)
Figure4.6. Final membership functions (a) input 1 (b) input 2
Table4.9. KL distance matrix for X1
MF1 MF2 MF3 MF4 MF5 MF6
MF1 0 0.18253 0.23386 0.23386 0.32494 0.32495
MF2 0.18253 0 0.23104 0.23103 0.40117 0.40118
MF3 0.23386 0.23104 0 0.70052 0.86876 0.10774
MF4 0.23386 0.23103 0.70052 0 0.10774 0.86876
MF5 0.32494 0.40117 0.86876 0.10774 0 1
MF6 0.32495 0.40118 0.10774 0.86876 1 0
43
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesis
53/109
Table4.10. KL distance matrix for X2
MF1 MF2 MF3 MF4 MF5 MF6
MF1 0 0.48375 0.44889 0.44885 0.46912 0.46913
MF2 0.48375 0 1 0.99996 0.0026265 0.0026214
MF3 0.44889 1 0 1e-005 0.9768 0.97681
MF4 0.44885 0.99996 1e-005 0 0.97675 0.97676
MF5 0.46912 0.0026265 0.9768 0.97675 0 5e-006
MF6 0.46913 0.0026214 0.97681 0.97676 5e-006 0
The inference system is refined by eliminating (merging or deleting) the MFs that
result in structural deterioration as indicated by the distance measures. A threshold value
of 0.2 was chosen and MFs with lower distance measures are merged accordingly. The
resultant network was trained for 1000 epochs and the MSE value was found to be
0.00031699 which is 48% lower. The resultant MFs are given in figure 4.7.
Figure4.7. Final membership functions (a) Input1 (b) Input2
The resultant network as can be seen has higher legibility compared to the
original network and hence the KL measure can be used for validating the networks
legibility. However, the threshold for merging has to be subjectively decided so that
44
-
8/2/2019 !!! ICBM Intelligent Condition Based Maintenance Thesi