fuzzy unordered rule induction for evaluating cardiac arrhythmia
TRANSCRIPT
![Page 1: Fuzzy Unordered Rule Induction for evaluating cardiac Arrhythmia](https://reader031.vdocuments.net/reader031/viewer/2022020614/575094711a28abbf6bb90d63/html5/thumbnails/1.jpg)
Biomed Eng Lett (2013) 3:74-79
DOI 10.1007/s13534-013-0096-9
Fuzzy Unordered Rule Induction for Evaluating Cardiac Arrhythmia
V. S. R. Kumari and P. Rajesh Kumar
Received: 2 April 2013 / Revised: 29 June 2013 / Accepted: 30 June 2013
© The Korean Society of Medical & Biological Engineering and Springer 2013
Abstract
Heart rate variations reveal current or impending heart/
cardiac diseases. A non-stationary signal – heart rate - is
measured using Electrocardiogram (ECG) to assess cardiac
Arrhythmia. But studying ECG reports is both tedious and
time consuming when you have to locate abnormalities from
the collected data. This is overcome by computer based
analytical tools for in-depth data study/classification for
diagnosis. Automatic arrhythmia assessment is easy due to
the existence of image processing techniques. Many
algorithms exist for ECG signals detection/classification.
This paper investigates RR interval based ECG classification
procedures for arrhythmic beat classification, a process
based on RR interval beat extraction using Symlet on ECG
data. Extracted RR data is used as a classification feature
with beats being classified through a boosting algorithm and
Fuzzy Unordered Rule Induction Algorithm (FURIA).
Classification efficiency evaluation was through the MIT-
BIH arrhythmia database.
Keywords Arrhythmia classification, Electrocardiogram
(ECG), RR interval, Symlet, Boosting, Fuzzy Unordered
Rule Induction Algorithm (FURIA), MIT-BIH ECG data
INTRODUCTION
ECG records the heart’s electrical activity for heart disease
diagnosis. ECG is a non-invasive technique where voltage
plots are measured by leads against time. The signal is
measured on the human body’s surface. Heart rate/ rhythm
disorder or morphological pattern changes reveal cardiac
arrhythmia, detected by a recorded ECG waveform [1, 2]
analysis. High mortality due to heart diseases emphasises the
need for proper ECG arrhythmias detection/classification for
correct treatment. Though ECG’s visual inspection is tiring,
recent image processing advances have led to automatic
arrhythmia [3] assessment research resulting in algorithms
recently being developed for automatic ECG signals
detection/classification.
ECG measures electrical signals where every heart beat is
displayed as electrical waves with peaks and valleys. ECGs
provide two types of information. First, an electrical wave’s
duration in crossing the heart is determined by measuring
time intervals on the ECG with signal frequency ranging
from 0.05-100 Hz and dynamic ranges from 1-10 mV. Five
peaks and valleys labelled by successive letters of the
alphabet as P, Q, R, S and T characterize ECG signals. A
good ECG performance analysing system depends on
accurate detection of QRS complex as also T and P waves.
The heart’s upper chamber activation is represented by the P
wave, the atria; while QRS wave (or complex) and T waves
represent ventricular (lower chambers) excitation.QRS detection
is of great importance in automatic ECG signal analysis [4].
Once identified, a detailed ECG signal examination including
heart rate and ST segment are undertaken. A P-QRS-T wave’s
amplitude and duration includes heart afflicting disease
information, but ECGs do not provide data on cardiac
contraction/ pumping function. Cardiac diseases diagnosis is
through position and magnitudes of PR interval and segment,
ST interval and segment, QRS interval and QT interval. [5].
Fig. 1 reveals normal ECG pattern and components.
ECG determines Bundle branch block (BBB), a form of
heart block involving a conduction delay/failure in a branch
in the bundle of the heart. It may be complete/incomplete,
transient, permanent, or intermittent, and hence is named
based on the involvement of left or right bundle branch. It
cannot be discovered whether a bundle branch block is
complete. When linked to acute anterior wall myocardial
infarction, bundle branch block identifies high-risk patients.
V. S. R. Kumari ( ) Department of ECE, SMCE, Tummalapalem, Guntur, A. P NDIATel : +918632344402 / Fax : +91863244403E-mail : [email protected]
P. Rajesh Kumar College of Engineering, Andhra University, Visakhapatnam, INDIA
REVIEW ARTICLE
![Page 2: Fuzzy Unordered Rule Induction for evaluating cardiac Arrhythmia](https://reader031.vdocuments.net/reader031/viewer/2022020614/575094711a28abbf6bb90d63/html5/thumbnails/2.jpg)
Biomed Eng Lett (2013) 3:74-79 75
Cardiac impulses inability to be conducted down bundle
branches, cause abnormally shaped QRS complexes. BBB is
visible in high-risk, acute, anterior wall myocardial infarction,
due to ischemia or necrosis of the bundle branches, trauma
(as in surgical manipulation), or mechanical branch compression
due to a tumor. Further condition deterioration leads to
installation of a pacemaker.
Bundle branch block fall under a group of heart problems
called intraventricular conduction defects (IVCD). Two
bundle branches, right and left exist. The right bundle carries
nerve impulses that contract the right ventricle (the heart’s
lower chamber) and the left bundle’s nerve impulses contract
left ventricle. The two bundles join together initially at a
place named the bundle of His. Nerve impulses pass through
the heart’s sinus node to the bundle of His and then into right
and left bundle branches. Bundle branch block refers to a
slowing/interruption of nerve impulses. Patients with right
bundle branch block (RBBB) have limited or nil conducted
nerve impulses. The right ventricle receives impulse through
muscle-to-muscle spread, outside a regular nerve pathway.
This impulse transmission mechanism is slow leading to
delayed right ventricle contraction. There are many types of
left bundle branch block (LBBB) each having its own
characteristic failure mechanism.
ECG features are represented as statistical measures or
extracted from time and frequency domains. Though this
procedure provides impressive results in some classification
tasks, it still fails to discriminate all ECG beats. Wavelet
transformation representing signals in various translations/
scales [6] is another feature extraction method. ECG signals
based cardiac arrhythmia classification methods developed
were weak and inaccurate. So ECG features are extracted for
cardiac arrhythmia classification. Many machine learning and
data mining methods were sought to improve ECG arrhythmia
detection accuracy.
Automated heartbeat classification based on various
features representing ECG and other classification methods
were studied by other researchers [7-12] earlier. They include
frequency-based features [7], ECG morphology [8, 10], and
heartbeat interval features [10, 11], neural networks [12],
Support Vector Machine [11], and self-organizing networks
[8] are some common classification methods.
KhoureichKa [13] proposed a waveform similarity and
RR interval ECG beat classification procedure. This method
classified 6 heart beat types (normal beat, atrial premature
beat, paced beat, premature ventricular beat, left bundle
branch block beat and right bundle branch block beat).
Wavelet transform techniques denoised ECG signal and
extracted RR intervals were used as feature. The classifier
has an annotated beats training database compiled for it as it
used waveform comparison of unknown beats. The MIT/
BIH arrhythmia database’s 46 records were used for evaluation
achieving 97.52% classification rate.
M.G. Tsipouras et al., [14] suggested a knowledge-based
arrhythmic beat classification method and arrhythmic episode
detection and classification using only RR-interval signals
from ECG recordings. Arrhythmic beat classification algorithm
uses a three RR-interval sliding window. Classification is
undertaken through 4 beat categories: normal, premature
ventricular contractions, ventricular flutter/fibrillation and 28
heart block. Input used is beat classification of a knowledge-
based deterministic automation to ensure classification and
arrhythmic episode detection. The six rhythm types classified
include ventricular couplet, ventricular bigeminy, ventricular
tachycardia, ventricular trigeminy, ventricular flutter/fibrillation
and 28 heart block. The MIT-BIH arrhythmia database
evaluates the proposed methods revealing 94% accuracy for
arrhythmic episode detection and classification and 98%
accuracy for arrhythmic beat classification. This method is
better than other complicated techniques like RR-interval
signal used for arrhythmia beat and episode classification.
Bashir, et al., [15] proposed a nested ensemble technique
for real time cardiac arrhythmia classification. The method
includes a training data set and feature selection manipulation,
with the former being manipulated for classifier learning
through updating training data, and feature selection for
improved performance and accuracy during classification.
Experiments proved that all ECG features should be considered
for the proposed model’s evaluation and accuracy.
In this paper, classification methods for arrhythmic beat
classification based on RR interval in ECG are investigated.
The RR beat interval is extracted using Symlet conversion
on ECG data is used [16] as in our previous investigations.
Extracted RR data is used as classification feature for
boosting algorithm and Fuzzy Unordered Rule Induction
Algorithm (FURIA). Classification efficiency was evaluated
Fig. 1. Normal ECG signal and its P, Q, R, S and T peaks.
![Page 3: Fuzzy Unordered Rule Induction for evaluating cardiac Arrhythmia](https://reader031.vdocuments.net/reader031/viewer/2022020614/575094711a28abbf6bb90d63/html5/thumbnails/3.jpg)
76 Biomed Eng Lett (2013) 3:74-79
using MIT-BIH arrhythmia database. Classification efficiency
is evaluated with the MIT-BIH arrhythmia database. The
remainder of the paper is organized as follows: section 2
reviews related work in the literature, section 3 details the
methodology. Section 4 provides results and discussion, and
section 5 concludes the paper.
METHODOLOGY
Symlet wavelet
Waveforms bound in time and frequency are wavelets and
the latter’s analysis splits signals into the mother wavelet’s
shifted and scaled versions. The Continuous Wavelet
Transform (CWT) is known by the wavelet function ψ
adding signal times multiplied by scaled and shifted
versions. The continuous wavelet is defined mathematically
by
CWT is the reason for many wavelet coefficients C, a
function of scale and position. The original signal constituent
wavelets are got through multiplying all coefficients by
applicable scaled and shifted wavelets. Daubechies proposed
Symlet, almost symmetrical wavelets to get modifications of
the db family [17]. Both wavelet families are similar, with
difference of db wavelets having maximal phase while
Symlet have minimal phase. Symlet are compactly supported
wavelets with slight asymmetry and its wavelet coefficient is
any positive even number and highest number of vanishing
moments for a specific support width.
Boosting
In boosting, k classifiers set is iteratively learned with every
training tuple being assigned weights [18]. The process
begins with a classifier Mi learning, and updating weights,
thereby helping the next classifier, Mi+1, to assign more
weights to misclassified Mi training tuples. After k classifiers
are learned iteratively, the final boosted classifier, M*, sums
up every classifier’s votes.
A higher weight is allotted to the classifier through
boosting with a lower error rate vote. So, accurate classifiers
have higher weightage. Classifier Mi’s vote weightage is
computed as follows:
For each class, c, weights of every classifier that assigned
class c to X is added up. The class of a tuple is returned
based on highest vote sum.
Boosting with decision stump
A decision stump includes a split decision tree. As a decision
stump is a weak learner it is combined with weak learners by
AdaBoost algorithm into an accurate classifier [19]. Decision
stumps form a committee. Base classifiers are built on weighted
examples with the committee deciding on a majority vote.
When this process starts, all examples get equal weights
which are increased in the second round for misclassified
examples but decreased for correctly classified examples by
the first decision stump. This procedure is repeated till a
particular number of stumps are created. Every committee
member has its own specialty because of special training.
Hence, the performance of a diversified committee is better
than that of a single member.
Boosting with REP tree
Reduced Error Pruning (REP) tree
The created tree is the best fit on training data in a decision
tree learning algorithm [20]. Usually built trees overfit training
samples and so are pruned to reduce training data dependency
and also to ensure that they fit other examples. Reduced
Error Pruning (REP) trees are procedures to prune decision
trees as it both easy and effective, but it does have a tendency
to over prune a tree. REP trees main disadvantage is that
they need a separate pruning dataset. But it is very powerful
with a large training dataset or through boosting. REP pruning
replaces a subtree with a leaf representing an examples
majority. Modifications are required only when it reduces
error through equal or lower number of misclassifications.
Boosting with J48
J48 algorithm is an implementation of C4.5 decision tree
learner. J48 uses greedy algorithm to prompt decision tree
classification. Based on a decision-tree built with a training
dataset unseen data is classified. J48 node generated decision
trees evaluate the significance of every feature, e.g., heart
rate.
ECG features are classified to the appropriate arrhythmia
class by following a root to tree leaves path resulting in class
decision. Decision trees are built top-down, with a suitable
attribute chosen at each level. An information-theoretic
measure evaluates features to provide “classification power”
for every feature. The training dataset is split into subsets on
feature selection, conforming to selected feature values. This
process is repeated for all subsets, till subset instances come
under one class.
Fuzzy Unordered Rule Induction Algorithm (FURIA)
Fuzzy Unordered Rule Induction Algorithm (FURIA) is a
rule-based classification method founded on RIPPER [21].
C scale, position( ) f t( )ψ scale, position, t( )dt
∞–
∞
∫=
log1 error M
i( )–
error Mi
( )--------------------------------
![Page 4: Fuzzy Unordered Rule Induction for evaluating cardiac Arrhythmia](https://reader031.vdocuments.net/reader031/viewer/2022020614/575094711a28abbf6bb90d63/html5/thumbnails/4.jpg)
Biomed Eng Lett (2013) 3:74-79 77
FURIA learns fuzzy rules, - not conventional rules – and
unordered rule sets and not rule lists. Its advantage is
preserving simple and comprehensible rule sets. It includes
many modifications/extensions. It uses a rule stretching
method to deal with uncovered examples. Experiments prove
that FURIA outperforms original RIPPER and classifiers
like C4.5, significantly regarding classification accuracy.
FURIA specifically learns fuzzy rules instead of conventional
rules and unordered rule sets instead of rule lists. Uncovered
samples are handled with an efficient rule stretching method.
Fuzzy rules are general compared to conventional rules with
many advantages. For example, conventional (non-fuzzy)
rules produce “sharp” decision boundary models with
corresponding abrupt transitions among classes which is
questionable and not very intuitive. Also one expects support
for a class provided rule to reduce from “full” (inside rule
core) to “zero” (near boundary) slowly and not at once.
Fuzzy rules “soft” boundaries, is their characteristic, but they
convert to crisp boundaries when a classification decision is
planned. Boundaries are flexible in fuzzy rules. For example,
using suitable aggregation operators to combine fuzzy rules
which are not axis-parallel.
Conventional rule learners result in a decision list to
produce rules learned by turn for every class starting from
the smallest (in terms of relative occurrence frequency) and
ending with the second largest. Also, a default rule is added
to a majority class and a new query instance is classified by
the first rule in the list which covers it.
This has advantages and disadvantages. For example, it
might have an unwanted bias as classes are not treated
symmetrically. Sorting rules on a priority basis compromises
comprehensibility (condition part of every rule includes
negated conditions of earlier rules). To offset this FURIA
learns an unordered rule set. In other words, a rule set for
each class in a one-versus-rest scheme where the resultant
model is incomplete, i.e., a new query may be uncovered by
any rule (this way decision lists faces less problems). This
algorithm and results are extension of boosting techniques
for cardiac arrhythymia prediction [22].
EXPERIMENTAL SETUP
A MIT-BIH arrhythmia database is used to evaluate classifiers
performance. The evaluation dataset has 165 instance; 55
events each for Right bunch bundle block, Left bunch bundle
block and Normal RR interval. Continuous wavelet transforms
using symlet2 filters are applied. Figs. 2-4 show event
output when continuous wavelet transforms using symlet2 is
applied.
RESULTS AND DISCUSSION
Instances are classified as Right bunch bundle block (R ),
Left bunch bundle block (L) and Normal RR interval (N).
FURIA learns fuzzy rules for each class from the training
Fig. 2. Left bunch bundle block output on applying continuouswavelet transforms using symlet2.
Fig. 3. Right bunch bundle block output on applying continuouswavelet transforms using symlet2.
Fig. 4. Normal RR interval output on applying continuous wavelettransforms using symlet2.
![Page 5: Fuzzy Unordered Rule Induction for evaluating cardiac Arrhythmia](https://reader031.vdocuments.net/reader031/viewer/2022020614/575094711a28abbf6bb90d63/html5/thumbnails/5.jpg)
78 Biomed Eng Lett (2013) 3:74-79
instances. Following are the five rules obtained by FURIA:
(a2 in [-78.404, -69.722, inf, inf]) =>class=L (CF = 0.94)
(a118 in [-inf, -inf, -17.512, -12.86]) and (a213 in [-inf,
-inf, 1.1767, 1.2272]) => class=N (CF = 0.97)
(a118 in [-inf, -inf, -17.512, -14.293]) and (a238 in [-inf,
-inf, -0.54844, -0.35637]) => class=N (CF = 0.95)
(a233 in [-0.84513, -0.15514, inf, inf]) and (a6 in [-inf,
-inf, -80.307, -76.38]) => class=N (CF = 0.92)
(a156 in [-inf, -inf, -0.75362, 2.7266]) and (a82 in [2.2714,
3.0664, inf, inf]) => class=R (CF = 0.98)
Fuzzy Unordered Rule Induction Algorithm (FURIA) and
Boosting classify instances with a decision stump, boosting
with both J48 and REP tree. Classification accuracy
experiments results are provided in the following Tables/
Figures. Table 1 tabulates the result summary for varied
techniques. Fig. 5 shows plotted classification accuracy and
Fig. 6 shows plotted Root mean squared error.
According to observations the obtained results of the Root
Mean Square Error was minimised with FURIA algoritham
compared to previous algoritham i.e. boossting with decision
stump.
Table 2 tabulates the precision, recall and f-Measure. The
precision and recall is computed as
Precision =
Recall =
For a good classification system the values of precision
and recall should be high which is seen in the boosting with
decision stump and FURIA technique.
CONCLUSION
This paper investigates RR interval based ECG classification
method for arrhythmic beat classification. The methodology
is based on RR interval beat extraction using Symlet conversion
on ECG data. Extracted RR data is a classification feature.
Number of relevant images retrieved
Total number of images retrieved------------------------------------------------------------------------------------------
Number of relevant images retrieved
Total number of relevant images in the Database------------------------------------------------------------------------------------------------------------------------
f measure 2precision recall×
precision recall+----------------------------------------×=
Table 1. Summary of the results.
Boosting with decision stump Boosting with J48 Boosting with REP tree FURIA
Classification accuracy 92.125% 91.52% 91.52% 92.121%
Root mean squared error 0.211(21.1%)
0.1947(19.47%)
0.1891(18.91%)
0.1897(18.97%)
Fig. 5. Classification accuracy. Fig. 6. Root mean squared error.
Table 2. Precision, recall and F Measure.
Precision Recall F-Measure
Boosting with decision stump 0.911 0.921 0.916
Boosting with J48 0.91 0.915 0.912
Boosting with REP tree 0.915 0.915 0.914
FURIA 0.911 0.921 0.916
![Page 6: Fuzzy Unordered Rule Induction for evaluating cardiac Arrhythmia](https://reader031.vdocuments.net/reader031/viewer/2022020614/575094711a28abbf6bb90d63/html5/thumbnails/6.jpg)
Biomed Eng Lett (2013) 3:74-79 79
Boosting algorithms and FURIA classify beats. MIT-BIH
arrhythmia database evaluates classification efficiency. Instances
are classified as Right bunch bundle block, Left bunch
bundle block and Normal RR interval. FURIA achieved a
classification accuracy of 92.12% and Root mean square
error with boosting with decision stump was 0.211 (21.1%)
and with FURIA was 0.1893 (18.97%).
CONFLICT OF INTEREST STATEMENTS
Kumari VSR declares that s/he has no conflict of interest in
relation to the work in this article. Kumar PR declares that
s/he has no conflict of interest in relation to the work in this
article.
REFERENCES
[1] Couderc JP, Xia X, Peterson DR, McNitt S, Zhao H, PolonskyS, Zareba W. T-wave morphology abnormalities in benign,potent, and arrhythmogenic I< sub>kr</sub> inhibition. HeartRhythm. 2011; 8(7):1036-43.
[2] El-Sherif N, Caref EB, Yin H, Restivo M. Theelectrophysiological mechanism of ventricular arrhythmias inthe long QT syndrome: tridimensional mapping of activationand recovery patterns. Circ Res. 1996; 79(3):474-92.
[3] Saxena SC, Kumar V, Hamde ST. Feature extraction from ECGsignals using wavelet transforms for disease diagnostics. Int JSyst Sci. 2002; 33(13):1073-85.
[4] Raghavendra BS, Bera D, Bopardikar AS, Narayanan R.Cardiac arrhythmia detection using dynamic time warping ofECG beats in e-healthcare systems. IEEE Int Sym Wireless,Mobile, Multimedia Networks. 2011. pp. 1-6.
[5] McSharry PE, Clifford GD, Terassenko L, Smith LA. Adynamical model for generating synthetic electrocardiogramsignals. IEEE T Bio-med Eng. 2003; 50(3):289-94.
[6] Özbay Y, Ceylan R, Karlik B. Integration of type-2 fuzzyclustering and wavelet transform in a neural network basedECG classifier. Expert Syst Appl. 2011; 38(1):1004-10.
[7] Ince T, Kiranyaz S, Gabbouj M. A generic and robust system forautomated patient-specific classification of ECG signals. IEEET Bio-med Eng. 2009; 56(5):1415-26.
[8] Kaneko M, Iseri F, Gotoh T, Yoneyama T, Yamauchi T,Takeshita K, Sueda N. QRS morphological analysis using twolayered self-organizing map for heartbeat classification. ComputCardiol. 2010. pp. 975-8. IEEE.
[9] Basil T, Chandra BS, Lakshminarayan C. A comparison ofstatistical machine learning methods in heartbeat detection andclassification. Big Data Analytics. 2012. pp. 16-25.
[10] De Chazal P, O'Dwyer M, Reilly RB. Automatic classificationof heartbeats using ECG morphology and heartbeat intervalfeatures. IEEE T Bio-med Eng. 2004; 51(7):1196-206.
[11] Kampouraki A, Manis G, Nikou C. Heartbeat time seriesclassification with support vector machines. IEEE T Inf TechBiomed. 2009; 13(4):512-8.
[12] Jiang W, Seong KG. Block-based neural networks forpersonalized ECG signal classification. IEEE T Neural Networ.2007; 18(6):1750-61.
[13] Ahmad K. ECG beats classification using waveform similarityand RR interval. 2011. http://arxiv.org/abs/1101.1836.
[14] Tsipouras MG, Fotiadis DI, Sideris D. An arrhythmiaclassification system based on the RR interval signal. Artif IntellMed. 2005; 33:237-50.
[15] Bashir MEA, Akasha M, Lee DG, Yi GM, Ryu KH, Cha EJ, BaeJW, Cho MC, Yoo CW. Nested ensemble technique forexcellence real time cardiac health monitoring. BioComp. 2010.pp.519-25.
[16] Kumari R. Performance evaluation of boosting techniques forcardiac arrhythmia prediction. Int J Comput Appl. 2012; 57(17).
[17] Boultadakis G, Skrapas K, Frangos P. Time-frequency analysisof radar signals. RTO/SET, 80. 2004. pp. 7.1-20.
[18] Friedman J, Hastie T, Tibshirani R. Additive logistic regression:a statistical view of boosting. Ann Stat. 2000; 28(2):337-407.
[19] Rätsch G, Onoda T, Müller KR. Soft margins for adaBoost.Mach Learn. 2001; 42(3):287-320.
[20] Quinlan JR. Simplifying decision trees. In Gaines B, Boose J,editors. Knowledge acquisition for knowledge-based systems.1987. pp. 239-52.
[21] Hühn J, Hüllermeier E. FURIA: an algorithm for unorderedfuzzy rule induction. Data Min Knowl Disc. 2009; 19(3):293-319.
[22] Kumari VSR, Kumar PR. Performance of evaluation ofboosting technique for cardiac arrhythmia prediction. Int JComput Appl. 2012; 57(17):18-223.