Uploaded by Hafizudin Imran

Crackle separation and classification from normal Respiratory sounds using Gaussian Mixture Model

advertisement
2015 IEEE 10th International Conference on Industrial and Information Systems, ICIIS 2015, Dec. 18-20,2015, Sri Lanka
Crackle Separation and Classification from Normal
Respiratory Sounds Using Gaussian Mixture Model
Syed Osama Maruf, M. Usama Azhar, Sajid Gul Khawaja, M. Usman Akram
College of Electrical and Mechanical Engineering,
National University of Sciences and Technology, Pakistan.
maru(jr1991@hotmail.com, usamaazhar32@gmail.com, sajid.gu1.2009@gmail.com, usmakram@gmail.com
Abstract-Analysis of Respiratory sound signal is helpful in
detection of adventitious lung sound which are an indication
of disease. This helps in classification of normal respiratory
sounds from abnormal respiratory sounds and this can be used to
accurately diagnose respiratory diseases as is done by a medical
practitioner via auscultation. This process has subjective nature
and that is why simple auscultation cannot be relied upon. A
computer aided diagnostic system which analyzes respiratory
sounds can be very helpful in detection of various respiratory
diseases such as pneumonia, asthma, bronchitis and tuberculosis.
In this paper we present a novel method for automated detection
-0.3
of crackles which indicate severity of a respiratory disease. The
proposed system consists of four modules i.e., pre-processing in
which noise is filtered out, followed by feature extraction. The
proposed system then performs feature selection based on rank
tests and finally classification to separate crackles from normal
breath sounds.
I.
0.6
I N T RODUCTION
Millions of people die due to respiratory diseases all over
the world and according to World Health Organization (WHO)
Chronic Obstructive Pulmonary Disease (COPD) will become
the third leading cause of death worldwide by 2030. In Europe,
chronic obstructive pulmonary disease effects about 10% to
25% of the adult population [1]. Latest studies also show
that Pneumonia alone accounts for about a million deaths in
children annually. This represents 18% of all annual under­
five worldwide mortality [2] and about 98% of these deaths
occur in developing countries. The economic cost of asthma,
COPD, and pneumonia was $106 billion in 2009: $81 billion
in direct health expenditures $25 billion in indirect cost of
mortality [3]. The high mortality rate can be attributed to
the fact that there is a scarcity of trained medical health
professionals as compared with the number of patients and also
due to lack of awareness about health facilities in people of
backward areas. The diagnosis of these diseases is conducted
by recording of a detailed history of the patient followed by a
physical exam in which detection of adventitious lung sounds
as well as abnormal breath sounds is undertaken with the help
of a stethoscope; a process called auscultation. Adventitious
lung sounds are additional respiratory sounds superimposed on
breath sounds [4] and are divided into two types 1) stationary
and 2) non-stationary. Stationary adventitious lung sounds are
wheezes and rhonchi whereas non-stationary sounds include
crackles. Crackles are discontinuous explosive sounds which
occur usually during inspiration [4]. Figure 1 shows the
waveforms for normal breath and crackles.
978-1-4799-1876-8/15/$31.00 ©2015 IEEE
0.8
267
X
10!o
Fig. I.
Breath sound waveforms. Top: Normal breath sound waveform;
Bottom: Crackle sound waveform
The occurrence of crackles is an indication of the severity of
the pulmonary disease [5] and simple auscultation cannot be
relied upon as auscultation with a stethoscope is a subjective
process since it depends on the individual's own hearing, ex­
perience and ability to distinguish different sounds patterns[l].
Thus there is a need for a system which accurately detects the
presence of crackles in the respiratory sound of patient. This
system would not be able to replace the doctor but would be
valuable for countries where the doctor to patient ratio is low.
This article consists of five sections. Section 2 highlights
existing methods and related work for respiratory sound anal­
ysis. Section 3 describes a brief overview and all steps of
the proposed system. The results are presented in Section 4
followed by conclusions in Section 5.
2015 IEEE 10th International Conference on Industrial and Information Systems, ICIIS 2015, Dec. 18-20,2015, Sri Lanka
II.
RELAT ED WORK
Respiratory Sound Analysis is a relatively new area of
interest for researchers and not many methods of detection
and classification of crackles have been formulated. Presence
of crackles is strong indicator of pathological illness and it
is imperative that methods should be devised for its detec­
tion to help in clinical diagnosis. The existing methods are
broadly categorized as the non-linear separation stationary
non-stationary filter (ST-NST) [6] and its several modified
version [7], [8], the wavelet transform-based stationary-non­
stationary filter (WTST-NST) [9] and the generalized fuzzy
rule-based stationary non-stationary filter (GFST-NST)[10].
Mohammed Bahoura and Xiaoguang Lu [11], [12] pre­
sented a model WPST- NST method based on double thresh­
holding in wavelet domain by using time domain features
which separates crackle's coefficients. Unlike simple wavelet
transform, wavelet packet transform is obtained by applying
wavelet transform at every level which is equivalent to multi­
channel filtering. Using this model an accuracy of 93.9%
is achieved [12].Fatma Ayari et al. [13] has discussed two
methodologies to classify crackles and their extraction from
the lung signal. The first one is statistics based methodology
and the second is fuzzy nonlinear classifiers. Nine features
have been selected to enhance behavior of crackle. These
features relate to amplitude, time and waveform. Sensitivity
of 98.34% and positive predictive value of 97.88% have been
achieved. Martinez-Hernandez et al. [14] used lung sounds
which are acquired by multi-channel microphone array on
which feature extraction is done by multivariate AR model
and after dimensionality reduction techniques like PCA and
SVD are applied, the classification was done by SNN (Super­
vised Neural Network) using back-propagation method and
Levenberg-Marquardt rule a.k.a. damped-least square. They
classified DIP (diffuse interstitial pneumonia) characterized by
crackles (100-2000 Hz). SNN is then applied and gave best
results at 20 nodes for SVD and 25 nodes for PCA. MAR
in conjunction with PCA is superior it gave an accuracy of
92.86% in the testing phase.
III.
N o rm al
Fig. 2.
A.
PROPOSED M ET HOD
268
Flow diagram of proposed system
Preprocessing
Pre-Processing of the lung sound is done with two objectives
in mind, i.e., reduction of background noises and the other is
to enhance the quality of the recorded sound. To cater for
the former objective pre-filtering using a band-pass filter with
cut-off frequencies at 100 Hz to reduce heart sounds and 2500
Hz to eliminate high- frequency noise is applied [15] as it is
within these frequency ranges that the lung sounds are present
[16].
B.
Over the past few years computerized methods of respi­
ratory sound acquisition and analysis have overcome many
shortcomings of simple auscultation [l].Computer aided di­
agnostic systems have brought new horizons in detection and
treatment of many diseases. Thus, many different techniques
have been used to acquire and then use respiratory sounds for
diagnosis of pulmonary diseases. Figure 2 shows a complete
flow diagram for all the phases of the proposed system start­
ing from sound acquisition to classification of crackles. The
proposed system is divided into four phases, i.e., acquisition
of the lung sound and preprocessing, feature extraction and
wavelet decomposition with removal of the silent phase, and
finally feature selection and classification. A Gaussian Mixture
Model based on a Bayes decision rule is used to differentiate
crackles from normal respiratory sounds.
Cra c k l e s
Feature Extraction
A feature set is formed to distinguish between normal
respiratory sounds and crackles. The formation of the feature
vector was done by the following method: The pre-processed
sound files are used to calculate three spatial-temporal features
namely, pitch, energy and spectrogram. These features are
extracted from the entire spectrum after which the wavelet
decomposition is done using Daubechies-8 wavelet for 5th
level decomposition tree. The levels containing silent phase
are then discarded and the level with the most information,
i.e., the last level is then used to calculate other 9 features.
The entire feature vector contains 12 features which are of
the form Iv
Xl, X2, X3, ........., X12. The features which we
have used are Pitch(xl), Spectrogram(x2), Energy(x3), Shan­
non Entropy(x4), Mean(x5), Range (x6), IQR(x7), MAD(x8),
Moment(x9), Skewness (x10), Kurtosis(x11) and PSD(x12).
=
2015 IEEE 10th International Conference on Industrial and Information Systems, ICIIS 2015, Dec. 18-20,2015, Sri Lanka
The description of the features in the feature vector is given
below:
1) Pitch(Xl) is associated with the frequency of sound
wave and allows its ordering based on it [17], i.e., high
pitch is attributed to sound having a high frequency
and low pitch is associated with sound having a low
frequency.
2) Spectrogram(x2) is a visual representation of the spec­
trum of frequencies in a sound or other signal as they
vary with time.
3) Energy(x3) is the energy of wavelet or wavelet packet
decomposition.
4) Shannon Entropy(x4) is measure of diversity or ran­
domness of the sound wave [18].
5) lvIean(xs) of the sound is calculated for each candidate
region in order to differentiate between crackles and
normal respiratory sounds.
6) Range(x6) is the difference between the maximum and
minimum of the sound sample.
7) IQR(x7) is the inter-quartile range of the sound in time­
domain.
8) !vI AD(xs) is the median absolute deviation of the
sound. Moment(x9) central sample moment of sound
specified by the positive integer order.
9) Skewness(xlO) is a measure of the asymmetry of the
probability distribution of a real-valued random variable
about its mean.
10) Kurtosis(Xll) is a descriptor of the shape of the prob­
ability distribution of the real- valued random variable
[19].
11) PSD(X12) the power spectrum density is the average
power of the sound spectrum.
TABLE I
ANALYSIS OF FEATURES FOR FEATURE SELECTION USING RANKSUM
TEST
";'
.
r'
..
05
. . ...
.
.
�
o
3
.
.
.... .
.
"
.
.
nd Iq,
.. ' . '"1"'
.
. ..
Ansari Bradley Test
P Value
Score
292
0.9305
0.397
275
0.567
280
252
0.034
0.422
305
260
0.09
252
0.034
0.0517
255
314
0.19
341
0.005
324
0.062
0.051
255
M d
•
15
.
··· ·
..
.
.
: ......
. .
.
r·- ....
"
: .....
, 0 -
•
·
"
.
"'
.
.
'l1""
�..
;.
••
:
.
.
.
.
'.:
•
••
,.
IS
o
0
KurtoStS
•
Mad and Iqr
....
15
r
•.
....
..
05
':
.
. ,'-
. .
. . .
.....
"0f"'
_.'
..
o
3
C. Feature Selection
269
Test
Score
219
228
215
295
302
361
293
278
323
342
441
278
R nSt
Feature extraction phase extracts different features out of
which best features are selected by feature selection phase
using rank tests.
Feature selection phase selects a feature set from complete
feature vector using two rank sum tests i.e. Wilcoxon Rank­
Sum and Ansari-Bradley Tests. Wilcoxon Rank-Sum test is a
non-parametric test of the null hypothesis that two populations
are the same against an alternative hypothesis that the two dis­
tributions differ only with respect to the median. It has higher
efficiency on non- normal distributions such as a mixture of
normal distributions [20]. Ansari-Bradley test compares two
independent samples which come from the same distribution
against the alternative that they come from same distributions
having the same median and shape but different variances [21].
Table 1 shows the results obtained when Wilcoxon Rank-Sum
and Ansari-Bradley tests are applied to different features.
The features showing the least P -value are selected based
on the results of rank sum tests. Top four common features
with minimum p - value are selected which are Range(x6),
IQR(x7), !vI AD(xs) and Kurtosis(Xl1). They are then used
to classify crackles from normal respiratory sounds. Figure 3
shows the plots for top four features
Wilcoxon
P Value
0.0405
0.0717
0.0309
0.989
0.8366
0.0675
0.989
0.67
0.433
0.191
0.00005
0.67
Features
Pitch
Spectrogram
Energy
Entropy
Mean
Range
IQR
Mad
Moment
Skewness
Kurtosis
PSD
t::
.
.
.
.
. :!::
,
.
.
.
.
.
.
•
'- .!It."
:���.
....
...
.
.
:,
'. � .
'1"
:
ro., • •:
.
' :
.
.
.
.. "
.
"'1' '
'
.
.
.
��
150
"
o
0
Fig. 3. 3 dimensional scatter plots. Top plot shows the scatter plot for range,
MAD and IQR. Bottom plot shows the scatter plot for kurtosis, MAD and
IQR.
D.
Classification
For the purpose of classification, we use a Bayesian classi­
fier using Gaussian functions known as the Gaussian Mixture
Model [22]. The Gaussian Mixture Model offers fast and
accurate performance in the testing phase while being trained
extensively in the training phase. In our context, each sound
class is modeled by a Gaussian Mixture Model obtained
by training [23]. The training is done by using a training
data set with the respiratory sounds labeled as crackles and
normal after which the Bayes decision rule is used in the
2015 IEEE 10th International Conference on Industrial and Information Systems, ICIIS 2015, Dec. 18-20,2015, Sri Lanka
estimation of a decision criteria from the training set. Two
classes Xl
cr-ackles and X2
nor-mal have been defined.
The parameters for GMM are optimized using Expectation
Maximization (EM) which is an iterative method and it
chooses optimal parameters by finding the local maximum
value of GMM distributions for training data [24].
Bayes decision rule is stated as [24]
=
=
Choose
iJ,p(vIRl)P(Rl)
Rl
other-wise
choose
R2
>
accuracy as figures of merit. These parameters were calculated
using equations 1-4 respectively.
Sensitivity(Sen)
SpeciJicity(Spec)
p(vIR2)P(R2)
PPV
ACCUTacy(Acc)
=
p(vIRi)
=
L N(vlJ.Lj, �j)Wj
j=l
(3)
Expectation Maximization is done to select an optimum value
of k where in the estimation step PE of each point of the
Gaussian is calculated using
PE(n,J ) .
_
wjN(vnI/Lj, �j)
"
L... i=l N(vnIJ.Li, �i)Wi
"",
1
=
NTo!."l
�j
�
PE(n,j)(vn -,Lj)(Vn - J.Ljf
Wj -
�j
---
NTotal
(6)
(7)
L;:�'(Ol PE(n,j) and NTotal are the total number
where �j
candidate regions.
=
IV.
(10)
(TP+FP)
(TP+TN)
=
(9)
(TN+FP)
(TP+TN+FN+FP)
(11)
•
•
•
TP are true positives, meaning crackles are correctly
classified.
TN are true negatives, meaning normal respiratory sounds
are correctly classified.
FP are false positives, meaning normal respiratory sounds
are wrongly classified as crackles regions.
FN are false negatives, meaning crackles are wrongly
classified as normal respiratory sounds.
Table-2 shows the evaluation results of proposed system and its
comparison with other classifiers like support Vector Machine
(SVM) and Artificial Neural Network (ANN) consisting of 10
nodes in the input layer.
TABLE II
PROPOSED SYSTEM EVALUATION RESULTS
Classifier
GMM
SVM
ANN
TP
13
13
10
TN
27
25
25
FN
I
1
4
FP
0
2
2
Sen
92.85
92.8
85.3
Spec
100
92.5
7l.2
PPV
100
86.6
92.5
Acc
97.56
92.6
85.3
(4)
In the final step maximization of the likelihood function is
achieved by re-calculation of covariance matrix, mean and
weight of the jth Gaussian by using the equations given below:
�j
TN
=
where
•
where v and J.L are feature vector containing m number of
features and mean vector containing mean of each feature
respectively. � is a m x m covariance matrix. In our case
m
3. Then the likelihood function of the Gaussian is
calculated as
(8)
(TP+FN)
TP
=
(I)
where p(vIRi) is the class conditional Probability Density
Function (pdt) also known as likelihood and P(Ri) is the prior
probability of class Ri which is calculated as the ratio of class
Ri samples in the training set. The conditional probability
density function of the feature vector for the two classes is
calculated using
TP
=
EXPERIMENTAL RESULT S
The evaluation of proposed system is done using respiratory
sound files. We use a sound repository of 41 files out of which
14 have been categorized as crackles by a human grader. The
performance of the proposed system has been evaluated using
sensitivity, specificity ,positive predictive value (PPV) and
270
Gaussian Mixture Model out performs the other classifiers
and is thus selected as the classifier of choice in our proposed
method. The effectiveness of GMM can be ascertained from
the high accuracy shown in the testing phase.
For performance comparison with existing methods, we
present the values of sensitivity, specificity, PPV and accuracy
of the methods of Xiaoguang Lu et al. [11] , Martinez­
Hernandez et al. [15] and Fatma Ayari et al. [13]. The results
of these comparisons are shown in table-3.
TABLE III
PERFORMANCE COMPARISON OF PROPOSED SYSTEM WITH EXISTING
METHODS
Method
Lu, Xiaoguang et al. [11]
Martinez-Hernandez et al. [15]
Fatima Ayari et al. [13]
PM
Sen
92.9
91.38
98.34
92.85
Spec
PPV
94. 4
94.34
100
97.88
100
Acc
93.9
92.86
97.56
The table shows that the proposed system outperforms in
terms of accuracy of classification. The reason for the improve­
ment can be accredited to the fact that in the proposed method
a detailed feature set and classifier have been employed which
is not done by the above mentioned authors.
2015 IEEE 10th International Conference on Industrial and Information Systems, ICIIS 2015, Dec. 18-20,2015, Sri Lanka
V.
CONCLUSION
Digital signal processing techniques can be applied to study
lung sounds as an aid to clinical diagnosis [23]. New methods
for identifying and measuring adventitious lung sounds are
being developed in various research institutes [25]. We have
introduced a new classification tool for Crackles analysis as
their detection is important for the evaluation of the severity
of the respiratory disease. The proposed system consisted of
four phases i.e. preprocessing, feature extraction, feature se­
lection and finally the classification. We have used sensitivity,
specificity, PPV and accuracy to evaluate our proposed system.
The system has achieved specificity of 100%, sensitivity of
92.85%, PPV of 100% and accuracy of 97.56% which are
better than the recently published methods. It is evident
from the comparison with previous systems that the proposed
method has outperformed them and has classified crackles
form other respiratory sounds with good accuracy.
REFERENCES
[I] A.R. A. Sovijarvi, J.Vanderschoot, J.E.Earis, " Standardization of comput­
erized respiratory sound analysis". Eur Respir Rev 10;77,585,2000.
[2] UNICEF, Pneumonia and Diarrhea Tackling the Deadliest Diseases of the
World's Poorest Children, UNICEF Division of Policy and Strategy, New
York, June 2012.
[3] "NHLBI Fact Book, Fiscal Year 2012",Page 51, 2012
[4] A.R. A. Sovijrvi, F. Dalmasso, J. Vanderschoot, L.P. Malmberg, G. Righ­
ini, S.A.T. Stoneman," Definition of terms for applications of respiratory
sounds". Eur Respir Rev 10;77,597-610,2000
[5] Epler GR, Carrington CB, Gaensler EA. Crackles (rales) in the interstitial
pulmonary diseases. 73: 333-339, Chest 1978
[6] M. Ono, K. Arakawa,M. Mori, T. Sugimoto, and H. Harashima, "Sepa­
ration of fine crackles from vesicular sounds by a nonlinear digital filter,"
IEEE Trans Biomed Eng, vol. 36, no. 2, pp. 286-291, 1989.
[7] K. Arakawa, H. Harashima, M. Ono, and M. Mori, "Non-linear digital
filters for extracting crackles from lung sounds," Front. Med. BioI. Eng., ,
no. 3, pp. 245- 257, 1991
[8] L. J. Hadjileontiadis and S. M. Panas, "Nonlinear separation of crackles
and squawks from vesicular sounds using third-order statistics," in 18th
International Conference of the IEEE Engineering in Medicine and Biology
Society, vol. 5, pp. 2217-2219, 1996
[9] L. J. Hadjileontiadis and S.M. Panas, "Separation of discontinuous adven­
titious sounds from vesicular sounds using a wavelet-based filter," IEEE
Trans Biomed Eng, vol. 44, no. 12, pp. 1269-1281, 1997.
[10] YA.Tolias, L. J. Hadjileontiadis, and S. M. Panas, "Realtime separation
of discontinuous adventitious sounds from vesicular sounds using a fuzzy
rule-based filter," IEEE Trans Inf Technol Biomed, vol. 2, no. 3, pp. 204215, 1998
[II] Mohammed Bahoura and Xiaoguang Lu ,"SEPARATION OF CRACK­
LES FROM VESICULAR SOUNDS USINGWAVELET PACKET
TRANSFORM" ,2006
[12] Mohammed Bahoura and Xiaoguang Lu, " An Automatic System For
Crackles Detection And Classification" IEEE CCECE/CCGEI, 2006
[13] Fatma Ayari, Melli Ksouri and Ali Alouani,"A new scheme for auto­
matic classification of pathologic lung sounds", IJCSI International Journal
of Computer Science Issues, Vol, 9, Issue 4, No.1 July 2012.
[14] Martinez-Hernandez, H.G. , Aljama-Corrales, c.T., Gonzalez-Camarena,
R., Charleston-Villalobos Y.S, Chi-Lem, G. ," Computerized Classification
of Normal and Abnormal Lung Sounds by Multivariate Linear Autoregres­
sive Model". Proceedings of the 2005 IEEE Engineering in Medicine and
Biology 27th Annual Conference, Shanghai, China, Sep 1-4,2005
[IS] J.E.Earis, B. M.G. Cheetham.," Current methods used for computerized
respiratory sound analysis ". Eur Respir Rev 10;77,586-590,2000
[16] G. Charbonneau, E. Ademovic, B.M.G. Cheetham, L.P. Malmberg, J.
Vanderschoot, A.R. A. Sovijrvi, "Basic techniques for respiratory sound
analysis", Eur Respir Rev 2000; 10: 77, 625-635
[17] Anssi Klapuri and Manuel Davy (2006). "Signal processing methods for
music transcription". Springer. p. 8. ISBN 978-0-387-30667-4.
271
[18] Ihara, Shunsuke (1993). "Information theory for continuous systems".
World Scientific. p. 2. ISBN 978-981-02-0985-8.
[19] Dodge, Y "The Oxford Dictionary of Statistical Terms", OUP. ISBN
0-19-920613-9,2003
[20] Paul A. Pappas and Venita DePuy ," An Overview of Non-parametric
Tests in SAS: When, Why, and How"
[21] A.R. Ansari and R.A. Bradley ,"Rank-Sum Tests for Dispersions",
Institute of Mathematical Statistics is collaborating with JSTOR to digitize,
preserve, and extend access to T he Annals of Mathematical Statistics
[22] M. Usman Akram, Anam Tariq, M. Almas Anjum, and M. Younus Javed
" Automated detection of exudates in colored retinal images for diagnosis
of diabetic retinopathy". Applied Optics/ Vol. 5I ,No.201l 0 July 2012
[23] M. Bahoura, C. Pelletier , "Respiratory Sounds Classification using
Cepstral Analysis and Gaussian Mixture Models". Proceedings of the 26th
Annual International Conference of the IEEE EMBS San Francisco, CA,
USA 0 September 1-5, 2004
[24] R. O. Duda, P. E. Hart, and D. G. Stork, "Pattern Classification" Wiley,
2001
[25] J.E.Earis and B. M.G. Cheetham ,"Future perspectives for respiratory
sound research". Eur Respir Rev 10:77,641-646,2000
Download