Volume 15 Supplement 6
Knowledge Discovery and Interactive Data Mining in Bioinformatics
Selection of entropy-measure parameters for knowledge discovery in heart rate variability data
- Christopher C Mayer†^{1}Email author,
- Martin Bachler†^{1, 2},
- Matthias Hörtenhuber^{2},
- Christof Stocker^{3},
- Andreas Holzinger^{3} and
- Siegfried Wassertheurer^{1}
https://doi.org/10.1186/1471-2105-15-S6-S2
© Mayer et al.; licensee BioMed Central Ltd. 2014
Published: 16 May 2014
Abstract
Background
Heart rate variability is the variation of the time interval between consecutive heartbeats. Entropy is a commonly used tool to describe the regularity of data sets. Entropy functions are defined using multiple parameters, the selection of which is controversial and depends on the intended purpose. This study describes the results of tests conducted to support parameter selection, towards the goal of enabling further biomarker discovery.
Methods
This study deals with approximate, sample, fuzzy, and fuzzy measure entropies. All data were obtained from PhysioNet, a free-access, on-line archive of physiological signals, and represent various medical conditions. Five tests were defined and conducted to examine the influence of: varying the threshold value r (as multiples of the sample standard deviation σ, or the entropy-maximizing r_{Chon}), the data length N, the weighting factors n for fuzzy and fuzzy measure entropies, and the thresholds r_{ F } and r_{ L } for fuzzy measure entropy. The results were tested for normality using Lilliefors' composite goodness-of-fit test. Consequently, the p-value was calculated with either a two sample t-test or a Wilcoxon rank sum test.
Results
The first test shows a cross-over of entropy values with regard to a change of r. Thus, a clear statement that a higher entropy corresponds to a high irregularity is not possible, but is rather an indicator of differences in regularity. N should be at least 200 data points for r = 0.2 σ and should even exceed a length of 1000 for r = r_{Chon}. The results for the weighting parameters n for the fuzzy membership function show different behavior when coupled with different r values, therefore the weighting parameters have been chosen independently for the different threshold values. The tests concerning r_{ F } and r_{ L } showed that there is no optimal choice, but r = r_{ F } = r_{ L } is reasonable with r = r_{Chon} or r = 0.2σ.
Conclusions
Some of the tests showed a dependency of the test significance on the data at hand. Nevertheless, as the medical conditions are unknown beforehand, compromises had to be made. Optimal parameter combinations are suggested for the methods considered. Yet, due to the high number of potential parameter combinations, further investigations of entropy for heart rate variability data will be necessary.
Keywords
entropy parameter selection comparison knowledge discoveryBackground
Heart rate variability (HRV) is the variation of the time interval between consecutive heartbeats. It highly depends on the extrinsic regulation of the heart rate (HR) and reflects the balance between the sympathetic and the parasympathetic nervous system [1]. Batchinsky et al. [2] develop a collection of methods for using HRV to describe regular periodic oscillations in the heart rate, attributed to the vagal and/or sympathetic branches of the autonomic nervous system.
Research on HRV has attracted considerable attention in the fields of psychology and behavioral medicine. It has its origin in the search for non-invasive correlates of injury severity which can be extracted from available signals in order to discover new cardiac biomarkers [2]. These signals are usually ones that are routinely measured, and include sources like a photoplethysmogram or the electrocardiogram (ECG) [3]. Electrocardiography is an interpretation of the electrical activity of the heart over some period of time. It is a non-invasive procedure, using electrodes attached to the surface of the skin, and is commonly used to measure the heart rate, the regularity of the beats, and characterize properties or injuries in the heart chamber. An R-to-R interval (RRI) describes the latency between two consecutive R peaks in the ECG. The RRI time series are used as input for the determination of HRV parameters.
In studies of HRV, both time- and frequency-domain measures are typically used by practitioners and researchers [1, 4]. Additionally, further knowledge about the subject's status can be discovered by the evaluation of certain patterns and shifts in an "apparent ensemble amount of randomness" of a stochastic process [5]. This randomness, as well as the predictability of this process, can be measured by entropy [6]. Thus, it is a commonly used tool to describe the regularity of large biomedical data sets. The more regulatory inputs a system has, the higher its irregularity is, due to interference of those regulatory systems. This assumption is also true for many biomedical systems such as HRV [7]. Therefore, it is a reasonable hypothesis that a more regular heart rate variability is connected to a defect in a regulatory system. To measure this irregularity, Pincus et al. proposed approximate entropy (ApEn) [8]. Further types of entropy were developed based on this method to improve it [9]. Because of its ability to measure regularity, entropy is widely used as a diagnostic tool in medicine to derive and discover biomarkers in large biomedical data. Its applications range from sudden infant death syndrome [7], to complexity analysis of intracranial pressure dynamics during periods of severe intracranial hypertension [10], to quantification of amplitude variations in mechanomyographic signals [11], to analysis of short gait data sets [12], to automatic detection of normal, pre-ictal, and ictal conditions from recorded electroencephalography signals [13], to the postural sway in stroke patients [14].
The approach to quantify the structural complexity (or, inversely, regularity) of the HRV is called heart rate complexity (HRC) and utilizes methods derived from nonlinear dynamics. Note that complexity and variability are not necessarily the same [15]. A periodical signal, such as a sine wave, is variable but not complex. This property allows complexity measures to ignore the complicated periodic oscillations to at least some extent [16].
To date, numerous entropy types are widely accepted as measures of the HRC. However, since their function is controlled by three, four or six parameters, there are many possible combinations to choose from. The selection of criteria for these parameters is controversial and heavily depends on the intended purpose and the data at hand [17]. The variation of only one parameter often results in highly non-linear behaviour [12]. Therefore, results of calculations with different parameters cannot be easily extrapolated from existing data, but have to be computed individually. Hence, the variation of several parameters in order to optimize the reliability of the results is a very time consuming process. To be used in daily routine, reasonable parameters have to be selected prior to the evaluation of HRC.
The determination of appropriate parameters for entropy in general [11, 12, 18–20] and for HRV applications in particular [16, 17, 21, 22] is the subject of on- going research. A summary of the current knowledge regarding parameter selection is given in the subsection "Parameter selection". In order to verify the results of previous publications and to extend them by the usage of more entropy measures, this work focuses on the parameter selection for approximate, sample, fuzzy and fuzzy measure entropy and their implications for HRV data. The objective of this paper is to provide a reference for choosing parameters for HRV applications based on their influence on the different entropies' ability to distinguish significantly between pathological and non-pathological recordings. The main research questions addressed in this paper are: (1) what does the choice of the threshold value(s) mean for the entropies to be a direct measure of regularity, (2) how does the data length influence significance for different data sets, (3) how should the weighting factor(s) be chosen for fuzzy and fuzzy measure entropy, and finally, (4) what are the constraints for choosing the threshold values?
Methods
In the following sections approximate (ApEn), sample (SampEn), fuzzy (FuzzyEn) and fuzzy measure entropy (FuzzyMEn) are described in detail. They are built on each other and ordered by increasing complexity. The number of parameters increases from three for ApEn to six for FuzzyMEn, with the basic parameters staying the same and being extended by new ones in each step. Afterwards, the challenge of parameter selection and the tests conducted for the parameters to the different entropy types are described. Finally, the data used for all performed tests are briefly described.
Approximate entropy
The main idea behind approximate entropy is that a sequence is regular if a subsequence and an expansion of the subsequence are similar. It was developed by Pincus [8] and is calculated the following way.
The approximate entropy is then defined as $\text{ApEn}\left(m,r\right):={\displaystyle \underset{N\to \infty}{\text{lim}}}\left({\varphi}^{m}-{\varphi}^{m+1}\right)$, which can be estimated by $\text{ApEn}\left(m,r,N\right):={\varphi}^{m}-{\varphi}^{m+1}$.
Sample entropy
Richman and Moorman showed in [23] that approximate entropy is biased towards regularity. Thus, they modified it to sample entropy. The main difference between the two is that sample entropy does not count self-matches, and only compares the first N − m subsequences instead of all N − m + 1, so the same amount of subsequences are used in ϕ^{ m } and ϕ^{m+1}[23]. It is calculated in the following way.
Sample entropy is then defined as $\text{SampEn}\left(m,r\right):={\displaystyle \underset{N\to \infty}{\text{lim}}}\left(\text{ln}{\varphi}^{m}-\text{ln}{\varphi}^{m+1}\right),$ which can be estimated by $\text{SampEn}\left(m,r,N\right):=\text{ln}{\varphi}^{m}-\text{ln}{\varphi}^{m+1}.$
Fuzzy entropy
Fuzzy entropy is defined as $\text{FuzzyEn}\left(m,r,n\right):={\displaystyle \underset{N\to \infty}{\text{lim}}}\left(\text{ln}{\varphi}^{m}-\text{ln}{\varphi}^{m+1}\right)$ and can be estimated by $\text{FuzzyEn}\left(m,r,n,N\right):=\text{ln}{\varphi}^{m}-\text{ln}{\varphi}^{m+1}.$
Fuzzy measure entropy
Liu et al. proposed adding a function for global similarity to the fuzzy entropy and called the combination fuzzy measure entropy [24]. It can be calculated as follows.
Fuzzy measure entropy is then defined as FuzzyMEn(m, r_{ L }, r_{ F }, n_{ L }, n_{ F }) := $\underset{N\to \infty}{\text{lim}}}\left(\text{ln}{\varphi}_{L}^{m}-\text{ln}{\varphi}_{L}^{m+1}+\text{ln}{\varphi}_{F}^{m}-\text{ln}{\varphi}_{F}^{m+1}\right)$, which can be estimated by $\text{FuzzyMEn}\left(m,{r}_{L},{r}_{F},{n}_{L},{n}_{F},N\right):=\left(\text{ln}{\varphi}_{L}^{m}-\text{ln}{\varphi}_{L}^{m+1}+\text{ln}{\varphi}_{F}^{m}-\text{ln}{\varphi}_{F}^{m+1}\right)$.
Parameter selection
As one can see in the description of each entropy, the various entropy types have three, four or six parameters. Thus, there are many possible combinations to choose from. The parameters which are varied in the test cases, their ranges and values mentioned in the literature, and the choice of certain fixed parameters are described here.
The most common choice for the template length is m = 2, as it was recommended by Pincus and Goldberger for ApEn [7], by Yentes et. al for SampEn [12], and confirmed by other studies, e.g. [20]. A false nearest neighbor method is also sometimes used, but according to Chon et al. the standard choice leads to the statistically best solutions for ApEn for human heart rate variability data [25]. In [9, 24], the template length is set to m = 2 for other entropy types as well. For comparability, this value was used for all tests.
Some publications describe a sensitivity of the entropies to the data length N [1, 6, 8]. Therefore, the significance of the entropies calculated of parts of the heart rate variability data has been tested with increasing data set size.
For the threshold parameter r, Pincus suggested in [8] to choose a value between 0.1 σ and 0.25 σ, where σ is the sample standard deviation of the data sequence. This is also the standard range in most publications, with the most common choice of r = 0.2 σ [17, 21, 23, 24, 26]. To examine threshold parameters in the standard range, they were tested with heart rate variability data.
where σ_{1} is the standard deviation of the distances in the data sequence, i.e., the standard deviation of {(u_{1} − u_{2}), . . . , (u_{N−1}− u_{ N })}, and σ_{2} the standard deviation of the complete data sequence. This formula was derived from non-physiological data and Liu et al. showed in [17] that it is not always a good approximation of the maximizing r, but actually leads to more significant results than the maximizing r, when applied to heart rate variability data.
Regarding fuzzy function parameters, Chen et al. [9] used n = 2 and r = 0.2 σ for test signals. They described in [9] that for a larger n, the closer data points are weighted more strongly. Liu et al. [24] used the weighting factors n_{ L } = 3, n_{ F } = 2 and r_{ L } = r_{ F } = 0.2σ for heart rate variability analysis. Their choices for n_{ L } and n_{ F } were given without any motivation.
Test cases
To get further knowledge of the parameters, heart rate variability data were used to compare different choices of r_{ L }, r_{ F } and n, n_{ L } and n_{ F } with respect to the resulting significance of the statistical tests comparing pathological and normal cases.
The following test cases using the data described in the Data section have been conducted to answer the research questions stated in the Background section and to support an "optimal" parameter selection for all entropy types for heart rate variability data. For each research question, one test case has been designed (except for the third, which is covered in two test cases: one for each entropy type under consideration). All tests have been conducted consecutively. Fixed parameters have been taken from literature or based on the outcomes of preceding tests.
Test case 1
Variation of the threshold values r = r_{ L } = r_{ F } within the standard interval [0.1 σ, 0.25 σ] [8] to show its influence on the entropy values and the aforementioned flip-flop effect [21, 27]. Fixed parameters were m = 2, n = n_{ L } = 3 and n_{ F } = 2 [7, 24]. For data length N, the data length of the shortest RR interval sequence of the available data N = 1126 sets was used.
Test case 2
Variation of the data length N = x · 110 with x ∈ [1, 10] to show its influence on the significance. Maximum N was 1100 due to the length of the available test data. Fixed parameters were m = 2, n = n_{ L } = 3, n_{ F } = 2 and r = r_{Chon} or r = 0.2 σ [7, 17, 21, 23–26].
Test case 3
Variation of the weighting factor n for FuzzyEn in the interval [1, 6] to show its influence on the significance. Fixed parameters were m = 2 and r = r_{Chon} or r = 0.2 σ [7, 17, 21, 23–26]. N = 1000 was chosen based on the results of test case 2.
Test case 4
Variation of the weighting factors n_{ L } and n_{ F } for FuzzyMEn in the interval [1, 6] to show their influence on the significance. Fixed parameters were m = 2 and r_{ L } = r_{ F } = r_{Chon} or r_{ L } = r_{ F } = 0.2 σ [7, 17, 21, 23–26]. N = 1000 was chosen based on the results of test case 2.
Test case 5
Variation of the threshold values r_{ L } and r_{ F } for FuzzyMEn in an interval of [0.25 · r_{Chon}, 6 · r_{Chon}] and in an interval of [0.1 σ, 0.25 σ][8] to show their influence on the significance. The parameter m = 2 was fixed [7]. N = 1000, n_{ L } and n_{ F } were chosen based on the results of the previous tests.
In order to test for their statistical significance, the calculated entropies were first tested for normality using Lilliefors' composite goodness-of-fit test [28]. If this test was positive for all results within a subset of a test case (i.e., certain database and/or choice for r), the p-value was calculated with a two sample t-test, and otherwise a Wilcoxon rank sum test was performed. To ensure comparability, the same statistical test was used for each subset of a test case.
Since statistical tests used in this work are based on the same null hypothesis, the same subject groups, the same endpoints and only slight variations of the analysis method, interaction of the observed results is not only possible, but highly probable. On the other hand, p-value adjustments such as the commonly used Bonferroni correction assume uncorrelated endpoints and are therefore considered inappropriate for the tasks in this work [29]. Besides, the aim of this work is not to test whether there is a difference between groups, but to investigate the ability of varied methods to detect those differences.
Due to the high computational complexity of the tests conducted (e.g., the calculation of the variation of the weighting factors n_{ L } and n_{ F } for FuzzyMEn takes several hours for only one r and one comparison of databases), the authors had to refrain from more robust randomized testing strategies (i.e., the permutation of the original data), since the computation time would multiply by at least a thousand times.
Data
All data used for the described tests have been taken from http://Physionet.org[30], a free-access, on-line archive of physiological signals. They are described in detail in this section.
To create a control group all databases described as non-pathological were combined into one database, afterwards called np. This database contains the Normal Sinus Rhythm RR Interval Database, which consists of the beat annotations of 54 long-term ECG recordings, digitized at 128 samples per second, of subjects with normal sinus rhythm (30 men, aged 28.5 to 76, and 24 women, aged 58 to 73). Furthermore, it includes the MIT-BIH Normal Sinus Rhythm Database, which consists of 18 long-term recordings (5 men, aged 26 to 45, and 13 women, aged 20 to 50) digitized at 128 samples per second. The Fantasia Database of 120-minute recordings of twenty young (10 men and 10 women; 21 - 34 years old) and twenty elderly (10 men and 10 women; 68 - 85 years old) healthy subjects with ECG digitized at 250 Hz was also used [31]. The record "fantasia/f1o09" had to be excluded due to its high number of supraventricular premature beats. This results in a total database size of 111 recordings. RR intervals greater than 2.5 seconds were excluded to ignore artifacts.
Two databases were used to search for pathological effects. One was the Congestive Heart Failure RR Interval Database, afterwards referred to as chf2, which includes 29 long-term ECG recordings, with a sampling frequency of 128 Hz, of subjects aged 34 to 79 (8 men and 2 women; gender not known for the remaining subjects) with congestive heart failure (NYHA classes I, II, and III) [32].
The second one was the MIT-BIH Arrhythmia Database, afterwards called mit, which contains 48 half-hour recordings, sampled with a frequency of 360 Hz, from 47 subjects (25 men aged 32 to 89 years and 22 women aged 23 to 89 years) [33]. It contains a set of randomly chosen signals and 25 signals especially chosen to include examples of uncommon but clinically important arrhythmias recorded at the BIH Arrhythmia Laboratory [33].
The two databases chf2 and mit were always evaluated separately to keep the results as homogeneous as possible and to avoid the mutual neutralization of ab- normalities. To ensure comparability, the data length N was equal for all recordings. Longer recordings were cropped at the beginning and the end in equal shares.
In a way, this work can be considered as a pilot study, as only previously recorded data are used. Furthermore, the findings of this work will be incorporated in follow-up studies.
Results
Summary of the best results achieved for each test case.
Test case number | Test case description | Results with | |
---|---|---|---|
r = r_{Chon} | r = 0.2 σ | ||
1 | Variation of the threshold values r = r_{ L } = r_{ F } within the standard interval [0.1 σ, 0.25 σ] | Not applicable | Flip-flop effect verified for ApEn but not for other entropies |
2 | Variation of the data length N = x · 110 with x ∈ [1,10] | Significant with N > 1000 | Significant with N > 200 |
3 | Variation of the weighting factor n for FuzzyEn in the interval [1,6] | Significant with n < 3 | Significant with n = 1 |
4 | Variation of the weighting factors n_{ L } and n_{ F } for FuzzyMEn in the interval [1,6] | Best results with n_{ L } = 2, n_{ F } = 1 | Best results with n_{ L } = 1, n_{ F } = 3 |
5 | Variation of r_{ L } and r_{ F } for FuzzyMEn in an interval of [0.25 · r_{Chon}, 6 · r_{Chon}] and in an interval of [0.1σ, 0.25σ] | inconclusive |
Figure 1 corresponds to test case 1 and shows the effect of different threshold values on the entropy values. One of the defining characteristics of FuzzyEn and FuzzyMEn, their relative insensitivity to changes in r, is clearly visible. The flip-flop effect can be observed for ApEn. For the chf2 database, entropy values are higher for pathological data where r < 0.15 σ, whereas they are lower for r > 0.15 σ. This behavior is reversed for the mit database, with lower entropy values for pathological data for r < 0.18 σ and higher entropy values for pathological data for r > 0.18 σ.
The results for test case 2, analyzing the sensitivity of the entropy types to the data length are represented in figure 2. The more data points evaluated, the higher the separation between pathological and non-pathological data. When using the threshold value r = 0.2 σ, significance is already reached with N ≥ 200 data points, whereas with r = r_{Chon} more data (N ≥ 1000) are needed before significance is reached. Comparing the different methods when using r = 0.2 σ, one can see that they only differ when N is very small. With higher N , their behavior converges.
Figure 3 presents the results of test case 3, showing that the results become insignificant for n > 3 with r = r_{Chon}, and for n > 1 with r = 0.2 σ in the mit database (C, D). The increasing p-values are below the significance level (p < 0.05) for the chf2 database within the observed range of n (A, B).
Figure 4, corresponding to test case 4, shows a higher significance for n_{ L } ≤ 2 and all values of n_{ F } (A), or in the case of r = 0.2 σ for n_{ F } ∈ [2, 3.5] (B). In figure 4 (C and D) the situation is different: the best results are achieved with n_{ F } < 1.5 and n_{ L } ≥ 2 for r = r_{Chon} in case C, whereas for r = 0.2 σ (D) the best performance is reached with n_{ L } < 1.5 and any n_{ F }.
The results for test case 5 are shown in figure 5. They represent the ability of FuzzyMEn to differentiate between chf2 and np (A and B) and mit and np (C and D) for different choices of the threshold values r_{ F } and r_{ L }, when chosen as multiples of r_{Chon} in the range of [0.25 · r_{Chon}, 6 · r_{Chon}] (A, C), and calculated according to the results of test case 4 with n_{ L } = 2 and n_{ F } = 1, or inside the standard range of [0.1 σ, 0.25 σ] (B, D), calculated with n_{ L } = 1 and n_{ F } = 3. In (A), the best results are achieved with ${r}_{L}\lesssim 1\cdot {r}_{\text{Chon}}$ or ${r}_{F}\lesssim 1\cdot {r}_{\text{Chon}}$, whereas they have to be greater than or equal to 1 · r_{Chon} in (C) for good performance. In (B), r_{ L } < 0.12 σ or ${r}_{L}\gtrsim 0.2\sigma $ yield significant results, however rF does not matter that much. In contrast, the best performance in (D) is achieved with ${r}_{F}\gtrsim 0.18\sigma $ and r_{ L }∈ [0.14 σ, 0.22 σ].
Discussion
Recommendations for different combinations of parameters.
r= r_{ L }= r_{ F } | m | N | n= n_{ L } | n _{ F } |
---|---|---|---|---|
r _{Chon} | 2 | > 1000 | 2 | 1 |
0.2 σ | 2 | > 200 | 1 | 3 |
Some of the tests concerning the parameter selection showed no clear results. A couple of evaluations revealed contradicting results depending on the database (chf2 vs. mit ) or on the chosen parameters (e.g., r = r_{ L } = r_{ F } = r_{Chon} vs. r = r_{ L } = r_{ F } = 0.2σ). The latter is easy to overcome by choosing a different set of parameters based on the choice of the threshold value r. However, a compromise must be found in order to find parameters suitable for both databases.
One of the hardest difficulties lies in the choice of the threshold value r due to the flip-flop effect, i.e., for some parameters one data set has a higher entropy compared to another, but this order is reversed for different parameter choices [17, 27]. This can occur for simple signals, but also when analyzing heart rate variability data, as in figure 1. This leads to difficulties with the interpretation of the entropy, i.e., the direct assignment of entropy values to pathological or non-pathological data without a given r. In our tests, the effect occurred for ApEn, but it is also reported to occur for SampEn and FuzzyEn as well by Boskovic et al. [27]. The flip-flop effect does not allow us to make a clear statement, as in [7–9, 17], that a higher entropy corresponds to a higher irregularity. Since this effect can happen for all different entropy values [27], they should not be seen as a direct measure of regularity, but rather as an indicator of differences in regularity with regard to certain time periods.
Compared to the threshold value, the data length N seems to have a smaller effect on the ability of the entropy measures to differentiate pathological from non-pathological data sets. Generally, a larger N leads to a higher probability of significance. If possible, the data length N should be longer than 200 data points when using r = 0.2 σ. This finding is consistent with previous studies, e.g. [12]. Surprisingly, it should even exceed a length of 1000 data points when using r = r_{Chon}, assuming a continuing trend in figure 2. Due to the length of the available test data, the range had to be restricted for this test. In case of HRV data, N is the number of recorded heartbeats and therefore proportional to the duration of the recording. Thus, to increase N, longer measurements are necessary. The Task Force of The European Society of Cardiology and The North American Society of Pacing and Electrophysiology [4] recommends a duration of five minutes for short time recordings, which would result in 300 data points at an average heart rate of 60 beats per second. This would be sufficient when using r = 0.2 σ, but not for r = r_{Chon}. These considerations should be kept in mind when dealing with HRV data.
Due to the different behavior when varying n, n_{ L } and n_{ F } given different threshold parameters r = r_{ L } = r_{ F } = r_{Chon} and r = r_{ L } = r_{ F } = 0.2 σ, the parameters n, n_{ F } and n_{ L } have been chosen independently for the different threshold values. This is no constraint to the method, since the choice of r is known beforehand anyway (and is not based on the potentially unknown medical condition of the subject). The values n_{ L } = 2 and n_{ F } = 1 for r_{ L } = r_{ F } = r_{Chon}, and n_{ L } = 1 and n_{ F } = 3 for r_{ L } = r_{ F } = 0.2 σ showed better results than n_{ L }= 3, n_{ F } = 2 as proposed by Liu et al. in [24]. Unsurprisingly, similar values were found for n, as n for FuzzyEn equals n_{ L } for FuzzyMEn. For consistency, n = n_{ L } = 2 for r = r_{Chon} and n = n_{ L } = 1 for r = 0.2 σ are recommended.
The tests concerning r_{ F } and r_{ L } showed that there is no optimal choice, since the results in figure 5 (A) and (C) as well as (B) and (D) contradict each other. Nevertheless, both r_{ F } = r_{ L } = r_{Chon} and r_{ F } = r_{ L } = 0.2 σ, as described in the literature [9, 24, 25], are the most reasonable compromise between figure 5 (A) and (C), and (B) and (D), respectively.
Finally, a number of important limitations of this study need to be considered. First, this study is limited to previously recorded signals of only two different cardiac diseases due to the availability of data. As already mentioned in the Data section, a separate evaluation is warranted, to avoid the mutual neutralization of abnormalities. Furthermore, the template length was fixed to m = 2 for all calculations, as the number of possible variations of parameters would get too high otherwise and m = 2 seems to be a reasonable choice in all investigated literature, e.g., [8]. In [34] it was reported that spikes due to recording errors in heart rate variability data can disturb ApEn and SampEn. No tests were done to examine this behavior for the included entropies. Finally, the study did not evaluate the dependency of the entropies on age and gender as reported in literature [35, 36].
Conclusions
The results of the presented study clearly stress the need for further investigations of signal entropy for heart rate variability data. Given the wide range of different medical conditions of subjects, the assortment of available methods to calculate entropy, and their customizability with up to six degrees of freedom, it is almost impossible to cover all combinations in a single study. Future work will therefore be focused on overcoming the limitations of the presented work, i.e., extending the evaluations to other cardiac diseases, the variation of the template length m, investigating the robustness with respect to recording errors, and the relation of the parameter choice to age and gender.
Notes
Declarations
Declarations
This research received no grant from any funding agency. Publication for this article has been funded by the research group.
This article has been published as part of BMC Bioinformatics Volume 15 Supplement 6, 2014: Knowledge Discovery and Interactive Data Mining in Bioinformatics. The full contents of the supplement are available online at http://www.biomedcentral.com/bmcbioinformatics/supplements/15/S6.
Authors’ Affiliations
References
- Rajendra Acharya U, Paul Joseph K, Kannathal N, Lim C, Suri J: Heart rate variability: a review. Med Biol Eng Comput. 2006, 44: 1031-1051. 10.1007/s11517-006-0119-0. doi:10.1007/s11517-006-0119-0View ArticlePubMedGoogle Scholar
- Batchinsky AI, Salinas J, Cancio LC: Assessment of the Need to Perform Life-Saving Interventions Using Comprehensive Analysis of the Electrocardiogram and Artificial Neural Networks. RTO-MP-HFM-182: Use of Advanced Technologies and New Procedures in Medical Field Operations. 2010, 39-116.Google Scholar
- Bachler M, Mayer C, Hametner B, Wassertheurer S, Holzinger A: Online and offline determination of qt and pr interval and qrs duration in electrocardiography. Pervasive Computing and the Networked World Lecture Notes in Computer Science. Edited by: Zu, Q., Hu, B., El¸ci, A. 2013, Springer, Berlin Heidelberg, 7719: 1-15. 10.1007/978-3-642-37015-1_1.View ArticleGoogle Scholar
- American Heart Association Inc, European Society of Cardiology: Guidelines - heart rate variability. Eur Heart J. 1996, 17: 354-381. 10.1093/oxfordjournals.eurheartj.a014868.View ArticleGoogle Scholar
- Pincus SM: Assessing serial irregularity and its implications for health. Annals of the New York Academy of Sciences. 2001, 954: 245-267.View ArticlePubMedGoogle Scholar
- Hornero R, Aboy M, Abasolo D, McNames J, Goldstein B: Interpretation of approximate entropy: analysis of intracranial pressure approximate entropy during acute intracranial hypertension. IEEE Trans Biomed Eng. 2005, 52 (10): 1671-1680. 10.1109/TBME.2005.855722.View ArticlePubMedGoogle Scholar
- Pincus SM, Goldberger AL: Physiological time-series analysis: what does regularity quantify?. Am J Physiol. 1994, 266 (4 Pt 2): 1643-1656.Google Scholar
- Pincus SM: Approximate entropy as a measure of system complexity. Proc Natl Acad Sci USA. 1991, 88 (6): 2297-2301. 10.1073/pnas.88.6.2297.PubMed CentralView ArticlePubMedGoogle Scholar
- Chen W, Zhuang J, Yu W, Wang Z: Measuring complexity using FuzzyEn, ApEn, and SampEn. Med Eng Phys. 2009, 31 (1): 61-68. 10.1016/j.medengphy.2008.04.005.View ArticlePubMedGoogle Scholar
- Hornero R, Aboy M, Abasolo D, McNames J, Wakeland W, Goldstein B: Complex analysis of intracranial hypertension using approximate entropy. Crit Care Med. 2006, 34 (1): 87-95. 10.1097/01.CCM.0000190426.44782.F0.View ArticlePubMedGoogle Scholar
- Sarlabous L, Torres A, Fiz JA, Gea J, Martínez-Llorens JM, Morera J, Jané R: Interpretation of the approximate entropy using fixed tolerance values as a measure of amplitude variations in biomedical signals. Engineering in Medicine and Biology Society (EMBC), 2010 Annual International Conference of the IEEE. 2010, 5967-5970. doi:10.1109/IEMBS.2010.5627570View ArticleGoogle Scholar
- Yentes J, Hunt N, Schmid K, Kaipust J, McGrath D, Stergiou N: The appropriate use of approximate entropy and sample entropy with short data sets. Annals of Biomedical Engineering. 2013, 41 (2): 349-365. 10.1007/s10439-012-0668-3. doi:10.1007/s10439-012-0668-3View ArticlePubMedGoogle Scholar
- Acharya UR, Molinari F, Sree SV, Chattopadhyay S, Ng KH, Suri JS: Automated diagnosis of epileptic eeg using entropies. Biomedical Signal Processing and Control. 2012, 7 (4): 401-408. 10.1016/j.bspc.2011.07.007.View ArticleGoogle Scholar
- Roerdink M, De Haart M, Daffertshofer A, Donker SF, Geurts AC, Beek PJ: Dynamical structure of center-of-pressure trajectories in patients recovering from stroke. Exp Brain Res. 2006, 174 (2): 256-269. 10.1007/s00221-006-0441-7.View ArticlePubMedGoogle Scholar
- Kaplan DT, Furman MI, Pincus SM, Ryan SM, Lipsitz LA, Goldberger AL: Aging and the complexity of cardiovascular dynamics. Biophysical Journal. 1991, 59 (4): 945-949. 10.1016/S0006-3495(91)82309-8. doi:10.1016/S-0006-3495(91)82309-8PubMed CentralView ArticlePubMedGoogle Scholar
- Holzinger A, Stocker C, Bruschi M, Auinger A, Silva H, Gamboa H, Fred ALN, Huang R, Ghorbani AA, Pasi G, Yamaguchi T, Yen NY, Jin B: On applying approximate entropy to ecg signals for knowledge discovery on the example of big sensor data. AMT Lecture Notes in Computer Science. 2012, Springer, Berlin Heidelberg, 7669: 646-657. 10.1007/978-3-642-35236-2_64.Google Scholar
- Liu C, Liu C, Shao P, Li L, Sun X, Wang X, Liu F: Comparison of different threshold values r for approximate entropy: application to investigate the heart rate variability between heart failure and healthy control groups. Physiol Meas. 2011, 32 (2): 167-180. 10.1088/0967-3334/32/2/002.View ArticlePubMedGoogle Scholar
- Ramdani S, Bouchara F, Lagarde J: Influence of noise on the sample entropy algorithm. Chaos: An Interdisciplinary Journal of Nonlinear Science. 2009, 19 (1): doi:10.1063/1.3081406Google Scholar
- Chen W, Wang Z, Xie H, Yu W: Characterization of surface emg signal based on fuzzy entropy. Neural Systems and Rehabilitation Engineering, IEEE Transactions on. 2010, 15 (2): 266-272. doi:10.1109/TNSRE.2007.897025View ArticleGoogle Scholar
- Alcaraz R, Abásolo D, Hornero R, Rieta JJ: Optimal parameters study for sample entropy-based atrial fibrillation organization analysis. Computer Methods and Programs in Biomedicine. 2010, 99 (1): 124-132. 10.1016/j.cmpb.2010.02.009. doi:10.1016/j.cmpb.2010.02.009View ArticlePubMedGoogle Scholar
- Castiglioni P, Di Rienzo M: How the threshold "r" influences approximate entropy analysis of heart-rate variability. IEEE. 2008, 561-564.Google Scholar
- Lake DE, Richman JS, Griffin MP, Moorman JR: Sample entropy analysis of neonatal heart rate variability. American Journal of Physiology - Regulatory, Integrative and Comparative Physiology. 2002, 283 (3): 789-797. doi:10.1152/ajpregu.00069.2002., [http://ajpregu.physiology.org/content/283/3/R789.full.pdf]View ArticleGoogle Scholar
- Richman JS, Moorman JR: Physiological time-series analysis using approximate entropy and sample entropy. Am J Physiol Heart Circ Physiol. 2000, 278 (6): 2039-2049.Google Scholar
- Liu C, Li K, Zhao L, Liu F, Zheng D, Liu C, Liu S: Analysis of heart rate variability using fuzzy measure entropy. Comput Biol Med. 2013, 43 (2): 100-108. 10.1016/j.compbiomed.2012.11.005.View ArticlePubMedGoogle Scholar
- Chon K, Scully CG, Lu S: Approximate entropy for all signals. IEEE Eng Med Biol Mag. 2009, 28 (6): 18-23.View ArticlePubMedGoogle Scholar
- Lu S, Chen X, Kanters JK, Solomon IC, Chon KH: Automatic selection of the threshold value R for approximate entropy. IEEE Trans Biomed Eng. 2008, 55 (8): 1966-1972.View ArticlePubMedGoogle Scholar
- Boskovic A, Loncar-Turukalo T, Japundzic-Zigon N, Bajic D: The flip-flop effect in entropy estimation. IEEE. 2011, 227-230.Google Scholar
- Lilliefors HW: On the kolmogorov-smirnov test for normality with mean and variance unknown. Journal of the American Statistical Association. 1967, 62 (318): 399-402. 10.1080/01621459.1967.10482916. doi:10.1080/01621459.1967.10482916, [http://amstat.tandfonline.com/doi/pdf/10.1080/01621459.1967.10482916]View ArticleGoogle Scholar
- Altman DG, Machin D, Bryant TN, Gardner MJ, Gardner S, Bird S, Campbell M, Daly LE, Morris J: Statistics with confidence: confidence intervals and statistical guidelines. 2000Google Scholar
- Goldberger AL, Amaral LA, Glass L, Hausdorff JM, Ivanov PC, Mark RG, Mietus JE, Moody GB, Peng CK, Stanley HE: PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. Circulation. 2000, 101 (23): 215-220. 10.1161/01.CIR.101.23.e215.View ArticleGoogle Scholar
- Iyengar N, Peng C, Morin R, Goldberger A, Lipsitz LA: Age-related alterations in the fractal scaling of cardiac interbeat interval dynamics. American Journal of Physiology-Regulatory, Integrative and Comparative Physiology. 1996, 271 (4): 1078-1084.Google Scholar
- Baim DS, Colucci WS, Monrad ES, Smith HS, Wright RF, Lanoue A, Gauthier DF, Ransil BJ, Grossman W, Braunwald E: Survival of patients with severe congestive heart failure treated with oral milrinone. J Am Coll Cardiol. 1986, 7 (3): 661-670. 10.1016/S0735-1097(86)80478-8.View ArticlePubMedGoogle Scholar
- Moody GB, Mark RG: The impact of the MIT-BIH arrhythmia database. IEEE Eng Med Biol Mag. 2001, 20 (3): 45-50. 10.1109/51.932724.View ArticlePubMedGoogle Scholar
- Molina-Picó A, Cuesta-Frau D, Aboy M, Crespo C, Miró-Martínez P, Oltra-Crespo S: Comparative study of approximate entropy and sample entropy robustness to spikes. Artif Intell Med. 2011, 53 (2): 97-106. 10.1016/j.artmed.2011.06.007.View ArticlePubMedGoogle Scholar
- Beckers F, Verheyden B, Aubert AE: Aging and nonlinear heart rate control in a healthy population. American Journal of Physiology-Heart and Circulatory Physiology. 2006, 290 (6): 2560-2570. 10.1152/ajpheart.00903.2005.View ArticleGoogle Scholar
- Ryan SM, Goldberger AL, Pincus SM, Mietus J, Lipsitz LA: Gender- and age-related differences in heart rate dynamics. Are women more complex than men? Journal of the American College of Cardiology. 1994, 24 (7): 1700-1707. doi:10.1016/0735-1097(94)90177-5View ArticlePubMedGoogle Scholar
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.