Skip to main content

Tremor assessment using smartphone sensor data and fuzzy reasoning



Tremor severity assessment is an important step for the diagnosis and treatment decision-making of essential tremor (ET) patients. Traditionally, tremor severity is assessed by using questionnaires (e.g., ETRS and QUEST surveys). In this work we assume the possibility of assessing tremor severity using sensor data and computerized analyses. The goal of this work is to assess severity of tremor objectively, to be better able to asses improvement in ET patients due to deep brain stimulation or other treatments.


We collect tremor data by strapping smartphones to the wrists of ET patients. The resulting raw sensor data is then pre-processed to remove any artifact due to patient’s intentional movement. Finally, this data is exploited to automatically build a transparent, interpretable, and succinct fuzzy model for the severity assessment of ET. For this purpose, we exploit pyFUME, a tool for the data-driven estimation of fuzzy models. It leverages the FST-PSO swarm intelligence meta-heuristic to identify optimal clusters in data, reducing the possibility of a premature convergence in local minima which would result in a sub-optimal model. pyFUME was also combined with GRABS, a novel methodology for the automatic simplification of fuzzy rules.


Our model is able to assess tremor severity of patients suffering from Essential Tremor, notably without the need for subjective questionnaires nor interviews. The fuzzy model improves the mean absolute error (MAE) metric by 78–81% compared to linear models and by 71–74% compared to a model based on decision trees.


This study confirms that tremor data gathered using the smartphones is useful for the constructing of machine learning models that can be used to support the diagnosis and monitoring of patients who suffer from Essential Tremor. The model produced by our methodology is easy to inspect and, notably, characterized by a lower error with respect to approaches based on linear models or decision trees.


Millions of people worldwide are affected by neurological diseases that induce tremors. Although tremor is often associated with Parkinson’s disease, it can also be a symptom of other neurological disorders. A patient suffering from tremor shows involuntary oscillations of any body part around any plane [1]. When there is no demonstrable cause of this tremor, the patient is diagnosed with Essential Tremor (ET).

Tremor hinders many daily tasks, such as precision tasks like buttoning shirts or other tasks like holding a cup of hot coffee [2]. For the correct diagnosis and treatment of tremor-related diseases, tremor severity assessment is an important step. Tremor severity assessment is also an important tool for evaluating treatment and monitoring how the disease develops, since essential tremor is a chronic and progressive neurological disease. Up to now, tremor severity is measured with qualitative rating scales, such as the Quality of life in Essential Tremor (QUEST) score or the Essential Tremor Rating Scale (ETRS) score.

The QUEST score [3] is determined by making the patient fill in a 30-item questionnaire that contains questions related to the effect of the tremor on the patient’s quality of life. Examples of such questions are “My tremor interferes with my ability to communicate with others” and “I have lost interest in my hobbies because of my tremor”.

The ETRS [4] is filled by a medically trained assessor and evaluates the severity of the tremor in rest, postural, and during movement. The assessor asks the patient to perform 21 different tasks (such a pouring a cup of water, or drawing a line or spiral) and scores the severity of the tremor during these tasks on a 0–4 scale.

Both the QUEST and the ETRS scores are widely used for diagnostics and treatment of ET, but they are subjective in nature. Using direct measurements from wearable sensors could provide more objectivity of the tremor severity assessment. Therefore, the attention for computerized tremor analysis has increased recently (for example, see [5,6,7,8]).

These previous studies often rely on dedicated devices to measure the tremor. These devices are expensive and only useful in clinical settings since patients do not have these devices at home. In this study, we propose using the relatively cheap and widely available sensors of smartphones to measure tremor severity. To do so, we rely on the smartphone application TREMOR12 [9], an open-source mobile app was developed by clinicians from the Maastricht University Medical Center. The app is freely available for research purposes in the Apple App Store.

In this study, we aim to develop a fuzzy model that maps the relationship between the QUEST and ETRS score and the sensor measurements of a smartphone strapped to the wrist of an ET patient. We use machine learning to determine the optimal parameters of the model. While many machine learning models (such as deep neural networks) behave like black boxes, fuzzy models have the advantage of being transparent. Because of this property, humans can inspect and study the behavior of the model. The resulting model can help clinicians to more objectively assess the severity of the patient’s tremor and can increase the inter-rater reliability among clinicians.

Related work

A literature study by Grimaldi and Manto [10] reveals that four different types of sensors are commonly used to measure tremor for in ET patients:

  1. 1.

    Accelerometers assess the linear acceleration on a specific axis (in g). These measurements are a combination of linear acceleration, gravity, and additive noise. To diagnose tremor, only the linear acceleration component is relevant, but up to now, no analytic model has been validated to separate gravity (and noise) from the linear acceleration. Low-pass filtering is currently used in most studies [10].

  2. 2.

    Gyroscopes measure how fast the angular position or orientation of an object changes over time. Because of this, gyroscopes are used to measure the rotation speed (in radians per second) and the rotation or orientation (in radians).

  3. 3.

    Elektromyograms (EMGs) are used to measure the activity of the muscle fibers. This data is collected by a fine wire, which is inserted into a muscle of the ET patient. In [11], researchers were successful in differentiating Parkinson’s disease from Essential Tremor utilizing an Elektromyogram.

  4. 4.

    Force sensors assess the torque in newton meter and angular motion (in radians per second). The torque is the force applied to rotate an object around a pivot or axis. Force meters are assumed to be promising for assessing tremor severity, but the measuring devices are non-portable and expensive.

In Table 1 the characteristics of each of the before mentioned sensors for assessing tremor severity are summarized. Both the accelerometer and the gyroscope are affordable and non-invasive, which makes them easy to use on a larger scale. Therefore, studies that utilize these types of sensors will be the focus of the remainder of this section.

Table 1 Characteristics of sensors to measure tremor severity

In a pilot study [8], the feasibility of assessing the severity of the symptoms and motor complications for Parkinson’s disease patients by means of accelerometer data is investigated. Twelve patients were asked to wear a body sensor network that consisted of 8 accelerometers: two sensors were located on the upper arms (one right, one left) two on the lower arms, two on the upper legs, and two sensors were located on the lower legs of each patient. The data that was gathered was used to train a support vector machine which classified the severity of tremor, dyskinesia, and bradykinesia. For validation, the output of this classification model was compared with the assessment of medically trained clinicians. The error of the model (compared to the clinician’s estimates) were on average 2.5%.

In another study [12] on the usefulness of accelerometer data, data of 18 Parkinson’s Disease patients and 5 control patients were gathered. During the data measurement of both resting and action/postural tremor, a set of accelerometers was mounted on different patient’s body segments. The estimation of the tremor type (resting or action/postural) and the severity of the tremor are based on features extracted from the acquired accelerometer signals and hidden Markov models. The trained models were able to quantify tremor severity with 87% accuracy, discriminate resting from postural tremor, and discriminates tremor from other Parkinsonian motor symptoms during daily activities.

A similar study [7] data of 7 Parkinson’s Disease patients were collected using wrist-worn accelerometers and gyroscopes. These sensor data were used to train a least-square-estimation model to estimate the severity of rest, postural, and kinetic tremors. The models’ estimation of tremor severity correlated well with the estimations of a neurologist (r = 0.98).

The above-mentioned studies show great potential for automatic tremor severity assessment, but all studies make use of dedicated devices to measure the tremor severity. These devices are often expensive, hard to use, or not widely available. To overcome this problem, some studies propose to make use of smartphone sensors to measure tremor severity (e.g. [13,14,15]). However, these studies (and the smartphone applications used) focus on Parkinson’s disease patients. In Table 2 it can be observed that tremor that stems from Parkinson’s Disease differs significantly from Essential Tremor. To our knowledge, only one study [6] used smartphone sensors for gathering data to estimate the severity of Essential Tremor. However, only accelerometer data is used in this study.

In this work, we aim to build a model that is able to estimate tremor severity (in terms of ETRS or QUEST score) from smartphone sensor data. In contrast to [6], we use both the accelerometer and gyroscope of a smartphone. For data gathering, the freely available TREMOR12 application [9] is used.

Table 2 Characteristics of tremor caused by Parkinson’s disease and essential tremor


Data collection

In this study we make use of data that are recorded using the the TREMOR12 application [9]. This app offers low-cost tremor quantification for research purposes and algorithm development. With TREMOR12 it is possible to record acceleration, rotation, rotation speed, and gravity, all along three axes. The measurements are all time-stamped, and it is possible to sample in a frequency of up to 100 Hz. The raw signal data of the sensors can be exported as a comma-separated file (.csv) for further processing and analysis. The application has been developed by clinicians (from the Maastricht University Medical Center, the Netherlands) to aid in evaluating the effect of the treatment of Essential Tremor patients. TREMOR12 runs on iPhone and iPod Touch (iOS8 or newer) and can be downloaded free of charge from the App Store. The source code is freely available from GitHub at the following address:

The sensor data are collected with a 10-ms sample rate (100 Hz). The measuring device is an iPhone 5s. Samples are collected using the accelerometer and the gyroscope of the phone, and measurements are done along the x, y, and z axes. The recorded variables are acceleration (in g) and rotation speed (in radians per second).

Twenty participants (11 men, 9 women), who were all diagnosed with Essential Tremor, took part in the experiment. At the time of the experiment, the average age of the participants was 67 years, and they were on average diagnosed with Essential Tremor 22 years ago. All participants underwent Deep Brain Stimulation as a treatment for Essential Tremor in the past. The ETRS scores of the participants varied between 10 and 68 and the time of the experiment, their QUEST scores were between 3 and 68.1 (Fig. 1).

Fig. 1
figure 1

(figure adapted from [16])

Body positions for data measuring

At least 95% of the patients who suffer from Essential Tremor have a tremor in the upper limbs [17]. Because of this, the sensor data recording device for this study is strapped around the participant’s wrists. The phone was attached using an arm strap that is generally used for running and other sports activities. The phone was strapped on the dorsal side of the wrist by the trained experimenter to ensure a good fit and non-rigid fixation.

The participants were asked to perform five different tests. All tests were performed twice by each participant: once using the left hand and once using the right hand. The following tests were conducted:

  1. 1.

    Rest The participant was asked to place both forearms and hands resting on a table. The tremor was measured for one minute on both wrists.

  2. 2.

    Postural 1 The participant was asked to stretch both arms forward with the hand palms facing down. This position is depicted in Fig. 1a. The tremor was measured for one minute on both wrists.

  3. 3.

    Postural 2 The participant was asked to hold both hands in front of the chest with the hand palm down and elbows sideways. This position is depicted in Fig. 1b. Again, tremor is measured on both sides for 1 min.

  4. 4.

    Glass The participant was asked to pick up a glass of water from the table, bring the glass towards the mouth, and put it back on the table to measure the kinetic tremor. This test was repeated three times for each side.

  5. 5.

    Finger-nose test The participant was asked to bring his index finger to the nose and subsequently to the index finger of the researcher. This movement was repeated three times to measure the kinetic tremor.

Data preparation

Filtering out noise

To begin and terminate the recording of data, the user must tap a button on the screen of the recording device. This procedure introduces noise at the beginning and at the end of the recording, as highlighted in Fig. 2.

Fig. 2
figure 2

Acceleration for patient 1 during the rest test on the x, y and z axis. Tapping the start and stop recording button on the device and the patient getting in the correct position creates noise in the data

According to our analyses, this type of noise is typically present in the first and the last 50 samples of the data recorded during each test. Hence, in all tests that follow, the first and last 50 samples of each signal file were automatically discarded.

An additional source of noise in the measurements is due to voluntary movement, as evidenced during glass and finger-nose test. An example of such noise is shown in Fig. 3, where the original unprocessed signal is denoted by the blue line. Wide, low-frequency peaks originate from the patient moving his finger from his nose to the researcher’s hand and vice versa, a movement that was repeated three times. This signal is not relevant for tremor assessment and must be removed. In order to filter out the noise due to voluntary movement, we applied an Equiripple Finite Impulse Response filter between 7 and 12 hertz, corresponding to Essential Tremor frequencies [1]. In this way, gravitational and motion components are removed from the signals. The result of this filtering phase is denoted in Fig. 3 by a red line.

Fig. 3
figure 3

Patient 19—finger-nose test—right hand: The signal before (blue) and after (red) equiripple FIR filtering

Feature extraction

When all signals are processed and filtered, we proceed with the feature extraction phase. The goal is to reduce the time-series to features values, i.e., scalar real values encompassing all the information that can be used in the learning phase. Specifically, the following features are extracted:

  • in the time domain, we calculate the root-mean-square (RMS) value of the signal which is a measure for the signal strength, and the signal period, which represents the average duration of one wave in the signal;

  • in the frequency domain, the dominant magnitude and frequency are extracted using Daubechies 8 wavelet [18]. The power growth is also extracted from the time-frequency domain to represent the power increase over time.

These features provide information about the highest intensity peak accumulated over the frequency range and the number of oscillations per second associated with it. Overall, 50 features (left and right wrist \(\times\) five tests \(\times\) five features per signal) must summarize the tremor for every patient.

Feature selection

Not all variables need to be meaningful when mapping the sensor data to the QUEST or ETRS score. We would like to find the smallest possible subset of variables that still provides a good mapping. To identify the most relevant variables for the models, we adopt a wrapper approach (for more information, see [19, 20]). We employ sequential forward selection in combination with cross-validation, which means variables are sequentially added to an empty candidate set until the addition of new variables does not decrease mean square error (MSE) of the model. The underlying models are CART decision trees [21] since these models are easy and fast to train and have been proven to be effective in modeling using sensor data of Essential Tremor patients [22].

Missing data

Not all tests have been performed by all patients since the finger-nose test was added to the experiment only after the data of the first four patients were already collected. Spline interpolation was used to estimate the missing values in case of incomplete data series.

Fuzzy modeling

FST-PSO clustering and fuzzy set inference

The first stage of our methodology consists of performing a fuzzy clustering [23] on the experimental data, in order to map such clusters against the linguistic variables. This step is necessary to automatically derive the fuzzy sets for the fuzzy rules. The conventional fuzzy C-means algorithm can be prone to premature convergence to local minima [24]. For this reason, we developed a novel version of the clustering algorithm powered by the swarm intelligence method Fuzzy Self-Tuning PSO (FST-PSO) [25]. FST-PSO is a settings-free variant of Particle Swarm Optimization, able to adapt its functioning to the problem under investigation. In this case, FST-PSO contributes to determining the optimal cluster centers and the corresponding fuzzy partition matrix. By projecting the elements of the clusters on the linguistic variables, and by calculating the envelope of membership values for each cluster, fuzzy sets can be created and fit automatically. Specifically, in this work we fit Gaussian fuzzy sets. These fuzzy sets are used later by our system to represent linguistic terms.

Takagi–Sugeno fuzzy model

Fuzzy models are “expert systems”, where knowledge is represented in the form of (fuzzy) rules [26]. The knowledge is formalized using fuzzy sets [27] and their membership functions, which are exploited in these fuzzy rules. During the process of fuzzy inference, a given input is mapped to the output. This mapping serves as a basis from which decisions or predictions can be made. Because the fuzzy rules are presented in natural language, they can be inspected and interpreted by humans, so that they can reveal patterns in the data.

In this work, we develop a first-order Takagi–Sugeno fuzzy model. A Takagi–Sugeno fuzzy model [28] consists of a set of fuzzy rules. Each rule characterizes a local relation between the input and output variables. The rules of a first order Takagi–Sugeno fuzzy model are all in the following format:

$$\begin{aligned} {\mathbf {R}}_j:&\text{ IF } x_{1} \text{ is } A_{j1} \text{ and } \dotsc \text{ and } x_{N} \text{ is } A_{jN} \\&\text{ THEN } y_{j} = {\mathbf {a}}_j^{T}{\mathbf {x}} + b_{j} \end{aligned}$$

In this representation \(j = 1, \dotsc J\) denotes the rule number and \({\mathbf {x}} = (x_{1}, \dotsc x_{N})\) is the input vector. N is the number of input features; \(A_{jn}\) denotes the fuzzy set for rule \(R_j\) and \(n^{th}\) feature, and \(y_j\) is the consequent function of rule \(R_j\), that is, a linear combination of the elements of \({\mathbf {x}}\), with coefficients \({\mathbf {a}}_j\) and a constant \(b_{j}\).

To assess to which degree each of the fuzzy rules applies in specific cases, the degree of fulfillment is calculated. The degree of fulfillment of a rule j is defined as:

$$\begin{aligned} \beta _j = \min (\mu _{A_{j1}} ({\mathbf {x}}),\ldots , \mu _{A_{jN}} ({\mathbf {x}})), \ \text {for } j=1,\dots ,N. \end{aligned}$$

The overall output \(y^*\) of the fuzzy model for a specific vector of input variables \(\mathbf{x }\) can then be calculated as the weighted average of the outputs of the individual rules:

$$\begin{aligned} y^*= \frac{\sum _{j=1}^{J} \beta _j y_j}{\sum _{j=1}^{J} \beta _j}. \end{aligned}$$

The creation of a Takagi–Sugeno fuzzy model generally takes place in two steps [29, 30].

  • In the structure identification phase, the number of rules and partition of the feature space is determined. This is often done by employing clustering algorithms, e.g., grid partitioning, subtractive [31] clustering, or fuzzy c-means [32]. In this study, we apply FST-PSO clustering (as described in the previous section);

  • In the step of parameter identification, the model’s parameters (e.g., the membership functions and the linear coefficients) are identified. Often least-square or derivative-based optimization techniques are used for this (for more information see [29]). In this work we rely on least-square optimization techniques.

Simplification of the rule base with GRABS

Fuzzy inference systems (FIS) found application in several fields of science over the last years because of their transparency and interpretability, which is higher than common “black-box” machine learning approaches. In order to estimate a transparent FIS from data, we proposed a new approach, named GRABS (Graph-Based Simplification), for FIS simplification which leverages graph theory to identify and remove similar fuzzy sets from rule bases [33]. GRABS uses a threshold-based similarity index (the Jaccard similarity [34]) to determine which fuzzy sets are overlapping. Then, a graph is created, where the fuzzy sets are the nodes, and similar sets are linked with an edge. GRABS detects the connected sub-components in the graph, whose nodes represent the same linguistic term. It then proceeds to the simplification (i.e., removal) of redundant nodes. Thanks to the similarity threshold hyper-parameter, GRABS can be used to reduce the bloating of rules with a tunable level of error.


For this work, we relied on two Python packages: Simpful and pyFUME. In this section, we give some background on the workings of these packages.


Simpful is a user-friendly, general-purpose, lightweight Python API for the definition of FIS [35]. Simpful’s API was designed in such a way that the modeling phase should resemble, as much as possible, human thinking and natural language, in order to simplify the definition of fuzzy sets, linguistic variables, and fuzzy rules. Simpful also supports fuzzy inference based on Sugeno reasoning of any order. One key feature of Simpful is that fuzzy rules are defined through well-formed strings of text, expressed using natural language, and thus simplifying the definition of the rule base. Thanks to the human-readable format, Simpful facilitates the inspection of the model and the interpretation of results.

The current version of Simpful (v2.0.0) supports both (i) polygonal and functional (e.g., sigmoidal, gaussian or custom shaped) fuzzy sets. Simpful also supports the definition of fuzzy rules with an arbitrary degree of complexity, built using linguistic terms combined with logic operators (e.g., AND, OR, NOT). Simpful supports the definition of FIS with multiple inputs and outputs, with no limitation, and the definition of arbitrary fuzzy networks [36]. pyFUME, described in the next paragraph, creates Simpful models automatically out of data. Simpful is available for download, under GPL license, on GitHub at the following URL: Simpful can also be installed by using the PyPI facility: pip install simpful.


pyFUME [37] is a Python library for the automatic derivation of fuzzy models, designed to provide an easy to use and extensible interface both for practitioners and researchers.

Currently, pyFUME offers facilities to simplify the following operations: loading of the input data, with automatic partitioning between training and test data sets; clustering of the data in the input-output space by means of Fuzzy C-Means (FCM) clustering [32] or the method based on Fuzzy Self-Tuning Particle Swarm Optimization (FST-PSO [24, 25]) described in the previous sections; estimation of the antecedent sets of the fuzzy model, using the method described in [38], using Gaussian (default option), double Gaussian, or sigmoidal membership functions; estimation of the consequent parameters of the first-order TS fuzzy model, implementing the functionalities described in [39]; generation, using the estimated antecedents and consequents, of an executable Simpful model (with the possibility of exporting the source code as a separate, executable file). pyFUME also provides a facility for the testing of the derived model, providing functionalities for the measurement of Root Mean Square Error (RMSE), Mean Square Error (MSE), or Mean Absolute Error (MAE).

pyFUME is implemented in Python, depends on numpy [40], scipy [41] and Simpful [35], and can be downloaded from GITHUB at the following address: pyFUME can also be installed by using the PyPI facility: pip install pyFUME.


We generated two fuzzy models using pyFUME: one to map the sensor data to the ETRS score and one to the QUEST score. The data set at hand for this study is limited in size. Because of this, we limit both models to two rules to avoid overfitting. For structure identification, FST-PSO based clustering was used [24]. The model was simplified using GRABS, with a similarity threshold of 0.90, since this number simplifies the models sufficiently without losing information and compromising on accuracy. All other settings for pyFUME were kept on default. A summary of these settings can be found in Table 3. The pyFUME code to generate the model and calculate the mean absolute error can be found in Fig. 4.

Table 3 Parameter settings used for developing the fuzzy models
Fig. 4
figure 4

The pyFUME code to generate the model that maps the sensor data to the ETRS score (line 3–4) and to calculate the mean square error of the resulting model (line 6–7)

ETRS score

In our previous work [22] we show that following features are related to the ETRS score (see also Table 4 for an overview): the power growth of the accelerometer data in rest, the dominant magnitude of the accelerometer and the signal root mean square (RMS) of the rotation speed as measured during the first postural test, and the dominant frequency (of the accelerometer) and the dominant magnitude (of the rotation speed) as measured during the second postural test. Therefore, these five variables are used to generate the fuzzy model that maps the sensor data to the ETRS score. The following two rules were obtained:

  • RULE 1: IF (dominantFrequency IS high) AND (dominantMagnitude IS any value) AND (signalPeriod IS any value) AND (powerGrowth IS any value) THEN (ETRS score = 60.40 * dominantFrequency − 87.00 * dominantMagnitude + 14.0 * signalPeriod + 7.10 * signalRMS + 57.70 * powerGrowth - 2.19)

  • RULE 2: IF (dominantFrequency IS low) AND (dominantMagnitude IS low) AND (signalPeriod IS medium) AND (powerGrowth IS medium) THEN (ETRS score = − 33.30 * dominantFrequency + 41.30 * dominantMagnitude + 10.80 * signalPeriod + 3.19 * signalRMS + 8.53 * powerGrowth + 55.70)

This model maps the sensor data to the ETRS score with a mean absolute error (MAE) of 2.70. The membership functions belonging to this fuzzy model are plotted in Fig. 5. As can be observed in the model’s rules and Fig. 5, one of the two fuzzy sets for the variable ‘signal RMS’ has been dropped by GRABS. This means the two sets showed a similarity of more than 90%.

Fig. 5
figure 5

The membership functions for the model that maps the sensor data to the ETRS score

Figure 5 seems to imply that the variable ‘Signal RMS’ does not contribute to the the performance of the system. To test this hypothesis, a new model excluding this variable is trained. The resulting rules are as follows:

  • RULE 1: IF (dominantFrequency IS high) AND (dominantMagnitude IS any value) AND (signalPeriod IS any value) AND (powerGrowth IS any value) THEN (ETRS score = 27.8 * dominantFrequency − 42.4 * dominantMagnitude + 22.4 * signalPeriod + 23.3 * powerGrowth + 8.7)

  • RULE 2: IF (dominantFrequency IS low) AND (dominantMagnitude IS low) AND (signalPeriod IS medium) AND (powerGrowth IS medium) THEN (ETRS Score = − 10.8 * dominantFrequency + 53.5 * dominantMagnitude − 18.2 * signalPeriod + 14.0 * powerGrowth + 58.5)

The membership functions belonging to this fuzzy model are plotted in Fig. 6. This model has a mean absolute error (MAE) of 1.85. It can therefore be concluded that this model outperforms the previously presented model.

The model excluding ‘Signal RMS’ performs better than a multiple regression line (\(R^2 = 0.58\), MAE = 8.44). We can also conclude that this model outperforms the model presented in [22], which was trained on the same data set and had an MAE of 6.41. Therefore, compared to the model presented in [22], the error has been reduced by 71.1%. These results are also shown in Table 5.

The computational time required to estimate the ETRS for new patients using these models is negligible . For example, computing the output of the fuzzy model—which is the most complex model of the three models that were developed—requires 0.6 ms (tested on a machine running Windows 10, with a Intel® Core™ i7-7700HQ CPU @ 2.80GHz).

Fig. 6
figure 6

The membership functions for the model that maps the sensor data to the ETRS score, excluding the variable ‘Signal RMS’

QUEST score

Following the findings presented in [22], we select the power growth of the rotation speed during the first postural test and the dominant frequency of the accelerometer as measured during the second postural test (see Table 4). These variables are then used to train the fuzzy model that maps the sensor data to the QUEST score. The trained model consist of the following set of rules:

  • RULE 1: IF (DominantFrequency IS high) AND (PowerGrowth IS any value) THEN (QUEST score = 4.66 * DominantFrequency − 8.52 * PowerGrowth + 11.70)

  • RULE 2: IF (DominantFrequency IS low) AND (PowerGrowth IS low) THEN (QUEST score = − 4.41 * DominantFrequency − 69.50 * PowerGrowth + 47.00)

The membership functions of this fuzzy model are plotted in Fig. 7. None of the fuzzy sets showed high similarity, and therefore none of the sets were dropped by GRABS.

The mean absolute error of the fuzzy model is 2.30. When using the same data to fit a multiple regression line, a low fit is found (\(R^2 = 0.17\)), which also leads to low performance (MAE = 12.10). The fuzzy model therefore clearly outperforms the regression line. The decision tree as presented in [22] has an error of 8.65. This means an improvement of 73.4% in terms of the error rate for the fuzzy model compared to a decision tree when using the same data. These results are summarized in Table 5.

The computational time required to estimate the QUEST score for new patients using the fuzzy model is 0.4 ms (tested on a machine running Windows 10, with a Intel® Core™ i7-7700HQ CPU @ 2.80GHz).

Table 4 Variables used for training the fuzzy models
Fig. 7
figure 7

The membership functions for the model that maps the sensor data to the QUEST score

Table 5 Comparison between the regression models, decision trees and fuzzy models for predicting the ETRS and QUEST score. The fuzzy models outperform the other two models


We presented a data-driven methodology for the automatic creation of interpretable models supporting the diagnosis and monitoring of patients suffering from Essential Tremor. Our methodology relies on pyFUME [37]—a novel tool combining swarm intelligence, fuzzy clustering, and fuzzy reasoning—extended with GRABS, a graph-based approach for rule-base simplification [33].

The tremor data used in this study was collected using the TREMOR12 application, which runs on widely available devices (in this study, iPhones), a characteristic that makes it easy and relatively inexpensive to acquire experimental data.

The developed fuzzy models show a better performance than linear models or the CART decision trees presented in [22]. The mean absolute error of the fuzzy model that maps sensor data to the ETRS score is 78% lower compared to the error of a linear model, and 71% lower then decision trees. The fuzzy models also outperform linear models (with 81%) and decision trees with (with 73%) when mapping the sensor data to the QUEST score. The fuzzy models therefore clearly outperform these other models.

In this work we relied on the feature selection phase performed by the CART algorithm in our previous study [22]. However, we envision that this step should be performed autonomously by pyFUME, either as a pre-processing phase or using a bottom-up approach. Integrating this functionality in pyFUME will be among our prioritary future developments.

All the patients involved in this study were treated with DBS during sample acquisition, leading to relatively low values for both ETRS and QUEST scores. Hence, patients suffering from severe tremors were not represented in our data set. In order to make our methodology able to detect and diagnose such cases, additional data collected from untreated patients should be added to the data set, and puFUME should run again to build a model suitable for the identification of such patients.


The QUEST and the ETRS scores are widely used for diagnostics and treatment of ET, but they are subjective in nature. To overcome this problem, in this study we use direct measurements from wearable (smartphone) sensors to assess the severity of the tremor more objectively. We used the data extracted from these measurements to develop fuzzy models that map the relationship between the QUEST and ETRS score. The models developed in this study outperform both linear models and decision trees, and the running time of model inference is in the order of miliseconds.

Models such as the ones developed in this study can help clinicians to quickly and objectively measure the effect of Deep Brain Simulation (DBS) on the patient’s tremors. For DBS, a medical device called a neurostimulator is implanted in the patient’s chest. The neurostimulator sends electrical impulses trough electrodes to specific brain nuclei, which suppresses the patient’s tremor. The immediate feedback of sensor data and machine learning models could help finding the optimal voltage and the sequence of the stimulation for a specific patient to minimize the patient’s tremors. This could ultimately lead to a situation where the neurostimulator is self-learning, and the settings of the stimulation can be adjusted automatically based on real-time feedback.

The use of sensor data from wearables, combined with models produced by machine learning algorithms, can also be useful for patients treated in different ways. For example, they can be used to find the correct dosage for ET patients treated with medication or measure the effectiveness of magnetic resonance (MR) guided focused ultrasound.

As a final remark, our data set currently contains 20 ET patients: we plan to collect new samples in the future to derive more robust models. This current study shows that using smartphone data can be useful to diagnose and assess the severity of Essential Tremor. The developed models indicate that there is a relation between the smartphone measurements and tremor severity. However, more data is needed to develop models that are suitable for implementation in clinical practice. In the future, we plan to use smaller and lighter devices for data collection in future studies. The size of the measuring device might restrict the movement of the wrist of the patient, which could influence the measurements of the tremor. As a solution to this, we plan to use smartwatches in follow-up studies.

Availability of data and materials

The data sets used for this study are available from the corresponding author on reasonable request.



Classification and regression tree


Deep brain simulation


Essential tremor


Essential Tremor Rating Scale


Fuzzy self-tuning particle swarm optimization


Graph-based simplification


Mean absolute error


Mean square error


Quality of life in essential tremor

RMS value:

Root-mean-squared value


Root mean square error


  1. Elias WJ, Shah BB. Tremor. J Am Med Assoc (JAMA). 2014;311(9):948–54.

    Article  CAS  Google Scholar 

  2. Louis ED, Barnes L, Albert SM, Cote L, Schneier FR, Pullman SL, Yu Q. Correlates of functional disability in essential tremor. Mov Disord. 2001;16(5):914–20.

    Article  CAS  Google Scholar 

  3. Tröster AI, Pahwa R, Fields JA, Tanner CM, Lyons KE. Quality of life in essential tremor questionnaire (QUEST): development and initial validation. Parkinsonism Relat Disord. 2005;11(6):367–73.

    Article  Google Scholar 

  4. Fahn S, Tolosa E, Marín C. Clinical rating scale for tremor. Parkinson’s Dis Mov Disord. 1993;2:271–80.

    Google Scholar 

  5. Pan D, Dhall R, Lieberman A, Petitti DB. A mobile cloud-based Parkinson’s disease assessment system for home-based monitoring. JMIR mHealth uHealth. 2015;3(1):29.

    Article  Google Scholar 

  6. Senova S, Querlioz D, Thiriez C, Jedynak P, Jarraya B, Palfi S. Using the accelerometers integrated in smartphones to evaluate essential tremor. Stereotact Funct Neurosurg. 2015;93(2):94–101.

    Article  Google Scholar 

  7. Dai H, Zhang P, Lueth T. Quantitative assessment of parkinsonian tremor based on an inertial measurement unit. Sensors. 2015;15(10):25055–71.

    Article  Google Scholar 

  8. Patel S, Lorincz K, Hughes R, Huggins N, Growdon J, Standaert D, Akay M, Dy J, Welsh M, Bonato P. Monitoring motor fluctuations in patients with parkinson’s disease using wearable sensors. IEEE Trans Inf Technol Biomed. 2009;13(6):864–73.

    Article  Google Scholar 

  9. Kubben PL, Kuijf ML, Ackermans LP, Leentjes AF, Temel Y. Tremor12: an open-source mobile app for tremor quantification. Stereotact Funct Neurosurg. 2016;94(3):182–6.

    Article  Google Scholar 

  10. Grimaldi G, Manto M. Neurological tremor: sensors, signal processing and emerging applications. Sensors. 2010;10(2):1399–422.

    Article  Google Scholar 

  11. Breit S, Spieker S, Schulz J, Gasser T. Long-term EMG recordings differentiate between parkinsonian and essential tremor. J Neurol. 2008;255(1):103–11.

    Article  CAS  Google Scholar 

  12. Rigas G, Tzallas AT, Tsipouras MG, Bougia P, Tripoliti EE, Baga D, Fotiadis DI, Tsouli SG, Konitsiotis S. Assessment of tremor activity in the Parkinson’s disease using a set of wearable sensors. IEEE Trans Inf Technol Biomed. 2012;16(3):478–87.

    Article  Google Scholar 

  13. LeMoyne R, Mastroianni T. Use of smartphones and portable media devices for quantifying human movement characteristics of gait, tendon reflex response, and Parkinson’s disease hand tremor. Methods Mol Biol. 2015;1256:335–58.

    Article  Google Scholar 

  14. Kostikis N, Hristu-Varsakelis D, Arnaoutoglou M, Kotsavasiloglou C. A smartphone-based tool for assessing Parkinsonian hand tremor. IEEE J Biomed Health Inform. 2015;19(6):1835–42.

    Article  CAS  Google Scholar 

  15. Carignan B, Daneault J-F, Duval C. Measuring tremor with a smartphone. Methods Mol Biol. 2015;1256:359–74.

    Article  Google Scholar 

  16. Rana AQ, Chou KL. Clinical approach to essential tremor. In: Rana AQ, Chou KL, editors. Essential tremor in clinical practice. Cham: Springer; 2015. p. 21–30.

    Google Scholar 

  17. Poston KL, Rios E, Louis ED. Action tremor of the legs in essential tremor: prevalence, clinical correlates, and comparison with age-matched controls. Parkinsonism Relat Disord. 2009;15(8):602–5.

    Article  Google Scholar 

  18. Daubechies I. Ten lectures on wavelets, vol. 61. Philadelphia: SIAM; 1992.

    Book  Google Scholar 

  19. Guyon I, Elisseeff A. An introduction to variable and feature selection. J Mach Learn Res. 2003;3(Mar):1157–82.

    Google Scholar 

  20. Vieira SM, Sousa JMC, Kaymak U. Fuzzy criteria for feature selection. Fuzzy Sets Syst. 2012;189(1):1–18.

    Article  Google Scholar 

  21. Breiman L, Friedman J, Olshen R, Stone C. Classification and regression trees. Wadsworth Int Group. 1984;37(15):237–51.

    Google Scholar 

  22. Zamora G, Fuchs C, Degeneffe A, Kubben P, Kaymak U. A smartphone-based clinical decision support system for tremor assessment. In: Cazzaniga P, Besozzi D, Merelli I, Manzoni L, editors. Lecture notes in bioinformatics. Berlin: Springer; 2020. p. 1–10.

    Google Scholar 

  23. Bezdek JC. Pattern recognition with fuzzy objective function algorithms. New York: Springer; 2013.

    Google Scholar 

  24. Fuchs C, Spolaor S, Nobile MS, Kaymak U. A swarm intelligence approach to avoid local optima in fuzzy c-means clustering. In: 2019 IEEE international conference on fuzzy systems (FUZZ-IEEE), pp. 1–6; 2019; IEEE

  25. Nobile MS, Cazzaniga P, Besozzi D, Colombo R, Mauri G, Pasi G. Fuzzy self-tuning PSO: a settings-free algorithm for global optimization. Swarm Evolut Comput. 2018;39:70–85.

    Article  Google Scholar 

  26. Jackson P. Introduction to expert systems. Boston: Addison-Wesley Longman Publishing Co. Inc; 1998.

    Google Scholar 

  27. Zadeh LA. Fuzzy sets. Inf Control. 1965;8(3):338–53.

    Article  Google Scholar 

  28. Takagi T, Sugeno M. Fuzzy identification of systems and its applications to modeling and control. IEEE Trans Syst Man Cybern. 1985;1:116–32.

    Article  Google Scholar 

  29. Jang J-SR, Sun C-T, Mizutani E. Neuro-fuzzy and soft computing. Upper Saddle River: Prentice-Hall; 1997.

    Google Scholar 

  30. da Costa Sousa JM, Kaymak U. Fuzzy decision making in modeling and control. World scientific series in robotics and intelligent systems, 2002: vol. 27. World Scientific, New Jersey

  31. Chiu SL. Fuzzy model identification based on cluster estimation. J Intell Fuzzy Syst. 1994;2(3):267–78.

    Article  Google Scholar 

  32. Bezdek JC. Models for pattern recognition. In: Pattern recognition with fuzzy objective function algorithms, 1981; pp. 1–13. Springer, New York

  33. Fuchs C, Spolaor S, Nobile MS, Kaymak U. A graph theory approach to fuzzy rule base simplification. In: International conference on information processing and management of uncertainty in knowledge-based systems, 2020;pp. 387–401. Springer

  34. Jaccard P. Etude Comparative de la Distribution Florale dans Une Portion des Alpes et du Jura. Lausanne: Impr. Corbaz; 1901.

    Google Scholar 

  35. Spolaor S, Fuchs C, Cazzaniga P, Kaymak U, Besozzi D, Nobile MS. Simpful: fuzzy logic made simple. Int J Comput Intell Syst. 2020;13(1):1687–98.

    Article  Google Scholar 

  36. Gegov A. Fuzzy networks for complex systems. Berlin: Springer; 2010.

    Google Scholar 

  37. Fuchs C, Spolaor S, Nobile MS, Kaymak U. pyFUME: a python package for fuzzy model estimation. In: 2020 IEEE international conference on fuzzy systems (FUZZ-IEEE), 2020;pp. 1–8. IEEE

  38. Fuchs C, Wilbik A, Kaymak U. Towards more specific estimation of membership functions for data-driven fuzzy inference systems. In: 2018 IEEE international conference on fuzzy systems (FUZZ-IEEE). IEEE; 2018. p. 1–8.

  39. Babuška R. Fuzzy modelling and identification toolbox. Control Engineering Laboratory, Faculty of Information Technology and Systems, Delft University of Technology, Delft, The Netherlands, version 2000:3

  40. Oliphant TE. A guide to NumPy, vol. 1. USA: Trelgol Publishing; 2006.

    Google Scholar 

  41. Jones E, Oliphant TE, Peterson P. et al. SciPy: Open source scientific tools for Python 2001.

Download references


The authors would like to thank Dr. Chiara Gallese for creating the illustration in Fig. 1.

About this supplement

This article has been published as part of BMC Bioinformatics Volume 22, Supplement 2 2021: 15th and 16th International Conference on Computational Intelligence methods for Bioinformatics and Biostatistics (CIBB 2018-19). The full contents of the supplement are available at


The publication charges for this article are covered by the Information Systems group of the department of Industrial Engineering and Innovation Sciences, Eindhoven University of Technology. The authors received no additional funding for this work.

Author information

Authors and Affiliations



PK and UK designed and directed the project. AD gathered the data by performing the experiments. GZ pre-processed the data and performed feature selection. CF further processed the data and built and analyzed the models. CF and MSN analyzed the results and prepared the draft manuscript. All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Caro Fuchs.

Ethics declarations

Ethics approval and consent to participate

The Ethical Review Board of the Maastricht University Medical Center decided that the study did not fall under the scope of the Medical Research Involving Human Subjects Act (WMO) and approved the study design. Written informed consent to participate was obtained from each patient included in this study.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Fuchs, C., Nobile, M.S., Zamora, G. et al. Tremor assessment using smartphone sensor data and fuzzy reasoning. BMC Bioinformatics 22 (Suppl 2), 57 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: