 Software
 Open Access
 Published:
The Kendrick modelling platform: language abstractions and tools for epidemiology
BMC Bioinformatics volume 20, Article number: 312 (2019)
 The Correction to this article has been published in BMC Bioinformatics 2019 20:439
Abstract
Background
Mathematical and computational models are widely used to study the transmission, pathogenicity, and propagation of infectious diseases. Unfortunately, complex mathematical models are difficult to define, reuse and reproduce because they are composed of several concerns that are intertwined. The problem is even worse for computational models because the epidemiological concerns are also intertwined with lowlevel implementation details that are not easily accessible to noncomputing scientists. Our goal is to make compartmental epidemiological models easier to define, reuse and reproduce by facilitating implementation of different simulation approaches with only very little programming knowledge.
Results
We achieve our goal through the definition of a domainspecific language (DSL), Kendrick, that relies on a very general mathematical definition of epidemiological concerns as stochastic automata that are combined using tensoralgebra operators. A very large class of epidemiological concerns, including multispecies, spatial concerns, control policies, sex or age structures, are supported and can be defined independently of each other and combined into models to be simulated by different methods. Implementing models does not require sophisticated programming skills any more. The various concerns involved within a model can be changed independently of the others as well as reused within other models. They are not plagued by lowlevel implementation details.
Conclusions
Kendrick is one of the few DSLs for epidemiological modelling that does not burden its users with implementation details or required sophisticated programming skills. It is also currently the only language for epidemiology modelling that supports modularity through clear separation of concerns hence fostering reproducibility and reuse of models and simulations. Future work includes extending Kendrick to support noncompartmental models and improving its interoperability with existing complementary tools.
Background
The complexity of new infections, relying on many interconnected factors [1–3] such as biodiversity decline [4], antibiotics resistance [5] or intensification of worldwide trade [6], makes the anticipation of their propagation and their evolution a challenge. Epidemiological models could help address this challenge. Indeed, mathematical and computational models have become widely used for investigating the mechanisms of infectious disease propagation [7, 8], exploring their evolutionary dynamics [9, 10], or informing control strategies [11, 12]. They largely rely on the socalled SIR framework [7, 13] where the host population is divided into compartments corresponding to the epidemiological status of individuals: those Susceptible to the pathogen (state S) can become Infectious (state I), allowing them to transmit the pathogen and eventually become Recovered (state R), i.e. immunised against the pathogen. Such models aim to characterise the transition between these categories and the consequences on the dynamics of each category, especially the ‘Infectious’ one that contains diseased individuals. Obviously, other categories can be added ad libitum in order to take into account different concerns (factors) such as the aforementioned ones.
Epidemiological models are sometimes analytically tractable and can more generally be implemented and simulated in different ways. The first approach is often to express the life cycle as a system of ordinary differential equations to be deterministically simulated through numerical methods, such as RungeKutta algorithms [14]. While this approach is especially useful to understand the average dynamics without chance, shifting to a stochastic approach (e.g., through Gillespie algorithms [15]) is known to be more realistic compared to real epidemiological data [7], which allows to analyze the impact of random events on the simulated dynamics of infectious diseases (such as their seasonality [7]). Finally, an agentbased implementation is sometimes required to reach a level of details that would not be tractable with other approaches because of combinatorial explosion [16].
Unfortunately, complex mathematical models are difficult to define, reuse and reproduce because their various concerns are intertwined. Secondly, implementing them requires various degrees of programming skills that span from rudimentary for the deterministic implementations to very sophisticated for agentbased models. Thirdly, implementation choices can be very heterogeneous with nonnegligible impacts on the disease dynamics. For instance, despite their simplicity, deterministic models can show very different dynamics according to the algorithm being used [7]. Finally, implementation details are intertwined with the epidemiological concerns making it even harder to define, reuse or reproduce models.
We address these issues through the definition of a domainspecific language (DSL), Kendrick, that relies on a very general mathematical definition of epidemiological concerns. DomainSpecific Languages (DSLs) separate modelling and specification concerns (conceptual model) from implementation aspects (computational model). Contrary to Generalpurpose Programming Languages (GPLs), DSLs are higherlevel languages that provide abstractions and notations that are directlyrelated to the concepts of the studied domain [17–19].
A DSL itself, however, does not separate the domain concerns from each other. In epidemiology, this represents a significant challenge because the various concerns cannot be completely independent of each other. For instance, if a spatial concern is added to a model it is precisely because it is expected that the rate of infection is not uniformly distributed across space. Therefore, the key issue is to provide a way to express the various concerns as independently as possible despite their inevitable interactions.
We solve this problem thanks to a set of mathematical definitions that are general enough to capture a large variety of epidemiological concerns and that support a threestep approach in which models are:

1
defined abstractly and independently of each other (using the KendrickModel keyword),

2
combined in any possible order (using the Composition keyword),

3
instantiated and correlated to each other (using the Scenario keyword).
The interactions between models only occur in the third step so that model definitions can be defined and reused independently of each other.
To illustrate the practical usage of our modelling language, we present in this paper two case studies on measles and mosquitoborne diseases. Our approach is carefully validated through statistical comparisons between our simulation results and wellestablished platform simulations in order to guarantee the link with the mathematical epidemiology literature. While parts of the underlying mathematical models and of the syntax of Kendrick have been presented elsewhere [20, 21], this is the first time the software itself is presented, through a step by step coverage of its implementation and usage.
Implementation
This section sheds light on the mathematical model on which Kendrick relies, on the Kendrick language itself as well as on its implementation (for installation and experimentation details see the Results & Discussion section).
Overview of the underlying mathematical model
We have adopted a stochastic viewpoint in which models are ContinuousTime Markov Chains (CTMCs). Models that are defined deterministically from differential equations can be interpreted from a stochastic viewpoint provided some assumptions about the probability distribution which is often assumed to be Poisson. Conversely, the stochastic models that we consider can be abstracted back to the deterministic ones.
Studying the literature on CTMCs lead us to define models and their concerns as stochastic automata [22]. We have then looked for an operator to combine concerns as freely as possible. The concept of Stochastic Automata Network (SAN) [23] seemed general enough to be reused. The automata of a SAN are combined by a tensor sum operator when the underlying Markov processes are CTMCs as in our case.
There are two ways in which stochastic automata may interact [23]:

The rate at which a transition occurs may be a function of the state of a set of automata. These transitions are called functional transitions and their rates functional rates.

A transition in one automaton may force one to occur in one or several other automata. The latter are called triggered transitions.
In epidemiological models, it is crucial to support functional rates while we have not met examples that required triggered transitions. We have focused on functional rates because they allow the description of the heterogeneities that, as it was mentioned previously, motivate the use of different concerns in the first place. It is crucial to be able to express, for instance, that the force of infection depends on the region where individuals live. The point is that this information is deferred, in Kendrick models, to a special phase (introduced by the Scenario keyword) that is separated from the definition of concerns. These definitions can then be more easily reused.
With our plattform, any given model can be run as a deterministic or stochastic simulation. The default deterministic solver is RK4 [14]. Gillespie’s direct and tauleap methods are both available for stochastic simulations. Stochastic individualbased simulations can also be run by triggering events at the level of individuals [16].
The Kendrick DSL
The goal of the DSL is threefold: a) provide an easytouse concise and readable syntax, b) hide implementation details as much as possible, and c) provide a way to keep the concerns separated (i.e. avoid textual dependencies between them).
With these goals in mind, our DSL defines the following linguistic (syntactical and semantic) entities: Epidemiological Models, Compositions, Scenarios, Simulations and Visualizations, which are exemplified in Table 1. Each of these entities can be defined as a separate component (with its own file if necessary) and reused several times in multiple projects by simply referring to its name. Moreover, all of the aforementioned entities can be extended (i.e. specialized). It is possible to introduce new properties (see the SEIR model in case study I below), as well as overriding some of them.
First, Models are defined as independently as possible from each other. They are characterized by epidemic concerns (such as SIR or similar schema, spatial aspects, age or sex structure...). These models can then be combined (through the Composition entity). Scenarios describe the initial conditions of experiments and can themselves be combined in different ways to provide inputs to Simulation entities. These entities define algorithmic properties for experiments, such as solvers and timing constraints. Finally, the Visualization entities define the desired visual outputs (such as epidemiological figures and maps) of the experimental results.
Functional rates that define the heterogeneities of the various concerns, and thus bind some of them, are introduced in the Scenario entities only. This is crucial to be able to define concerns as independently as possible. Note that the Kendrick syntax was very carefully designed to make it very convenient to name compartments of any size while the typical Matlab implementations heavily rely on lowlevel matrix operations. This notation is quite convenient to initialize parameters and especially functional rates whose initial values may depend on several concerns. See for instance the initialization of the mu and rho parameters in the Mosquitoborne disease case study further in this document. In this case, mu and rho are functional rates because their value depends on the species. More complex functional rates involving several concerns can also be easily initialized by assigning values to compartments of various sizes.
Implementation details
Kendrick is implemented on top of the Pharo [24] programming environment using the Moose metamodelling platform [25]. The numerical analysis backend is based on the PolyMath framework [26] while the visualisation subsystem relies on Roassal [27, 28] and the UI components on GT Tools and the Moldable Inspector [29]. In addition to that, the DSL extensively relies on Pharo’s reflective subsystem [30], with extensions based on the PetitParser [31] framework and on the Helvetia parsing workbench [32].
Kendrick supports all major desktop platforms (Gnu/Linux, MacOS and MS Windows) and adopts an open continuous integration process (based on github^{Footnote 1} and Travis CI^{Footnote 2}). It has been extensively tested using the SUnit [33] framework, readily available through a dedicated website^{Footnote 3} and well documented through a collaborative wiki^{Footnote 4}.
Results
In this section, we first provide practical information on how to install and start using Kendrick. Then we illustrate in detail how to simulate the evolution of two models of infectious diseases. The first one is measles, a childhood disease that has been extensively studied through epidemiological modelling while the second model is a vectorborne disease, which allows us to illustrate the use of a multihost concern.
Installing & running Kendrick
The easiest way to install Kendrick is to download a precompiled bundle for your platform from Github^{Footnote 5}. After unzipping the corresponding compressed archive file, simply run one of the Kendrick launchers (KendrickUI for Mac and Linux, or KendrickWinLauncher for Windows).
Alternatively, a very straightforward method to compile Kendrick from sources is provided on all systems with a bash commandline (including Linux, Mac and Windows with Cygwin and/or the Windows 10 Bash subsystem), by issuing the following command:
This command will automatically retrieve all the required dependencies, compile Kendrick from sources and set up all the execution scripts for normal or development use.
Kendrick can be used in three different modes. First, the UI mode relies on a specific IDE to edit and run Kendrick models. This UI follows the Cascading Lists^{Footnote 6} pattern of navigation: each selection reveals a new column of actionable information (on the right), with a horizontal bar on the bottom that controls the position and size of the viewport. This mode is launched by invoking the following command.
Second, Kendrick can be used from the commandline. Kendrick models are edited using any source code editor and stored in the Sources directory of the Kendrick installation, along with all the other models. A model is run using the following command and the results are then stored in the Output directory.
Third, an advanced UI mode is available where Kendrick is run with the full development environment (allowing to use both the DSL and the Pharo API of Kendrick). This mode is run this way:
Case study I: Measles
We consider the transmission of measles through a SEIR model with demography [34]. Individuals are born in the Susceptible (S) status with a birth rate μ, then may become Exposed (E) (i.e. infected but not yet infectious) with transmission rate βI. After an average latent period of 1/σ, they may become Infectious (I) and transmit the pathogen. Finally, they may move to the Recovered (R) class after an average infectious period of 1/γ.
The model, shown below, is available from the Kendrick UI by navigating either to Scripts/Measles.kendrick (as a single file script, see Fig. 1) or to Projects/Measles.
Lines 1 to 10 define the core epidemiological concern, SEIR, including the transition parameters. Complete models (such as Measles) are produced by a Composition which computes a tensor sum of one or several modelling concerns (lines 1213). In this simple example, the Measles model includes a single concern: SEIR.
Finally, from lines 15 to 27, two Scenarios are declared separately to foster reuse even though both are required to run the Measles model. The first one assigns a value to each parameter of the model while the second one sets the initial value of each compartment of the core concerns (S, E, I, and R).
This version of the Measles model relies on the transition syntax of Kendrick, but ODEs (Ordinary Differential Equations) can also be used to express the same model:
A core epidemiological concern (e.g., SIR, SEIR etc.) is required in all epidemiological models. Defining them as separated entities fosters their reuse in different models. The most frequently used entities are gathered in the standard Kendrick library (available through our editor by navigating to Library/KendrickModel.
The initial size of the population is 100,000 individuals. The value of the other parameters are taken from the related literature [7, 34]. Figure 2 shows the result of the deterministic simulation (using the RK4 solver) produced by the Simulation and Visualization entities shown below:
The Simulation entity (from lines 1 to 6) uses both the scenarios that were previously defined, and specifies which algorithm to use (here RungeKutta) and the timing characteristics (time step and duration of the simulation). Finally, the Visualization entity specifies the desired output for a specific Simulation and by default plots the dynamics of the infected compartment.
Case study II: A mosquitoborne disease with three host species
Our second example is an SIR model with demography of a mosquitoborne disease with three host species. These species are a vector (mosquito) and two potential hosts: reservoir1 and reservoir2. So the population that is always partitioned using at least the status attribute is here also partitioned using the species one leading to 3x3 compartments.
All the species have the same six transitions: birth, deaths (for each one of the three status compartments), infection and recovery. Given a transition, the transition function is the same for all states. Only 4 generic transitions need to be defined with Kendrick, which will then automatically generate the remaining ones for each species of the model.
As previously, the model is available through the Kendrick distribution by navigating through our editor either to Scripts/Mosquitos.kendrick (single file script, seen in Fig. 3) or to Projects/Mosquito.
The script of the model is shown below.
The SIR concern is defined (from lines 1 to 9) including transitions and parameters. The multispecies concern (lines 1113) includes the three aforementioned species and has no transitions since individuals do not change their species. The Mosquito model combines the epidemiological and multispecies concerns (lines 15 to 17).
The Mosquito model is then complemented by two scenarios. The (MosquitoPopulation) scenario (lines 19 to 23) sets the initial value of the compartments while the second scenario (MosquitoParameters) sets the value of the parameters (lines 25 to 35). When the initial size of a compartment depends on the species a compound name is used such as "mu_species" or "beta_species" and a vector is then provided with a value for each species rather than a single value for all the species. R is not mentioned because its value is the default one (zero).
As mentioned earlier, when the value of a parameter not only depends on the state of its own automaton (i.e. concern) but on the state of another one it is expressed as a functional rate which is the case for mu and beta (lines 2833). Both parameters are defined by the SIR concern but depend on the species. Compound names are used in this case too to specify vectors or even matrices. Mu is initialized by a mere vector because In order to specify functional rates, parameters are followed by attribute keys to denote how their values vary with such attributes, i.e. line 28 means that the value of parameter mu varies with species, the first value represents mu of mosquito, the second one is of reservoir1 and the last one is of reservoir2.
The simulation (lines 37 to 42) runs the Gillespie algorithm for the given timeframe and step using both the aforementioned scenarios. The visualization (lines 44 to 48) plots the dynamics of the infectious (I) compartment (Fig. 4).
More example models are available
The first casestudy focused on a basic model with a single (core) concern: SEIR. The second casestudy included a core concern (SIR) and a multispecies one. More example models combining more concerns (such as spatial, control policies, sex or age structures, multistrain, etc.) are available in the Kendrick distribution and on the wiki^{Footnote 7}.
For example, the Influenza6 model (see Scripts/Influenza6.kendrick in the distribution or^{Footnote 8} on the wiki) includes SEIRS, a multispecies concern, a quarantine concern, and a spatial concern of 5 countries of Southeast Asia. More complex spatial graphs can be constructed by importing data from external sources (e.g., GIS data, flight connections etc.).
The diversity of these examples suggest that Kendrick can support the most common epidemiological concerns.
Batch analysis & integration with external tools
In order to support batch exploration of epidemiological models (for sensitivity analysis for example), epidemiological experiments can be defined with different parameter values to explore a grid of the parameter space.
These experiments can be defined using either (a) the dedicated Experiment entity in Kendrick Projects (as shown in the codesnippet below and in Fig. 5) or (b) commandline arguments.
In both cases, the results of the experiments can be analysed by thirdparty software. In the case of the commandline experiments, the setup allows Kendrick to be driven by and integrated with platforms such as R^{Footnote 9} or OpenMole^{Footnote 10} for further analysis.
An experiment, PopRateAnalysis is defined on the Measles model using the MeaslesDiagramViz visualization as an output template (lines 13). A series of parameter intervals of the form [min, max] @ step are then introduced (lines 48). populationSize will thus get 2 values: 100000 and 150000, while the initial infected population as well as γ and β will get 3 values each. Finally, line 9 runs the experiment.
Kendrick runs a simulation for each of the 54 (2x3x3x3) combinations. The results can then be explored and analysed as seen Fig. 5. The results are also exported automatically in the Output folder of the project for further analysis with external tools.
Model fitting to observed data is possible by running a Kendrick model from the command line. An example is given on our development website^{Footnote 11} where a complete Kendrick model is built in Matlab/Octave^{Footnote 12}. The space of the possible parameter values is then explored, driven by a minimization function written in Octave. More direct support by Kendrick is planned to minimize the need of external tools.
Validating the implementation
To validate the core functionalities of Kendrick as a modelling and simulation platform, the output of deterministic Kendrick models have been compared to a reference implementation of the same models on Scilab [35]. The simulation results (see Fig. 2) suggest that for deterministic models, Kendrick results are identical to those of the reference implementation.
The dynamics of deterministic simulations have also been compared to those of Gillespie and individualbased simulations (Fig. 6).
The results for the measles model can be seen in the upper part of Fig. 6, with the lower part displaying the results of the mosquitoborne model for each individual host species. The deterministic dynamics can be superimposed on the stochastic and individualbased ones, validating our implementation of the simulation logic.
Finally, outputs of individualbased and stochastic simulations using the same configuration have been crossexamined. Key properties of the epidemiological dynamics, such as the duration and the peak of the epidemic, have been extracted from the results of 200 executions of each individualbased and stochastic model. A KolmogorovSmirnov test on each pair of samples has showed no statistical difference between the individualbased simulations and the stochastic ones. The results can be seen in Table 2, where all P−values are greater than 0.05, validating that the resulting distributions are statistically indistinguishable. This suggests that our individualbased and stochastic algorithms are correctly implemented and provide compatible results.
Discussion
Although DSLs have been used before in the context of bioinformatics [36–38], only a small number of them focused on epidemiological modelling [39, 40]. For example, Ronald [39] is a DSL for studying the interactions between malaria infections and drug treatments, but has unfortunately been discontinued. Schneider and al. [40] have also proposed a DSL for epidemics, but their solution only support agentbased models. Mathematical modelling languages (MMLs) such as Scilab [35], Modelica [41], Matlab [42] or JSim [43] do allow easier definition of mathematical models as sets of ODEs, but are too broad in scope to properly cover the domainspecific needs of epidemiology. On the other hand, individual libraries targeting epidemiological modelling, such as Epipy  a visualisation data tool for epidemiology written in Python [44], or GillespieSSA  an R package for generating stochastic simulation using Gillespie’s algorithms [45], cover only specific epidemiological needs.
Closer to our approach are computational modelling tools for epidemiology such as FluTe [46], GLEAMviz [47], STEM [48] and FRED [49]. These solutions use dedicated approaches to model the transmission of infectious disease and provide a graphical user interface (GUI) to specify and visualise an epidemiological model. The main features of these tools are summarised in Table 3 and compared to Kendrick.
Contrary to solutions that only focus on stochastic simulations (such as GLEAMviz [47] and STEM [48]) our platform can provide more modelling options thanks to the availability of individualbased simulation. The largescale epidemic simulations that can be carried out with GLEAMviz [47], STEM [48] or FRED [49] can also be considered with Kendrick since it provides all the necessary building blocks (disease spread, airflight graph linking the airports) but it has not been attempted yet. A case study of metapopulation model can be found in the Additional file 1 provided with this publication. This particular case is also an example of spatial visualisation integrated within the platform.
Given the above comparison, Kendrick can be considered as a higherlevel disciplined solution that aims to cover as many specificities of epidemiological modelling as possible. The fact that Kendrick chooses the right level of abstraction for each case is also expected to help modellers focus on what is essential and avoid irrelevant or inconsistent definitions. This allows easier crossexamination of different modelling simulation schemes (such as deterministic, demographicstochastic and individualbased). In particular, letting modellers directly manipulate epidemiological concepts, such as compartments, is expected to reduce the burden of having to deal with implementation details that typically involve programminglanguagedependent matrix manipulations.
Currently, our platform has nevertheless several limitations. At the moment, we only support deterministic, stochastic and individualbased simulations. More highresolution models like agentbased simulation where agents can have complex behaviours or contactnetworks will be defined in the future. Spatialbased models based on partial differential equations (PDE) are also beyond the scope of this paper.
Furthermore, there is also an inherent tradeoff in our pervasive use of a DSL, especially regarding simulations. While we do allow for the configuration of basic simulation parameters, there is currently no way to adapt simulations in more elaborate ways, without getting back to development in the host language (Pharo in our case). This is an acceptable limitation for a DSL in general, which aims to be userfriendly and reusable, forfeiting the ability to control every little detail of the implementation by its endusers.
Conclusion
This paper has introduced Kendrick, a language and a tool to make it easier to define, reuse and reproduce compartmental epidemiological models. Kendrick relies on a very general mathematical definition of epidemiological concerns as stochastic automata that are combined using tensoralgebra operators. A large class of epidemiological concerns can be defined this way, including multispecies, spatial concerns, control policies and sex or age structures. Concerns can be defined independently of each other, and combined into models that are simulated by different methods with very little programming knowledge.
Kendrick features have been highlighted using two classic examples of epidemiological models. The results produced using Kendrick are equivalent to wellestablished but hardertouse, programming platforms. Kendrick supports batchanalysis and experimentation, allowing its combination with external tools for statistical analysis.
We hope that Kendrick will be further adopted and extended to support even more facets of epidemiology. It is available as open source software under the MIT Licence: http://ummisco.github.io/kendrick/.
Availability and requirements
Project name: KENDRICK
Project home page: http://ummisco.github.io/kendrick/
Operating System: multiplatform (Linux/MacOS/Windows)
Programming environment: Pharo 6.1: http://www.pharo.org/
Programming language: Smalltalk
Requirements: All the required tools for the installation of KENDRICK are described on the project home page
License: MIT License
Any restrictions to use by nonacademics: no restrictions
Availability of data and materials
Not applicable.
Change history
27 August 2019
Following publication of the original article [1], the author noticed that the following lines were missing from the published article. The original article has been corrected.
Notes
 1.
 2.
 3.
 4.
 5.
 6.
 7.
 8.
 9.
 10.
 11.
 12.
Abbreviations
 DSLs:

Domain specific languages
 GPLs:

Generalpurpose programming languages
 GUI:

Graphical user interface
 MMLs:

Mathematical modelling languages
 ODEs:

Ordinary differential equations
 UI:

User interface
References
 1
Cohen ML. Changing patterns of infectious disease. Nature. 2000; 406:762–8.
 2
Eisenberg JNS, Cevallos W, Ponce K, Levy K, Bates SJ, Scott JC, Hubbard A, Vieira N, Endara P, Espinel M, Trueba G. Environmental change and infectious disease: how new roads affect the transmission of diarrheal pathogens in rural Ecuador. Proc Natl Acad Sci. 2006; 103(51):19460–5.
 3
Ostfeld RS, Keesing F. Effects of host diversity on infectious disease. Ann Rev Ecol Evol Syst. 2012; 43(1):157–82.
 4
Keesing F, et al. Impacts of biodiversity on the emergence and transmission of infectious diseases. Nature. 2010; 468:647–52.
 5
Jones KE, Patel NG, Levy MA, Storeygard A, Balk D, Gittleman JL, Daszak P. Global trends in emerging infectious diseases. Nature. 2008; 451:990–4.
 6
Laxminarayan R, et al. Antibiotic resistance  the need for global solutions. Lancet Infect Dis. 2013; 13(12):1057–98.
 7
Keeling MJ, Rohani P. Modeling infectious diseases. Princeton: Princeton University Press; 2008.
 8
Xia Y, Bjornstad ON, Grenfell BT. Measles metapopulation dynamics: a gravity model for epidemiological coupling and dynamics. Am Nat. 2004; 164(2):267–81.
 9
Gandon S, Mackinnon MJ, Nee S, Read aF. Imperfect vaccines and the evolution of pathogen virulence. Nature. 2001; 414(6865):751–6.
 10
Read AF, Huijben S. Perspective: Evolutionary biology and the avoidance of antimicrobial resistance. Evol Appl. 2009; 2(1):40–51.
 11
Bauch CT, Szusz E, Garrison LP. Scheduling of measles vaccination in lowincome countries: Projections of a dynamic model. Vaccine. 2009; 27(31):4090–8.
 12
Levin A, Burgess C, Garrison LP, Bauch C, Babigumira J, Simons E, Dabbagh A. Global eradication of measles: an epidemiologic and economic evaluation. J Infect Dis. 2011; 204 Suppl (Suppl 1):98–106.
 13
Anderson RM, May RM. Infectious Diseases of Humans: Dynamics and Control. Oxford: Oxford Science Publications; 1991.
 14
Griffiths DF, Higham DJ. Numerical Methods for Ordinary Differential Equations. Springer Undergraduate Mathematics Series: Springer; 2010.
 15
Gillespie DT. Exact stochastic simulation of coupled chemical reactions. J Phys Chem. 1977; 81:2340–61.
 16
Roche B, Drake JM, Rohani P. An agentbased model to study the epidemiological and evolutionary dynamics of influenza viruses. BMC Bioinformatics. 2011; 12:87.
 17
Van Deursen A, Klint P, Viser J. Domainspecific languages: An annotated bibliography. ACM SIGPLAN Not. 2000; 35(6):26–36.
 18
Mernik M, Heering J, Sloane AM. When and how to develop domainspecific languages. ACM Comput Surv. 2005; 37(4):316–44.
 19
Fowler M. Domainspecific Languages. USA: Pearson Education; 2010.
 20
BUI TMA, Stinckwich S, Ziane M, Roche B, HO TV. KENDRICK: A Domain Specific Language and platform for mathematical epidemiological modelling. In: the 11th IEEE RIVF International Conference on Computing & Communication TechnologiesResearch, Innovation, and Vision for Future (RIVF). IEEE: 2015. p. 132–7.
 21
Bui TMA, Papoulias N, Ziane M, Stinckwich S. Explicit composition constructs in dsls: The case of the epidemiological language kendrick. In: Proceedings of the 11th Edition of the International Workshop on Smalltalk Technologies, IWST’16. New York: ACM: 2016. p. 20–12011.
 22
Bui TMA, Ziane M, Stinckwich S, Ho TV, Roche B, Papoulias N. Separation of concerns in epidemiological modelling. In: Companion Proceedings of the 15th International Conference on Modularity, MODULARITY Companion 2016. New York: ACM: 2016. p. 196–200.
 23
Plateau B, Stewart WJ. Stochastic automata networks. In: Computational Probability. Boston: Springer: 2000. p. 113–51.
 24
Black AP, Ducasse S, Nierstrasz O, Pollet D, Cassou D, Denker M. Pharo by Example. Kehrsatz: Square Bracket Associates; 2009, p. 333. http://pharobyexample.org/.
 25
Girba T. The Moose Book. Switzerland: Self Published; 2010. http://www.themoosebook.org/book.
 26
PolyMath. Open Source Software for Numerical Computation in Pharo. https://github.com/PolyMathOrg/PolyMath.
 27
Araya VP, Bergel A, Cassou D, Ducasse S, Laval J. Agile visualization with Roassal. Deep Into Pharo. 2013;:209–39.
 28
Bergel A. Agile visualization. Chile: Lulu; 2016.
 29
Chris A. Moldable tools. PhD thesis. University of Bern. 2016.
 30
Foote B, Johnson RE. Reflective facilities in Smalltalk80. In: ACM Sigplan Notices. Vol. 24, No. 10. ACM: 1989. p. 327–35.
 31
Renggli L, Ducasse S, Gîrba T, Nierstrasz O. Practical dynamic grammars for dynamic languages. In: The 4th Workshop on Dynamic Languages and Applications (DYLA 2010). New York: ACM: 2010.
 32
Renggli L, Gîrba T, Nierstrasz O. Embedding languages without breaking tools. In: European Conference on ObjectOriented Programming. Berlin: Springer: 2010. p. 380–404.
 33
Ducasse S. SUnit Explained. http://www.iam.unibe.ch/~ducasse/Programmez/OnTheWeb/SUnitEnglish2.pdf.
 34
Anderson RM, May RM. Infectious Diseases of Humans, vol. 1. UK: Oxford university press Oxford; 1991.
 35
Scilab. Open Source Software for Numerical Computation. http://www.scilab.org/.
 36
Fall A, Fall J. A domainspecific language for models of landscape dynamics. Ecol Model. 2001; 141:1–18.
 37
Degenne P, Lo Seen D, Parigot D, Forax R, Tran A, Ait Lahcen A, Curé O, Jeansoulin R. Design of a domain specific language for modelling processes in landscapes. Ecol Model. 2009; 220:3527–35.
 38
van Engelen RA. Atmol: A domainspecific language for atmospheric modelling. CIT J Comput Inf Technol. 2001; 9:289–303. Special Issue on DomainSpecific Languages Part I.
 39
Antao T, Hastings IM, McBurney P. Ronald: A DomainSpecific Language to Study the Interactions Between Malaria Infections and Drug Treatments. In: Proceedings of the International Conference on Bioinformatics & Computational Biology (BIOCOMP 2008), Vol 2. CSREA Press: 2008. p. 747–52.
 40
Schneider O, Dutchyn C, Osgood N. Towards frabjous: a twolevel system for functional reactive agentbased epidemic simulation. In: Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium. ACM: 2012. p. 785–90.
 41
Modelica Language. https://www.modelica.org/.
 42
Matlab. the Language of Technical Computing. http://www.mathworks.com/products/matlab/.
 43
Introduction of JSim Framework. http://www.physiome.org/jsim/.
 44
Epipy. Python tools for epidemiology. http://cmrivers.github.io/epipy/.
 45
GillespieSSA. Gillespie’s Stochastic Simulation Algorithm (SSA). http://cran.rproject.org/web/packages/GillespieSSA/index.html.
 46
Chao DL, Halloran ME, Obenchain VJ, Longini JrIM. FluTE, a publicly available stochastic influenza epidemic simulation model. PLoS Comput Biol. 2010; 6(1):e1000656.
 47
Van den Broeck W, Gioannini C, Gonçalves B, Quaggiotto M, Colizza V, Vespignani A. The gleamviz computational tool, a publicly available software to explore realistic epidemic spreading scenarios at the global scale. BMC Infect Dis. 2011; 11:37.
 48
Falenski A, Filter M, Thöns C, Weiser AA, Wigger JF, Davis M, Douglas JV, Edlund S, Hu K, Kaufman JH, et al. A generic opensource software framework supporting scenario simulations in bioterrorist crises. Biosecurity bioterrorism: biodefense Strateg Pract Sci. 2013; 11(S1):134–45.
 49
Grefenstette JJ, Brown ST, Rosenfeld R, DePasse J, Stone NTB, Cooley PC, Wheaton WD, Fyshe A, Galloway DD, Sriram A, et al. Fred (a framework for reconstructing epidemic dynamics): an opensource software system for modeling infectious diseases and control strategies using censusbased populations. BMC Public Health. 2013; 13(1):940.
Acknowledgements
This work was supported by Alexandre Bergel (Chile University) and his team Object Profile (http://objectprofile.com/ObjectProfile.html) with the visualisation tool ROASSAL. We would like to acknowledge the financial support from Agence Nationale de la Recherche (ANR) PANIC project and European Smalltalk User Group (ESUG). We also gratefully acknowledge the financial support from Hanoi University of Science and Technology through the "Spatial Modelling Language for Epidemiology" project  grant number T2017TT001.
Funding
This work was funded by Agence Nationale de la Recherche (ANR) PANIC project, European Smalltalk User Group (ESUG) and Hanoi University of Science and Technology through the "Spatial Modelling Language for Epidemiology" project with grant number T2017TT001.
The funders had no role in the work that is presented in this manuscript nor in the decision to publish or in the writing or editing process.
Author information
Affiliations
Contributions
BTMA participated in the definition of the mathematical model and of the case studies, in the implementation of the software, in the analysis of the simulation results and drafted the manuscript. NP participated in the definition of the case studies, defined and implemented the higher level of the DSL, the UI and the batchanalysis and improved the manuscript. BR participated in the definition of the case studies, in the analysis of the results and improved the manuscript. SS participated in the definition of the case studies, in the implementation of the software and improved the manuscript. MZ lead the definition of the mathematical model, helped design the case studies and improved the manuscript. All the authors read and approved the final manuscript.
Corresponding author
Correspondence to Mai Anh BUI T..
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file
Additional file 1
A spatial SIR model specified with Kendrick. (PDF 348 kb)
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
BUI T., M.A., Papoulias, N., Stinckwich, S. et al. The Kendrick modelling platform: language abstractions and tools for epidemiology. BMC Bioinformatics 20, 312 (2019). https://doi.org/10.1186/s1285901928430
Received:
Accepted:
Published:
Keywords
 Domainspecific language
 Modularity
 Mathematical modelling
 Epidemiological modelling
 Compartmental models