QuickPed: an online tool for drawing pedigrees and analysing relatedness

Vigeland, Magnus D.

doi:10.1186/s12859-022-04759-y

Research
Open access
Published: 07 June 2022

QuickPed: an online tool for drawing pedigrees and analysing relatedness

Magnus D. Vigeland¹

BMC Bioinformatics volume 23, Article number: 220 (2022) Cite this article

5073 Accesses
8 Citations
5 Altmetric
Metrics details

Abstract

Background

The ubiquity of pedigrees in many scientific areas calls for versatile and user-friendly software. Previously published online pedigree tools have limited support for complex pedigrees and do not provide analysis of relatedness between pedigree members.

Results

We introduce QuickPed, a web application for interactive pedigree creation and analysis. It supports complex inbreeding and comes with a rich built-in library of common and interesting pedigrees. The program calculates all standard coefficients of relatedness, including inbreeding, kinship and identity coefficients, and offers specialised plots for visualising relatedness. It also implements a novel algorithm for describing pairwise relationships in words.

Conclusion

QuickPed is a user-friendly pedigree tool aimed at researchers, case workers and teachers. It contains a number of features not found in other similar tools, and represents a significant addition to the body of pedigree software by making advanced relatedness analyses available for non-bioinformaticians.

Peer Review reports

Background

Drawing and analysing genealogical relationships are indispensable tasks in fields like medical genetics, forensic genetics, ecology and animal breeding, creating a demand for easily accessible software. Several free online tools for creating pedigrees are currently available, including ped_draw [1], HaploForge [2], pedigreejs [3], and Progeny [4]. However, these are geared towards clinical applications and have limited support for complex pedigrees commonly seen in areas like forensic genetics and animal breeding. For instance, all of the mentioned programs struggle with cross-generational mating (see Additional file 1: Fig. S1 for a simple example). Another limitation pertains to importing and exporting ped files describing pedigrees in text format. Such files are widely used to store pedigree data, both for purposes of reproducibility and for communication between software. Of the listed programs, ped_draw and pedigreejs import, but do not export, ped files. Conversely, HaploForge can save pedigrees as ped files after creation, but cannot import such files. The Progeny pedigree tool has no ped file support.

To the best of our knowledge, no online pedigree programs offer analysis of relatedness, like coefficients of kinship and gene identity. Such coefficients play an important role in many fields, as exemplified by recent studies in quantitative genetics [5], forensic genetics [6, 7] and ancient DNA [8]. Despite their widespread use, there is a serious lack of user-friendly software for computing relatedness coefficients, particularly for users without specialised bioinformatic skills.

X-chromosomal counterparts of the standard (autosomal) coefficients are easily defined and have a long history of applications, for instance in medical genetics [9] and forensic genetics [10]. However, it may be argued that the X-chromosomal coefficients remain considerably understudied, possibly due to the practical difficulties of computing them.

Here we introduce QuickPed, an interactive web tool for building and editing pedigrees, which also computes a wide variety of relatedness coefficients, both autosomal and X-chromosomal. In addition, QuickPed implements the relatedness triangle for visualising relatedness, and a novel algorithm producing verbal descriptions of pairwise relationships.

Implementation

QuickPed is written in R using the Shiny package, and is powered by the ped suite packages for pedigree analysis in R [11]. In particular, the relatedness coefficients are computed with the ribd package [12], while the algorithm for describing relationships verbally descriptions, discussed in detail below, is implemented in verbalisr. Pedigrees are created with pedtools and plotted by importing kinship2 [13], following standard pedigree nomenclature [14].

Interactive pedigree creation

To create a pedigree in QuickPed, the user can either choose one from the extensive built-in list or load an existing ped file. Malformed ped files are detected and generate informative error messages. Loaded pedigrees may be modified by selecting individuals and using appropriate buttons, as seen in Fig.1. The final result can be stored as an image (png or pdf) or as a ped file. Further instructions and information can be found at the QuickPed home page (see link below under Availability and requirements).

Relatedness coefficients

Once a pedigree is created, a series of relatedness coefficients between its members can be computed. The following coefficients are supported, where A and B denote any members of the pedigree:

The inbreeding coefficient $f_{\!A}$, defined as the kinship coefficient (see below) of the parents of A, or 0 if A is a founder [15].
The kinship coefficient $\varphi _{\!AB}$, defined as the probability that a random allele from A and a random allele from B at the same autosomal locus, are identical by descent (IBD), i.e., that they have the same ancestral origin within the pedigree [15].
The IBD coefficients $\kappa _{\!AB} = (\kappa _0, \kappa _1, \kappa _2)$, defined (for non-inbred individuals only) as the probability of sharing respectively 0, 1, or 2 alleles IBD at a random autosomal locus [16].
The condensed identity coefficients $\Delta _{\!AB} = (\Delta _1, \dots , \Delta _9)$ of Jacquard [17].
The detailed identity coefficients $\delta _{\!AB} = (\delta _1, \dots , \delta _{15})$ of Jacquard [17].
X-chromosomal versions of all the above coefficients. Details about these can be found in the user manual.

For an introduction to these relatedness coefficients and their applications, see e.g., Thompson [18]. Lange [15] gives a more rigorous treatment with detailed algorithms, while Vigeland [11] focuses on calculations in R.

In addition to the standard coefficients described above, QuickPed also reports the relationship degree, as popularized by KING [19] and similar software for relatedness inference. In simple cases the degree equals the number of pedigree steps separating the individuals (e.g., 1 for parent-child and 2 for half siblings). More generally the degree is defined as a discretisation of the kinship coefficient $\varphi$, by rounding $\log _2(1/\varphi ) - 1$ to the nearest integer. This yields, for instance, degree 0 if $\varphi \in [\tfrac{1}{2}^{3/2}, 1] \approx [0.354, 1]$, degree 1 if $\varphi \in [\tfrac{1}{2}^{5/2}, \tfrac{1}{2}^{3/2}) \approx [0.177, 0.354)$, and degree 2 if $\varphi \in [\tfrac{1}{2}^{7/2}, \tfrac{1}{2}^{5/2}) \approx [0.088, 0.177)$.

For noninbred relationships, QuickPed implements a visualisation device known as the relatedness triangle, or IBD triangle. The IBD coefficients $(\kappa _0, \kappa _1, \kappa _2)$ of any such relationship can be viewed as a point $(\kappa _0, \kappa _2)$ in the plane triangle defined by $\kappa _0 \ge 0$, $\kappa _2 \ge 0$ and $\kappa _0 + \kappa _2 \le 1$ [11, 12]. The location of the most common relationships are indicated on the figure, as well as the inadmissible region established by Thompson [20], as a visual guide to the user.

Relationship descriptions

QuickPed implements a novel algorithm for describing pairwise relationships, inspired by Wright’s path formula for the kinship coefficient [21]:

$$\begin{aligned} \varphi _{\!AB} = \sum _ {C, v_1, v_2} \frac{1}{2^{l_1 + l_2 + 1}}(1+f_C). \end{aligned}$$

(1)

The sum is over all common ancestors C of A and B, and all pairs $(v_1, v_2)$ of non-intersecting paths from C to A and B, respectively, with path lengths $l_1 = |v_1|$ and $l_2 = |v_2|$. Note that C may coincide with A or B, in which case the corresponding path has length 0.

To describe the relationship between A and B, the program first identifies all connecting paths, represented in the form $(C, v_1, v_2)$ as above, and classifies them as either lineal (if $l_1 = 0$ or $l_2 = 0$), sibling ($l_1 = l_2 = 1$), avuncular ($l_1 > l_2 = 1$ or vice versa) or cousin ($l_1, l_2 > 1$). Pairs of paths $(C, v_1, v_2)$, $(C', v_1, v_2)$ that are identical except that $C'$ is a spouse of C, are unified and tagged as full, while the remaining are half. The path degree is $l_1 + l_2 - \gamma$, where $\gamma$ is 1 if the path is full and 0 otherwise. For cousin paths we also define the cousinship degree as $\min (l_1, l_2) - 1$ and removal $|l_1 - l_2|$. Finally, the information about each path is translated to a human-readable statement in standardised format. Sets of paths with identical data $(l_1, l_2, \gamma )$ are reported together as double (or triple, etc.) relationships.

Results

To illustrate the description algorithm, we consider the relationship between individuals 6 and 7 in Fig.1. They have four connecting paths, namely 6-[4]-7, 6-[5]-7, 6-4-[2]-5-7 and 6-5-[2]-4-7. In this notation, the ancestor C of each path is shown in brackets between $v_1$ and $v_2$. The first two paths merge into one full path, classified as full siblings. The two remaining paths both have $l_1 = l_2 = 2$, corresponding to half cousins of degree 1 with no removal. Being numerically equal they constitute a double relationship. The complete QuickPed output is as follows:

Full siblings
- 6-[4,5]-7
Double half first cousins
- 6-4-[2]-5-7
- 6-5-[2]-4-7

For a more interesting demonstration, we applied the description feature to the famously complex pedigree of the Habsburg royalties. The inbreeding coefficient of King Charles II of Spain (1661–1700) has been estimated to approximately 0.25 [22], i.e., similar to that of a child produced by brother-sister incest. The ancestry of Charles II is included as one of the built-in pedigrees in QuickPed and reproduced in Fig.2. For Philip IV and Mariana (the parents of Charles II) the program reports that they are, simultaneously,

Uncle-niece
First cousins once removed
Second cousins once removed
Triple second cousins twice removed
Triple third cousins
Septuple third cousins once removed
Sextuple third cousins twice removed
Triple 4th cousins
Septuple 4th cousins once removed

The complete pedigree paths are included in the output.

QuickPed offers a numerical summary of the selected relationship, by listing the standard relatedness coefficients. In the case of Philip IV and Mariana, we find:

Inbreeding coefficients $f = 0.082$ and $f = 0.136$, respectively
Kinship coefficient $\varphi = 0.231$
Relationship degree $= 1$
Identity coefficients $\Delta = (0.011, 0.004, 0.044, 0.024, 0.077, 0.044, 0.071, 0.498, 0.228)$

Since Philip IV and Mariana are both inbred, their $\kappa$ coefficients are undefined. To exemplify the relatedness triangle, we therefore look at two other members of the Habsburg family, namely the second cousins William V and Renata (rightmost in the 4th generation). Fig.3 shows the point corresponding to their coefficients, $\kappa = (0.9375, 0.0625, 0)$, in comparison with other common relationships.

Discussion

QuickPed aims to fill three gaps in the pedigree software literature. Firstly, it provides a quick, easy-to-use pedigree builder with robust support for import/export of ped files. Powered by the plotting abilities of kinship2 [13], QuickPed supports many pedigrees which are poorly handled by comparable programs (Additional file 1: Fig. S1). Moreover, the interactive process is often accelerated by the many built-in templates, which includes both common pedigrees (e.g., aunt-nephew, first cousins), historic examples (e.g., Habsburg, Tutankhamun) and theoretically important relationships that are challenging to create from scratch (e.g., quadruple half first cousins). One limitation of QuickPed as a pedigree drawing program pertains to pedigree size. There is no hard-coded size limit, but in practice the plot window cannot comfortably display more than about 100 individuals. Another limitation is the set of annotation tools. For users requiring comprehensive clinical symbols we recommend pedigreejs [3] or Progeny [4].

Secondly, QuickPed is to our knowledge the first online calculator of pedigree coefficients. Particularly in the case of identity coefficients, existing programs like IdCoefs [23] demand nontrivial bioinformatic skills of the user, including a separate preparation of ped files. In QuickPed the entire process is interactive, making it more convenient for many users. Regarding X-chromosomal coefficients, we believe this to be an area of untapped potential, hindered by lack of software. It is our hope that QuickPed’s ability to calculate X-chromosomal versions of all available coefficients, including condensed and detailed identity coefficients, may stimulate some attention in this direction.

Finally, QuickPed introduces standardised descriptions of pairwise relationships. Although this feature was originally conceived for pedagogical purposes, we find that it has substantial practical merit. In the Habsburg family (Fig.2) it would be a daunting task to untangle the pedigree paths by hand. But also in much simpler cases, for example that in Fig.1, it is our experience that relationships are often specified imprecisely, even by specialists. As such, our algorithm provides a practical method to avoid misunderstanding and improve communication.

Conclusion

QuickPed is a free, online pedigree tool primarily aimed at researchers, case workers and teachers. In addition to an intuitive pedigree builder, the program contains a variety of features for relatedness analysis, that are either novel or for the first time made accessible to non-bioinformaticians.

Availability of data and materials

The Habsburg dataset analysed in this study is available as a built-in pedigree in QuickPed, and also in the source code repository: https://github.com/magnusdv/quickped/tree/master/data.

Project name: QuickPed.

Project home page (live app): https://magnusdv.shinyapps.io/quickped.

Project information (user manual): https://magnusdv.github.io/pedsuite/articles/web_only/quickped.html.

Project repository (source code): https://github.com/magnusdv/quickped.

Operating system(s): Platform independent.

Programming language: R.

Other requirements: None.

License: GPL3.

Any restrictions to use by non-academics: Licence needed.

Abbreviations

IBD:: Identical by descent

References

Velinder M, Lee D, Marth G. ped_draw: pedigree drawing with ease. BMC Bioinform. 2020;21:569.
Article Google Scholar
Tekman M, Medlar A, Mozere M, Kleta R, Stanescu H. HaploForge: a comprehensive pedigree drawing and haplotype visualization web application. Bioinformatics. 2017;33:3871–7.
Article CAS PubMed Google Scholar
Carver T, Cunningham AP, BabbdeVilliers C, Lee A, Hartley S, Tischkowitz M, et al. Pedigreejs a web based graphical pedigree editor. Bioinformatics. 2018;34:1069–71.
Article CAS PubMed Google Scholar
Progeny Genetics LLC. Aliso Viejo, CA, USA. https://pedigree.progenygenetics.com/.
Kaplanis J, Gordon A, Shor T, Weissbrod O, Geiger D, Wahl M, et al. Quantitative analysis of population-scale family trees with millions of relatives. Science. 2018;360(6385):171–5.
Article CAS PubMed PubMed Central Google Scholar
Green PJ, Mortera J. Inference about complex relationships using peak height data from DNA mixtures. J R Stat Soc Ser C (App Stat). 2021;70(4):1049–82.
Article Google Scholar
Slooten KJ, Egeland T. Exclusion probabilities and likelihood ratios with applications to kinship problems. Int J Leg Med. 2014;128:415–25.
Article Google Scholar
Fowler C, Olalde I, Cummings V, Armit I, Büster L, Cuthbert S, et al. A high-resolution picture of kinship practices in an early Neolithic tomb. Nature. 2022;601:584–7.
Article CAS PubMed Google Scholar
Weeks DE, Valappil TI, Schroeder M, Brown DL. An X-linked version of the affected-pedigree-member method of linkage analysis. Hum Hered. 1995;45:25–33.
Article CAS PubMed Google Scholar
Pinto N, Gusmão L, Amorim A. X-chromosome markers in kinship testing: a generalisation of the IBD approach identifying situations where their contribution is crucial. Forensic Sci Int Genet. 2011;5:27–32.
Article CAS PubMed Google Scholar
Vigeland MD. Pedigree analysis in R. Cambridge: Academic Press; 2021.
Google Scholar
Vigeland MD. Relatedness coefficients in pedigrees with inbred founders. J Math Biol. 2020;81:185–207.
Article PubMed PubMed Central Google Scholar
Sinnwell JP, Therneau TM, Schaid DJ. The kinship2 R package for pedigree data. Hum Hered. 2014;78(2):91–3.
Article PubMed Google Scholar
Bennett RL, Steinhaus KA, Uhrich SB, O’Sullivan CK, Resta RG, Lochner-Doyle D, et al. Recommendations for standardized human pedigree nomenclature. Pedigree standardization task force of the national society of genetic counselors. Am J Hum Genet. 1995;56:745–52.
CAS PubMed PubMed Central Google Scholar
Lange K. Mathematical and statistical methods for genetic analysis. 2nd ed. New York: Springer; 2002.
Book Google Scholar
Thompson EA. The estimation of pairwise relationships. Ann Hum Genet. 1975;39(2):173–88.
Article CAS PubMed Google Scholar
Jacquard A. The genetic structure of populations. Krickeberg K, Lewontin RC, Neyman J, Schreiber M, editors. Berlin, Heidelberg, New York: Springer-Verlag; 1974.
Thompson EA. Statistical inference from genetic data on pedigrees. In: Institute of mathematical statistics: NSF-CBMS regional conference series in probability and statistics; 2000.
Manichaikul A, Mychaleckyj JC, Rich SS, Daly K, Sale M, Chen WM. Robust relationship inference in genome-wide association studies. Bioinformatics. 2010;26:2867–73.
Article CAS PubMed PubMed Central Google Scholar
Thompson EA. A restriction on the space of genetic relationships. Ann Hum Genet. 1976;40(2):201–4.
Article CAS PubMed Google Scholar
Wright S. Coefficients of inbreeding and relationship. Am Nat. 1922;56(645):330–8.
Article Google Scholar
Alvarez G, Ceballos FC, Quinteiro C. The role of inbreeding in the extinction of a European royal dynasty. PLOS One. 2009;4:e5174.
Article PubMed PubMed Central Google Scholar
Abney M. A graphical algorithm for fast computation of identity coefficients and generalized kinship coefficients. Bioinformatics. 2009;25(12):1561–3.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

I am grateful to all those who have given feedback during the development of QuickPed. In particular I thank Prof. Thore Egeland and Tuva Fjære for extensive testing and constructive comments.

Funding

This work has been supported by the Norwegian Research Council (project no. 321043).

Author information

Authors and Affiliations

Department of Medical Genetics, Oslo University Hospital, Oslo, Norway
Magnus D. Vigeland

Authors

Magnus D. Vigeland
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.D.V. conceived and executed the study, created the software and wrote the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Magnus D. Vigeland.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The author declares that he has no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Fig. S1

A pedigree with cross-generational mating, as displayed in various pedigree tools. A A ped file describing a pedigree with 5 individuals: Father (1), mother (2), daughter (3), son (4), and a child (5) resulting from father-daughter incest. B The pedigree as rendered by ped_draw [1], HaploForge [2], pedigreejs [3], and Progeny [4], respectively. For ped_draw and pedigreejs, the pedigree was loaded from the ped file, while HaploForge and Progeny required manual creation. In all cases, the result is inadequate. C The pedigree as shown in QuickPed.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Vigeland, M.D. QuickPed: an online tool for drawing pedigrees and analysing relatedness. BMC Bioinformatics 23, 220 (2022). https://doi.org/10.1186/s12859-022-04759-y

Download citation

Received: 11 February 2022
Accepted: 27 May 2022
Published: 07 June 2022
DOI: https://doi.org/10.1186/s12859-022-04759-y

QuickPed: an online tool for drawing pedigrees and analysing relatedness