Skip to main content

KidneyGPS: a user-friendly web application to help prioritize kidney function genes and variants based on evidence from genome-wide association studies

Abstract

Background

Genome-wide association studies (GWAS) have identified hundreds of genetic loci associated with kidney function. By combining these findings with post-GWAS information (e.g., statistical fine-mapping to identify independent association signals and to narrow down signals to causal variants; or different sources of annotation data), new hypotheses regarding physiology and disease aetiology can be obtained. These hypotheses need to be tested in laboratory experiments, for example, to identify new therapeutic targets. For this purpose, the evidence obtained from GWAS and post-GWAS analyses must be processed and presented in a way that they are easily accessible to kidney researchers without specific GWAS expertise.

Main

Here we present KidneyGPS, a user-friendly web-application that combines genetic variant association for estimated glomerular filtration rate (eGFR) from the Chronic Kidney Disease Genetics consortium with annotation of (i) genetic variants with functional or regulatory effects (“SNP-to-gene” mapping), (ii) genes with kidney phenotypes in mice or human (“gene-to-phenotype”), and (iii) drugability of genes (to support re-purposing). KidneyGPS adopts a comprehensive approach summarizing evidence for all 5906 genes in the 424 GWAS loci for eGFR identified previously and the 35,885 variants in the 99% credible sets of 594 independent signals. KidneyGPS enables user-friendly access to the abundance of information by search functions for genes, variants, and regions. KidneyGPS also provides a function (“GPS tab”) to generate lists of genes with specific characteristics thus enabling customizable Gene Prioritisation (GPS). These specific characteristics can be as broad as any gene in the 424 loci with a known kidney phenotype in mice or human; or they can be highly focussed on genes mapping to genetic variants or signals with particularly with high statistical support. KidneyGPS is implemented with RShiny in a modularized fashion to facilitate update of input data (https://kidneygps.ur.de/gps/).

Conclusion

With the focus on kidney function related evidence, KidneyGPS fills a gap between large general platforms for accessing GWAS and post-GWAS results and the specific needs of the kidney research community. This makes KidneyGPS an important platform for kidney researchers to help translate in silico research results into in vitro or in vivo research.

Peer Review reports

Background

Genome-wide association studies (GWAS) have successfully identified genetic loci associated with complex traits [1]. Human genetic evidence has been shown to improve the success rate in drug discovery, especially when the drug target is supported by a Mendelian trait or a GWAS association linked to a missense variant or an otherwise deleterious coding variant [2].

Chronic kidney disease is one of the top 15 causes of death in industrial countries and a poses a large individual and public health burden [3]. Impaired glomerular filtration rate (GFR) < 60 ml/1.73m2/min is a hallmark of CKD. GWAS have identified hundreds of genetic loci associated with GFR estimated from serum creatinine (eGFR) in the general population [4, 5]. However, linking the significant GWAS loci to causal variants and genes pinpointing molecular disease mechanisms relevant for kidney function is challenging.

Post-GWAS fine-mapping of identified loci aims to identify independent signals within loci and to narrow down each signal to the genetic variants that are likely the driving variant of an association signal (e.g., 99% credible sets of variants containing the causal variant with 99% probability, [6]). A 99% credible set of variants is generated for each signal. However, only the signals narrowed down to a limited number of variants in the credible set (e.g., small sets of five or less variants), or sets of any size that include a variant with a high posterior probability of association (e.g., PPA ≥ 50%), might be considered to have strong statistical support for pinpointing the rather likely causal variants. A signal with one variant in the credible set can be considered as narrowed down to single variant resolution: this variant has a 99% probability to be the causal variant for the association signal, given the causal variant is among the analysed variants. Such variant prioritizations have been adopted by Mahajan and colleagues (for Type 2 Diabetes) [7], Wuttke and colleagues (for kidney function) [5] and by Fritsche and colleagues (for Age-related macular degeneration) [8]. Another post-GWAS aim is then the mapping of the statistically supported genetic variants to genes by a variant’s predicted effect on the protein or regulatory function in relevant tissue. The relevant annotation data will typically depend on the disease or phenotype of interest.

The kidney researcher will typically want to focus on kidney tissue, if gene regulatory information is accessed, but also to have a look at other tissue to see whether gene regulation is kidney-specific or ubiquitous. A kidney researcher might also want to see whether a gene has a kidney phenotype in mice or human, or whether the gene is already a drug target in registered clinical trials for kidney disease. Yet, a gene targeted by clinical trials for other diseases might also be interesting for potential re-purposing. Thus, the relevant information should be kidney-centered, but not necessarily kidney-specific. A kidney researcher might also want to see genetic association evidence for kidney function biomarker other than creatinine-based eGFR, for eGFR in individuals with diabetes mellitus (DM), or for eGFR decline over time.

For a kidney researcher interested in selecting targets for functional follow-up or in judging pre-existing targets about GWAS support, the abundance of information from GWAS loci associated with kidney function and annotation data to map genes with kidney function relevance can be overwhelming. The data resulting from GWAS and post-GWAS is typically made publicly available either as online database or as supplementary tables of research articles. Supplementary tables are very specific, but cumbersome to search through. Online platforms like “GWAS catalog” [9], “Open Targets” [10] or “HugeAMP” [11] provide a broad view across numerous traits in parallel and integrate as many available GWAS results as possible. While this is comprehensive across traits, it can be challenging to extract the information relevant for a specific disease or trait. Bridging the gap between GWAS-based evidence and ready information for laboratory researchers is still an unmet need, particularly for kidney function genetics.

Our KidneyGPS, is a web-based application to query the evidence from GWAS results on eGFR joined with multiple annotation datasets and post-GWAS analyses that are relevant to the kidney. KidneyGPS also provides extended kidney-relevant genetic association results: (i) for alternative kidney function biomarkers, (ii) for eGFR in individuals with diabetes, and (iii) for eGFR decline. KidneyGPS can thus help the kidney-interested laboratory researcher to obtain an overview of existing genetic association support of specific genes and variants and to generate lists of genes with specific characteristics (customizable Gene PrioritSation, GPS).

Construction and content

The primary backbone of KidneyGPS are the 424 loci identified from GWAS on eGFR from the Chronic Kidney Disease Genetics (CKDGen) consortium, the largest consortium dedicated to genetic research on kidney function, and its expansion by UK Biobank totalling 1,201,909 individuals of primarily European ancestry [4]. Each locus was fine-mapped to identify independent signals within the locus using approximate conditional analyses (GCTA [12]); for each signal, the 99% credible set of variants was generated using Bayesian fine-mapping [13] (i.e., smallest set of variants that contains the causal variant with 99% probability). Due to the lack of an appropriate trans-ethnic linkage disequilibrium (LD) reference panel (required for the GCTA analysis), the GCTA and Bayesian fine-mapping analyses were based on Europeans-only (i.e., on eGFR summary statistics from 1,004,040 Europeans and on a LD reference panel that consisted of 20,000 unrelated Europeans from the UK Biobank). Given the recently expressed concern regarding the limited robustness of fine-mapping results upon different rounding of GWAS association statistics [6, 14], we conducted sensitivity analyses. We found some of the signals and credible sets observed in the original analysis [4] as less robust when increasing the number of decimal digits in the per-variant association statistics from 4 to 6, but no relevant change when further increasing to 8 decimal digits (Additaional file 1: Note S1, Figs. S1, S2). We thus updated the fine-mapping results (independent signal identification, credible sets) using 6 decimal digits for the association statistics yielding 594 independent signals across the 424 loci with a total of 35,885 variants in the 99% credible sets. Among the 594 signals, 172 signals showed strong statistical support for pinpointing likely causal variants by a 99% credible set that was small (≤ 5 variants) or contained a high-PPA (≥ 50%) variant (Fig. 1). Comparison of variants’ PPA with the size of the credible set that contained the variant supported the idea that small credible sets were enriched for high PPA variants (Additaional file 1: Fig. S3).

Fig. 1
figure 1

The 594 eGFR signals by strength of statistical support of pinpointing the likely causal variants. The 594 independent eGFR association signals were separated by size of their 99% credible set, stating the number of signals and number of variants in the corresponding 99% credible sets. In the 3rd row, we distinguished signals further by the highest PPA of a variant in a set: (i) one variant in the credible set (i.e. > 99% PPA, light green), (ii) at least one variant with PPA 50%-99% (dark green), or (iii) small credible set with all variants’ PPA < 50% (turquoise). Blue and turquoise boxes under dark green boxes contain the remaining variants in signals with one high PPA variant. The last row shows the number of respective variants that were mapped to a gene (protein-relevant or eQTL/sQTL in kidney tissue), also stating the number of mapped genes and number of signals with at least one credible set variant mapped to a gene

For each of the 35,855 credible set variants, we queried annotation data to infer whether variants were protein-relevant or regulatory for any of the 5906 genes located the 424 eGFR loci (“variant-to-gene mapping”): (1) whether the variant resided within a gene and was protein-relevant by any of the three categories (using Ensembl Variant Effect Predictor, VEP [15], and Combined Annotation Dependent Depletion, CADD, [16] for the categories): (i) “stop-gained”, “stop-lost”, “non-synonymous (providing CADD-Phred score) (ii) “canonical-splice”, “noncoding-change”, “synonymous”, “splice-site” (providing CADD-Phred score) and (iii) other consequences restricted to CADD-Phred score ≥ 15 to ascertain protein-relevance; (2) whether the variant had an association with gene expression (eQTL, false discovery rate < 0.05) in kidney-tissue (tubulo-interstitium or glomerulus from NEPTUNE [17] or Susztaklab [18], kidney cortex from GTEx v7 [19]) or any tissue (GTEx v7); (3) whether the variant was associated with expression levels of exon junctions or variation in the relative abundances of gene transcript isoforms (splice quantitative trait locus, sQTL, FDR < 0.05) in any tissue (GTEx V7).

We also queried annotation data for the 5,906 genes (“gene-to-phenotype mapping”): (4) whether the gene had kidney-relevant phenotypes in mice (Mouse Genome Informatics, MGI [20]); (5) whether the gene was known for human genetic disorders with kidney phenotype (Online Mendelian Inheritance in Man, OMIM, [21]) or with evidence from sequencing patients (CKD patients [22] or autosomal dominant tubulointerstitial kidney disease patients [23]); (6) whether the gene was known for drugability or drug-interaction from registered clinical trials (Therapeutic Target Database [24]) for kidney-related indications (ICD-11 codes GB4“X” to GB9“X”, Additional file 1: Table S1), also providing the information on other indications (relevant for re-purposing).

Additionally, for each of the 594 eGFR signals, we queried further genetic association data relevant to the kidney researcher: (7) To highlight the relevance of a genetic association with creatinine-based eGFR for kidney function rather than creatinine metabolism, we included information on whether the locus association was directionally consistent and nominally significant for blood urea nitrogen (BUN) or cystatin-based eGFR (eGFRcys; i.e. locus lead variant P < 0.05; opposite or same direction of effect for BUN or eGFRcys, respectively; n = 852,678 or 460,826, respectively; yielding 491 of 594 signals validated); (8) Since genetic effects with steeper decline versus more stable eGFR over time might point to particularly deleterious mechanisms for the kidney, we included information on whether the signal showed significant association on eGFR decline (N = 343,339 [25], yielding 8 decline signals). (9) Since eGFR etiology in DM might differ from eGFR generally, we included information on the eGFR association for DM and noDM individuals separately and whether the eGFR signals were more or less pronounced in DM versus noDM (NDM = 178,691, NnoDM = 1,296,113 [26], yielding 6 signals associated with eGFR only or more strongly in DM and 1 signal only associated in noDM).

 The specific aspects and annotation data integrated in KidneyGPS as of now (version 2.3.0) and how this compares to the previous publication [4] are shown in Additaional file 1: Table S2 and the KidneyGPS online documentation. By these annotations (of the 35,855 variants, 5906 genes), KidneyGPS contains (a) 13,716 credible set variants that mapped to any of 940 genes as being protein-relevant or eQTL/sQTL in kidney tissue (Table 1A) and (b) 381 genes with known kidney phenotype in mouse or human (including drug targets for kidney diseases; Table 1B).

Table 1 Annotation of credible set variants and genes in the 594 signals associated with eGFR

The architecture of the KidneyGPS database is structured hierarchically based on the 424 eGFR loci and annotation data, where each of these above outlined steps are considered additional layers of evidence that are partly locus-specific, signal-specific, variant-specific, or gene-specific (Fig. 2). KidneyGPS is implemented as RShiny web application (https://kidneygps.ur.de/gps/).

Fig. 2
figure 2

Overview of the architecture of KidneyGPS. We integrated data from multiple GWAS and post-GWAS approaches and derived evidence for variant and gene prioritization regarding their impact on kidney function. Shown are the integrated datasets by the various levels (with the respective input and output of the query within the software) and the front end (input and output of the query of the user); implemented as web application using RShiny

Utility and discussion

User interface

The user interface of KidneyGPS is organized into five tabs: Three tabs enable the specific search for genes, variants and regions (underlying data structure shown in Additional file 1: Fig. S4): (1) “gene search” tab: search for genes using their gene names (synonyms automatically mapped to their official HGNC gene name [18]), (2) “variant search” tab: search variants using their rs-identifiers or genetic positions (chr:pos from GRCh37), and (3) “region search” tab: search for start and end of a genomic regions using chromosome and base-positions. (4) The GPS tab enables to generate lists of genes with specific characteristics and thus enables customized gene prioritization. (5) The last tab, labelled “Documentation & Help”, includes step-by-step-guides for search and GPS features, data source descriptions, and contact options, and data privacy statement.

The GPS tab summarizes the integrated data on all 5906 genes and 594 signals in a comprehensive “GPS Table”. It enables gene prioritization by generating lists of genes with specific characteristics, using various filter options: for example, all genes in the 594 eGFR signals that have a known kidney phenotype in mice or human, or all genes mapping to signals with specific properties regarding the strength of statistical support for pinpointing the likely causal variant. There are three types of filter options: (1) “Signal filtering” allows to restrict the view of the GPS-table to signals with small credible sets (≤ 5 variants) or to signals with a high PPA variant in the credible set (10%, 50% or 99%) or any combination of the two filters. Additionally, one can restrict the GPS-view to signals where the genetic association with creatinine-based eGFR is eGFRcys/BUN-validated, modulated in individuals with DM, or with a genetic association also for eGFR decline over time. (2) “Variant-to-gene mapping” allows to further restrict the GPS-view to genes that are mapped by credible set variants with specific characteristics, e.g., protein-altering variants, or regulatory variants in kidney tissue. (3) “Gene-to-phenotype mapping” allows to restrict the GPS-view to genes of specific characteristics, e.g., known for human or mouse kidney phenotypes, or for genes that are drug targets in registered trials for kidney disease. Case examples of how to use the GPS tab are provided in Additional file 1: Note S2.

The “gene search” tab enables the user to search for up to 2,000 genes simultaneously, either as a list or a text-file input. The generated output provides an extract of the GPS-Table for the respective gene(s) as well as all underlying gene- and variant-based information. The variant-based information can be restricted to variants with PPA above a user-defined threshold. The “variant search” is similarly structured allowing for simultaneous querying of up to 2000 variants. The output yields the eGFR association statistics (all ancestries) for any variant that was genome-wide significantly associated with eGFR (P < 5 × 10–8). For credible set variants (regardless of their association p value), it yields eGFR association statistics in European-ancestry (with and without fully conditioning on independent signal index variants of the locus) and functional or regulatory annotation, if applicable. The “region search” enables the user to identify eGFR associated loci that overlap their genomic region of interest and provides the GPS Table for all genes within these loci. For the gene and variant search, we also provide regional association plots of the respective eGFR locus (LocusZoom [27]). All search results are available for download as Excel tables or as plain csv files.

Limitations and potential misinterpretations

Understanding the limitations of the methods and data employed in KidneyGPS is crucial to avoid misinterpretations.

There is some uncertainty in the robustness of independent signal identification and generated credible sets of variants [6]. The signal identification uses the variants’ correlations based on variant reference panels. Due to the lack of an appropriate trans-ethnic reference panel, here, a reference panel of 20,000 unrelated Europeans from UK Biobank was used, which fits very well to the integrated UK Biobank GWAS, but with some uncertainties regarding its fit to other European or non-European GWAS studies included in CKDGen. The generation of credible sets of variants for each signal was performed based on European-only eGFR summary statistics using the method by Wakefield [13], which assumes one causal variant per signal and thus depends on successfully distinguishing the independent signals up front. Alternative approaches allowing for multiple causal variants per signal like SuSiE [28, 29] or FINEMAP [30] are currently investigated, but have not been widely applied to large consortia data so far and might even more strongly depend on the variant reference panel and power. Any approach for generating credible sets of variants will successfully narrow down only a part of all GWAS signals due to power and regions with large stretches of jointly inherited variants (i.e., high LD). Credible sets as wide as > 100 or even > 1000 genetic variants or credible set variants with low PPA provide limited credibility in the statistical support of any of these variants and users might want to use the “signal filter” of KidneyGPS to focus on signals and/or variants with high statistical support. Currently emerging methods to investigate fine-mapping robustness, e.g., SLALOM [14], might help quantify the uncertainties in fine-mapping results. Integrating more individuals of African ancestry can help dissect large stretches of high LD in the genome and sharpen GWAS signals in the future.

It is a limitation that the GWAS association statistics for eGFR are based mostly and the fine-mapping solely on European populations. Therefore, KidneyGPS results cannot be generalized to non-European populations when genetic variant frequencies differ or when the causal variants even vary between ancestries. For example, multiple variants in APOL1 have high impact on kidney function in African Americans [31], but are not associated in European ancestry due to low allele frequencies. Currently emerging larger GWAS on kidney function with improved coverage of non-European populations and emerging multi-population fine-mapping methods will provide the opportunity to expand KidneyGPS to represent individuals more globally.

With regards to the phenotype used for the GWAS, some genetic effects identified for creatinine-based eGFR may not necessarily impact kidney function but creatinine metabolisms. Our filter option for eGFRcys or BUN validated signals can be used to provide more certainty on kidney function relevance.

Finally, genetic effects on eGFR might differ by sex, in the presence of hypertension, or have other interactions with lifestyle factors, but GWAS within these subgroups or GWAS incorporating gene-lifestyle interactions are currently lacking. We do provide genetic association of eGFR in DM [26] and highlight the—surprisingly few—differences of these with general eGFR.

Comparison with existing tools

While other post-GWAS tools like GWAScatalog [9], HugeAMP [11] and Open Target Genetics [10] are more comprehensive across diseases and traits, KidneyGPS stands out due to its focus on kidney function, its direct link to other kidney-related GWAS, and due to its customizable approach to generate lists of genes of specific characteristics (Table 2). Although all platforms provide variant- and gene-search functionalities, KidneyGPS outperforms the other platforms in terms of the depth of kidney-related results and the options available to extract kidney function specific information (real showcase example in Additional file 1: Note S3). KidneyGPS is unique in its customizable approach to gene prioritization in the sense that the user can generate lists of genes of specific characteristics depending on personal preference or research interest, rather than prioritizing genes by a specified algorithm that typically implies assumptions. An advantage of GWAS Catalog [9], Open Targets Genetics [10] or HugeAMP [11] is that users can extract information across many different phenotypes. However, due to the focus on data related to kidney function, KidneyGPS is a practical tool for the kidney researcher enabling easy access and easy information extraction.

Table 2 Feature comparison across GWAS online platforms

Future developments

We have developed KidneyGPS within a cooperation with the CKDGen consortium and we will continue to cooperate there. This will enable timely updates with novel loci and signals for eGFR, more diverse populations included in the GWAS, improved resolution at already existing signals, and further expansion to other kidney function traits or subgroup-specific GWAS. Other updates will include information on open chromatin in kidney tissue, kidney cell-type specific gene expression and enriched pathway.

Conclusion

KidneyGPS summarizes the statistical and biological evidence identified by GWAS for eGFR and post-GWAS approaches with a special focus on kidney-relevant annotation data as well as complementing genetic associations for other kidney-related traits. KidneyGPS displays this information in a clear and comprehensive way to make it easily accessible to a broader interdisciplinary scientific community. KidneyGPS is designed to support kidney researchers to help translate in silico research results into in vitro or in vivo research.

Availability of data and materials

The KidneyGPS can be accessed at https://kidneygps.ur.de/gps/ and the source code is available at https://github.com/kjstanzick/KidneyGPS. KidneyGPS integrates data from various sources of information including kidney function loci [4] gene expression data [17,18,19], functional annotation and CADD [15, 16], kidney phenotypes [20,21,22,23], diabetes-specific kidney function [26], kidney function decline [24, 25].

References

  1. Loos RJF. 15 years of genome-wide association studies and no signs of slowing down. Nat Commun. 2020;11:5900. https://doi.org/10.1038/s41467-020-19653-5.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  2. King EA, Davis JW, Degner JF. Are drug targets with genetic support twice as likely to be approved? Revised estimates of the impact of genetic support for drug mechanisms on the probability of drug approval. PLoS Genet. 2019;15:e1008489. https://doi.org/10.1371/journal.pgen.1008489.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  3. Naghavi M, Abajobir AA, Abbafati C, Abbas KM, Abd-Allah F, Abera SF, et al. Global, regional, and national age-sex specific mortality for 264 causes of death, 1980–2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet. 2017;390:1151–210. https://doi.org/10.1016/S0140-6736(17)32152-9.

    Article  Google Scholar 

  4. Stanzick KJ, Li Y, Schlosser P, Gorski M, Wuttke M, Thomas LF, et al. Discovery and prioritization of variants and genes for kidney function in 1.2 million individuals. Nat Commun. 2021;12:4350. https://doi.org/10.1038/s41467-021-24491-0.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  5. Wuttke M, Li Y, Li M, Sieber KB, Feitosa MF, Gorski M, et al. A catalog of genetic loci associated with kidney function from analyses of a million individuals. Nat Genet. 2019;51:957–72. https://doi.org/10.1038/s41588-019-0407-x.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  6. Schaid DJ, Chen W, Larson NB. From genome-wide associations to candidate causal variants by statistical fine-mapping. Nat Rev Genet. 2018;19:491–504. https://doi.org/10.1038/s41576-018-0016-z.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  7. Mahajan A, Taliun D, Thurner M, Robertson NR, Torres JM, Rayner NW, et al. Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nat Genet. 2018;50:1505–13. https://doi.org/10.1038/s41588-018-0241-6.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  8. Fritsche LG, Igl W, Bailey JNC, Grassmann F, Sengupta S, Bragg-Gresham JL, et al. A large genome-wide association study of age-related macular degeneration highlights contributions of rare and common variants. Nat Genet. 2016;48:134–43. https://doi.org/10.1038/ng.3448.

    Article  PubMed  CAS  Google Scholar 

  9. Sollis E, Mosaku A, Abid A, Buniello A, Cerezo M, Gil L, et al. The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res. 2023;51:D977–85. https://doi.org/10.1093/nar/gkac1010.

    Article  PubMed  CAS  Google Scholar 

  10. Ghoussaini M, Mountjoy E, Carmona M, Peat G, Schmidt EM, Hercules A, et al. Open targets genetics: systematic identification of trait-associated genes using large-scale genetics and functional genomics. Nucleic Acids Res. 2021;49:D1311–20. https://doi.org/10.1093/nar/gkaa840.

    Article  PubMed  CAS  Google Scholar 

  11. Common Metabolic Diseases Knowledge Portal (cmdkp.org). https://hugeamp.org/dinspector.html?dataset=Stanzick2021_CKD_mixed&phenotype=eGFRcrea. Accessed 16 Feb 2023.

  12. Yang J, Ferreira T, Morris AP, Medland SE, Madden PAF, Heath AC, et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat Genet. 2012;44:369–75. https://doi.org/10.1038/ng.2213.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  13. Wakefield J. A Bayesian measure of the probability of false discovery in genetic epidemiology studies. Am J Hum Genet. 2007;81:208–27. https://doi.org/10.1086/519024.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  14. Kanai M, Elzur R, Zhou W, Daly MJ, Finucane HK. Meta-analysis fine-mapping is often miscalibrated at single-variant resolution. Cell Genom. 2022. https://doi.org/10.1016/j.xgen.2022.100210.

    Article  PubMed  PubMed Central  Google Scholar 

  15. McLaren W, Gil L, Hunt SE, Riat HS, Ritchie GRS, Thormann A, et al. The ensembl variant effect predictor. Genome Biol. 2016. https://doi.org/10.1186/s13059-016-0974-4.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Rentzsch P, Witten D, Cooper GM, Shendure J, Kircher M. CADD: predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 2018;47:D886–94. https://doi.org/10.1093/nar/gky1016.

    Article  PubMed Central  CAS  Google Scholar 

  17. Gillies CE, Putler R, Menon R, Otto E, Yasutake K, Nair V, et al. An eQTL landscape of kidney tissue in human nephrotic syndrome. Am J Hum Genet. 2018;103:232–44. https://doi.org/10.1016/j.ajhg.2018.07.004.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  18. Sheng X, Guan Y, Ma Z, Wu J, Liu H, Qiu C, et al. Mapping the genetic architecture of human traits to cell types in the kidney identifies mechanisms of disease and potential treatments. Nat Genet. 2021;53:1322–33. https://doi.org/10.1038/s41588-021-00909-9.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  19. Aguet F, Brown AA, Castel SE, Davis JR, He Y, Jo B, et al. Genetic effects on gene expression across human tissues. Nature. 2017;550:204–13. https://doi.org/10.1038/nature24277.

    Article  Google Scholar 

  20. Bult CJ, Blake JA, Smith CL, Kadin JA, Richardson JE, Anagnostopoulos A, et al. Mouse genome database (MGD) 2019. Nucleic Acids Res. 2018;47:D801–6. https://doi.org/10.1093/nar/gky1056.

    Article  PubMed Central  CAS  Google Scholar 

  21. Hamosh A, Scott AF, Amberger J, Valle D, McKusick VA. Online mendelian inheritance in man (OMIM). Hum Mutat. 2000;15:57–61. https://doi.org/10.1002/(SICI)1098-1004(200001)15:1%3c57::AID-HUMU12%3e3.0.CO;2-G.

    Article  PubMed  CAS  Google Scholar 

  22. Groopman EE, Marasa M, Cameron-Christie S, Petrovski S, Aggarwal VS, Milo-Rasouly H, et al. Diagnostic utility of exome sequencing for kidney disease. N Engl J Med. 2019;380:142–51. https://doi.org/10.1056/NEJMoa1806891.

    Article  PubMed  CAS  Google Scholar 

  23. Wopperer FJ, Knaup KX, Stanzick KJ, Schneider K, Jobst-Schwan T, Ekici AB, et al. Diverse molecular causes of unsolved autosomal dominant tubulointerstitial kidney diseases. Kidney Int. 2022;102:405–20. https://doi.org/10.1016/j.kint.2022.04.031.

    Article  PubMed  CAS  Google Scholar 

  24. Zhou Y, Zhang Y, Lian X, Li F, Wang C, Zhu F, et al. Therapeutic target database update 2022: facilitating drug discovery with enriched comparative data of targeted agents. Nucleic Acids Res. 2022;50:D1398–407. https://doi.org/10.1093/nar/gkab953.

    Article  PubMed  CAS  Google Scholar 

  25. Gorski M, Rasheed H, Teumer A, Thomas LF, Graham SE, Sveinbjornsson G, et al. Genetic loci and prioritization of genes for kidney function decline derived from a meta-analysis of 62 longitudinal genome-wide association studies. Kidney Int. 2022;102:624–39. https://doi.org/10.1016/j.kint.2022.05.021.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  26. Winkler TW, Rasheed H, Teumer A, Gorski M, Rowan BX, Stanzick KJ, et al. Differential and shared genetic effects on kidney function between diabetic and non-diabetic individuals. Commun Biol. 2022;5:580. https://doi.org/10.1038/s42003-022-03448-z.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  27. Pruim RJ, Welch RP, Sanna S, Teslovich TM, Chines PS, Gliedt TP, et al. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics. 2010;26:2336–7. https://doi.org/10.1093/bioinformatics/btq419.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  28. Wang G, Sarkar A, Carbonetto P, Stephens M. A simple new approach to variable selection in regression, with application to genetic fine mapping. J R Stat Soc B. 2020;82:1273–300. https://doi.org/10.1111/rssb.12388.

    Article  Google Scholar 

  29. Zou Y, Carbonetto P, Wang G, Stephens M. Fine-mapping from summary data with the “Sum of Single Effects” model. PLoS Genet. 2022;18:e1010299. https://doi.org/10.1371/journal.pgen.1010299.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  30. Benner C, Spencer CCA, Havulinna AS, Salomaa V, Ripatti S, Pirinen M. FINEMAP: efficient variable selection using summary data from genome-wide association studies. Bioinformatics. 2016;32:1493–501. https://doi.org/10.1093/bioinformatics/btw018.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  31. Parsa A, Kao WHL, Xie D, Astor BC, Li M, Hsu C, et al. APOL1 risk variants, race, and progression of chronic kidney disease. N Engl J Med. 2013;369:2183–96. https://doi.org/10.1056/NEJMoa1310345.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  32. Liu H, Doke T, Guo D, et al. Epigenomic and transcriptomic analyses define core cell types, genes and targetable mechanisms for kidney disease. Nat Genet. 2022;54(7):950–62. https://doi.org/10.1038/s41588-022-01097-w.

    Article  PubMed  CAS  Google Scholar 

Download references

Acknowledgements

We thank the CKDGen Consortium and its Analyst Group for fruitful discussion and feedback on the project.

Funding

Open Access funding enabled and organized by Projekt DEAL. Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – Project-ID 387509280, SFB 1350; Project-ID 509149993, TRR 374.

Author information

Authors and Affiliations

Authors

Contributions

KJ Stanzick developed KidneyGPS, analysed and interpreted the underlying data and was a major contributor in writing the manuscript. KJ Stark contributed to the conception and design of KidneyGPS and the interpretation of the underlying data. MG contributed to the data analysis and interpretation. JS, RK, FK and RW beta tested KidneyGPS and contributed to the final conception and design of KidneyGPS. TWW and IMH contributed to the conception and design of KidneyGPS, the interpretation of the underlying data and substantially revised the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Iris M. Heid or Thomas W. Winkler.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file1

: Note S1. Sensitivity Analyses regarding the impact of decimal digits on secondary signal identification and fine-mapping results. Note S2. Case example on how to use the “GPS tab”. Note S3. Detailed comparison of gene-search and results from KidneyGPS to GWAS Catalog, Open Targets Genetics and HugeAMP. Table S1. ICD-11 codes. Table S2. Comparison of data. Fig. S1. Comparison of the identifying p-values of the 634 signal index variants in analyses with different numbers of digits analysis. Fig. S2. Comparison of credible set sizes for all 634 signals. Fig. S3. Comparison of variant’s PPA and 99% credible set size. Fig. S4. Architecture of KidneyGPS search functions.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Stanzick, K.J., Stark, K.J., Gorski, M. et al. KidneyGPS: a user-friendly web application to help prioritize kidney function genes and variants based on evidence from genome-wide association studies. BMC Bioinformatics 24, 355 (2023). https://doi.org/10.1186/s12859-023-05472-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12859-023-05472-0

Keywords