- Research Article
- Open Access
Three-dimensional spatial analysis of missense variants in RTEL1 identifies pathogenic variants in patients with Familial Interstitial Pneumonia
© The Author(s). 2018
- Received: 6 September 2017
- Accepted: 3 January 2018
- Published: 23 January 2018
Next-generation sequencing of individuals with genetic diseases often detects candidate rare variants in numerous genes, but determining which are causal remains challenging. We hypothesized that the spatial distribution of missense variants in protein structures contains information about function and pathogenicity that can help prioritize variants of unknown significance (VUS) and elucidate the structural mechanisms leading to disease.
To illustrate this approach in a clinical application, we analyzed 13 candidate missense variants in regulator of telomere elongation helicase 1 (RTEL1) identified in patients with Familial Interstitial Pneumonia (FIP). We curated pathogenic and neutral RTEL1 variants from the literature and public databases. We then used homology modeling to construct a 3D structural model of RTEL1 and mapped known variants into this structure. We next developed a pathogenicity prediction algorithm based on proximity to known disease causing and neutral variants and evaluated its performance with leave-one-out cross-validation. We further validated our predictions with segregation analyses, telomere lengths, and mutagenesis data from the homologous XPD protein. Our algorithm for classifying RTEL1 VUS based on spatial proximity to pathogenic and neutral variation accurately distinguished 7 known pathogenic from 29 neutral variants (ROC AUC = 0.85) in the N-terminal domains of RTEL1. Pathogenic proximity scores were also significantly correlated with effects on ATPase activity (Pearson r = −0.65, p = 0.0004) in XPD, a related helicase. Applying the algorithm to 13 VUS identified from sequencing of RTEL1 from patients predicted five out of six disease-segregating VUS to be pathogenic. We provide structural hypotheses regarding how these mutations may disrupt RTEL1 ATPase and helicase function.
Spatial analysis of missense variation accurately classified candidate VUS in RTEL1 and suggests how such variants cause disease. Incorporating spatial proximity analyses into other pathogenicity prediction tools may improve accuracy for other genes and genetic diseases.
The use of next-generation sequencing to study families with pulmonary diseases has led to the identification of novel genes and mechanisms associated with the inherited forms of pulmonary arterial hypertension [1–5] and pulmonary fibrosis [6–8]. Genetic variation in telomere-related genes is the predominant cause of pulmonary disease (when genetic etiology is known). Even when the genetic cause is unknown, such as with idiopathic pulmonary fibrosis, telomere shortening in peripheral blood mononuclear cells [9–11] and type II alveolar epithelial cells [6, 11] is commonly observed in patients and families. The mechanism through which telomere dysfunction leads to lung fibrosis is not clear, but may involve premature senescence of progenitor cells in the distal lung [12–14]. Among families with pulmonary fibrosis (Familial Interstitial Pneumonia, FIP), whole exome sequencing (WES) studies have identified that variation in a few genes is responsible for disease risk. The most commonly mutated genes in FIP patients are TERT (10–15% of cases) [15, 16], RTEL1, and PARN (3–4% of cases each) [6, 7]. Most FIP mutations identified to date are very rare or novel. Rare variation presents challenges when using genetic information in clinical practice, since most newly identified variants in FIP-associated genes are considered variants of unknown significance (VUS).
Predicting the effects of rare missense VUS on protein function is particularly challenging; some variants are tolerated while others lead to dramatic alterations in protein structure, trafficking/localization, or function . Classical genetic approaches, including linkage analysis, are often limited by small family size, disease onset late in life, and in the case of telomere-related genes such as RTEL1, may also be confounded by the inheritance of short telomeres (and thus increased disease risk) without inheritance of the causal allele. Assigning pathogenicity to VUS has important implications for genetic testing and family counseling, and may soon impact treatment decisions. While functional testing of variants remains the gold standard, in many cases this is not feasible in a sufficiently timely manner to impact clinical care. Numerous in-silico algorithms have been developed to predict VUS pathogenicity by analyzing evolutionary conservation patterns and/or biochemical characteristics of amino-acid substitutions (e.g., SIFT , PolyPhen , VAAST , GERP , CADD , VIPUR ). However, these methods frequently present discordant classifications  and rarely provide specific mechanistic hypotheses about the functional effects of VUS. Novel approaches are required that incorporate RTEL1-specific information to improve pathogenicity prediction.
We screened FIP families from our registry for rare variants in RTEL1 and identified 13 rare missense VUS. We hypothesized that pathogenic RTEL1 variants likely affect critical functions and/or protein interactions and thus would co-localize in three-dimensional space. To test this hypothesis, we used homology modeling to predict the tertiary structure of RTEL1 and identified a spatial cluster of variants with known disease-association in RTEL1’s helicase domains. We then developed an algorithm to classify missense VUS based on their spatial proximity to known pathogenic and neutral variants with the expectation that VUS near the pathogenic cluster are more likely contribute to disease. The approach outperformed two common pathogenicity prediction methods in cross-validation and predicted the pathogenicity of disease-segregating VUS with high accuracy. Our study supports the likely pathogenicity of novel FIP-associated rare variants, generates a new homology model of RTEL1’s 3D structure, supports quantitative spatial analysis in protein structure as a powerful approach to classify VUS in RTEL1, and suggests this technique may have broad applicability to other genes and genetic diseases.
Subjects and samples
We trained our spatial proximity prediction algorithm using putatively neutral RTEL1 missense variants from the 1000 Genomes Project  that were not otherwise associated with disease and pathogenic missense variants causing severe pediatric, autosomal recessive Hoyeraal-Hreidarsson syndrome collected from previous literature [25–31]. We evaluated the performance of our prediction algorithm using rare missense variants of unknown significance from patients with Familial Interstitial Pneumonia (FIP). Subjects were identified from the Familial Interstitial Pneumonia (FIP)/Familial Pulmonary Fibrosis (FPF) registries at Vanderbilt University, the University of Colorado, and National Jewish Hospital . FIP was defined by the presence of Idiopathic Interstitial Pneumonia (IIP) in two or more family members, including interstitial pulmonary fibrosis (IPF) in at least one individual. Phenotypes of subjects selected for sequencing were ascertained using ATS/ERS criteria for IIP . The affected status of deceased individuals was determined by review of available medical records, autopsy material, or by death certificates. DNA was isolated from blood and/or paraffin-embedded lung tissue using a PureGene Kit (Gentra Systems, Minneapolis, MN). Rare missense variants (MAF < 0.001) in RTEL1 were curated from whole-exome sequencing data as previously reported  (n = 189 families) or targeted modified Sanger sequencing of RTEL1 (n = 184 families) (Additional file 1: Figure S1). Co-segregation and telomere length measurements were performed as previously described . VUS co-segregation with disease and short telomeres were considered evidence for pathogenicity and represent true-positives in our analysis.
Protein structural analysis
We evaluated the predictive power of the pathogenic proximity score using leave-one-out cross-validation on the known pathogenic and neutral variants ; each variant was predicted to be pathogenic or neutral by its proximity to all other variants. We quantified the performance of each prediction method using the area under the receiver operating characteristic curve (ROC AUC). The ROC curve plots true positive rate, the proportion of true positives (pathogenic variants) predicted to be positive, versus false positive rate, the proportion of true negatives (neutral variants) predicted to be positive, as a function of prediction rank. The ROC AUC is equivalent to the probability that a randomly selected positive is ranked higher than a randomly selected negative; thus, perfect separation of positives and negatives produces a ROC AUC of 1.0 and random ordering produces a ROC AUC of 0.5. We compared the performance of the pathogenic proximity score with other pathogenicity prediction methods, including ConSurf evolutionary conservation scores , SIFT , and PolyPhen2 . A brief description of each approach is provided in the Additional file 1: Supplemental Methods.
Constructing a structural model of RTEL1
The protein structure for RTEL1 has not yet been experimentally determined, so we constructed a computationally derived homology model. To begin, we applied nine computational modeling algorithms to the protein sequence: GeneSilico , HHpred , I-TASSER , M4T , Pcons5 , Phyre2 , RaptorX , Robetta , and SWISS-MODEL . RaptorX produced the highest-coverage model, which consisted of two well-folded domains spanning residues 1–769 and 881–1151. This model was based on seven PDB structures: 4a15 , 3crv , 2fi7 , 2gm7 , 4pjq , 2vrw , 4a64 . To improve quality, the model was relaxed using Rosetta version 2015.19 , and then subjected to 1000 rounds of loop_modeling  using perturb_kic_with_fragments. This new structural model of RTEL1 is available as Additional file 2.
Known pathogenic missense variants in RTEL1 cluster in 3D structure
Spatial proximity analysis accurately classifies pathogenic and neutral RTEL1 variants
Based on the observed differences between neutral and pathogenic variant distributions, we hypothesized that candidate VUS could be classified by their relative spatial proximity to known pathogenic and neutral variants. To evaluate this, we used leave-one-out cross-validation to calculate pathogenic proximity scores (ΔP) for each known pathogenic and neutral variant in the N-terminal model of RTEL1 (Table S1) and then plotted ROC and PR curves to measure how accurately the proximity score predicts pathogenicity. Classifying variants by their pathogenic proximity score performed well (Fig. 1c); the approach yielded a ROC AUC of 0.85.
To estimate the sensitivity of the proximity-based prediction method to the number of known pathogenic variants, we recomputed pathogenic proximity scores using all possible subsets of pathogenic variants and then calculated the ROC and PR AUC for each subset (Additional file 1: Figure S1). As expected, performance increases as the number of known pathogenic mutations considered increases; the mean ROC AUC is 0.62 when only two pathogenic variants are known and 0.82 when six variants are considered. This suggests that performance will increase as more pathogenic variants are identified. However, we caution that the number of known pathogenic variants required will likely vary substantially based on the structure and function of the protein of interest.
We then compared the performance of our pathogenic proximity score to a representative set of current methods for in silico pathogenicity prediction: ConSurf evolutionary conservation , SIFT , PolyPhen2  (Fig. 1c). The pathogenic proximity score outperformed PolyPhen2 (ROC AUC = 0.81) and SIFT (ROC AUC = 0.80); evolutionary conservation had the best performance (ROC AUC = 0.89). The competitive ROC AUC with current methods and the relatively strong performance obtained with small numbers of known pathogenic variants demonstrates the predictive potential of spatial statistics, which are not currently used for variant pathogenicity prediction.
The pathogenic proximity score identifies nearly all disease-segregating VUS as pathogenic
Pathogenicity predictions for RTEL1 missense VUS from FIP patients
RTEL1 pathogenic proximity scores correlate with decreased ATPase activity in XPD mutants
Genetic variation in RTEL1 is a common cause of FIP in families with known genetic etiology. Most disease-causing RTEL1 variants are private or very rare mutations and appear to reduce RTEL1 levels and/or activity [6, 26]. Determining the pathogenicity of newly identified candidate VUS, particularly missense variants, presents a significant challenge in the diagnosis and treatment of patients and their family members that may be at risk . A number of algorithms provide predictions for missense pathogenicity, but disagreement between algorithms is frequent; in one report, the correlation between SIFT and PolyPhen2 scores was only 0.4 . Missense RVs in RTEL1 are potentially actionable, so improved approaches to predicting pathogenicity could have a substantial clinical impact. In this report, we describe a novel, quantitative structural approach to predicting VUS pathogenicity, applied to 13 rare missense VUS in RTEL1.
We constructed a homology model of the structure of RTEL1 and analyzed missense VUS relative to the spatial distribution of known pathogenic and neutral variation. Five of six VUS that segregated with FIP in families were predicted to be pathogenic by our method, as well as one VUS without disease co-segregation or telomere length data. Below, we outline potential structural mechanisms of action – ranging from disruption of protein-protein or protein-DNA interactions to destabilization of the tertiary structure of the protein – for each segregating VUS.
V516 is a moderately conserved, hydrophobic residue buried in the interior of the helicase II domain. It forms a small well-packed hydrophobic core, which lies under a patch of positively charged surface residues (R518, H713, R729, H731). Insertion of a leucine residue in this position is predicted to be destabilizing because of the additional steric bulk. Moreover, the structural rearrangement could disrupt the conformation of the basic surface patch, presumably affecting interaction with DNA.
S540 is a polar residue predicted to lie on a surface-exposed alpha helix in the helicase II domain. Mutation of the hydroxyl group to an isopropyl group is predicted to have one of two effects. Either the character of the protein surface will be changed from polar to hydrophobic at that location, or, by altering the amphipathic nature of that helix, the mutation could affect the helix packing and positioning, resulting in a larger structural change such as rotation of the helix. Either of these two effects could explain the functional consequence of the variant.
F559 is a bulky aromatic residue found on the interior of the protein model, within 9 Å of the predicted DNA-binding interface (Fig. 3b). Replacement of the large volume of the phenylalanine side chain with the smaller volume of isoleucine could alter the geometry of the DNA-binding cavity sufficiently to disrupt that interaction. Notably, while F559 is in the second shell of residues responsible for DNA contact, it is predicted to be directly adjacent to two first-shell residues, E591 and A621, which have been previously reported as disease-associated .
S688 is located on a buried helix one turn (5.9 Å) away from disease-associated residue R684. The mutation of serine to cysteine does not result in major changes in bulk, branching, charge, or hydrophobicity. However, the presence of the sulfhydryl group in the cysteine could potentially promote misfolding and aggregation upon incorrect formation of disulfide bonds, if exposed to oxidation.
D719 is located on a surface-exposed helix near the pathogenic cluster (Fig. 3c). Replacing the large charged aspartate sidechain with the single hydrogen of a glycine removes a bulky charge from the protein surface and likely disrupts the helix in that region.
T55 is a polar residue predicted to lie at the interface between alpha helices 1 and 2 (Fig. 3d). Relative to the other segregating variants, T55S is distal to the pathogenic cluster and is relatively equidistant to pathogenic and neutral variation. Both threonine and serine are unusual residues to find in a helix-helix interface, and suggest that this position may be functionally important. Replacement of a threonine sidechain with that of serine does not alter the hydroxyl character of the residue, though it reduces the steric bulk by one methyl group. This is not a major volumetric change, but the removal of a beta-branching amino acid could affect inter-helical packing. This steric change could result in a relative repacking of the helix-helix interface, or could change the strength of interaction between the helices. Another mutation in this helix (K48R) has been shown to abolish ATPase activity when mutated to arginine , though this mutation is also physically closer to the ATP-binding cleft. Although T55 is evolutionarily conserved, SIFT and PolyPhen2 each confidently predict the serine substitution to be benign. Ultimately, there is no obvious structural basis for the pathogenicity of T55S and its distance from the pathogenic cluster suggests that any functional effects are likely impacting alternative mechanisms.
In comparison to general pathogenicity-prediction algorithms, this approach makes use of dense population and disease-association data for variants specifically in RTEL1 using conservative assumptions of pathogenicity. Consequently, the availability of well-characterized pathogenic and neutral variants in the protein-of-interest is essential. The incorporation of variants and mutagenesis data from functional homologs may help to overcome this limitation. For example, the spatial distribution of disease-causing missense variants in RTEL1 suggests that the ATP-binding cleft between helicase domains I and II and the DNA-binding pore along helicase domain II are functionally critical regions of RTEL1. This finding is consistent with observed patterns of missense variants associated with Xeroderma pigmentosum (XP) in the homologous protein XPD . While variants in XPD have different phenotypic presentations than those in RTEL1, the overlapping regions of pathogenicity suggest similar functional effects, with higher-order phenotypes driven by cellular context or unique functional domains (e.g. RTEL1 harmonin-N-like domains). This hypothesis is supported by the significant correlation between RTEL1-derived pathogenic proximity scores and reduced ATPase activity in XPD. This algorithm can be iteratively enhanced as additional disease-associated variants and primary/homologous mutagenesis data become available.
Assigning pathogenicity to missense variants in RTEL1 presents unique challenges. An ideal biomarker/assay of RTEL1 activity has not been defined, and likely differs based on the specific mutation. Short PBMC telomeres appear to be a common feature associated with RTEL1 mutations, but it is not yet clear whether this is a uniform feature; telomere length in RTEL null mouse embryonic stem cells appears stable , so preserved telomere length alone may not sufficiently exclude deleterious function of RTEL1 variants. In light of these complexities, for algorithm training, we conservatively defined variants as pathogenic only if they had been reported to be associated with severe pediatric disease in a recessive genetic model. For testing on novel VUS, we considered segregation with disease and telomere length in defining likely pathogenic variants. Our method classified five of the six VUS that co-segregated with FIP as pathogenic, but it also misclassified three VUS. This may demonstrate a lack of specificity when considering only the location of variants within protein structure. Spatial information demonstrates predictive potential, but it does not directly capture the impact of specific amino acid substitutions, evolutionary conservation, or biochemical information critical for interpretation. However, the specificity of our approach is comparable with other prediction methods, nearly all of which also misclassified the three VUS. It is also possible that these “misclassified” variants do adversely affect RTEL1 function without leading to a direct effect on telomere length ; comprehensive evaluation of these variants and others over-time should lend more clarity. At present, technical issues have limited the ability to perform in-vitro studies in overexpression systems . In addition, it is possible that more than one dominant risk mutation could be found in a family; in this case, lack of co-segregation would not exclude a pathogenic effect.
We have focused our analysis on disease-causing variants in RTEL1 with a particular interest in predicting variants that increase risk for FIP. However, the methodology is dependent only on the availability of protein structural information (whether experimentally derived or computationally predicted) and the assumption that disease-causing variants are spatially clustered within the protein structure. The tendency for cancer-associated somatic mutations to form spatial clusters in protein sequence and structure is well established , and initial evidence for spatial clustering has likewise been observed for germline disease-causing variants [60, 61]. Thus, the methodology proposed here will likely be broadly useful in the identification of disease regions of interest within protein structure and variant pathogenicity prediction.
Our results demonstrate that considering the 3D spatial landscape of missense variation in RTEL1 has the potential to improve pathogenicity prediction and identify functional regions of protein structure important to the development of disease. We implicate the ATP-binding cleft between helicase domains I and II as well as the DNA-binding pore along helicase domain II as functional regions of RTEL1 contributing to the development of FIP. The similar distributions of disease-associated variants and a significant correlation with ATPase activity in the homologous protein XPD support this finding and suggest that including additional variants from homologous proteins may improve predictive power and discover shared biochemical etiology. More generally, we propose incorporating the spatial distributions of known pathogenic and neutral variation into pathogenicity prediction methods to complement existing predictive features, particularly for proteins in which pathogenic variants appear to form clusters within protein structure. Ultimately, the use of this information has the potential to enhance the utility of genetic data in elucidating the etiology of FIP and other heritable diseases.
RMS was supported by NIH T32 EY021453 and a SPORE grant from the Vanderbilt-Ingram Cancer Center. JM was supported by NIH (R01 GM080403, R01 GM099842, R01 HL122010). JAC was supported by institutional funds and a Vanderbilt Ambassadors Discovery Grant in Cancer Research. JAK was supported by NIH (K08HL130595, U54HL127672), Francis Family Foundation, Pulmonary Fibrosis Foundation, and Vanderbilt Faculty Research Scholars. JAP was supported by NIH U01HG007674. TSB was supported by NIH (P01HL92870, R01HL085317). TSB was supported by Department of Veterans Affairs. These funding bodies had no part in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Availability of data and materials
All data generated or analyzed during this study are included in this published article and its supplementary information files.
Conceptualization: RMS, JAK, JC, TSB, JP, JAC; Software: RMS; Investigation: RMS, JHS, JAK; Resources: JAK, JC, TSB, JAP, WSB, JM; Writing – Original Draft: RMS, JHS, JAK, JAC; Writing – Review & Editing: All Authors; Visualization: RMS, JHS, JAK; Supervision: TSB, JAP, JM, JAC; Funding Acquisition: JAK, TSB, JAP, JM, JAC. All authors read and approved the final manuscript.
Ethics approval and consent to participate
The Institutional Review Boards from Vanderbilt University, Duke University, University of Colorado and National Jewish Hospital approved this investigation and subjects or their surrogates provided written informed consent prior to enrollment in the study.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Hemnes AR, Zhao M, West J, Newman JH, Rich S, Archer SL, et al. Critical genomic networks and vasoreactive variants in idiopathic pulmonary arterial hypertension. Am J Respir Crit Care Med. 2016;194:464.View ArticlePubMedPubMed CentralGoogle Scholar
- de Jesus Perez VA, Yuan K, Lyuksyutova MA, Dewey F, Orcholski ME, Shuffle EM, et al. Whole-exome sequencing reveals TopBP1 as a novel gene in idiopathic pulmonary arterial hypertension. Am J Respir Crit Care Med. 2014;189:1260–72.View ArticlePubMedPubMed CentralGoogle Scholar
- Eyries M, Montani D, Girerd B, Perret C, Leroy A, Lonjou C, et al. EIF2AK4 mutations cause pulmonary veno-occlusive disease, a recessive form of pulmonary hypertension. Nat Genet. 2014;46:65–9.View ArticlePubMedGoogle Scholar
- Ma L, Roman-Campos D, Austin ED, Eyries M, Sampson KS, Soubrier F, et al. A novel channelopathy in pulmonary arterial hypertension. N Engl J Med. 2013;369:351–61.View ArticlePubMedPubMed CentralGoogle Scholar
- Austin ED, Ma L, LeDuc C, Berman Rosenzweig E, Borczuk A, Phillips JA, et al. Whole exome sequencing to identify a novel gene (caveolin-1) associated with human pulmonary arterial hypertension. Circ Cardiovasc Genet. 2012;5:336–43.View ArticlePubMedPubMed CentralGoogle Scholar
- Cogan JD, Kropski J a, Zhao M, Mitchell DB, Rives L, Markin C, et al. Rare variants in RTEL1 are associated with Familial Interstitial Pneumonia. Am J Respir Crit Care Med. 2015;191:646–55.View ArticlePubMedPubMed CentralGoogle Scholar
- Stuart BD, Choi J, Zaidi S, Xing C, Holohan B, Chen R, et al. Exome sequencing links mutations in PARN and RTEL1 with familial pulmonary fibrosis and telomere shortening. Nat Genet. 2015;47:512–7.View ArticlePubMedPubMed CentralGoogle Scholar
- Kannengiesser C, Borie R, Ménard C, Réocreux M, Nitschké P, Gazal S, Mal H, Cadranel J, Nunes H, Valeyre D, Cordier JF, Callebaut I, Boileau C, Cottin V, Grandchamp B, Revy P, Crestani B. Heterozygous RTEL1 mutations is a major cause of familial pulmonary fibrosis. Eur Respir J. 2015;46:474.View ArticlePubMedGoogle Scholar
- Diaz de Leon A, Cronkhite JT, Katzenstein ALA, Godwin JD, Raghu G, Glazer CS, et al. Telomere lengths, pulmonary fibrosis and telomerase (TERT) mutations. PLoS One. 2010;5:e10680.View ArticlePubMedPubMed CentralGoogle Scholar
- Cronkhite JT, Xing C, Raghu G, Chin KM, Torres F, Rosenblatt RL, et al. Telomere shortening in familial and sporadic pulmonary fibrosis. Am J Respir Crit Care Med. 2008;178:729–37.View ArticlePubMedPubMed CentralGoogle Scholar
- Armanios M, Alder JK, Chen JJ-L, Lancaster L, Danoff S, Su S, et al. Short telomeres are a risk factor for idiopathic pulmonary fibrosis. Proc Natl Acad Sci U S A. 2008;105:13051–6.View ArticlePubMedPubMed CentralGoogle Scholar
- Alder JK, Barkauskas CE, Limjunyawong N, Stanley SE, Kembou F, Tuder RM, et al. Telomere dysfunction causes alveolar stem cell failure. Proc Natl Acad Sci. 2015;112:201504780.View ArticleGoogle Scholar
- Povedano JM, Martinez P, Flores JM, Mulero F, Blasco M a. Mice with pulmonary fibrosis driven by telomere dysfunction. Cell Rep. 2015;12:286–99.View ArticlePubMedGoogle Scholar
- Chen R, Zhang K, Chen H, Zhao X, Wang J, Li L, et al. Telomerase deficiency causes alveolar stem cell senescence-associated low-grade inflammation in lungs. J Biol Chem. 2015;290:30813–29.View ArticlePubMedPubMed CentralGoogle Scholar
- Armanios M, Chen JJ-L, Cogan JD, Alder JK, Ingersoll RG, Markin C, et al. Telomerase mutations in families with idiopathic pulmonary fibrosis. N Engl J Med. 2007;356:1317–26.View ArticlePubMedGoogle Scholar
- Tsakiri KD, Cronkhite JT, Kuan PJ, Xing C, Raghu G, Weissler JC, et al. Adult-onset pulmonary fibrosis caused by mutations in telomerase. Proc Natl Acad Sci U S A. 2007;104:7552–7.View ArticlePubMedPubMed CentralGoogle Scholar
- Thusberg J, Olatubosun A, Vihinen M. Performance of mutation pathogenicity prediction methods on missense variants. Hum Mutat. 2011;32:358–68.View ArticlePubMedGoogle Scholar
- Ng PC, Henikoff S. SIFT: predicting amino acid changes that affect protein function. Nucleic Acids Res. 2003;31:3812–4.View ArticlePubMedPubMed CentralGoogle Scholar
- Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, et al. A method and server for predicting damaging missense mutations. Nat Methods. 2010;7:248–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Hu H, Huff CD, Moore B, Flygare S, Reese MG, Yandell M. VAAST 2.0: improved variant classification and disease-gene identification using a conservation-controlled amino acid substitution matrix. Genet Epidemiol. 2013;37:622–34.View ArticlePubMedPubMed CentralGoogle Scholar
- Cooper GM, Stone EA, Asimenos G, Green ED, Batzoglou S, Sidow A. Distribution and intensity of constraint in mammalian genomic sequence. Genome Res. 2005;15:901–13.View ArticlePubMedPubMed CentralGoogle Scholar
- Kircher M. A general framework for estimating the relative pathogenicity of human genetic variants. Nat Genet. 2014;46:310–5.View ArticlePubMedPubMed CentralGoogle Scholar
- Baugh EH, Simmons-Edler R, Mueller CL, Alford RF, Volfovsky N, Lash A, et al. Robust classification of protein variation using structural modeling and large-scale data integration. Preprint. 2015;XX:1–6.Google Scholar
- Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, et al. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491:56–65.View ArticlePubMedGoogle Scholar
- Walne AJ, Vulliamy T, Kirwan M, Plagnol V, Dokal I. Constitutional mutations in RTEL1 cause severe dyskeratosis congenita. Am J Hum Genet. 2013;92:448–53.View ArticlePubMedPubMed CentralGoogle Scholar
- Deng Z, Glousker G, Molczan A, Fox AJ, Lamm N, Dheekollu J, et al. Inherited mutations in the helicase RTEL1 cause telomere dysfunction and Hoyeraal-Hreidarsson syndrome. Proc Natl Acad Sci U S A. 2013;110:E3408–16.View ArticlePubMedPubMed CentralGoogle Scholar
- Ballew BJ, Joseph V, De S, Sarek G, Vannier JB, Stracker T, et al. A recessive founder mutation in regulator of telomere elongation helicase 1, RTEL1, underlies severe immunodeficiency and features of Hoyeraal Hreidarsson syndrome. PLoS Genet. 2013;9:e1003695.View ArticlePubMedPubMed CentralGoogle Scholar
- Ballew BJ, Yeager M, Jacobs K, Giri N, Boland J, Burdett L, et al. Germline mutations of regulator of telomere elongation helicase 1, RTEL1, in dyskeratosis congenita. Hum Genet. 2013;132:473–80.View ArticlePubMedPubMed CentralGoogle Scholar
- Hanna S, Béziat V, Jouanguy E, Casanova JL, Etzioni A. A homozygous mutation of RTEL1 in a child presenting with an apparently isolated natural killer cell deficiency. J Allergy Clin Immunol. 2015;136:1113–4.View ArticlePubMedGoogle Scholar
- Moriya K, Niizuma H, Rikiishi T, Yamaguchi H, Sasahara Y, Kure S. Novel compound heterozygous RTEL1 gene mutations in a patient with Hoyeraal-Hreidarsson syndrome. Pediatr Blood Cancer. 2016;63:1683–4.View ArticlePubMedGoogle Scholar
- Le Guen T, Jullien L, Touzot F, Schertzer M, Gaillard L, Perderiset M, et al. Human RTEL1 deficiency causes Hoyeraal-Hreidarsson syndrome with short telomeres and genome instability. Hum Mol Genet. 2013;22:3239–49.View ArticlePubMedGoogle Scholar
- Travis WD, Costabel U, Hansell DM, King TE, Lynch DA, Nicholson AG, et al. An official American Thoracic Society/European Respiratory Society statement: update of the international multidisciplinary classification of the idiopathic interstitial pneumonias. Am J Respir Crit Care Med. 2013;188:733–48.View ArticlePubMedGoogle Scholar
- Durham E, Dorr B, Woetzel N, Staritzbichler R, Meiler J. Solvent accessible surface area approximations for rapid and accurate protein structure prediction. J Mol Model. 2009;15:1093–108.View ArticlePubMedPubMed CentralGoogle Scholar
- Hastie T, Tibshirani R, Friedman J. The elements of statistical learning. First edit. Springer; 2009.View ArticleGoogle Scholar
- Goldenberg O, Erez E, Nimrod G, Ben-Tal N. The ConSurf-DB: pre-calculated evolutionary conservation profiles of protein structures. Nucleic Acids Res. 2009;37:323–7.View ArticleGoogle Scholar
- Kurowski MA, Bujnicki JM. GeneSilico protein structure prediction meta-server. Nucleic Acids Res. 2003;31:3305–7.View ArticlePubMedPubMed CentralGoogle Scholar
- Söding J, Biegert A, Lupas AN. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res. 2005;33:W244–8.View ArticlePubMedPubMed CentralGoogle Scholar
- Yang J, Yan R, Roy A, Xu D, Poisson J, Zhang Y. The I-TASSER suite: protein structure and function prediction. Nat Methods. 2014;12:7–8.View ArticleGoogle Scholar
- Fernandez-Fuentes N, Madrid-Aliste CJ, Rai BK, Fajardo JE, Fiser A. M4T: a comparative protein structure modeling server. Nucleic Acids Res. 2007;35:W363–8.View ArticlePubMedPubMed CentralGoogle Scholar
- Wallner B, Elofsson A. Pcons5: combining consensus, structural evaluation and fold recognition scores. Bioinformatics. 2005;21:4248–54.View ArticlePubMedGoogle Scholar
- Kelley LA, Mezulis S, Yates CM, Wass MN, Sternberg MJE. The Phyre2 web portal for protein modeling, prediction and analysis. Nat Protoc. 2015;10:845–58.View ArticlePubMedPubMed CentralGoogle Scholar
- Källberg M, Wang H, Wang S, Peng J, Wang Z, Lu H, et al. Template-based protein structure modeling using the RaptorX web server. Nat Protoc. 2012;7:1511–22.View ArticlePubMedPubMed CentralGoogle Scholar
- Kim DE, Chivian D, Baker D. Protein structure prediction and analysis using the Robetta server. Nucleic Acids Res. 2004;32:W526–31.View ArticlePubMedPubMed CentralGoogle Scholar
- Biasini M, Bienert S, Waterhouse A, Arnold K, Studer G, Schmidt T, et al. SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information. Nucleic Acids Res. 2014;42:W252–8.View ArticlePubMedPubMed CentralGoogle Scholar
- Kuper J, Wolski SC, Michels G, Kisker C. Functional and structural studies of the nucleotide excision repair helicase XPD suggest a polarity for DNA translocation. EMBO J. 2011;31:494–502.View ArticlePubMedPubMed CentralGoogle Scholar
- Fan L, Fuss J, Cheng Q, Arvai A, Hammel M. XPD helicase structures and activities: insights into the cancer and aging phenotypes from XPD mutations. Cell. 2008;133:789.View ArticlePubMedPubMed CentralGoogle Scholar
- Kim K, Oh J, Han D, Kim E, Lee B, Kim Y. Crystal structure of PilF: functional implication in the type 4 pilus biogenesis in Pseudomonas aeruginosa. Biochem Biophys Res. 2006;340:1028.View ArticleGoogle Scholar
- Sawaya MR, Chan S, Han GW, Perry LJ. Crystal structure of a ten a homolog/Thi-4 Thiaminase from Pyrobaculum Aerophilum. Protein data Bank. 2006;Google Scholar
- Coquille S, Filipovska A, Chia T, Rajappa L. An artificial PPR scaffold for programmable RNA recognition. Nat Commun. 2014;5:5729.View ArticlePubMedGoogle Scholar
- Rapley J, Tybulewicz V, Rittinger K. Crucial structural role for the PH and C1 domains of the Vav1 exchange factor. EMBO Rep. 2008;9:655.View ArticlePubMedPubMed CentralGoogle Scholar
- Vollmar M, Ayinampudi V, Cooper C, Guo K, Krojer T, Muniz JRC, et al. Crystal structure of the N-terminal domain of human Cul4B at 2.57A resolution. Protein Data Bank. 2012;Google Scholar
- Tyka M, Keedy D, André I, DiMaio F, Song Y. Alternate states of proteins revealed by detailed energy landscape mapping. J Mol. 2011;405:607.View ArticleGoogle Scholar
- Mandell D, Coutsias E, Kortemme T. Sub-angstrom accuracy in protein loop reconstruction by robotics-inspired conformational sampling. Nat Methods. 2009;6:551.View ArticlePubMedPubMed CentralGoogle Scholar
- Uringa EJ, Youds JL, Lisaingo K, Lansdorp PM, Boulton SJ. RTEL1: an essential helicase for telomere maintenance and the regulation of homologous recombination. Nucleic Acids Res. 2011;39:1647–55.View ArticlePubMedGoogle Scholar
- Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015;17:405–23.View ArticlePubMedPubMed CentralGoogle Scholar
- Stark C, Breitkreutz B-J, Reguly T, Boucher L, Breitkreutz A, Tyers M. BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 2006;34:D535–9.View ArticlePubMedGoogle Scholar
- Barber LJ, Youds JL, Ward JD, McIlwraith MJ, O’Neil NJ, Petalcorin MIR, et al. RTEL1 maintains genomic stability by suppressing homologous recombination. Cell. 2008;135:261–71.View ArticlePubMedPubMed CentralGoogle Scholar
- Uringa E-J, Lisaingo K, Pickett H a, Brind’Amour J, Rohde J-H, Zelensky A, et al. RTEL1 contributes to DNA replication and repair and telomere maintenance. Mol Biol Cell. 2012;23:2782–92.View ArticlePubMedPubMed CentralGoogle Scholar
- Porta-Pardo E, Kamburov A, Tamborero D, Pons T, Grases D, Valencia A, et al. Comparison of algorithms for the detection of cancer drivers at subgene resolution. Nat Methods. 2017;14:782.View ArticlePubMedGoogle Scholar
- Turner TN, Douville C, Kim D, Stenson PD, Cooper DN, Chakravarti A, et al. Proteins linked to autosomal dominant and autosomal recessive disorders harbor characteristic rare missense mutation distribution patterns. Hum Mol Genet. 2015;24:5995–6002.View ArticlePubMedGoogle Scholar
- Meyer MJ, Lapcevic R, Romero AE, Yoon M, Das J, Beltrán JF, et al. Mutation3D: cancer gene prediction through atomic clustering of coding variants in the structural proteome. Hum Mutat. 2016;37:447.View ArticlePubMedPubMed CentralGoogle Scholar