Abe N, Mamitsuka H: **Prediction of beta-sheet structures using stochastic tree grammars.** In *Proceedings Genome Informatics Workshop V*. Universal Academy Press; 1994:19–28.

Google Scholar

Alexandersson M, Cawley S, Pachter L: **SLAM cross-species gene finding and alignment with a generalized pair hidden Markov model.** *Genome Research* 2003, **13**(3):496–502. 10.1101/gr.424203

Article
PubMed Central
CAS
PubMed
Google Scholar

Arvestad L, Bruno WJ: **Estimation of reversible substitution matrices from multiple pairs of sequences.** *Journal of Molecular Evolution* 1997, **45**(6):696–703. 10.1007/PL00006274

Article
CAS
PubMed
Google Scholar

Baum LE: **An equality and associated maximization technique in statistical estimation for probabilistic functions of Markov processes.** *Inequalities* 1972, **3:** 1–8.

Google Scholar

Birney E, Durbin R: **Dynamite: a flexible code generating language for dynamic programming methods used in sequence comparison.** In *Proceedings of the Fifth International Conference on Intelligent Systems for Molecular Biology*. Edited by: Gaasterland T, Karp P, Karplus K, Ouzounis C, Sander C, Valencia A. Menlo Park, CA, AAAI Press; 1997:56–64.

Google Scholar

Bockhorst J, Qiu Y, Glasner J, Liu M, Blattner F, Craven M: **Predicting bacterial transcription units using sequence and expression data.** In *Proceedings of the Eleventh International Conference on Intelligent Systems for Molecular Biology*. Menlo Park, CA, AAAI Press; 2003:34–43.

Google Scholar

Branden C, Tooze J: *Introduction to Protein Structure*. Garland, New York; 1991.

Google Scholar

Brown M, Hughey R, Krogh A, Mian IS, Sjölander K, Haussler D: **Using Dirichlet mixture priors to derive hidden Markov models for protein families.** In *Proceedings of the First International Conference on Intelligent Systems for Molecular Biology*. Edited by: Hunter L, Searls DB, Shavlik J. Menlo Park, CA, AAAI Press; 1993:47–55.

Google Scholar

Bruno WJ: **Modelling residue usage in aligned protein sequences via maximum likelihood.** *Molecular Biology and Evolution* 1996, **13**(10):1368–1374.

Article
CAS
PubMed
Google Scholar

Burge C, Karlin S: **Prediction of complete gene structures in human genomic DNA.** *Journal of Molecular Biology* 1997, **268**(1):78–94. 10.1006/jmbi.1997.0951

Article
CAS
PubMed
Google Scholar

Churchill GA: **Stochastic models for heterogeneous DNA sequences.** *Bulletin of Mathematical Biology* 1989, **51:** 79–94.

Article
CAS
PubMed
Google Scholar

Dayhoff MO, Eck RV, Park CM: **A model of evolutionary change in proteins.** In *Atlas of Protein Sequence and Structure*. *Volume 5*. Edited by: Dayhoff MO. National Biomedical Research Foundation, Washington, DC; 1972:89–99.

Google Scholar

Dayhoff MO, Schwartz RM, Orcutt BC: **A model of evolutionary change in proteins.** In *Atlas of Protein Sequence and Structure*. *Volume 5*. Edited by: Dayhoff MO. National Biomedical Research Foundation, Washington, DC; 1978:345–352.

Dempster AP, Laird NM, Rubin DB: **Maximum likelihood from incomplete data via the EM algorithm.** *Journal of the Royal Statistical Society* 1977, **B39:** 1–38.

Google Scholar

Dimmic MW, Mindell DP, Goldstein RA: **Modeling evolution at the protein level using an adjustable amino acid fitness model.** *Proceedings of the Fifth Pacific Symposium on Biocomputing* 2000, 18–29.

Google Scholar

Durbin R, Eddy S, Krogh A, Mitchison G: *Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids*. Cambridge University Press, Cambridge, UK; 1998.

Book
Google Scholar

Eddy SR: **A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure.** *BMC Bioinformatics* 2002., **3**(18):

Eddy SR, Durbin R: **RNA sequence analysis using covariance models.** *Nucleic Acids Research* 1994, **22:** 2079–2088.

Article
PubMed Central
CAS
PubMed
Google Scholar

Eddy SR, Mitchison GJ, Durbin R: **Maximum discrimination hidden Markov models of sequence consensus.** *Journal of Computational Biology* 1995, **2:** 9–23.

Article
CAS
PubMed
Google Scholar

Engelhardt BE, Jordan MI, Muratore KE, Brenner SE: **Protein molecular function prediction by Bayesian phylogenomics.** *PLoS Computational Biology* 2005., **1**(5):

Felsenstein J: **Evolutionary trees from DNA sequences: a maximum likelihood approach.** *Journal of Molecular Evolution* 1981, **17:** 368–376. 10.1007/BF01734359

Article
CAS
PubMed
Google Scholar

Felsenstein J: *Inferring Phylogenies*. Sinauer Associates, Inc; 2003. ISBN 0878931775. ISBN 0878931775.

Google Scholar

Felsenstein J, Churchill GA: **A hidden Markov model approach to variation among sites in rate of evolution.** *Molecular Biology and Evolution* 1996, **13:** 93–104.

Article
CAS
PubMed
Google Scholar

Friedman N, Ninio M, Pe'er I, Pupko T: **A structural EM algorithm for phylogenetic inference.** *Journal of Computational Biology* 2002, **9:** 331–353. 10.1089/10665270252935494

Article
CAS
PubMed
Google Scholar

Gilks W, Richardson S, Spiegelhalter D: *Markov Chain Monte Carlo in Practice*. Chapman & Hall, London, UK; 1996.

Google Scholar

Goldman N, Thorne JL, Jones DT: **Using evolutionary trees in protein secondary structure prediction and other comparative sequence analyses.** *Journal of Molecular Biology* 1996, **263**(2):196–208. 10.1006/jmbi.1996.0569

Article
CAS
PubMed
Google Scholar

Goldman N, Yang Z: **A codon-based model of nucleotide substitution for protein-coding DNA sequences.** *Molecular Biology and Evolution* 1994, **11:** 725–735.

CAS
PubMed
Google Scholar

Gonnet GH, Cohen MA, Benner SA: **Exhaustive matching of the entire protein sequence database.** *Science* 1992, **256**(5062):1443–1445. 10.1126/science.1604319

Article
CAS
PubMed
Google Scholar

Gribskov M, McLachlan AD, Eisenberg D: **Profile analysis: detection of distantly related proteins.** *Proceedings of the National Academy of Sciences of the USA* 1987, **84:** 4355–4358. 10.1073/pnas.84.13.4355

Article
PubMed Central
CAS
PubMed
Google Scholar

Griffiths-Jones S, Bateman A, Marshall M, Khanna A, Eddy SR: **Rfam: an RNA family database.** *Nucleic Acids Research* 2003, **31**(1):439–441. 10.1093/nar/gkg006

Article
PubMed Central
CAS
PubMed
Google Scholar

Halpern AL, Bruno WJ: **Evolutionary distances for protein-coding sequences: modeling site-specific residue frequencies.** *Molecular Biology and Evolution* 1998, **15**(7):910–917.

Article
CAS
PubMed
Google Scholar

Hasegawa M, Kishino H, Yano T: **Dating the human-ape splitting by a molecular clock of mitochondrial DNA.** *Journal of Molecular Evolution* 1985, **22:** 160–174. 10.1007/BF02101694

Article
CAS
PubMed
Google Scholar

Hein J: **An algorithm for statistical alignment of sequences related by a binary tree.** In *Pacific Symposium on Biocomputing*. Edited by: Altman RB, Dunker AK, Hunter L, Laud-erdale K, Klein TE. Singapore, World Scientific; 2001:179–190.

Google Scholar

Hein J, Wiuf C, Knudsen B, Moller MB, Wibling G: **Statistical alignment: computational properties, homology testing and goodness-of-fit.** *Journal of Molecular Biology* 2000, **302:** 265–279. 10.1006/jmbi.2000.4061

Article
CAS
PubMed
Google Scholar

Hobolth A, Jensen JL: **Statistical inference in evolutionary models of DNA sequences via the EM algorithm.** *Statistical applications in Genetics and Molecular Biology* 2005., **4**(1):

Google Scholar

Holmes I: **A probabilistic model for the evolution of RNA structure.** *BMC Bioinformatics* 2004., **5**(166):

Google Scholar

Holmes I, Bruno WJ: **Evolutionary HMMs: a Bayesian approach to multiple alignment.** *Bioinformatics* 2001, **17**(9):803–820. 10.1093/bioinformatics/17.9.803

Article
CAS
PubMed
Google Scholar

Holmes I, Rubin GM: **An Expectation Maximization algorithm for training hidden substitution models.** *Journal of Molecular Biology* 2002, **317**(5):757–768. 10.1006/jmbi.2002.5405

Article
Google Scholar

Jojic V, Jojic N, Meek C, Geiger D, Siepel A, Haussler D, Heckerman D: **Efficient approximations for learning phylogenetic HMM models from data.** *Bioinformatics* 2004, **20**(Supplement 1):161–168. 10.1093/bioinformatics/bth917

Article
Google Scholar

Joshi A, Schabes Y: **Tree-adjoining grammars.** 1997.

Chapter
Google Scholar

Jukes TH, Cantor C: **Evolution of protein molecules.** In *Mammalian Protein Metabolism*. Academic Press, New York; 1969:21–132.

Chapter
Google Scholar

Kall L, Krogh A, Sonnhammer EL: **A combined transmembrane topology and signal peptide prediction method.** *Journal of Molecular Biology* 2004, **338**(5):1027–1036. 10.1016/j.jmb.2004.03.016

Article
CAS
PubMed
Google Scholar

Karlin S, Taylor H: *A First Course in Stochastic Processes*. Academic Press, San Diego, CA; 1975.

Google Scholar

Kimura M: **A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences.** *Journal of Molecular Evolution* 1980, **16:** 111–120. 10.1007/BF01731581

Article
CAS
PubMed
Google Scholar

Klosterman PS, Tamura M, Holbrook SR, Brenner SE: **SCOR: a structural classification of RNA database.** *Nucleic Acids Research* 2002, **30:** 392–394. 10.1093/nar/30.1.392

Article
PubMed Central
CAS
PubMed
Google Scholar

Knudsen B, Hein J: **RNA secondary structure prediction using stochastic context-free grammars and evolutionary history.** *Bioinformatics* 1999, **15**(6):446–454. 10.1093/bioinformatics/15.6.446

Article
CAS
PubMed
Google Scholar

Knudsen B, Hein J: **Pfold: RNA secondary structure prediction using stochastic context-free grammars.** *Nucleic Acids Research Evaluation Studies* 2003, **31**(13):3423–3428. 10.1093/nar/gkg614

Article
CAS
Google Scholar

Koshi JM, Goldstein RA: **Context-dependent optimal substitution matrices.** *Protein Engineering* 1995, **8:** 641–645.

Article
CAS
PubMed
Google Scholar

Krogh A, Brown M, Mian IS, Sjölander K, Haussler D: **Hidden Markov models in computational biology: applications to protein modeling.** *Journal of Molecular Biology* 1994, **235:** 1501–1531. 10.1006/jmbi.1994.1104

Article
CAS
PubMed
Google Scholar

Kschischang FR, Frey BJ, Loeliger H-A: **Factor graphs and the sum-product algorithm.** *IEEE Transactions on Information Theory* 1998, **47**(2):498–519. 10.1109/18.910572

Article
Google Scholar

Lari K, Young SJ: **The estimation of stochastic context-free grammars using the inside-outside algorithm.** *Computer Speech and Language* 1990, **4:** 35–56. 10.1016/0885-2308(90)90022-X

Article
Google Scholar

Lichtarge O, Bourne HR, Cohen FE: **An evolutionary trace method defines binding surfaces common to protein families.** *Journal of Molecular Biology* 1996, **257:** 342–358. 10.1006/jmbi.1996.0167

Article
CAS
PubMed
Google Scholar

Liò P, Goldman N: **Using protein structural information in evolutionary inference: transmembrane proteins.** *Molecular Biology and Evolution* 1999, **16:** 1696–1710.

Article
PubMed
Google Scholar

Lunter G, Ponting CP, Hein J: **Genome-wide identification of human functional DNA using a neutral indel model.** *PLoS Computational Biology* 2006., **2**(1):

Lunter GA, Hein J: **A nucleotide substitution model with nearest-neighbour interactions.** *Bioinformatics* 2004, **20**(Suppl 1):I216-I223. 10.1093/bioinformatics/bth901

Article
CAS
PubMed
Google Scholar

McCarthy JL: **Recursive functions of symbolic expressions and their computation by machine.** *Communications of the ACM* 1960, **3**(4):184–195. 10.1145/367177.367199

Article
Google Scholar

McLachlan GJ, Krishnan T: *The EM Algorithm and Extensions*. Wiley Interscience; 1996.

Google Scholar

Meyer IM, Durbin R: **Gene structure conservation aids similarity based gene prediction.** *Nucleic Acids Research* 2004, **32**(2):776–783. 10.1093/nar/gkh211

Article
PubMed Central
CAS
PubMed
Google Scholar

Michalek S, Timmer J: **Estimating rate constants in hidden Markov models by the EM algorithm.** *IEEE Transactions in Signal Processing* 1999, **47:** 226–228. 10.1109/78.738259

Article
Google Scholar

Miklós I, Lunter G, Holmes I: **A long indel model for evolutionary sequence alignment.** *Molecular Biology and Evolution* 2004, **21**(3):529–540. 10.1093/molbev/msh043

Article
PubMed
Google Scholar

Mizuguchi K, Deane CM, Blundell TL, Overington JP: **HOMSTRAD: a database of protein structure alignments for homologous families.** *Protein Science* 1998, **7:** 2469–2471.

Article
PubMed Central
CAS
PubMed
Google Scholar

Moses AM, Chiang DY, Pollard DA, Iyer VN, Eisen MB: **MONKEY: identifying conserved transcription-factor binding sites in multiple alignments using a binding site-specific evolutionary model.** *Genome Biology* 2004., **5**(12):

Muller T, Vingron M: **Modeling amino acid replacement.** *Journal of Computational Biology* 2000, **7**(6):761–776. 10.1089/10665270050514918

Article
CAS
PubMed
Google Scholar

Neyman J: **Molecular studies of evolution: a source of novel statistical problems.** In *Statistical Decision Theory and Related Topics*. Edited by: Gupta SS, Yackel J. Academic Press, New York; 1971.

Google Scholar

Ng PC, Henikoff S: **SIFT: Predicting amino acid changes that affect protein function.** *Nucleic Acids Research* 2003, **31**(13):3812–3814. 10.1093/nar/gkg509

Article
PubMed Central
CAS
PubMed
Google Scholar

Pearl J: *Probabilistic Reasoning in Intelligent Systems*. Morgan Kaufmann Publishers, San Mateo, California; 1988.

Google Scholar

Pedersen JS, Bejerano G, Siepel A, Rosenbloom K, Lindblad-Toh K, Lander ES, Kent J, Miller W, Haussler D: **Identification and classification of conserved RNA secondary structures in the human genome.** *PLoS Computational Biology* 2006, **2**(4):e33. 10.1371/journal.pcbi.0020033

Article
PubMed Central
CAS
PubMed
Google Scholar

Pedersen JS, Hein J: **Gene finding with a hidden Markov model of genome structure and evolution.** *Bioinformatics* 2003, **19**(2):219–227. 10.1093/bioinformatics/19.2.219

Article
CAS
PubMed
Google Scholar

Pedersen JS, Meyer IM, Forsberg R, Simmonds P, Hein J: **A comparative method for finding and folding RNA secondary structures within protein-coding regions.** *Nucleic Acids Research* 2004, **32**(16):4925–4923. 10.1093/nar/gkh839

Article
PubMed Central
CAS
PubMed
Google Scholar

Pollard KatherineS, Salama SofleR, Lambert Nelle, Lambot Marie-Alexandra, Coppens Sandra, Pedersen JakobS, Katzman Sol, King Bryan, Onodera Courtney, Siepel Adam, Kern AndrewD, Dehay Colette, Igel Haller, Ares Manuel Jr, Vanderhaeghen Pierre, Haussler David: **An RNA gene expressed during cortical development evolved rapidly in humans.** *Nature* 2006, **443**(7108):167–172. 10.1038/nature05113

Article
CAS
PubMed
Google Scholar

Pollock DD, Taylor WR, Goldman N: **Coevolving protein residues: maximum likelihood identification and relationship to structure.** *Journal of Molecular Biology* 1999, **287**(1):187–198. 10.1006/jmbi.1998.2601

Article
CAS
PubMed
Google Scholar

Rabiner LR, Juang BH: **An introduction to hidden Markov models.** *IEEE ASSP Magazine* 1986, **3**(1):4–16.

Article
Google Scholar

Rivas E, Eddy SR: **The language of RNA: a formal grammar that includes pseudoknots.** *Bioinformatics* 2000, **16**(4):334–340. 10.1093/bioinformatics/16.4.334

Article
CAS
PubMed
Google Scholar

Rivas E, Eddy SR: **Noncoding RNA gene detection using comparative sequence analysis.** *BMC Bioinformatics* 2001., **2**(8):

Rivest R: **S-expressions. Internet Draft.**1997. [http://theory.lcs.mit.edu/~rivest/sexp.txt]

Google Scholar

Rohl CA, Strauss CE, Misura KM, Baker D: **Protein structure prediction using Rosetta.** *Methods in Enzymology* 2004, **383:** 66–93.

Article
CAS
PubMed
Google Scholar

Saitou N, Nei M: **The neighbor-joining method: a new method for reconstructing phylogenetic trees.** *Molecular Biology and Evolution* 1987, **4:** 406–425.

CAS
PubMed
Google Scholar

Sakakibara Y, Brown M, Hughey R, Saira Mian I, Kimmen Sjölander, Underwood RC, Haussler D: **Stochastic context-free grammars for tRNA modeling.** *Nucleic Acids Research* 1994, **22:** 5112–5120.

Article
PubMed Central
CAS
PubMed
Google Scholar

Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K, Clawson H, Spieth J, Hillier LW, Richards S, Weinstock GM, Wilson RK, Gibbs RA, Kent WJ, Miller W, Haussler D: **Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes.** *Genome Research* 2005, **15**(8):1034–1050. 10.1101/gr.3715005

Article
PubMed Central
CAS
PubMed
Google Scholar

Siepel A, Haussler D: **Combining phylogenetic and hidden Markov models in biosequence analysis.** *Journal of Computational Biology* 2004, **11**(2–3):413–428. 10.1089/1066527041410472

Article
CAS
PubMed
Google Scholar

Siepel A, Haussler D: **Phylogenetic estimation of context-dependent substitution rates by maximum likelihood.** *Molecular Biology and Evolution* 2004, **21**(3):468–488. 10.1093/molbev/msh039

Article
CAS
PubMed
Google Scholar

Soyer OS, Goldstein RA: **Predicting functional sites in proteins: site-specific evolutionary models and their application to neurotransmitter transporters.** *Journal of Molecular Biology* 2004, **339**(1):227–242. 10.1016/j.jmb.2004.03.025

Article
CAS
PubMed
Google Scholar

Thorne JL, Goldman N, Jones DT: **Combining protein evolution and secondary structure.** *Molecular Biology and Evolution* 1996, **13:** 666–673.

Article
CAS
PubMed
Google Scholar

Thorne JL, Kishino H, Felsenstein J: **An evolutionary model for maximum likelihood alignment of DNA sequences.** *Journal of Molecular Evolution* 1991, **33:** 114–124. 10.1007/BF02193625

Article
CAS
PubMed
Google Scholar

Wasserman WW, Fickett JW: **Identification of regulatory regions which confer muscle-specific gene expression.** *Journal of Molecular Biology* 1998, **278**(1):167–181. 10.1006/jmbi.1998.1700

Article
CAS
PubMed
Google Scholar

Whelan S, de Bakker PI, Goldman N: **Pandit: a database of protein and associated nucleotide domains with inferred trees.** *Bioinformatics* 2003, **19**(12):1556–1563. 10.1093/bioinformatics/btg188

Article
CAS
PubMed
Google Scholar

Whelan S, Goldman N: **A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach.** *Molecular Biology and Evolution* 2001, **18**(5):691–699.

Article
CAS
PubMed
Google Scholar

**The xgram file format**[http://biowiki.org/XgramFormat]

**Information on xrate, xgram, xprot, xfold and related tools**[http://biowiki.org/XgramSoftware]

Yang Z: **Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites.** *Molecular Biology and Evolution* 1993, **10:** 1396–1401.

CAS
PubMed
Google Scholar

Yang Z: **Estimating the pattern of nucleotide substitution.** *Journal of Molecular Evolution* 1994, **39:** 105–111.

PubMed
Google Scholar

Yang Z: **Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods.** *Journal of Molecular Evolution* 1994, **39:** 306–314. 10.1007/BF00160154

Article
CAS
PubMed
Google Scholar

Yang Z, Nielsen R, Goldman N, Pedersen A-M: **Codon-substitution models for heterogeneous selection pressure at amino acid sites.** *Genetics* 2000, **155:** 432–449.

Google Scholar

Yap VB, Speed TP: *Statistical Methods in Molecular Evolution, chapter Estimating substitution matrices*. Springer; 2005.

Google Scholar