Van Belkum A, Struelens M, de Visser A, Verbrugh H, Tibayrenc M. Role of genomic typing in taxonomy, evolutionary genetics, and microbial epidemiology. Clin Microbiol Rev. 2001; 14(3):547–60.

Article
CAS
PubMed
PubMed Central
Google Scholar

Struck D, Lawyer G, Ternes AM, Schmit JC, Bercoff DP. Comet: adaptive context-based modeling for ultrafast hiv-1 subtype identification. Nucleic Acids Res. 2014; 42(18):e144.

Article
PubMed
PubMed Central
Google Scholar

Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped blast and psi-blast: a new generation of protein database search programs. Nucleic Acids Res. 1997; 25(17):3389–402.

Article
CAS
PubMed
PubMed Central
Google Scholar

Edgar RC. Search and clustering orders of magnitude faster than blast. Bioinformatics. 2010; 26(19):2460–1.

Article
CAS
PubMed
Google Scholar

Bao Y, Chetvernin V, Tatusova T. Improvements to pairwise sequence comparison (PASC): a genome-based web tool for virus classification. Arch Virol. 2014; 159(12):3293–304.

Article
CAS
PubMed
PubMed Central
Google Scholar

Lauber C, Gorbalenya AE. Partitioning the genetic diversity of a virus family: Approach and evaluation through a case study of picornaviruses. J Virol. 2012; 86(7):3890–904.

Article
CAS
PubMed
PubMed Central
Google Scholar

de Oliveira T, Deforche K, Cassol S, Salminen M, Paraskevis D, Seebregts C, Snoeck J, van Rensburg EJ, Wensing AMJ, van de Vijver DA, Boucher CA, Camacho R, Vandamme AM. An automated genotyping system for analysis of hiv-1 and other microbial sequences. Bioinformatics. 2005; 21(19):3797–800.

Article
CAS
PubMed
Google Scholar

Alcantara LCJ, Cassol S, Libin P, Deforche K, Pybus OG, Van Ranst M, Galvao-Castro B, Vandamme AM, de Oliveira T. A standardized framework for accurate, high-throughput genotyping of recombinant and non-recombinant viral sequences. Nucleic Acids Res. 2009; 37(Web Server issue):W634–42.

Article
CAS
PubMed
PubMed Central
Google Scholar

Matsen FA, Kodner RB, Armbrust EV. pplacer: linear time maximum-likelihood and bayesian phylogenetic placement of sequences onto a fixed reference tree. BMC Bioinformatics. 2010; 11:538.

Article
PubMed
PubMed Central
Google Scholar

Liu Z, Meng J, Sun X. A novel feature-based method for whole genome phylogenetic analysis without alignment: Application to HEV genotyping and subtyping. Biochem Biophys Res Commun. 2008; 368(2):223–30.

Article
CAS
PubMed
Google Scholar

Yu C, Hernandez T, Zheng H, Yau SC, Huang HH, He RL, Yang J, Yau SS-T. Real time classification of viruses in 12 dimensions. PLoS One. 2013; 8(5):e64328.

Article
PubMed
PubMed Central
Google Scholar

Vinga S, Almeida J. Alignment-free sequence comparison–a review. Bioinformatics. 2003; 19(4):513–23.

Article
CAS
PubMed
Google Scholar

Bonham-Carter O, Steele J, Bastola D. Alignment-free genetic sequence comparisons: a review of recent approaches by word analysis. Brief Bioinform. 2014; 15(6):890–905.

Article
PubMed
Google Scholar

Mantaci S, Restivo A, Sciortino M. Distance measures for biological sequences: Some recent approaches. Int J Approx Reason. 2008; 47(1):109–24.

Article
Google Scholar

Xing Z, Pei J, Keogh E. A brief survey on sequence classification. ACM SIGKDD Explor. 2010; 12(1):40–48.

Article
Google Scholar

Williams RC. Restriction fragment length polymorphism (RFLP). Am J Phys Anthropol. 1989; 32(S10):159–84.

Article
Google Scholar

Bernard HU, Chan SY, Manos MM, Ong CK, Villa LL, Delius H, Peyton CL, Bauer HM, Wheeler CM. Identification and assessment of known and novel human papillomaviruses by polymerase chain reaction amplification, restriction fragment length polymorphisms, nucleotide sequence, and phylogenetic algorithms. J Infect Dis. 1994; 170(5):1077–85.

Article
CAS
PubMed
Google Scholar

Nobre RJ, de Almeida LP, Martins TC. Complete genotyping of mucosal human papillomavirus using a restriction fragment length polymorphism analysis and an original typing algorithm. J Clin Virol. 2008; 42(1):13–21.

Article
CAS
PubMed
Google Scholar

Janini LM, Pieniazek D, Peralta JM, Schechter M, Tanuri A, Vicente ACP, dela Torre N, Pieniazek NJ, Luo CC, Kalish ML, Schochetman G, Rayfield MA. Identification of single and dual infections with distinct subtypes of human immunodeficiency virus type 1 by using restriction fragment length polymorphism analysis. Virus Genes. 1996; 13(1):69–81.

Article
CAS
PubMed
Google Scholar

Mizokami M, Nakano T, Orito E, Tanaka Y, Sakugawa H, Mukaide M, Robertson BH. Hepatitis B virus genotype assignment using restriction fragment length polymorphism patterns. FEBS Lett. 1999; 450(1–2):66–71.

Article
CAS
PubMed
Google Scholar

Nakao T, Enomoto N, Takada N, Takada A, Date T. Typing of hepatitis C virus genomes by restriction fragment length polymorphism. J Gen Virol. 1991; 72(9):2105–12.

Article
CAS
PubMed
Google Scholar

Pevzner P. Computational Molecular Biology: An Algorithmic Approach. Cambridge: MIT press; 2000.

Google Scholar

Adams J, Rothman E. Estimation of phylogenetic relationships from dna restriction patterns and selection of endonuclease cleavage sites. Proc Natl Acad Sci USA. 1982; 79(11):3560–4.

Article
CAS
PubMed
PubMed Central
Google Scholar

Templeton AR. Phylogenetic inference from restriction endonuclease cleavage site maps with particular reference to the evolution of human and the apes. Evolution. 1983; 37(2):221–44.

Article
CAS
Google Scholar

Felsenstein J. Phylogenies from restriction sites: A maximum-likelihood approach. Evolution. 1992; 46(1):159–73.

Article
Google Scholar

Chang HW, Cheng YH, Chuang LY, Yang CH. SNP-RFLPing 2: an updated and integrated PCR-RFLP tool for SNP genotyping. BMC Bioinformatics. 2010; 11:173.

Article
PubMed
PubMed Central
Google Scholar

Bajla I, Holländer I, Fluch S, Burg K, Kollár M. An alternative method for electrophoretic gel image analysis in the GelMaster software. Comput Methods Programs Biomed. 2005; 77(3):209–31.

Article
CAS
PubMed
Google Scholar

Maramis CF, Delopoulos AN, Lambropoulos AF. A computerized methodology for improved virus typing by PCR-RFLP gel electrophoresis. IEEE Trans Biomed Eng. 2011; 58(8):2339–51.

Article
Google Scholar

Roberts RJ, Vincze T, Posfai J, Macelis D. REBASE–a database for DNA restriction and modification: enzymes, genes and genomes. Nucleic Acids Res. 2015; 43(Database issue):298–9.

Article
Google Scholar

Ben-Bassat M. 35 Use of distance measures, information measures and error bounds in feature evaluation. Handbook of Statistics. 1982; 2:773–91.

Article
Google Scholar

Quinlan JR. C4.5: Programs for Machine Learning. San Francisco: Morgan Kaufmann Publishers Inc; 1993.

Google Scholar

Breiman L. Random forests. Mach Learn. 2001; 45(1):5–32.

Article
Google Scholar

Langley P, Iba W, Thompson K. An analysis of bayesian classifiers. In: Proceedings of the Tenth National Conference on Artificial Intelligence. AAAI’92. Menlo Park: AAAI Press: 1992. p. 223–8.

Google Scholar

John GH, Langley P. Estimating continuous distributions in bayesian classifiers. In: Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence. UAI’95. San Francisco: Morgan Kaufmann Publishers Inc: 1995. p. 338–45.

Google Scholar

Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995; 20(3):273–97.

Google Scholar

Cover T, Hart P. Nearest neighbor pattern classification. IEEE Trans Inf Theory. 1967; 13(1):21–7.

Article
Google Scholar

Aha DW, Kibler D, Albert MK. Instance-based learning algorithms. Mach Learn. 1991; 6(1):37–66.

Google Scholar

Freund Y, Schapire RE. A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci. 1997; 55(1):119–39.

Article
Google Scholar

Breiman L. Bagging predictors. Mach Learn. 1996; 24(2):123–40.

Google Scholar

Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH. The WEKA data mining software: an update. ACM SIGKDD Explor. 2009; 11(1):10–18.

Article
Google Scholar

Daigle B, Makarenkov V, Diallo AB. Effect of hundreds sequenced genomes on the classification of human papillomaviruses. In: Data Science, Learning by Latent Structures, and Knowledge Discovery. Berlin, Heidelberg: Springer: 2015. p. 309–18.

Google Scholar

Bernard HU, Burk RD, Chen Z, van Doorslaer K, zur Hausen H, de Villiers EM. Classification of papillomaviruses (PVs) based on 189 PV types and proposal of taxonomic amendments. Virology. 2010; 401(1):70–9.

Article
CAS
PubMed
PubMed Central
Google Scholar

Schaefer S. Hepatitis B virus taxonomy and hepatitis B virus genotypes. World J Gastroenterol. 2007; 13(1):14–21.

Article
CAS
PubMed
PubMed Central
Google Scholar

NCBI Resource Coordinators. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2016; 44(Database issue):D7–19.

Google Scholar

Robertson DL, Anderson JP, Bradac JA, Carr JK, Foley B, Funkhouser RK, Gao F, Hahn BH, Kalish ML, Kuiken C, Learn GH, Leitner T, McCutchan F, Osmanov S, Peeters M, Pieniazek D, Salminen M, Sharp PM, Wolinsky S, Korber B. HIV-1 nomenclature proposal. Science. 2000; 288(5463):55–6.

Article
CAS
PubMed
Google Scholar

Plantier JC, Leoz M, Dickerson JE, De Oliveira F, Cordonnier F, Lemée V, Damond F, Robertson DL, Simon F. A new human immunodeficiency virus derived from gorillas. Nat Med. 2009; 15(8):871–2.

Article
CAS
PubMed
Google Scholar

Gao F, Robertson DL, Carruthers CD, Morrison SG, Jian B, Chen Y, Barré-Sinoussi F, Girard M, Srinivasan A, Alashle G A, Abimiku AG, Shaw GM, Sharp PM, Hahn BH. A comprehensive panel of near-full-length clones and reference sequences for non-subtype B isolates of human immunodeficiency virus type 1. J Virol. 1998; 72(7):5680–98.

CAS
PubMed
PubMed Central
Google Scholar

Muñoz N, Bosch FX, de Sanjosé S, Herrero R, Castellsagué X, Shah KV, Snijders PJF, Meijer CJLM. Epidemiologic classification of human papillomavirus types associated with cervical cancer. N Engl J Med. 2003; 348(6):518–27.

Article
PubMed
Google Scholar

Perz JF, Armstrong GL, Farrington LA, Hutin YJF, Bell BP. The contributions of hepatitis B virus and hepatitis C virus infections to cirrhosis and primary liver cancer worldwide. J Hepatol. 2006; 45(4):529–38.

Article
PubMed
Google Scholar

Libbrecht MW, Noble WS. Machine learning applications in genetics and genomics. Nat Rev Genet. 2015; 16(6):321–32.

Article
CAS
PubMed
PubMed Central
Google Scholar

Lin WJ, Chen JJ. Class-imbalanced classifiers for high-dimensional data. Brief Bioinform. 2013; 14(1):13–26.

Article
PubMed
Google Scholar

Blagus R, Lusa L. Class prediction for high-dimensional class-imbalanced data. BMC Bioinformatics. 2010; 11:523.

Article
PubMed
PubMed Central
Google Scholar

Rousseeuw PJ. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math. 1987; 20:53–65.

Article
Google Scholar