Skip to content


Sequence analysis (methods)

Section edited by Olivier Poch

This section incorporates all aspects of sequence analysis methodology, including but not limited to: sequence alignment algorithms, discrete algorithms, phylogeny algorithms, gene prediction and sequence clustering methods.

Page 2 of 10

  1. Content type: Methodology article

    Intraspecific variation in ploidy occurs in a wide range of species including pathogenic and nonpathogenic eukaryotes such as yeasts and oomycetes. Ploidy can be inferred indirectly - without measuring DNA con...

    Authors: Clemens L. Weiß, Marina Pais, Liliana M. Cano, Sophien Kamoun and Hernán A. Burbano

    Citation: BMC Bioinformatics 2018 19:122

    Published on:

  2. Content type: Software

    DNA methylation is an important epigenetic modification critical in regulation and transgenerational inheritance. The methylation level can be estimated at single-nucleotide resolution by whole-genome bisulfit...

    Authors: Kevin Yu Yuan Huang, Yan-Jiun Huang and Pao-Yang Chen

    Citation: BMC Bioinformatics 2018 19:111

    Published on:

  3. Content type: Research article

    DNA methylation patterns store epigenetic information in the vast majority of eukaryotic species. The relatively high costs and technical challenges associated with the detection of DNA methylation however hav...

    Authors: Ingo Bulla, Benoît Aliaga, Virginia Lacal, Jan Bulla, Christoph Grunau and Cristian Chaparro

    Citation: BMC Bioinformatics 2018 19:105

    Published on:

  4. Content type: Software

    Prioritization of sequence variants for diagnosis and discovery of Mendelian diseases is challenging, especially in large collections of whole genome sequences (WGS). Fast, scalable solutions are needed for di...

    Authors: Steven Flygare, Edgar Javier Hernandez, Lon Phan, Barry Moore, Man Li, Anthony Fejes, Hao Hu, Karen Eilbeck, Chad Huff, Lynn Jorde, Martin G. Reese and Mark Yandell

    Citation: BMC Bioinformatics 2018 19:57

    Published on:

  5. Content type: Software

    The function of many noncoding RNAs (ncRNAs) depend upon their secondary structures. Over the last decades, several methodologies have been developed to predict such structures or to use them to functionally a...

    Authors: Raúl Arias-Carrasco, Yessenia Vásquez-Morán, Helder I. Nakaya and Vinicius Maracaja-Coutinho

    Citation: BMC Bioinformatics 2018 19:55

    Published on:

  6. Content type: Methodology Article

    Long read sequencing is changing the landscape of genomic research, especially de novo assembly. Despite the high error rate inherent to long read technologies, increased read lengths dramatically improve the con...

    Authors: Jeremy R. Wang, James Holt, Leonard McMillan and Corbin D. Jones

    Citation: BMC Bioinformatics 2018 19:50

    Published on:

  7. Content type: Software

    Over the last few decades, computational genomics has tremendously contributed to decipher biology from genome sequences and related data. Considerable effort has been devoted to the prediction of transcriptio...

    Authors: Marco Di Salvo, Eva Pinatel, Adelfia Talà, Marco Fondi, Clelia Peano and Pietro Alifano

    Citation: BMC Bioinformatics 2018 19:36

    Published on:

  8. Content type: Software

    Identification of differentially methylated regions (DMRs) is the initial step towards the study of DNA methylation-mediated gene regulation. Previous approaches to call DMRs suffer from false prediction, use ...

    Authors: David E. Condon, Phu V. Tran, Yu-Chin Lien, Jonathan Schug, Michael K. Georgieff, Rebecca A. Simmons and Kyoung-Jae Won

    Citation: BMC Bioinformatics 2018 19:31

    Published on:

  9. Content type: Software

    Genomic islands play an important role in microbial genome evolution, providing a mechanism for strains to adapt to new ecological conditions. A variety of computational methods, both genome-composition based ...

    Authors: Eliot C. Bush, Anne E. Clark, Carissa A. DeRanek, Alexander Eng, Juliet Forman, Kevin Heath, Alexander B. Lee, Daniel M. Stoebel, Zunyan Wang, Matthew Wilber and Helen Wu

    Citation: BMC Bioinformatics 2018 19:32

    Published on:

  10. Content type: Software

    GenIO is a novel web-server, designed to assist clinical genomics researchers and medical doctors in the diagnostic process of rare genetic diseases. The tool identifies the most probable variants causing a ra...

    Authors: Daniel Koile, Marta Cordoba, Maximiliano de Sousa Serro, Marcelo Andres Kauffman and Patricio Yankilevich

    Citation: BMC Bioinformatics 2018 19:25

    Published on:

  11. Content type: Research Article

    The uncovering of genes linked to human diseases is a pressing challenge in molecular biology and precision medicine. This task is often hindered by the large number of candidate genes and by the heterogeneity...

    Authors: Guido Zampieri, Dinh Van Tran, Michele Donini, Nicolò Navarin, Fabio Aiolli, Alessandro Sperduti and Giorgio Valle

    Citation: BMC Bioinformatics 2018 19:23

    Published on:

  12. Content type: Methodology article

    Cluster analysis is the most common unsupervised method for finding hidden groups in data. Clustering presents two main challenges: (1) finding the optimal number of clusters, and (2) removing “outliers” among...

    Authors: Min Wang, Zachary B. Abrams, Steven M. Kornblau and Kevin R. Coombes

    Citation: BMC Bioinformatics 2018 19:9

    Published on:

  13. Content type: Research Article

    Genomic imprinting is one of the well-known epigenetic factors causing the association between traits and genes, and has generally been examined by detecting parent-of-origin effects of alleles. A lot of metho...

    Authors: Qi-Lei Zou, Xiao-Ping You, Jian-Long Li, Wing Kam Fung and Ji-Yuan Zhou

    Citation: BMC Bioinformatics 2018 19:8

    Published on:

  14. Content type: Methodology Article

    ‘Next-generation’ (NGS) sequencing has wide application in medical genetics, including the detection of somatic variation in cancer. The Ion Torrent-based (IONT) platform is among NGS technologies employed in ...

    Authors: Aditya Deshpande, Wenhua Lang, Tina McDowell, Smruthy Sivakumar, Jiexin Zhang, Jing Wang, F. Anthony San Lucas, Jerry Fowler, Humam Kadara and Paul Scheet

    Citation: BMC Bioinformatics 2018 19:5

    Published on:

  15. Content type: Software

    De novo prediction of Transcription Factor Binding Sites (TFBS) using computational methods is a difficult task and it is an important problem in Bioinformatics. The correct recognition of TFBS plays an important...

    Authors: Jader M. Caldonazzo Garbelini, André Y. Kashiwabara and Danilo S. Sanches

    Citation: BMC Bioinformatics 2018 19:4

    Published on:

  16. Content type: Methodology article

    Genotyping-by-sequencing (GBS), a method to identify genetic variants and quickly genotype samples, reduces genome complexity by using restriction enzymes to divide the genome into fragments whose ends are seq...

    Authors: Daniel P. Wickland, Gopal Battu, Karen A. Hudson, Brian W. Diers and Matthew E. Hudson

    Citation: BMC Bioinformatics 2017 18:586

    Published on:

  17. Content type: Research Article

    The next generation sequencing (NGS) techniques have been around for over a decade. Many of their fundamental applications rely on the ability to compute good genome assemblies. As the technology evolves, the ...

    Authors: Nilesh Khiste and Lucian Ilie

    Citation: BMC Bioinformatics 2017 18:564

    Published on:

  18. Content type: Methodology Article

    The spatial Principal Component Analysis (sPCA, Jombart (Heredity 101:92-103, 2008) is designed to investigate non-random spatial distributions of genetic variation. Unfortunately, the associated tests used fo...

    Authors: V. Montano and T. Jombart

    Citation: BMC Bioinformatics 2017 18:562

    Published on:

  19. Content type: Methodology Article

    One of the most crucial steps in high-throughput sequence-based microbiome studies is the taxonomic assignment of sequences belonging to operational taxonomic units (OTUs). Without taxonomic classification, fu...

    Authors: Kristi Gdanetz, Gian Maria Niccolò Benucci, Natalie Vande Pol and Gregory Bonito

    Citation: BMC Bioinformatics 2017 18:538

    Published on:

  20. Content type: Methodology Article

    High-throughput sequencing has made it theoretically possible to obtain high-quality de novo assembled genome sequences but in practice DNA extracts are often contaminated with sequences from other organisms. Cur...

    Authors: Janna L. Fierst and Duncan A. Murdock

    Citation: BMC Bioinformatics 2017 18:533

    Published on:

  21. Content type: Methodology Article

    High-throughput sequencing offers higher throughput and lower cost for sequencing a genome. However, sequencing errors, including mismatches and indels, may be produced during sequencing. Because, errors may r...

    Authors: Yao-Ting Huang and Yu-Wen Huang

    Citation: BMC Bioinformatics 2017 18:524

    Published on:

  22. Content type: Research Article

    Mantle Cell Lymphoma (MCL) is a B cell aggressive neoplasia accounting for about the 6% of all lymphomas. The most common molecular marker of clonality in MCL, as in other B lymphoproliferative disorders, is t...

    Authors: Marco Beccuti, Elisa Genuardi, Greta Romano, Luigia Monitillo, Daniela Barbero, Mario Boccadoro, Marco Ladetto, Raffaele Calogero, Simone Ferrero and Francesca Cordero

    Citation: BMC Bioinformatics 2017 18:516

    Published on:

  23. Content type: Methodology Article

    The sequence of nucleotides in an RNA determines the possible base pairs for an RNA fold and thus also determines the overall shape and function of an RNA. The Swellix program presented here combines a helix a...

    Authors: Nathan Sloat, Jui-Wen Liu and Susan J. Schroeder

    Citation: BMC Bioinformatics 2017 18:504

    Published on:

  24. Content type: Research Article

    Gene expression profiling has led to the definition of breast cancer molecular subtypes: Basal-like, HER2-enriched, LuminalA, LuminalB and Normal-like. Different subtypes exhibit diverse responses to treatment...

    Authors: Liying Yang, Yunyan Shen, Xiguo Yuan, Junying Zhang and Jianhua Wei

    Citation: BMC Bioinformatics 2017 18:481

    Published on:

  25. Content type: Methodology Article

    De novo transcriptome assembly is an important technique for understanding gene expression in non-model organisms. Many de novo assemblers using the de Bruijn graph of a set of the RNA sequences rely on in-mem...

    Authors: Chang Sik Kim, Martyn D. Winn, Vipin Sachdeva and Kirk E. Jordan

    Citation: BMC Bioinformatics 2017 18:467

    Published on:

  26. Content type: Research Article

    Biomedical named entity recognition(BNER) is a crucial initial step of information extraction in biomedical domain. The task is typically modeled as a sequence labeling problem. Various machine learning algori...

    Authors: Chen Lyu, Bo Chen, Yafeng Ren and Donghong Ji

    Citation: BMC Bioinformatics 2017 18:462

    Published on:

  27. Content type: Methodology Article

    Somatic mutations accumulate in human cells throughout life. Some may have no adverse consequences, but some of them may lead to cancer. A cancer genome is typically unstable, and thus more mutations can accum...

    Authors: Yahya Bokhari and Tomasz Arodz

    Citation: BMC Bioinformatics 2017 18:458

    Published on:

  28. Content type: Methodology Article

    Detection of important functional and/or structural elements and identification of their positions in a large eukaryotic genomic sequence are an active research area. Gene is an important functional and struct...

    Authors: Biswanath Chowdhury, Arnav Garai and Gautam Garai

    Citation: BMC Bioinformatics 2017 18:460

    Published on:

  29. Content type: Software

    Pre-processing of high-throughput sequencing data for immune repertoire profiling is essential to insure high quality input for downstream analysis. VDJPipe is a flexible, high-performance tool that can perfor...

    Authors: Scott Christley, Mikhail K. Levin, Inimary T. Toby, John M. Fonner, Nancy L. Monson, William H. Rounds, Florian Rubelt, Walter Scarborough, Richard H. Scheuermann and Lindsay G. Cowell

    Citation: BMC Bioinformatics 2017 18:448

    Published on:

  30. Content type: Methodology Article

    Geminiviruses infect a broad range of cultivated and non-cultivated plants, causing significant economic losses worldwide. The studies of the diversity of species, taxonomy, mechanisms of evolution, geographic...

    Authors: José Cleydson F. Silva, Thales F. M. Carvalho, Elizabeth P. B. Fontes and Fabio R. Cerqueira

    Citation: BMC Bioinformatics 2017 18:431

    Published on:

  31. Content type: Methodology Article

    Metagenomics sequencing provides deep insights into microbial communities. To investigate their taxonomic structure, binning assembled contigs into discrete clusters is critical. Many binning algorithms have b...

    Authors: Ying Wang, Kun Wang, Yang Young Lu and Fengzhu Sun

    Citation: BMC Bioinformatics 2017 18:425

    Published on:

  32. Content type: Research Article

    Prediction of DNA-binding residue is important for understanding the protein-DNA recognition mechanism. Many computational methods have been proposed for the prediction, but most of them do not consider the re...

    Authors: Jiyun Zhou, Qin Lu, Ruifeng Xu, Yulan He and Hongpeng Wang

    Citation: BMC Bioinformatics 2017 18:379

    Published on:

  33. Content type: Methodology Article

    Alignment-free methods for comparing protein sequences have proved to be viable alternatives to approaches that first rely on an alignment of the sequences to be compared. Much work however need to be done bef...

    Authors: Saghi Nojoomi and Patrice Koehl

    Citation: BMC Bioinformatics 2017 18:378

    Published on:

  34. Content type: Methodology Article

    A multivariate genome-wide association test is proposed for analyzing data on multivariate quantitative phenotypes collected from related subjects. The proposed method is a two-step approach. The first step mo...

    Authors: James J. Yang, L Keoki Williams and Anne Buu

    Citation: BMC Bioinformatics 2017 18:376

    Published on:

  35. Content type: Research Article

    Recently, many standalone applications have been proposed to correct sequencing errors in Illumina data. The key idea is that downstream analysis tools such as de novo genome assemblers benefit from a reduced err...

    Authors: Mahdi Heydari, Giles Miclotte, Piet Demeester, Yves Van de Peer and Jan Fostier

    Citation: BMC Bioinformatics 2017 18:374

    Published on:

2017 Journal Metrics

  • Citation Impact
    2.213 - 2-year Impact Factor
    3.114 - 5-year Impact Factor
    0.878 - Source Normalized Impact per Paper (SNIP)
    1.479 - SCImago Journal Rank (SJR)


    Social Media Impact
    4446 mentions