Skip to content


Sequence analysis (methods)

Section edited by Olivier Poch

This section incorporates all aspects of sequence analysis methodology, including but not limited to: sequence alignment algorithms, discrete algorithms, phylogeny algorithms, gene prediction and sequence clustering methods.

Page 1 of 10

  1. Content type: Software

    Next-generation sequencing is revolutionising diagnosis and treatment of rare diseases, however its application to understanding common disease aetiology is limited. Rare disease applications binarily attribut...

    Authors: E. Mossotto, J. J. Ashton, L. O’Gorman, R. J. Pengelly, R. M. Beattie, B. D. MacArthur and S. Ennis

    Citation: BMC Bioinformatics 2019 20:254

    Published on:

  2. Content type: Methodology article

    The development of whole genome bisulfite sequencing has made it possible to identify methylation differences at single base resolution throughout an entire genome. However, a persistent challenge in DNA methy...

    Authors: Akanksha Srivastava, Yuliya V. Karpievitch, Steven R. Eichten, Justin O. Borevitz and Ryan Lister

    Citation: BMC Bioinformatics 2019 20:253

    Published on:

  3. Content type: Software

    With the widespread use of multiple amplicon-sequencing (MAS) in genetic variation detection, an efficient tool is required to remove primer sequences from short reads to ensure the reliability of downstream a...

    Authors: Xiaolong Zhang, Yanyan Shao, Jichao Tian, Yuwei Liao, Peiying Li, Yu Zhang, Jun Chen and Zhiguang Li

    Citation: BMC Bioinformatics 2019 20:236

    Published on:

  4. Content type: Software

    The Oxford Nanopore Technologies (ONT) MinION portable sequencer makes it possible to use cutting-edge genomic technologies in the field and the academic classroom.

    Authors: Héctor Rodríguez-Pérez, Tamara Hernández-Beeftink, José M. Lorenzo-Salazar, José L. Roda-García, Carlos J. Pérez-González, Marcos Colebrook and Carlos Flores

    Citation: BMC Bioinformatics 2019 20:234

    Published on:

  5. Content type: Methodology article

    Draft quality genomes for a multitude of organisms have become common due to the advancement of genome assemblers using long-read technologies with high error rates. Although current assemblies are substantial...

    Authors: Philipp Bongartz

    Citation: BMC Bioinformatics 2019 20:232

    Published on:

  6. Content type: Software

    RNA sequencing (RNA-seq) has become the standard means of analyzing gene and transcript expression in high-throughput. While previously sequence alignment was a time demanding step, fast alignment methods and ...

    Authors: Paula Pérez-Rubio, Claudio Lottaz and Julia C. Engelmann

    Citation: BMC Bioinformatics 2019 20:226

    Published on:

  7. Content type: Methodology article

    Data from genome-wide association studies (GWASs) have been used to estimate the heritability of human complex traits in recent years. Existing methods are based on the linear mixed model, with the assumption ...

    Authors: Xin Li, Dongya Wu, Yue Cui, Bing Liu, Henrik Walter, Gunter Schumann, Chong Li and Tianzi Jiang

    Citation: BMC Bioinformatics 2019 20:219

    Published on:

  8. Content type: Methodology article

    Next Generation Sequencing (NGS) is a commonly used technology for studying the genetic basis of biological processes and it underpins the aspirations of precision medicine. However, there are significant chal...

    Authors: A. Iacoangeli, A. Al Khleifat, W. Sproviero, A. Shatunov, A. R. Jones, S. L. Morgan, A. Pittman, R. J. Dobson, S. J. Newhouse and A. Al-Chalabi

    Citation: BMC Bioinformatics 2019 20:213

    Published on:

  9. Content type: Methodology article

    Establishment and maintenance of DNA methylation throughout the genome is an important epigenetic mechanism that regulates gene expression whose disruption has been implicated in human diseases like cancer. It...

    Authors: Garrett Jenkinson, Jordi Abante, Michael A. Koldobskiy, Andrew P. Feinberg and John Goutsias

    Citation: BMC Bioinformatics 2019 20:175

    Published on:

  10. Content type: Methodology article

    Identifying transcriptional enhancers and other cis-regulatory modules (CRMs) is an important goal of post-sequencing genome annotation. Computational approaches provide a useful complement to empirical methods f...

    Authors: Hasiba Asma and Marc S. Halfon

    Citation: BMC Bioinformatics 2019 20:174

    Published on:

  11. Content type: Methodology article

    Whole exome sequencing (WES) has been widely used in human genetics research. BGISEQ-500 is a recently established next-generation sequencing platform. However, the performance of BGISEQ-500 on WES is not well...

    Authors: Yu Xu, Zhe Lin, Chong Tang, Yujing Tang, Yue Cai, Hongbin Zhong, Xuebin Wang, Wenwei Zhang, Chongjun Xu, Jingjing Wang, Jian Wang, Huanming Yang, Linfeng Yang and Qiang Gao

    Citation: BMC Bioinformatics 2019 20:153

    Published on:

  12. Content type: Methodology article

    Functional antibody genes are often assembled by VDJ recombination and then diversified by somatic hypermutation. Identifying the combination of sourcing germline genes is critical to understand the process of...

    Authors: Qingchen Zhang, Lu Zhang, Chen Zhou, Yiyan Yang, Zuojing Yin, Dingfeng Wu, Kailin Tang and Zhiwei Cao

    Citation: BMC Bioinformatics 2019 20:137

    Published on:

  13. Content type: Research article

    As an important type of post-translational modification (PTM), protein glycosylation plays a crucial role in protein stability and protein function. The abundance and ubiquity of protein glycosylation across t...

    Authors: Fuyi Li, Yang Zhang, Anthony W. Purcell, Geoffrey I. Webb, Kuo-Chen Chou, Trevor Lithgow, Chen Li and Jiangning Song

    Citation: BMC Bioinformatics 2019 20:112

    Published on:

  14. Content type: Methodology article

    Group structures among genes encoded in functional relationships or biological pathways are valuable and unique features in large-scale molecular data for survival analysis. However, most of previous approache...

    Authors: Zaixiang Tang, Shufeng Lei, Xinyan Zhang, Zixuan Yi, Boyi Guo, Jake Y. Chen, Yueping Shen and Nengjun Yi

    Citation: BMC Bioinformatics 2019 20:94

    Published on:

  15. Content type: Research article

    The development of sequencing techniques and statistical methods provides great opportunities for identifying the impact of rare genetic variation on complex traits. However, there is a lack of knowledge on th...

    Authors: Xinyuan Zhang, Anna O. Basile, Sarah A. Pendergrass and Marylyn D. Ritchie

    Citation: BMC Bioinformatics 2019 20:46

    Published on:

  16. Content type: Research article

    The analysis of single-cell RNA sequencing (scRNAseq) data plays an important role in understanding the intrinsic and extrinsic cellular processes in biological and biomedical research. One significant effort ...

    Authors: Tianyu Wang, Boyang Li, Craig E. Nelson and Sheida Nabavi

    Citation: BMC Bioinformatics 2019 20:40

    Published on:

  17. Content type: Software

    Single-cell sequencing experiments use short DNA barcode ‘tags’ to identify reads that originate from the same cell. In order to recover single-cell information from such experiments, reads must be grouped bas...

    Authors: Akshay Tambe and Lior Pachter

    Citation: BMC Bioinformatics 2019 20:32

    Published on:

  18. Content type: Methodology article

    Skewed X chromosome inactivation (XCI), which is a non-random process, is frequently observed in both healthy and affected females. Furthermore, skewed XCI has been reported to be related to many X-linked dise...

    Authors: Peng Wang, Yu Zhang, Bei-Qi Wang, Jian-Long Li, Yi-Xin Wang, Dongdong Pan, Xian-Bo Wu, Wing Kam Fung and Ji-Yuan Zhou

    Citation: BMC Bioinformatics 2019 20:11

    Published on:

  19. Content type: Methodology article

    DNA methylation of CpG dinucleotides is an essential epigenetic modification that plays a key role in transcription. Widely used DNA enrichment-based methods offer high coverage for measuring methylated CpG di...

    Authors: Jingting Xu, Shimeng Liu, Ping Yin, Serdar Bulun and Yang Dai

    Citation: BMC Bioinformatics 2018 19:540

    Published on:

  20. Content type: Research article

    The deployment of Genome-wide association studies (GWASs) requires genomic information of a large population to produce reliable results. This raises significant privacy concerns, making people hesitate to con...

    Authors: Charlotte Bonte, Eleftheria Makri, Amin Ardeshirdavani, Jaak Simm, Yves Moreau and Frederik Vercauteren

    Citation: BMC Bioinformatics 2018 19:537

    Published on:

  21. Content type: Methodology article

    Researchers typically sequence a given individual multiple times, either re-sequencing the same DNA sample (technical replication) or sequencing different DNA samples collected on the same individual (biologic...

    Authors: Ariel W. Chan, Amy L. Williams and Jean-Luc Jannink

    Citation: BMC Bioinformatics 2018 19:478

    Published on:

  22. Content type: Software

    Targeted resequencing has become the most used and cost-effective approach for identifying causative mutations of Mendelian diseases both for diagnostics and research purposes. Due to very rapid technological ...

    Authors: F. Musacchia, A. Ciolfi, M. Mutarelli, A. Bruselles, R. Castello, M. Pinelli, S. Basu, S. Banfi, G. Casari, M. Tartaglia and V. Nigro

    Citation: BMC Bioinformatics 2018 19:477

    Published on:

  23. Content type: Software

    Mouse xenografts from (patient-derived) tumors (PDX) or tumor cell lines are widely used as models to study various biological and preclinical aspects of cancer. However, analyses of their RNA and DNA profiles...

    Authors: Roelof J. C. Kluin, Kristel Kemper, Thomas Kuilman, Julian R. de Ruiter, Vivek Iyer, Josep V. Forment, Paulien Cornelissen-Steijger, Iris de Rink, Petra ter Brugge, Ji-Ying Song, Sjoerd Klarenbeek, Ultan McDermott, Jos Jonkers, Arno Velds, David J. Adams, Daniel S. Peeper…

    Citation: BMC Bioinformatics 2018 19:366

    Published on:

  24. Content type: Research article

    Conventional methods of motor imagery brain computer interfaces (MI-BCIs) suffer from the limited number of samples and simplified features, so as to produce poor performances with spatial-frequency features a...

    Authors: Tian-jian Luo, Chang-le Zhou and Fei Chao

    Citation: BMC Bioinformatics 2018 19:344

    Published on:

  25. Content type: Methodology article

    Targeted amplicon sequencing of the 16S ribosomal RNA gene is one of the key tools for studying microbial diversity. The accuracy of this approach strongly depends on the choice of primer pairs and, in particu...

    Authors: Francesco Sambo, Francesca Finotello, Enrico Lavezzo, Giacomo Baruzzo, Giulia Masi, Elektra Peta, Marco Falda, Stefano Toppo, Luisa Barzon and Barbara Di Camillo

    Citation: BMC Bioinformatics 2018 19:343

    Published on:

  26. Content type: Methodology article

    Identification of homologous genes is fundamental to comparative genomics, functional genomics and phylogenomics. Extensive public homology databases are of great value for investigating homology but need to b...

    Authors: Siavash Sheikhizadeh Anari, Dick de Ridder, M. Eric Schranz and Sandra Smit

    Citation: BMC Bioinformatics 2018 19:340

    Published on:

  27. Content type: Research article

    Detection of highly divergent or yet unknown viruses from metagenomics sequencing datasets is a major bioinformatics challenge. When human samples are sequenced, a large proportion of assembled contigs are cla...

    Authors: Zurab Bzhalava, Ardi Tampuu, Piotr Bała, Raul Vicente and Joakim Dillner

    Citation: BMC Bioinformatics 2018 19:336

    Published on:

  28. Content type: Methodology article

    The development of a disease is a complex process that may result from joint effects of multiple genes. In this article, we propose the overlapping group screening (OGS) approach to determining active genes an...

    Authors: Jie-Huei Wang and Yi-Hau Chen

    Citation: BMC Bioinformatics 2018 19:335

    Published on:

  29. Content type: Software

    The automated prediction of the enzymatic functions of uncharacterized proteins is a crucial topic in bioinformatics. Although several methods and tools have been proposed to classify enzymes, most of these st...

    Authors: Alperen Dalkiran, Ahmet Sureyya Rifaioglu, Maria Jesus Martin, Rengul Cetin-Atalay, Volkan Atalay and Tunca Doğan

    Citation: BMC Bioinformatics 2018 19:334

    Published on:

  30. Content type: Methodology article

    Sequence alignment is crucial in genomics studies. However, optimal multiple sequence alignment (MSA) is NP-hard. Thus, modern MSA methods employ progressive heuristics, breaking the problem into a series of p...

    Authors: Massimo Maiolo, Xiaolei Zhang, Manuel Gil and Maria Anisimova

    Citation: BMC Bioinformatics 2018 19:331

    Published on:

  31. Content type: Methodology article

    Conventional phylogenetic clustering approaches rely on arbitrary cutpoints applied a posteriori to phylogenetic estimates. Although in practice, Bayesian and bootstrap-based clustering tend to lead to similar...

    Authors: Luc Villandré, Aurélie Labbe, Bluma Brenner, Michel Roger and David A Stephens

    Citation: BMC Bioinformatics 2018 19:324

    Published on:

  32. Content type: Methodology article

    Normalization is essential to ensure accurate analysis and proper interpretation of sequencing data, and chromosome conformation capture data such as Hi-C have particular challenges. Although several methods h...

    Authors: Nicolas Servant, Nelle Varoquaux, Edith Heard, Emmanuel Barillot and Jean-Philippe Vert

    Citation: BMC Bioinformatics 2018 19:313

    Published on:

  33. Content type: Research article

    Viral infection by dengue virus is a major public health problem in tropical countries. Early diagnosis and detection are increasingly based on quantitative reverse transcriptase real-time polymerase chain rea...

    Authors: Kevin Vanneste, Linda Garlant, Sylvia Broeders, Steven Van Gucht and Nancy H. Roosens

    Citation: BMC Bioinformatics 2018 19:312

    Published on:

  34. Content type: Methodology article

    Targeted resequencing with high-throughput sequencing (HTS) platforms can be used to efficiently interrogate the genomes of large numbers of individuals. A critical issue for research and applications using HT...

    Authors: Felix Francis, Michael D. Dumas, Scott B. Davis and Randall J. Wisser

    Citation: BMC Bioinformatics 2018 19:302

    Published on:

  35. Content type: Software

    Here, we present an R package for entropy/variability analysis that facilitates prompt and convenient data extraction, manipulation and visualization of protein features from multiple sequence alignments. BALC...

    Authors: Alicja Płuciennik, Michał Stolarczyk, Maria Bzówka, Agata Raczyńska, Tomasz Magdziarz and Artur Góra

    Citation: BMC Bioinformatics 2018 19:300

    Published on:

  36. Content type: Software

    Taxonomic identification of plants and insects is a hard process that demands expert taxonomists and time, and it’s often difficult to distinguish on morphology only. DNA barcodes allow a rapid species discove...

    Authors: Renato Renison Moreira Oliveira, Gisele Lopes Nunes, Talvâne Glauber Lopes de Lima, Guilherme Oliveira and Ronnie Alves

    Citation: BMC Bioinformatics 2018 19:297

    Published on:

  37. Content type: Software

    Since circular RNAs (circRNAs) post-transcriptionally regulate gene expression, they have attracted increasing attention. However, there is no existing tool to annotate and extract spliced sequences for circRN...

    Authors: Shanliang Zhong, Jinyan Wang, Qian Zhang, Hanzi Xu and Jifeng Feng

    Citation: BMC Bioinformatics 2018 19:292

    Published on:

2017 Journal Metrics

  • Citation Impact
    2.213 - 2-year Impact Factor
    3.114 - 5-year Impact Factor
    0.878 - Source Normalized Impact per Paper (SNIP)
    1.479 - SCImago Journal Rank (SJR)


    Social Media Impact
    4446 mentions