Skip to content


Sequence analysis (methods)

Section edited by Olivier Poch

This section incorporates all aspects of sequence analysis methodology, including but not limited to: sequence alignment algorithms, discrete algorithms, phylogeny algorithms, gene prediction and sequence clustering methods.

Page 1 of 10

  1. Content type: Methodology article

    Functional antibody genes are often assembled by VDJ recombination and then diversified by somatic hypermutation. Identifying the combination of sourcing germline genes is critical to understand the process of...

    Authors: Qingchen Zhang, Lu Zhang, Chen Zhou, Yiyan Yang, Zuojing Yin, Dingfeng Wu, Kailin Tang and Zhiwei Cao

    Citation: BMC Bioinformatics 2019 20:137

    Published on:

  2. Content type: Research article

    As an important type of post-translational modification (PTM), protein glycosylation plays a crucial role in protein stability and protein function. The abundance and ubiquity of protein glycosylation across t...

    Authors: Fuyi Li, Yang Zhang, Anthony W. Purcell, Geoffrey I. Webb, Kuo-Chen Chou, Trevor Lithgow, Chen Li and Jiangning Song

    Citation: BMC Bioinformatics 2019 20:112

    Published on:

  3. Content type: Methodology article

    Group structures among genes encoded in functional relationships or biological pathways are valuable and unique features in large-scale molecular data for survival analysis. However, most of previous approache...

    Authors: Zaixiang Tang, Shufeng Lei, Xinyan Zhang, Zixuan Yi, Boyi Guo, Jake Y. Chen, Yueping Shen and Nengjun Yi

    Citation: BMC Bioinformatics 2019 20:94

    Published on:

  4. Content type: Research article

    The development of sequencing techniques and statistical methods provides great opportunities for identifying the impact of rare genetic variation on complex traits. However, there is a lack of knowledge on th...

    Authors: Xinyuan Zhang, Anna O. Basile, Sarah A. Pendergrass and Marylyn D. Ritchie

    Citation: BMC Bioinformatics 2019 20:46

    Published on:

  5. Content type: Research article

    The analysis of single-cell RNA sequencing (scRNAseq) data plays an important role in understanding the intrinsic and extrinsic cellular processes in biological and biomedical research. One significant effort ...

    Authors: Tianyu Wang, Boyang Li, Craig E. Nelson and Sheida Nabavi

    Citation: BMC Bioinformatics 2019 20:40

    Published on:

  6. Content type: Software

    Single-cell sequencing experiments use short DNA barcode ‘tags’ to identify reads that originate from the same cell. In order to recover single-cell information from such experiments, reads must be grouped bas...

    Authors: Akshay Tambe and Lior Pachter

    Citation: BMC Bioinformatics 2019 20:32

    Published on:

  7. Content type: Methodology article

    Skewed X chromosome inactivation (XCI), which is a non-random process, is frequently observed in both healthy and affected females. Furthermore, skewed XCI has been reported to be related to many X-linked dise...

    Authors: Peng Wang, Yu Zhang, Bei-Qi Wang, Jian-Long Li, Yi-Xin Wang, Dongdong Pan, Xian-Bo Wu, Wing Kam Fung and Ji-Yuan Zhou

    Citation: BMC Bioinformatics 2019 20:11

    Published on:

  8. Content type: Methodology article

    DNA methylation of CpG dinucleotides is an essential epigenetic modification that plays a key role in transcription. Widely used DNA enrichment-based methods offer high coverage for measuring methylated CpG di...

    Authors: Jingting Xu, Shimeng Liu, Ping Yin, Serdar Bulun and Yang Dai

    Citation: BMC Bioinformatics 2018 19:540

    Published on:

  9. Content type: Research article

    The deployment of Genome-wide association studies (GWASs) requires genomic information of a large population to produce reliable results. This raises significant privacy concerns, making people hesitate to con...

    Authors: Charlotte Bonte, Eleftheria Makri, Amin Ardeshirdavani, Jaak Simm, Yves Moreau and Frederik Vercauteren

    Citation: BMC Bioinformatics 2018 19:537

    Published on:

  10. Content type: Methodology article

    Researchers typically sequence a given individual multiple times, either re-sequencing the same DNA sample (technical replication) or sequencing different DNA samples collected on the same individual (biologic...

    Authors: Ariel W. Chan, Amy L. Williams and Jean-Luc Jannink

    Citation: BMC Bioinformatics 2018 19:478

    Published on:

  11. Content type: Software

    Targeted resequencing has become the most used and cost-effective approach for identifying causative mutations of Mendelian diseases both for diagnostics and research purposes. Due to very rapid technological ...

    Authors: F. Musacchia, A. Ciolfi, M. Mutarelli, A. Bruselles, R. Castello, M. Pinelli, S. Basu, S. Banfi, G. Casari, M. Tartaglia and V. Nigro

    Citation: BMC Bioinformatics 2018 19:477

    Published on:

  12. Content type: Software

    Mouse xenografts from (patient-derived) tumors (PDX) or tumor cell lines are widely used as models to study various biological and preclinical aspects of cancer. However, analyses of their RNA and DNA profiles...

    Authors: Roelof J. C. Kluin, Kristel Kemper, Thomas Kuilman, Julian R. de Ruiter, Vivek Iyer, Josep V. Forment, Paulien Cornelissen-Steijger, Iris de Rink, Petra ter Brugge, Ji-Ying Song, Sjoerd Klarenbeek, Ultan McDermott, Jos Jonkers, Arno Velds, David J. Adams, Daniel S. Peeper…

    Citation: BMC Bioinformatics 2018 19:366

    Published on:

  13. Content type: Research article

    Conventional methods of motor imagery brain computer interfaces (MI-BCIs) suffer from the limited number of samples and simplified features, so as to produce poor performances with spatial-frequency features a...

    Authors: Tian-jian Luo, Chang-le Zhou and Fei Chao

    Citation: BMC Bioinformatics 2018 19:344

    Published on:

  14. Content type: Methodology article

    Targeted amplicon sequencing of the 16S ribosomal RNA gene is one of the key tools for studying microbial diversity. The accuracy of this approach strongly depends on the choice of primer pairs and, in particu...

    Authors: Francesco Sambo, Francesca Finotello, Enrico Lavezzo, Giacomo Baruzzo, Giulia Masi, Elektra Peta, Marco Falda, Stefano Toppo, Luisa Barzon and Barbara Di Camillo

    Citation: BMC Bioinformatics 2018 19:343

    Published on:

  15. Content type: Methodology article

    Identification of homologous genes is fundamental to comparative genomics, functional genomics and phylogenomics. Extensive public homology databases are of great value for investigating homology but need to b...

    Authors: Siavash Sheikhizadeh Anari, Dick de Ridder, M. Eric Schranz and Sandra Smit

    Citation: BMC Bioinformatics 2018 19:340

    Published on:

  16. Content type: Research article

    Detection of highly divergent or yet unknown viruses from metagenomics sequencing datasets is a major bioinformatics challenge. When human samples are sequenced, a large proportion of assembled contigs are cla...

    Authors: Zurab Bzhalava, Ardi Tampuu, Piotr Bała, Raul Vicente and Joakim Dillner

    Citation: BMC Bioinformatics 2018 19:336

    Published on:

  17. Content type: Methodology article

    The development of a disease is a complex process that may result from joint effects of multiple genes. In this article, we propose the overlapping group screening (OGS) approach to determining active genes an...

    Authors: Jie-Huei Wang and Yi-Hau Chen

    Citation: BMC Bioinformatics 2018 19:335

    Published on:

  18. Content type: Software

    The automated prediction of the enzymatic functions of uncharacterized proteins is a crucial topic in bioinformatics. Although several methods and tools have been proposed to classify enzymes, most of these st...

    Authors: Alperen Dalkiran, Ahmet Sureyya Rifaioglu, Maria Jesus Martin, Rengul Cetin-Atalay, Volkan Atalay and Tunca Doğan

    Citation: BMC Bioinformatics 2018 19:334

    Published on:

  19. Content type: Methodology article

    Sequence alignment is crucial in genomics studies. However, optimal multiple sequence alignment (MSA) is NP-hard. Thus, modern MSA methods employ progressive heuristics, breaking the problem into a series of p...

    Authors: Massimo Maiolo, Xiaolei Zhang, Manuel Gil and Maria Anisimova

    Citation: BMC Bioinformatics 2018 19:331

    Published on:

  20. Content type: Methodology article

    Conventional phylogenetic clustering approaches rely on arbitrary cutpoints applied a posteriori to phylogenetic estimates. Although in practice, Bayesian and bootstrap-based clustering tend to lead to similar...

    Authors: Luc Villandré, Aurélie Labbe, Bluma Brenner, Michel Roger and David A Stephens

    Citation: BMC Bioinformatics 2018 19:324

    Published on:

  21. Content type: Methodology article

    Normalization is essential to ensure accurate analysis and proper interpretation of sequencing data, and chromosome conformation capture data such as Hi-C have particular challenges. Although several methods h...

    Authors: Nicolas Servant, Nelle Varoquaux, Edith Heard, Emmanuel Barillot and Jean-Philippe Vert

    Citation: BMC Bioinformatics 2018 19:313

    Published on:

  22. Content type: Research article

    Viral infection by dengue virus is a major public health problem in tropical countries. Early diagnosis and detection are increasingly based on quantitative reverse transcriptase real-time polymerase chain rea...

    Authors: Kevin Vanneste, Linda Garlant, Sylvia Broeders, Steven Van Gucht and Nancy H. Roosens

    Citation: BMC Bioinformatics 2018 19:312

    Published on:

  23. Content type: Methodology article

    Targeted resequencing with high-throughput sequencing (HTS) platforms can be used to efficiently interrogate the genomes of large numbers of individuals. A critical issue for research and applications using HT...

    Authors: Felix Francis, Michael D. Dumas, Scott B. Davis and Randall J. Wisser

    Citation: BMC Bioinformatics 2018 19:302

    Published on:

  24. Content type: Software

    Here, we present an R package for entropy/variability analysis that facilitates prompt and convenient data extraction, manipulation and visualization of protein features from multiple sequence alignments. BALC...

    Authors: Alicja Płuciennik, Michał Stolarczyk, Maria Bzówka, Agata Raczyńska, Tomasz Magdziarz and Artur Góra

    Citation: BMC Bioinformatics 2018 19:300

    Published on:

  25. Content type: Software

    Taxonomic identification of plants and insects is a hard process that demands expert taxonomists and time, and it’s often difficult to distinguish on morphology only. DNA barcodes allow a rapid species discove...

    Authors: Renato Renison Moreira Oliveira, Gisele Lopes Nunes, Talvâne Glauber Lopes de Lima, Guilherme Oliveira and Ronnie Alves

    Citation: BMC Bioinformatics 2018 19:297

    Published on:

  26. Content type: Software

    Since circular RNAs (circRNAs) post-transcriptionally regulate gene expression, they have attracted increasing attention. However, there is no existing tool to annotate and extract spliced sequences for circRN...

    Authors: Shanliang Zhong, Jinyan Wang, Qian Zhang, Hanzi Xu and Jifeng Feng

    Citation: BMC Bioinformatics 2018 19:292

    Published on:

  27. Content type: Software

    miRNAs play important roles in the regulation of gene expression. The rapidly developing field of microRNA sequencing (miRNA-seq; small RNA-seq) needs comprehensive, robust, user-friendly and standardized bioi...

    Authors: Yin Lu, Alexander S. Baras and Marc K. Halushka

    Citation: BMC Bioinformatics 2018 19:275

    Published on:

  28. Content type: Methodology article

    Tandem repeats comprise significant proportion of the human genome including coding and regulatory regions. They are highly prone to repeat number variation and nucleotide mutation due to their repetitive and ...

    Authors: Devika Ganesamoorthy, Minh Duc Cao, Tania Duarte, Wenhan Chen and Lachlan Coin

    Citation: BMC Bioinformatics 2018 19:267

    Published on:

  29. Content type: Methodology article

    Detection of genomic inversions remains challenging. Many existing methods primarily target inzversions with a non repetitive breakpoint, leaving inverted repeat (IR) mediated non-allelic homologous recombinat...

    Authors: Haojing Shao, Devika Ganesamoorthy, Tania Duarte, Minh Duc Cao, Clive J. Hoggart and Lachlan J. M. Coin

    Citation: BMC Bioinformatics 2018 19:261

    Published on:

  30. Content type: Methodology article

    Current high-throughput sequencing platforms provide capacity to sequence multiple samples in parallel. Different samples are labeled by attaching a short sample specific nucleotide sequence, barcode, to each ...

    Authors: Panu Somervuo, Patrik Koskinen, Peng Mei, Liisa Holm, Petri Auvinen and Lars Paulin

    Citation: BMC Bioinformatics 2018 19:257

    Published on:

  31. Content type: Methodology article

    Haplotype assembly is the process of assigning the different alleles of the variants covered by mapped sequencing reads to the two haplotypes of the genome of a human individual. Long reads, which are nowadays...

    Authors: Stefano Beretta, Murray D. Patterson, Simone Zaccaria, Gianluca Della Vedova and Paola Bonizzoni

    Citation: BMC Bioinformatics 2018 19:252

    Published on:

  32. Content type: Software

    The advent of next-generation sequencing (NGS) has made whole-genome sequencing of cohorts of individuals a reality. Primary datasets of raw or aligned reads of this sort can get very large. For scientific que...

    Authors: Kristiina Ausmees, Aji John, Salman Z. Toor, Andreas Hellander and Carl Nettelblad

    Citation: BMC Bioinformatics 2018 19:240

    Published on:

  33. Content type: Research article

    The adaptive immune response intrinsically depends on hypervariable human leukocyte antigen (HLA) genes. Concomitantly, correct HLA phenotyping is crucial for successful donor-patient matching in organ transpl...

    Authors: Maria Luisa Matey-Hernandez, Søren Brunak and Jose M. G. Izarzugaza

    Citation: BMC Bioinformatics 2018 19:239

    Published on:

  34. Content type: Methodology article

    Current normalization methods for RNA-sequencing data allow either for intersample comparison to identify differentially expressed (DE) genes or for intrasample comparison for the discovery and validation of g...

    Authors: Marcel Smid, Robert R. J. Coebergh van den Braak, Harmen J. G. van de Werken, Job van Riet, Anne van Galen, Vanja de Weerd, Michelle van der Vlugt-Daane, Sandra I. Bril, Zarina S. Lalmahomed, Wigard P. Kloosterman, Saskia M. Wilting, John A. Foekens, Jan N. M. IJzermans, John W. M. Martens and Anieta M. Sieuwerts

    Citation: BMC Bioinformatics 2018 19:236

    Published on:

  35. Content type: Methodology article

    Spastic cerebral palsy (CP) is a leading cause of physical disability. Most people with spastic CP are born with it, but early diagnosis is challenging, and no current biomarker platform readily identifies aff...

    Authors: Erin L. Crowgey, Adam G. Marsh, Karyn G. Robinson, Stephanie K. Yeager and Robert E. Akins

    Citation: BMC Bioinformatics 2018 19:225

    Published on:

2017 Journal Metrics

  • Citation Impact
    2.213 - 2-year Impact Factor
    3.114 - 5-year Impact Factor
    0.878 - Source Normalized Impact per Paper (SNIP)
    1.479 - SCImago Journal Rank (SJR)


    Social Media Impact
    4446 mentions