Skip to main content

Articles

1643 result(s) for 'natural language processing' within BMC Bioinformatics

Page 29 of 33

  1. As in many different areas of science and technology, most important problems in bioinformatics rely on the proper development and assessment of binary classifiers. A generalized assessment of the performance ...

    Authors: Ismael A Vergara, Tomás Norambuena, Evandro Ferrada, Alex W Slater and Francisco Melo
    Citation: BMC Bioinformatics 2008 9:265
  2. Genetic mutation, selective pressure for translational efficiency and accuracy, level of gene expression, and protein function through natural selection are all believed to lead to codon usage bias (CUB). Ther...

    Authors: Zhang Zhang, Jun Li, Peng Cui, Feng Ding, Ang Li, Jeffrey P Townsend and Jun Yu
    Citation: BMC Bioinformatics 2012 13:43
  3. Mass spectrometry (MS) has become a promising analytical technique to acquire proteomics information for the characterization of biological samples. Nevertheless, most studies focus on the final proteins ident...

    Authors: Shisheng Wang, Hongwen Zhu, Hu Zhou, Jingqiu Cheng and Hao Yang
    Citation: BMC Bioinformatics 2020 21:439
  4. Biological networks play an increasingly important role in the exploration of functional modularity and cellular organization at a systemic level. Quite often the first tools used to analyze these networks are cl...

    Authors: Marco Pellegrini, Miriam Baglioni and Filippo Geraci
    Citation: BMC Bioinformatics 2016 17(Suppl 12):372

    This article is part of a Supplement: Volume 17 Supplement 12

  5. Gallbladder carcinoma (GBC), an aggressive malignant tumor of the biliary system, is characterized by high cellular heterogeneity and poor prognosis. Fewer data have been reported in GBC than other common canc...

    Authors: Li Guo, Yangyang Xiang, Yuyang Dou, Zibo Yin, Xinru Xu, Lihua Tang, Jiafeng Yu, Jun Wang and Tingming Liang
    Citation: BMC Bioinformatics 2023 24:12
  6. We present a new iterative algorithm for the molecular distance geometry problem with inaccurate and sparse data, which is based on the solution of linear systems, maximum cliques, and a minimization of nonlin...

    Authors: Michael Souza, Carlile Lavor, Albert Muritiba and Nelson Maculan
    Citation: BMC Bioinformatics 2013 14(Suppl 9):S7

    This article is part of a Supplement: Volume 14 Supplement 9

  7. Long reads have gained popularity in the analysis of metagenomics data. Therefore, we comprehensively assessed metagenomics classification tools on the species taxonomic level. We analysed kmer-based tools, ma...

    Authors: Josip Marić, Krešimir Križanović, Sylvain Riondet, Niranjan Nagarajan and Mile Šikić
    Citation: BMC Bioinformatics 2024 25:15
  8. Cancer stem cell theory suggests that cancers are derived by a population of cells named Cancer Stem Cells (CSCs) that are involved in the growth and in the progression of tumors, and lead to a hierarchical st...

    Authors: Francesca Cordero, Marco Beccuti, Chiara Fornari, Stefania Lanzardo, Laura Conti, Federica Cavallo, Gianfranco Balbo and Raffaele Calogero
    Citation: BMC Bioinformatics 2013 14(Suppl 6):S11

    This article is part of a Supplement: Volume 14 Supplement 6

  9. Despite their involvement in the regulation of gene expression and their importance as genomic markers for promoter prediction, no objective standard exists for defining CpG islands (CGIs), since all current a...

    Authors: Michael Hackenberg, Christopher Previti, Pedro Luis Luque-Escamilla, Pedro Carpena, José Martínez-Aroza and José L Oliver
    Citation: BMC Bioinformatics 2006 7:446
  10. Drug pharmacokinetics parameters, drug interaction parameters, and pharmacogenetics data have been unevenly collected in different databases and published extensively in the literature. Without appropriate pha...

    Authors: Heng-Yi Wu, Shreyas Karnik, Abhinita Subhadarshini, Zhiping Wang, Santosh Philips, Xu Han, Chienwei Chiang, Lei Liu, Malaz Boustani, Luis M Rocha, Sara K Quinney, David Flockhart and Lang Li
    Citation: BMC Bioinformatics 2013 14:35
  11. Gene set analysis (GSA) is a widely used strategy for gene expression data analysis based on pathway knowledge. GSA focuses on sets of related genes and has established major advantages over individual gene an...

    Authors: Weijun Luo, Michael S Friedman, Kerby Shedden, Kurt D Hankenson and Peter J Woolf
    Citation: BMC Bioinformatics 2009 10:161
  12. Population structure and cryptic relatedness between individuals (samples) are two major factors affecting false positives in genome-wide association studies (GWAS). In addition, population stratification and ...

    Authors: Elena Solovieva and Hiroaki Sakai
    Citation: BMC Bioinformatics 2023 24:135
  13. Integral membrane proteins constitute about 20–30% of all proteins in the fully sequenced genomes. They come in two structural classes, the α-helical and the β-barrel membrane proteins, demonstrating different...

    Authors: Pantelis G Bagos, Theodore D Liakopoulos, Ioannis C Spyropoulos and Stavros J Hamodrakas
    Citation: BMC Bioinformatics 2004 5:29
  14. The amount of scientific information about MicroRNAs (miRNAs) is growing exponentially, making it difficult for researchers to interpret experimental results. In this study, we present an automated text mining...

    Authors: Sujoy Roy, Brandon C. Curry, Behrouz Madahian and Ramin Homayouni
    Citation: BMC Bioinformatics 2016 17(Suppl 13):350

    This article is part of a Supplement: Volume 17 Supplement 13

  15. N6-methyladensine (m6A) is a common and abundant RNA methylation modification found in various species. As a type of post-transcriptional methylation, m6A plays an important role in diverse RNA activities such...

    Authors: Yiqian Zhang and Michiaki Hamada
    Citation: BMC Bioinformatics 2018 19(Suppl 19):524

    This article is part of a Supplement: Volume 19 Supplement 19

  16. In the biomedical domain, the desired information of a question (query) asked by biologists usually is a list of a certain type of entities covering different aspects that are related to the question, such as ...

    Authors: Xiaoshi Yin, Zhoujun Li, Jimmy Xiangji Huang and Xiaohua Hu
    Citation: BMC Bioinformatics 2011 12(Suppl 5):S8

    This article is part of a Supplement: Volume 12 Supplement 5

  17. Amyloid fibrillar aggregates of proteins or polypeptides are known to be associated with many human diseases. Recent studies suggest that short protein regions trigger this aggregation. Thus, identifying these...

    Authors: Jian Tian, Ningfeng Wu, Jun Guo and Yunliu Fan
    Citation: BMC Bioinformatics 2009 10(Suppl 1):S45

    This article is part of a Supplement: Volume 10 Supplement 1

  18. Medication recommendation based on electronic medical record (EMR) is a research hot spot in smart healthcare. For developing computational medication recommendation methods based on EMR, an important challeng...

    Authors: Shaofu Lin, Mengzhen Wang, Chengyu Shi, Zhe Xu, Lihong Chen, Qingcai Gao and Jianhui Chen
    Citation: BMC Bioinformatics 2022 23:552
  19. Upland cotton provides the most natural fiber in the world. During fiber development, the quality and yield of fiber were influenced by gene transcription. Revealing sequence features related to transcription ...

    Authors: Shang Liu, Hailiang Cheng, Javaria Ashraf, Youping Zhang, Qiaolian Wang, Limin Lv, Man He, Guoli Song and Dongyun Zuo
    Citation: BMC Bioinformatics 2022 23:91
  20. The malaria risk prediction is currently limited to using advanced statistical methods, such as time series and cluster analysis on epidemiological data. Nevertheless, machine learning models have been explore...

    Authors: Kah Yee Tai, Jasbir Dhaliwal and KokSheik Wong
    Citation: BMC Bioinformatics 2022 23:325
  21. This paper is devoted to distance measures for leaf-labelled trees on free leafset. A leaf-labelled tree is a data structure which is a special type of a tree where only leaves (terminal) nodes are labelled. T...

    Authors: Jakub Koperwas and Krzysztof Walczak
    Citation: BMC Bioinformatics 2011 12:204
  22. Large sequence datasets are difficult to visualize and handle. Additionally, they often do not represent a random subset of the natural diversity, but the result of uncoordinated and convenience sampling. Cons...

    Authors: Fabrizio Menardo, Chloé Loiseau, Daniela Brites, Mireia Coscolla, Sebastian M. Gygli, Liliana K. Rutaihwa, Andrej Trauner, Christian Beisel, Sonia Borrell and Sebastien Gagneux
    Citation: BMC Bioinformatics 2018 19:164
  23. Strongly multicollinear covariates, such as those typically represented in metabolomics applications, represent a challenge for multivariate regression analysis. These challenges are commonly circumvented by r...

    Authors: Tim U. H. Baumeister, Eivind Aadland, Roger G. Linington and Olav M. Kvalheim
    Citation: BMC Bioinformatics 2024 25:51
  24. The Stochastic Process Model (SPM) represents a general framework for modeling the joint evolution of repeatedly measured variables and time-to-event outcomes observed in longitudinal studies, i.e., SPM relate...

    Authors: Ilya Y. Zhbannikov, Konstantin Arbeev, Igor Akushevich, Eric Stallard and Anatoliy I. Yashin
    Citation: BMC Bioinformatics 2017 18:125
  25. It is one of the ultimate goals for modern biological research to fully elucidate the intricate interplays and the regulations of the molecular determinants that propel and characterize the progression of vers...

    Authors: Xia Li, Shaoqi Rao, Wei Jiang, Chuanxing Li, Yun Xiao, Zheng Guo, Qingpu Zhang, Lihong Wang, Lei Du, Jing Li, Li Li, Tianwen Zhang and Qing K Wang
    Citation: BMC Bioinformatics 2006 7:26
  26. Data from discovery proteomic and phosphoproteomic experiments typically include missing values that correspond to proteins that have not been identified in the analyzed sample. Replacing the missing values wi...

    Authors: Matúš Medo, Daniel M. Aebersold and Michaela Medová
    Citation: BMC Bioinformatics 2019 20:563
  27. Since traditional drug research and development is often time-consuming and high-risk, there is an increasing interest in establishing new medical indications for approved drugs, referred to as drug reposition...

    Authors: Hui Liu, Yinglong Song, Jihong Guan, Libo Luo and Ziheng Zhuang
    Citation: BMC Bioinformatics 2016 17(Suppl 17):539

    This article is part of a Supplement: Volume 17 Supplement 17

  28. High-throughput real-time quantitative reverse transcriptase polymerase chain reaction (qPCR) is a widely used technique in experiments where expression patterns of genes are to be profiled. Current stage tech...

    Authors: Jessica C Mar, Yasumasa Kimura, Kate Schroder, Katharine M Irvine, Yoshihide Hayashizaki, Harukazu Suzuki, David Hume and John Quackenbush
    Citation: BMC Bioinformatics 2009 10:110
  29. Flow cytometry (FCM) is a powerful single-cell based measurement method to ascertain multidimensional optical properties of millions of cells. FCM is widely used in medical diagnostics and health research. The...

    Authors: Joachim Ludwig, Christian Höner zu Siederdissen, Zishu Liu, Peter F. Stadler and Susann Müller
    Citation: BMC Bioinformatics 2019 20:643
  30. Boolean networks (BNs) provide an effective modelling formalism for various complex biochemical phenomena. Their long term behaviour is represented by attractors–subsets of the state space towards which the BN...

    Authors: Nikola Beneš, Luboš Brim, Jakub Kadlecaj, Samuel Pastva and David Šafránek
    Citation: BMC Bioinformatics 2022 23:173
  31. Stomach adenocarcinoma (STAD) is a common malignant tumor in the world and its prognosis is poor, miRNA plays a role mainly by influencing the expression of mRNAs, and participates in the occurrence and develo...

    Authors: Hao Qian, Nanxue Cui, Qiao Zhou and Shihai Zhang
    Citation: BMC Bioinformatics 2022 23:181
  32. Research in biomedical text categorization has mostly used the bag-of-words representation. Other more sophisticated representations of text based on syntactic, semantic and argumentative properties have been les...

    Authors: Antonio Jose Jimeno Yepes, Laura Plaza, Jorge Carrillo-de-Albornoz, James G Mork and Alan R Aronson
    Citation: BMC Bioinformatics 2015 16:113
  33. Drug–drug interactions (DDIs) occur when two or more drugs are taken simultaneously or successively. Early detection of adverse drug interactions can be essential in preventing medical errors and reducing heal...

    Authors: Dingkai Huang, Hongjian He, Jiaming Ouyang, Chang Zhao, Xin Dong and Jiang Xie
    Citation: BMC Bioinformatics 2022 23:561
  34. Phylogenetic methods are well-established bioinformatic tools for sequence analysis, allowing to describe the non-independencies of sequences because of their common ancestor. However, the evolutionary profile...

    Authors: Matteo Brilli, Alessio Mengoni, Marco Fondi, Marco Bazzicalupo, Pietro Liò and Renato Fani
    Citation: BMC Bioinformatics 2008 9:551
  35. Simulation of genetic variants data is frequently required for the evaluation of statistical methods in the fields of human and animal genetics. Although a number of high-quality genetic simulators have been d...

    Authors: Apostolos Dimitromanolakis, Jingxiong Xu, Agnieszka Krol and Laurent Briollais
    Citation: BMC Bioinformatics 2019 20:26
  36. Several computational tools for predicting protein Ubiquitylation and SUMOylation sites have been proposed to study their regulatory roles in gene location, gene expression, and genome replication. However, ex...

    Authors: Fei He, Jingyi Li, Rui Wang, Xiaowei Zhao and Ye Han
    Citation: BMC Bioinformatics 2021 22:519

Featured videos

View featured videos from across the BMC-series journals

Annual Journal Metrics

  • 2022 Citation Impact
    3.0 - 2-year Impact Factor
    4.3 - 5-year Impact Factor
    0.938 - SNIP (Source Normalized Impact per Paper)
    1.100 - SJR (SCImago Journal Rank)

    2023 Speed
    19 days submission to first editorial decision for all manuscripts (Median)
    146 days submission to accept (Median)

    2023 Usage
    5,987,678 downloads
    4,858 Altmetric mentions 

Sign up for article alerts and news from this journal