Articles

1645 result(s) for 'natural language processing' within BMC Bioinformatics

Page 22 of 33

Social tagging in the life sciences: characterizing a new metadata resource for bioinformatics

Academic social tagging systems, such as Connotea and CiteULike, provide researchers with a means to organize personal collections of online references with keywords (tags) and to share these collections with ...

Authors: Benjamin M Good, Joseph T Tennis and Mark D Wilkinson

Citation: BMC Bioinformatics 2009 10:313

Content type: Research article Published on: 25 September 2009
- View Full Text
- View PDF
INFLECT: an R-package for cytometry cluster evaluation using marker modality

Current methods of high-dimensional unsupervised clustering of mass cytometry data lack means to monitor and evaluate clustering results. Whether unsupervised clustering is correct is typically evaluated by ag...

Authors: Jan Verhoeff, Sanne Abeln and Juan J. Garcia-Vallejo

Citation: BMC Bioinformatics 2022 23:487

Content type: Software Published on: 16 November 2022
- View Full Text
- View PDF
A parallel method for enumerating amino acid compositions and masses of all theoretical peptides

Enumeration of all theoretically possible amino acid compositions is an important problem in several proteomics workflows, including peptide mass fingerprinting, mass defect labeling, mass defect filtering, an...

Authors: Alexey V Nefedov and Rovshan G Sadygov

Citation: BMC Bioinformatics 2011 12:432

Content type: Software Published on: 7 November 2011
- View Full Text
- View PDF
Roast: a tool for reference-free optimization of supertranscriptome assemblies

Transcriptomic studies involving organisms for which reference genomes are not available typically start by generating de novo transcriptome or supertranscriptome assembly from the raw RNA-seq reads. Assemblin...

Authors: Madiha Shabbir and Aziz Mithani

Citation: BMC Bioinformatics 2024 25:2

Content type: Software Published on: 2 January 2024
- View Full Text
- View PDF
GLEANER: a web server for GermLine cycle Expression ANalysis and Epigenetic Roadmap visualization

Germline cells are important carriers of genetic and epigenetic information transmitted across generations in mammals. During the mammalian germline cell development cycle (i.e., the germline cycle), cell pote...

Authors: Shiyang Zeng, Yuwei Hua, Yong Zhang, Guifen Liu and Chengchen Zhao

Citation: BMC Bioinformatics 2021 22:289

Content type: Database Published on: 31 May 2021
- View Full Text
- View PDF
An eScience-Bayes strategy for analyzing omics data

The omics fields promise to revolutionize our understanding of biology and biomedicine. However, their potential is compromised by the challenge to analyze the huge datasets produced. Analysis of omics data is...

Authors: Martin Eklund, Ola Spjuth and Jarl ES Wikberg

Citation: BMC Bioinformatics 2010 11:282

Content type: Methodology article Published on: 26 May 2010
- View Full Text
- View PDF
SPANG: a SPARQL client supporting generation and reuse of queries for distributed RDF databases

Toward improved interoperability of distributed biological databases, an increasing number of datasets have been published in the standardized Resource Description Framework (RDF). Although the powerful SPARQL...

Authors: Hirokazu Chiba and Ikuo Uchiyama

Citation: BMC Bioinformatics 2017 18:93

Content type: Software Published on: 8 February 2017
- View Full Text
- View PDF
Dissecting trait heterogeneity: a comparison of three clustering methods applied to genotypic data

Trait heterogeneity, which exists when a trait has been defined with insufficient specificity such that it is actually two or more distinct traits, has been implicated as a confounding factor in traditional st...

Authors: Tricia A Thornton-Wells, Jason H Moore and Jonathan L Haines

Citation: BMC Bioinformatics 2006 7:204

Content type: Research article Published on: 12 April 2006
- View Full Text
- View PDF
CIPR: a web-based R/shiny app and R package to annotate cell clusters in single cell RNA sequencing experiments

Single cell RNA sequencing (scRNAseq) has provided invaluable insights into cellular heterogeneity and functional states in health and disease. During the analysis of scRNAseq data, annotating the biological i...

Authors: H. Atakan Ekiz, Christopher J. Conley, W. Zac Stephens and Ryan M. O’Connell

Citation: BMC Bioinformatics 2020 21:191

Content type: Software Published on: 15 May 2020
- View Full Text
- View PDF
ATGC transcriptomics: a web-based application to integrate, explore and analyze de novo transcriptomic data

In the last years, applications based on massively parallelized RNA sequencing (RNA-seq) have become valuable approaches for studying non-model species, e.g., without a fully sequenced genome. RNA-seq is a use...

Authors: Sergio Gonzalez, Bernardo Clavijo, Máximo Rivarola, Patricio Moreno, Paula Fernandez, Joaquín Dopazo and Norma Paniego

Citation: BMC Bioinformatics 2017 18:121

Content type: Software Published on: 22 February 2017
- View Full Text
- View PDF
Annotation of protein residues based on a literature analysis: cross-validation against UniProtKb

A protein annotation database, such as the Universal Protein Resource knowledge base (UniProtKb), is a valuable resource for the validation and interpretation of predicted 3D structure patterns in proteins. Ex...

Authors: Kevin Nagel, Antonio Jimeno-Yepes and Dietrich Rebholz-Schuhmann

Citation: BMC Bioinformatics 2009 10(Suppl 8):S4

Content type: Research Published on: 27 August 2009

This article is part of a Supplement: Volume 10 Supplement 8
- View Full Text
- View PDF
The Genia Event and Protein Coreference tasks of the BioNLP Shared Task 2011

The Genia task, when it was introduced in 2009, was the first community-wide effort to address a fine-grained, structural information extraction from biomedical literature. Arranged for the second time as one ...

Authors: Jin-Dong Kim, Ngan Nguyen, Yue Wang, Jun'ichi Tsujii, Toshihisa Takagi and Akinori Yonezawa

Citation: BMC Bioinformatics 2012 13(Suppl 11):S1

Content type: Proceedings Published on: 26 June 2012

This article is part of a Supplement: Volume 13 Supplement 11
- View Full Text
- View PDF
PyConvU-Net: a lightweight and multiscale network for biomedical image segmentation

With the development of deep learning (DL), more and more methods based on deep learning are proposed and achieve state-of-the-art performance in biomedical image segmentation. However, these methods are usual...

Authors: Changyong Li, Yongxian Fan and Xiaodong Cai

Citation: BMC Bioinformatics 2021 22:14

Content type: Methodology article Published on: 7 January 2021
- View Full Text
- View PDF
Improving classification in protein structure databases using text mining

The classification of protein domains in the CATH resource is primarily based on structural comparisons, sequence similarity and manual analysis. One of the main bottlenecks in the processing of new entries is...

Authors: Antonis Koussounadis, Oliver C Redfern and David T Jones

Citation: BMC Bioinformatics 2009 10:129

Content type: Methodology article Published on: 5 May 2009
- View Full Text
- View PDF
CamurWeb: a classification software and a large knowledge base for gene expression data of cancer

The high growth of Next Generation Sequencing data currently demands new knowledge extraction methods. In particular, the RNA sequencing gene expression experimental technique stands out for case-control studi...

Authors: Emanuel Weitschek, Silvia Di Lauro, Eleonora Cappelli, Paola Bertolazzi and Giovanni Felici

Citation: BMC Bioinformatics 2018 19(Suppl 10):354

Content type: Research Published on: 15 October 2018

This article is part of a Supplement: Volume 19 Supplement 10
- View Full Text
- View PDF
Constraint Logic Programming approach to protein structure prediction

The protein structure prediction problem is one of the most challenging problems in biological sciences. Many approaches have been proposed using database information and/or simplified protein models. The prot...

Authors: Alessandro Dal Palù, Agostino Dovier and Federico Fogolari

Citation: BMC Bioinformatics 2004 5:186

Content type: Research article Published on: 30 November 2004
- View Full Text
- View PDF
PPAI: a web server for predicting protein-aptamer interactions

The interactions between proteins and aptamers are prevalent in organisms and play an important role in various life activities. Thanks to the rapid accumulation of protein-aptamer interaction data, it is nece...

Authors: Jianwei Li, Xiaoyu Ma, Xichuan Li and Junhua Gu

Citation: BMC Bioinformatics 2020 21:236

Content type: Methodology article Published on: 9 June 2020
- View Full Text
- View PDF
Vestige: Maximum likelihood phylogenetic footprinting

Phylogenetic footprinting is the identification of functional regions of DNA by their evolutionary conservation. This is achieved by comparing orthologous regions from multiple species and identifying the DNA ...

Authors: Matthew J Wakefield, Peter Maxwell and Gavin A Huttley

Citation: BMC Bioinformatics 2005 6:130

Content type: Software Published on: 29 May 2005
- View Full Text
- View PDF
iIL13Pred: improved prediction of IL-13 inducing peptides using popular machine learning classifiers

Inflammatory mediators play havoc in several diseases including the novel Coronavirus disease 2019 (COVID-19) and generally correlate with the severity of the disease. Interleukin-13 (IL-13), is a pleiotropic ...

Authors: Pooja Arora, Neha Periwal, Yash Goyal, Vikas Sood and Baljeet Kaur

Citation: BMC Bioinformatics 2023 24:141

Content type: Research Published on: 11 April 2023
- View Full Text
- View PDF
Attention mechanism enhanced LSTM with residual architecture and its application for protein-protein interaction residue pairs prediction

Recurrent neural network(RNN) is a good way to process sequential data, but the capability of RNN to compute long sequence data is inefficient. As a variant of RNN, long short term memory(LSTM) solved the prob...

Authors: Jiale Liu and Xinqi Gong

Citation: BMC Bioinformatics 2019 20:609

Content type: Methodology Article Published on: 27 November 2019
- View Full Text
- View PDF
AttCRISPR: a spacetime interpretable model for prediction of sgRNA on-target activity

More and more Cas9 variants with higher specificity are developed to avoid the off-target effect, which brings a significant volume of experimental data. Conventional machine learning performs poorly on these ...

Authors: Li-Ming Xiao, Yun-Qi Wan and Zhen-Ran Jiang

Citation: BMC Bioinformatics 2021 22:589

Content type: Research Published on: 13 December 2021
- View Full Text
- View PDF
Retrieval with gene queries

Accuracy of document retrieval from MEDLINE for gene queries is crucially important for many applications in bioinformatics. We explore five information retrieval-based methods to rank documents retrieved by P...

Authors: Aditya K Sehgal and Padmini Srinivasan

Citation: BMC Bioinformatics 2006 7:220

Content type: Research article Published on: 21 April 2006
- View Full Text
- View PDF
lifex-ep: a robust and efficient software for cardiac electrophysiology simulations

Simulating the cardiac function requires the numerical solution of multi-physics and multi-scale mathematical models. This underscores the need for streamlined, accurate, and high-performance computational too...

Authors: Pasquale Claudio Africa, Roberto Piersanti, Francesco Regazzoni, Michele Bucelli, Matteo Salvador, Marco Fedele, Stefano Pagani, Luca Dede’ and Alfio Quarteroni

Citation: BMC Bioinformatics 2023 24:389

Content type: Software Published on: 13 October 2023
- View Full Text
- View PDF
TCR-L: an analysis tool for evaluating the association between the T-cell receptor repertoire and clinical phenotypes

T cell receptors (TCRs) play critical roles in adaptive immune responses, and recent advances in genome technology have made it possible to examine the T cell receptor (TCR) repertoire at the individual sequen...

Authors: Meiling Liu, Juna Goo, Yang Liu, Wei Sun, Michael C. Wu, Li Hsu and Qianchuan He

Citation: BMC Bioinformatics 2022 23:152

Content type: Research Published on: 28 April 2022
- View Full Text
- View PDF
ISVASE: identification of sequence variant associated with splicing event using RNA-seq data

Exon recognition and splicing precisely and efficiently by spliceosome is the key to generate mature mRNAs. About one third or a half of disease-related mutations affect RNA splicing. Software PVAAS has been d...

Authors: Hasan Awad Aljohi, Wanfei Liu, Qiang Lin, Jun Yu and Songnian Hu

Citation: BMC Bioinformatics 2017 18:320

Content type: Software Published on: 28 June 2017
- View Full Text
- View PDF
PhyloNet: a software package for analyzing and reconstructing reticulate evolutionary relationships

Phylogenies, i.e., the evolutionary histories of groups of taxa, play a major role in representing the interrelationships among biological entities. Many software tools for reconstructing and evaluating such p...

Authors: Cuong Than, Derek Ruths and Luay Nakhleh

Citation: BMC Bioinformatics 2008 9:322

Content type: Software Published on: 28 July 2008
- View Full Text
- View PDF
Effective identification of varieties by nucleotide polymorphisms and its application for essentially derived variety identification in rice

Plant variety identification is the one most important of agricultural systems. Development of DNA marker profiles of released varieties to compare with candidate variety or future variety is required. However...

Authors: Xiong Yuan, Zirong Li, Liwen Xiong, Sufeng Song, Xingfei Zheng, Zhonghai Tang, Zheming Yuan and Lanzhi Li

Citation: BMC Bioinformatics 2022 23:30

Content type: Research Published on: 10 January 2022
- View Full Text
- View PDF
An improved clear cell renal cell carcinoma stage prediction model based on gene sets

Clear cell renal cell carcinoma (ccRCC) is the most common subtype of renal cell carcinoma and accounts for cancer-related deaths. Survival rates are very low when the tumor is discovered in the late-stage. Th...

Authors: Fangjun Li, Mu Yang, Yunhe Li, Mingqiang Zhang, Wenjuan Wang, Dongfeng Yuan and Dongqi Tang

Citation: BMC Bioinformatics 2020 21:232

Content type: Research article Published on: 8 June 2020
- View Full Text
- View PDF
PCirc: random forest-based plant circRNA identification software

Circular RNA (circRNA) is a novel type of RNA with a closed-loop structure. Increasing numbers of circRNAs are being identified in plants and animals, and recent studies have shown that circRNAs play an import...

Authors: Shuwei Yin, Xiao Tian, Jingjing Zhang, Peisen Sun and Guanglin Li

Citation: BMC Bioinformatics 2021 22:10

Content type: Software Published on: 6 January 2021
- View Full Text
- View PDF
Uncovering extensive post-translation regulation during human cell cycle progression by integrative multi-’omics analysis

Analysis of high-throughput multi-’omics interactions across the hierarchy of expression has wide interest in making inferences with regard to biological function and biomarker discovery. Expression levels acr...

Authors: Gregory M. Parkes and Mahesan Niranjan

Citation: BMC Bioinformatics 2019 20:536

Content type: Research Article Published on: 29 October 2019
- View Full Text
- View PDF
Gene function classification using Bayesian models with hierarchy-based priors

We investigate whether annotation of gene function can be improved using a classification scheme that is aware that functional classes are organized in a hierarchy. The classifiers look at phylogenic descripto...

Authors: Babak Shahbaba and Radford M Neal

Citation: BMC Bioinformatics 2006 7:448

Content type: Research article Published on: 12 October 2006
- View Full Text
- View PDF
MapGL: inferring evolutionary gain and loss of short genomic sequence features by phylogenetic maximum parsimony

Comparative genomics studies are growing in number partly because of their unique ability to provide insight into shared and divergent biology between species. Of particular interest is the use of phylogenetic...

Authors: Adam G. Diehl and Alan P. Boyle

Citation: BMC Bioinformatics 2020 21:416

Content type: Software Published on: 22 September 2020
- View Full Text
- View PDF
A comparison of the functional modules identified from time course and static PPI network data

Cellular systems are highly dynamic and responsive to cues from the environment. Cellular function and response patterns to external stimuli are regulated by biological networks. A protein-protein interaction ...

Authors: Xiwei Tang, Jianxin Wang, Binbin Liu, Min Li, Gang Chen and Yi Pan

Citation: BMC Bioinformatics 2011 12:339

Content type: Methodology article Published on: 15 August 2011
- View Full Text
- View PDF
Annotation and query of tissue microarray data using the NCI Thesaurus

The Stanford Tissue Microarray Database (TMAD) is a repository of data serving a consortium of pathologists and biomedical researchers. The tissue samples in TMAD are annotated with multiple free-text fields, ...

Authors: Nigam H Shah, Daniel L Rubin, Inigo Espinosa, Kelli Montgomery and Mark A Musen

Citation: BMC Bioinformatics 2007 8:296

Content type: Software Published on: 8 August 2007
- View Full Text
- View PDF
3D deep convolutional neural networks for amino acid environment similarity analysis

Central to protein biology is the understanding of how structural elements give rise to observed function. The surfeit of protein structural data enables development of computational methods to systematically ...

Authors: Wen Torng and Russ B. Altman

Citation: BMC Bioinformatics 2017 18:302

Content type: Methodology article Published on: 14 June 2017
- View Full Text
- View PDF
FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data

Data clustering analysis has been extensively applied to extract information from gene expression profiles obtained with DNA microarrays. To this aim, existing clustering approaches, mainly developed in comput...

Authors: Limin Fu and Enzo Medico

Citation: BMC Bioinformatics 2007 8:3

Content type: Methodology article Published on: 4 January 2007
- View Full Text
- View PDF
Bayesian statistical modelling of human protein interaction network incorporating protein disorder information

We present a statistical method of analysis of biological networks based on the exponential random graph model, namely p2-model, as opposed to previous descriptive approaches. The model is capable to capture g...

Authors: Svetlana Bulashevska, Alla Bulashevska and Roland Eils

Citation: BMC Bioinformatics 2010 11:46

Content type: Research article Published on: 25 January 2010
- View Full Text
- View PDF
Knowledge-enhanced biomedical named entity recognition and normalization: application to proteins and genes

Automated biomedical named entity recognition and normalization serves as the basis for many downstream applications in information management. However, this task is challenging due to name variations and enti...

Authors: Huiwei Zhou, Shixian Ning, Zhe Liu, Chengkun Lang, Zhuang Liu and Bizun Lei

Citation: BMC Bioinformatics 2020 21:35

Content type: Research article Published on: 30 January 2020
- View Full Text
- View PDF
An inferential framework for biological network hypothesis tests

Networks are ubiquitous in modern cell biology and physiology. A large literature exists for inferring/proposing biological pathways/networks using statistical or machine learning algorithms. Despite these adv...

Authors: Phillip D Yates and Nitai D Mukhopadhyay

Citation: BMC Bioinformatics 2013 14:94

Content type: Methodology article Published on: 14 March 2013
- View Full Text
- View PDF
Prediction of 8-state protein secondary structures by a novel deep learning architecture

Protein secondary structure can be regarded as an information bridge that links the primary sequence and tertiary structure. Accurate 8-state secondary structure prediction can significantly give more precise ...

Authors: Buzhong Zhang, Jinyan Li and Qiang Lü

Citation: BMC Bioinformatics 2018 19:293

Content type: Research article Published on: 3 August 2018
- View Full Text
- View PDF
Ensemble attribute profile clustering: discovering and characterizing groups of genes with similar patterns of biological features

Ensemble attribute profile clustering is a novel, text-based strategy for analyzing a user-defined list of genes and/or proteins. The strategy exploits annotation data present in gene-centered corpora and util...

Authors: JR Semeiks, A Rizki, MJ Bissell and IS Mian

Citation: BMC Bioinformatics 2006 7:147

Content type: Methodology article Published on: 16 March 2006
- View Full Text
- View PDF
Optimization algorithms for functional deimmunization of therapeutic proteins

To develop protein therapeutics from exogenous sources, it is necessary to mitigate the risks of eliciting an anti-biotherapeutic immune response. A key aspect of the response is the recognition and surface di...

Authors: Andrew S Parker, Wei Zheng, Karl E Griswold and Chris Bailey-Kellogg

Citation: BMC Bioinformatics 2010 11:180

Content type: Methodology article Published on: 9 April 2010
- View Full Text
- View PDF
Comparison of co-expression measures: mutual information, correlation, and model based indices

Co-expression measures are often used to define networks among genes. Mutual information (MI) is often used as a generalized correlation measure. It is not clear how much MI adds beyond standard (robust) corre...

Authors: Lin Song, Peter Langfelder and Steve Horvath

Citation: BMC Bioinformatics 2012 13:328

Content type: Research article Published on: 9 December 2012
- View Full Text
- View PDF
CITEViz: interactively classify cell populations in CITE-Seq via a flow cytometry-like gating workflow using R-Shiny

The rapid advancement of new genomic sequencing technology has enabled the development of multi-omic single-cell sequencing assays. These assays profile multiple modalities in the same cell and can often yield...

Authors: Garth L. Kong, Thai T. Nguyen, Wesley K. Rosales, Anjali D. Panikar, John H. W. Cheney, Theresa A. Lusardi, William M. Yashar, Brittany M. Curtiss, Sarah A. Carratt, Theodore P. Braun and Julia E. Maxson

Citation: BMC Bioinformatics 2024 25:142

Content type: Software Published on: 2 April 2024
- View Full Text
- View PDF
Metrics for GO based protein semantic similarity: a systematic evaluation

Several semantic similarity measures have been applied to gene products annotated with Gene Ontology terms, providing a basis for their functional comparison. However, it is still unclear which is the best app...

Authors: Catia Pesquita, Daniel Faria, Hugo Bastos, António EN Ferreira, André O Falcão and Francisco M Couto

Citation: BMC Bioinformatics 2008 9(Suppl 5):S4

Content type: Proceedings Published on: 29 April 2008

This article is part of a Supplement: Volume 9 Supplement 5
- View Full Text
- View PDF
SVcnn: an accurate deep learning-based method for detecting structural variation based on long-read data

Structural variations (SVs) refer to variations in an organism’s chromosome structure that exceed a length of 50 base pairs. They play a significant role in genetic diseases and evolutionary mechanisms. While ...

Authors: Yan Zheng and Xuequn Shang

Citation: BMC Bioinformatics 2023 24:213

Content type: Research Published on: 23 May 2023
- View Full Text
- View PDF
Structural alignment of protein descriptors – a combinatorial model

Structural alignment of proteins is one of the most challenging problems in molecular biology. The tertiary structure of a protein strictly correlates with its function and computationally predicted structures...

Authors: Maciej Antczak, Marta Kasprzak, Piotr Lukasiak and Jacek Blazewicz

Citation: BMC Bioinformatics 2016 17:383

Content type: Methodology Article Published on: 17 September 2016
- View Full Text
- View PDF
Proceedings of the 16th Annual UT-KBRIN Bioinformatics Summit 2016: bioinformatics

Authors: Eric C. Rouchka, Julia H. Chariker, David A. Tieri, Juw Won Park, Shreedharkumar Rajurkar, Vikas Singh, Nishchal K. Verma, Yan Cui, Mark Farman, Bradford Condon, Neil Moore, Jerzy Jaromczyk, Jolanta Jaromczyk, Daniel Harris, Patrick Calie, Eun Kyong Shin…

Citation: BMC Bioinformatics 2017 18(Suppl 9):377

Content type: Meeting abstracts Published on: 13 October 2017

This article is part of a Supplement: Volume 18 Supplement 9

The Correction to this article has been published in BMC Bioinformatics 2017 18:490
- View Full Text
- View PDF
Moving forward through the in silico modeling of tuberculosis: a further step with UISS-TB

In 2018, about 10 million people were found infected by tuberculosis, with approximately 1.2 million deaths worldwide. Despite these numbers have been relatively stable in recent years, tuberculosis is still c...

Authors: Giulia Russo, Giuseppe Sgroi, Giuseppe Alessandro Parasiliti Palumbo, Marzio Pennisi, Miguel A. Juarez, Pere-Joan Cardona, Santo Motta, Kenneth B. Walker, Epifanio Fichera, Marco Viceconti and Francesco Pappalardo

Citation: BMC Bioinformatics 2020 21(Suppl 17):458

Content type: Research Published on: 14 December 2020

This article is part of a Supplement: Volume 21 Supplement 17
- View Full Text
- View PDF
EasyCGTree: a pipeline for prokaryotic phylogenomic analysis based on core gene sets

Genome-scale phylogenetic analysis based on core gene sets is routinely used in microbiological research. However, the techniques are still not approachable for individuals with little bioinformatics experienc...

Authors: Dao-Feng Zhang, Wei He, Zongze Shao, Iftikhar Ahmed, Yuqin Zhang, Wen-Jun Li and Zhe Zhao

Citation: BMC Bioinformatics 2023 24:390

Content type: Software Published on: 14 October 2023
- View Full Text
- View PDF

How was your experience today?

Rating Please select one rating

Awful

Bad

Good

Great

Thank you for your feedback.

Tell us why (opens in a new tab)

Featured videos

View featured videos from across the BMC-series journals

Articles

1645 result(s) for 'natural language processing' within BMC Bioinformatics

Featured videos

Important information

Annual Journal Metrics

Follow

BMC Bioinformatics

Contact us