Featured videos
View featured videos from across the BMC-series journals
Page 1 of 34
Here, we introduce an algorithm (new method) based on ideas from the field of natural language processing (NLP) to solve this problem. We...
During drug development, it is essential to gather information about the change of clinical exposure of a drug (object) due to the pharmacokinetic (PK) drug-drug interactions (DDIs) with another drug (precipitant...
Manual curation of biological databases, an expensive and labor-intensive process, is essential for high quality integrated data ... the implementation of a state-of-the-art Natural Language Processing system tha...
Despite increasing interest in applying Natural Language Processing (NLP) to biomedical text, whether this...
Natural Language Processing (NLP) has been shown effective to...
The Enteropathogen Resource Integration Center (ERIC; http://www.ericbrc.org) has a goal of providing bioinformatics support for the scientific communi...
Extracting information from free texts using natural language processing (NLP) can save time and reduce...
In this work, we propose a novel pipeline for KEGG orthology annotation of bacterial protein sequences that uses natural language processing and deep learning. To assess the effectiveness...
Thousands of genes have been associated with different Mendelian conditions. One of the valuable sources to track these gene-disease associations (GDAs) is the Online Mendelian Inheritance in Man (OMIM) database....
The abundance of biomedical text data coupled with advances in natural language processing (NLP) is resulting in novel biomedical ... are reliant on the availability of domain-specific language models (LMs) that ...
Many biomedical natural language processing systems demonstrated large differences between their previously...
We present an extension of the TM tool, which utilizes natural language processing (NLP) for analyzing the context of ... the global low-resolution docking scan was post-processed, separately, by constraints from...
Identifying local recurrences in breast cancer from patient data sets is important for clinical research and practice. Developing a model using natural language processing and machine learning to identify local r...
We report the development and evaluation of Microbial Phenomics Information Extractor (MicroPIE, version 0.1.0). MicroPIE is a natural language processing application that uses a robust supervised classification....
Natural language processing (NLP) applications are increasingly important in...
Interest is growing in the application of syntactic parsers to natural language processing problems in biology, but assessing their performance...
Applications of Natural Language Processing (NLP) technology to biomedical texts have...subdomain variation within the biomedical domain, i.e., the extent to which different subject areas of biomedicine are chara...
Pre-trained natural language processing models on a large natural language corpus can naturally transfer learned knowledge to ... , few studies focused on enriching such protein language models by jointly learnin...
The growing recognition of the microbiome’s impact on human health and well-being has prompted extensive research into discovering the links between microbiome dysbiosis and disease (healthy) states. However, thi...
Longitudinal data on key cancer outcomes for clinical research, such as response to treatment and disease progression, are not captured in standard cancer registry reporting. Manual extraction of such outcomes fr...
Metabolic flux analysis has become an established method in systems biology and functional genomics. The most common approach for determining intracellular metabolic fluxes is to utilize mass spectrometry in c...
Protein S-nitrosylation (SNO) plays a key role in transferring nitric oxide-mediated signals in both animals and plants and has emerged as an important mechanism for regulating protein functions and cell signa...
Natural proteins occupy a small portion of the protein sequence space, whereas artificial proteins can explore a wider range of possibilities within the sequence space. However, specific requirements may not b...
Large Language Models (LLMs) like Generative Pre-trained Transformer (GPT) from OpenAI and LLaMA (Large Language Model Meta AI) from Meta AI are increasingly recognized for their potential in the field of chem...
It has been suggested previously that genome and proteome sequences show characteristics typical of natural-language texts such as "signature-style" word ... , and that the algorithms originally developed for natural
Many efforts exist to design and implement approaches and tools for data capture, integration and analysis in the life sciences. Challenges are not only the heterogeneity, size and distribution of information ...
Gas chromatography–mass spectrometry (GC-MS) is a technique frequently used in targeted and non-targeted measurements of metabolites. Most existing software tools for processing of raw instrument GC-MS data ti...
Monoclonal antibodies (mAbs) have been used as therapeutic agents, which must overcome many developability issues after the discovery from in vitro display libraries. Especially, polyreactive mAbs can strongly...
Microarray data analysis has been the subject of extensive and ongoing pipeline development due to its complexity, the availability of several options at each analysis step, and the development of new analysis...
The recognition of pharmacological substances, compounds and proteins is essential for biomedical relation extraction, knowledge graph construction, drug discovery, as well as medical question answering. Although...
We introduced a novel way to represent protein sequences as continuous vectors (embeddings) by using the language model ELMo taken from natural language processing. By modeling protein sequences, ELMo effectively...
Determining a protein’s quaternary state, i.e. the number of monomers in a functional unit, is a critical step in protein characterization. Many proteins form multimers for their activity, and over 50% are estima...
The annotation of protein sequences in public databases has long posed a challenge in molecular biology. This issue is particularly acute for viral proteins, which demonstrate limited homology to known protein...
Transformer-based large language models (LLMs) are very suited for biological sequence data, because of analogies to natural language. Complex relationships can be learned, because a concept of "words" can be ...
Decisions in healthcare usually rely on the goodness and completeness of data that could be coupled with heuristics to improve the decision process itself. However, this is often an incomplete process. Structured...
Liquid chromatography coupled to mass spectrometry (LC/MS) has been widely used in proteomics and metabolomics research. In this context, the technology has been increasingly used for differential profiling, i...
Information extraction (IE) efforts are widely acknowledged to be important in harnessing the rapid advance of biomedical knowledge, particularly in areas where important factual information is published in a ...
The Biology System Description Language (BiSDL) is an accessible, easy-to-use computational language for multicellular synthetic biology. It allows synthetic biologists to represent spatiality and multi-level ...
Biomedical question answering (QA) is a sub-task of natural language processing in a specific domain, which aims to ... the neural network and large scale pre-trained language model have largely improved its perf...
Due to the nature of scientific methodology, research articles are rich in speculative and tentative statements, also known as hedges. We explore a linguistically motivated approach to the problem of recognizi...
This article presents Neji, an open source framework optimized for biomedical concept recognition built around four key characteristics: modularity, scalability, speed, and usability. It integrates modules for bi...
CoQUAD is a question-answering system that mines COVID-19 literature using natural language processing techniques to help the research community find...
Glaucoma can cause irreversible blindness to people’s eyesight. Since there are no symptoms in its early stage, it is particularly important to accurately segment the optic disc (OD) and optic cup (OC) from fu...
Techniques from computational linguistics can provide new insights into the annotation process. GO annotations show similar statistical behaviours to those seen in natural language with measured exponents that pr...
We present a literature-driven workflow involving document delivery and natural language processing steps generating tagged sentences containing lipid, protein...bibliosphere. We illustrate the extent of the desc...
Medical information has rapidly increased on the internet and has become one of the main targets of search engine use. However, medical information on the internet is subject to the problems of quality and acc...
Calcium (Ca2+) propagates within tissues serving as an important information carrier. In particular, cilia beat frequency in oviduct cells is partially regulated by Ca2+ changes. Thus, measuring the calcium de...
During library construction polymerase chain reaction is used to enrich the DNA before sequencing. Typically, this process generates duplicate read sequences. Removal of these artifacts is mandatory, as they c...
We verified that typographical errors in unstructured text negatively affect the performance of natural language processing tasks. The proposed method of a typo...
View featured videos from across the BMC-series journals
Citation Impact 2023
Journal Impact Factor: 2.9
5-year Journal Impact Factor: 3.6
Source Normalized Impact per Paper (SNIP): 0.821
SCImago Journal Rank (SJR): 1.005
Speed 2023
Submission to first editorial decision (median days): 12
Submission to acceptance (median days): 146
Usage 2023
Downloads: 5,987,678
Altmetric mentions: 4,858