Articles

Page 240 of 249

AutoFACT: An Auto matic F unctional A nnotation and C lassification T ool

Assignment of function to new molecular sequence data is an essential step in genomics projects. The usual process involves similarity searches of a given sequence against one or more databases, an arduous pro...

Authors: Liisa B Koski, Michael W Gray, B Franz Lang and Gertraud Burger

Citation: BMC Bioinformatics 2005 6:151

Content type: Software Published on: 16 June 2005
- View Full Text
- View PDF
Thesaurus-based disambiguation of gene symbols

Massive text mining of the biological literature holds great promise of relating disparate information and discovering new knowledge. However, disambiguation of gene symbols is a major bottleneck.

Authors: Bob JA Schijvenaars, Barend Mons, Marc Weeber, Martijn J Schuemie, Erik M van Mulligen, Hester M Wain and Jan A Kors

Citation: BMC Bioinformatics 2005 6:149

Content type: Research article Published on: 16 June 2005
- View Full Text
- View PDF
Feature selection and classification for microarray data analysis: Evolutionary methods for identifying predictive genes

In the clinical context, samples assayed by microarray are often classified by cell line or tumour type and it is of interest to discover a set of genes that can be used as class predictors. The leukemia datas...

Authors: Thanyaluk Jirapech-Umpai and Stuart Aitken

Citation: BMC Bioinformatics 2005 6:148

Content type: Research article Published on: 15 June 2005
- View Full Text
- View PDF
Alkahest NuclearBLAST : a user-friendly BLAST management and analysis system

Sequencing of EST and BAC end datasets is no longer limited to large research groups. Drops in per-base pricing have made high throughput sequencing accessible to individual investigators. However, there are f...

Authors: Stephen E Diener, Thomas D Houfek, Sam E Kalat, DE Windham, Mark Burke, Charles Opperman and Ralph A Dean

Citation: BMC Bioinformatics 2005 6:147

Content type: Software Published on: 15 June 2005
- View Full Text
- View PDF
Visualization-based discovery and analysis of genomic aberrations in microarray data

Chromosomal copy number changes (aneuploidies) play a key role in cancer progression and molecular evolution. These copy number changes can be studied using microarray-based comparative genomic hybridization (...

Authors: Chad L Myers, Xing Chen and Olga G Troyanskaya

Citation: BMC Bioinformatics 2005 6:146

Content type: Software Published on: 13 June 2005
- View Full Text
- View PDF
Satellog: A database for the identification and prioritization of satellite repeats in disease association studies

To date, 35 human diseases, some of which also exhibit anticipation, have been associated with unstable repeats. Anticipation has been reported in a number of diseases in which repeat expansion may have a role...

Authors: Perseus I Missirlis, Carri-Lyn R Mead, Stefanie L Butland, BF Francis Ouellette, Rebecca S Devon, Blair R Leavitt and Robert A Holt

Citation: BMC Bioinformatics 2005 6:145

Content type: Database Published on: 10 June 2005
- View Full Text
- View PDF
PAGE: Parametric Analysis of Gene Set Enrichment

Gene set enrichment analysis (GSEA) is a microarray data analysis method that uses predefined gene sets and ranks of genes to identify significant biological changes in microarray data sets. GSEA is especially...

Authors: Seon-Young Kim and David J Volsky

Citation: BMC Bioinformatics 2005 6:144

Content type: Methodology article Published on: 8 June 2005
- View Full Text
- View PDF
Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information

The rapid publication of important research in the biomedical literature makes it increasingly difficult for researchers to keep current with significant work in their area of interest.

Authors: James W Cooper and Aaron Kershenbaum

Citation: BMC Bioinformatics 2005 6:143

Content type: Methodology article Published on: 7 June 2005
- View Full Text
- View PDF
Which gene did you mean?

Computational Biology needs computer-readable information records. Increasingly, meta-analysed and pre-digested information is being used in the follow up of high throughput experiments and other investigation...

Authors: Barend Mons

Citation: BMC Bioinformatics 2005 6:142

Content type: Commentary Published on: 7 June 2005
- View Full Text
- View PDF
Chemistry in Bioinformatics

Chemical information is now seen as critical for most areas of life sciences. But unlike Bioinformatics, where data is openly available and freely re-usable, most chemical information is closed and cannot be r...

Authors: Peter Murray-Rust, John BO Mitchell and Henry S Rzepa

Citation: BMC Bioinformatics 2005 6:141

Content type: Commentary Published on: 7 June 2005
- View Full Text
- View PDF
BMC Bioinformatics comes of age

Authors: Matthew J Cockerill

Citation: BMC Bioinformatics 2005 6:140

Content type: Editorial Published on: 7 June 2005
- View Full Text
- View PDF
PentaPlot: A software tool for the illustration of genome mosaicism

Dekapentagonal maps depict the phylogenetic relationships of five genomes in a visually appealing diagram and can be viewed as an alternative to a single evolutionary consensus tree. In particular, the generat...

Authors: Lutz Hamel, Olga Zhaxybayeva and J Peter Gogarten

Citation: BMC Bioinformatics 2005 6:139

Content type: Software Published on: 6 June 2005
- View Full Text
- View PDF
libcov: A C++ bioinformatic library to manipulate protein structures, sequence alignments and phylogeny

An increasing number of bioinformatics methods are considering the phylogenetic relationships between biological sequences. Implementing new methodologies using the maximum likelihood phylogenetic framework ca...

Authors: Davin Butt, Andrew J Roger and Christian Blouin

Citation: BMC Bioinformatics 2005 6:138

Content type: Software Published on: 6 June 2005
- View Full Text
- View PDF
SplitTester : software to identify domains responsible for functional divergence in protein family

Many protein families have undergone functional divergence after gene duplications such that current subgroups of the family carry out overlapping but distinct biological roles. For the protein families with k...

Authors: Xiang Gao, Kent A Vander Velden, Daniel F Voytas and Xun Gu

Citation: BMC Bioinformatics 2005 6:137

Content type: Software Published on: 1 June 2005
- View Full Text
- View PDF
AVID: An integrative framework for discovering functional relationships among proteins

Determining the functions of uncharacterized proteins is one of the most pressing problems in the post-genomic era. Large scale protein-protein interaction assays, global mRNA expression analyses and systemati...

Authors: Taijiao Jiang and Amy E Keating

Citation: BMC Bioinformatics 2005 6:136

Content type: Methodology article Published on: 1 June 2005
- View Full Text
- View PDF
YANA – a software tool for analyzing flux modes, gene-expression and enzyme activities

A number of algorithms for steady state analysis of metabolic networks have been developed over the years. Of these, Elementary Mode Analysis (EMA) has proven especially useful. Despite its low user-friendline...

Authors: Roland Schwarz, Patrick Musch, Axel von Kamp, Bernd Engels, Heiner Schirmer, Stefan Schuster and Thomas Dandekar

Citation: BMC Bioinformatics 2005 6:135

Content type: Software Published on: 1 June 2005
- View Full Text
- View PDF
Empirical codon substitution matrix

Codon substitution probabilities are used in many types of molecular evolution studies such as determining Ka/Ks ratios, creating ancestral DNA sequences or aligning coding DNA. Until the recent dramatic incre...

Authors: Adrian Schneider, Gina M Cannarozzi and Gaston H Gonnet

Citation: BMC Bioinformatics 2005 6:134

Content type: Research article Published on: 1 June 2005
- View Full Text
- View PDF
SeqDoC: rapid SNP and mutation detection by direct comparison of DNA sequence chromatograms

This paper describes SeqDoC, a simple, web-based tool to carry out direct comparison of ABI sequence chromatograms. This allows the rapid identification of single nucleotide polymorphisms (SNPs) and point muta...

Authors: Mark L Crowe

Citation: BMC Bioinformatics 2005 6:133

Content type: Software Published on: 31 May 2005
- View Full Text
- View PDF
Generating quantitative models describing the sequence specificity of biological processes with the stabilized matrix method

Many processes in molecular biology involve the recognition of short sequences of nucleic-or amino acids, such as the binding of immunogenic peptides to major histocompatibility complex (MHC) molecules. From e...

Authors: Bjoern Peters and Alessandro Sette

Citation: BMC Bioinformatics 2005 6:132

Content type: Software Published on: 31 May 2005
- View Full Text
- View PDF
Gene finding in the chicken genome

Despite the continuous production of genome sequence for a number of organisms, reliable, comprehensive, and cost effective gene prediction remains problematic. This is particularly true for genomes for which ...

Authors: Eduardo Eyras, Alexandre Reymond, Robert Castelo, Jacqueline M Bye, Francisco Camara, Paul Flicek, Elizabeth J Huckle, Genis Parra, David D Shteynberg, Carine Wyss, Jane Rogers, Stylianos E Antonarakis, Ewan Birney, Roderic Guigo and Michael R Brent

Citation: BMC Bioinformatics 2005 6:131

Content type: Research article Published on: 30 May 2005
- View Full Text
- View PDF
Vestige: Maximum likelihood phylogenetic footprinting

Phylogenetic footprinting is the identification of functional regions of DNA by their evolutionary conservation. This is achieved by comparing orthologous regions from multiple species and identifying the DNA ...

Authors: Matthew J Wakefield, Peter Maxwell and Gavin A Huttley

Citation: BMC Bioinformatics 2005 6:130

Content type: Software Published on: 29 May 2005
- View Full Text
- View PDF
Considerations when using the significance analysis of microarrays (SAM) algorithm

Users of microarray technology typically strive to use universally acceptable data analysis strategies to determine significant expression changes in their experiments. One of the most frequently utilised meth...

Authors: Ola Larsson, Claes Wahlestedt and James A Timmons

Citation: BMC Bioinformatics 2005 6:129

Content type: Correspondence Published on: 29 May 2005
- View Full Text
- View PDF
Integrative analysis of multiple gene expression profiles with quality-adjusted effect size models

With the explosion of microarray studies, an enormous amount of data is being produced. Systematic integration of gene expression data from different sources increases statistical power of detecting differenti...

Authors: Pingzhao Hu, Celia MT Greenwood and Joseph Beyene

Citation: BMC Bioinformatics 2005 6:128

Content type: Research article Published on: 27 May 2005
- View Full Text
- View PDF
Phylogenetic reconstruction of ancestral character states for gene expression and mRNA splicing data

As genomes evolve after speciation, gene content, coding sequence, gene expression, and splicing all diverge with time from ancestors with close relatives. A minimum evolution general method for continuous cha...

Authors: Roald Rossnes, Ingvar Eidhammer and David A Liberles

Citation: BMC Bioinformatics 2005 6:127

Content type: Software Published on: 27 May 2005
- View Full Text
- View PDF
Comparative analysis of chromatin landscape in regulatory regions of human housekeeping and tissue specific genes

Global regulatory mechanisms involving chromatin assembly and remodelling in the promoter regions of genes is implicated in eukaryotic transcription control especially for genes subjected to spatial and tempor...

Authors: Mythily Ganapathi, Pragya Srivastava, Sushanta Kumar Das Sutar, Kaushal Kumar, Dipayan Dasgupta, Gajinder Pal Singh, Vani Brahmachari and Samir K Brahmachari

Citation: BMC Bioinformatics 2005 6:126

Content type: Research article Published on: 26 May 2005
- View Full Text
- View PDF
Detection of nuclei in 4D Nomarski DIC microscope images of early Caenorhabditis elegans embryos using local image entropy and object tracking

The ability to detect nuclei in embryos is essential for studying the development of multicellular organisms. A system of automated nuclear detection has already been tested on a set of four-dimensional (4D) N...

Authors: Shugo Hamahashi, Shuichi Onami and Hiroaki Kitano

Citation: BMC Bioinformatics 2005 6:125

Content type: Methodology article Published on: 24 May 2005
- View Full Text
- View PDF
Data-poor categorization and passage retrieval for Gene Ontology Annotation in Swiss-Prot

In the context of the BioCreative competition, where training data were very sparse, we investigated two complementary tasks: 1) given a Swiss-Prot triplet, containing a protein, a GO (Gene Ontology) term and ...

Authors: Frédéric Ehrler, Antoine Geissbühler, Antonio Jimeno and Patrick Ruch

Citation: BMC Bioinformatics 2005 6(Suppl 1):S23

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
Mining protein function from text using term-based support vector machines

Text mining has spurred huge interest in the domain of biology. The goal of the BioCreAtIvE exercise was to evaluate the performance of current text mining systems. We participated in Task 2, which addressed a...

Authors: Simon B Rice, Goran Nenadic and Benjamin J Stapley

Citation: BMC Bioinformatics 2005 6(Suppl 1):S22

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
Finding genomic ontology terms in text using evidence content

The development of text mining systems that annotate biological entities with their properties using scientific literature is an important recent research topic. These systems need first to recognize the biolo...

Authors: Francisco M Couto, Mário J Silva and Pedro M Coutinho

Citation: BMC Bioinformatics 2005 6(Suppl 1):S21

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
Protein annotation as term categorization in the gene ontology using word proximity networks

We participated in the BioCreAtIvE Task 2, which addressed the annotation of proteins into the Gene Ontology (GO) based on the text of a given document and the selection of evidence text from the document just...

Authors: Karin Verspoor, Judith Cohn, Cliff Joslyn, Sue Mniszewski, Andreas Rechtsteiner, Luis M Rocha and Tiago Simas

Citation: BMC Bioinformatics 2005 6(Suppl 1):S20

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
A sentence sliding window approach to extract protein annotations from biomedical articles

Within the emerging field of text mining and statistical natural language processing (NLP) applied to biomedical articles, a broad variety of techniques have been developed during the past years. Nevertheless,...

Authors: Martin Krallinger, Maria Padron and Alfonso Valencia

Citation: BMC Bioinformatics 2005 6(Suppl 1):S19

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
Learning Statistical Models for Annotating Proteins with Function Information using Biomedical Text

The BioCreative text mining evaluation investigated the application of text mining methods to the task of automatically extracting information from text in biomedical research articles. We participated in Task...

Authors: Soumya Ray and Mark Craven

Citation: BMC Bioinformatics 2005 6(Suppl 1):S18

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
An evaluation of GO annotation retrieval for BioCreAtIvE and GOA

The Gene Ontology Annotation (GOA) database http://www.ebi.ac.uk/GOA aims to provide high-quality supplementary GO annotation to proteins in the UniProt Know...

Authors: Evelyn B Camon, Daniel G Barrell, Emily C Dimmer, Vivian Lee, Michele Magrane, John Maslen, David Binns and Rolf Apweiler

Citation: BMC Bioinformatics 2005 6(Suppl 1):S17

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
Evaluation of BioCreAtIvE assessment of task 2

Molecular Biology accumulated substantial amounts of data concerning functions of genes and proteins. Information relating to functional descriptions is generally extracted manually from textual data and store...

Authors: Christian Blaschke, Eduardo Andres Leon, Martin Krallinger and Alfonso Valencia

Citation: BMC Bioinformatics 2005 6(Suppl 1):S16

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
A simple approach for protein name identification: prospects and limits

Significant parts of biological knowledge are available only as unstructured text in articles of biomedical journals. By automatically identifying gene and gene product (protein) names and mapping these to uni...

Authors: Katrin Fundel, Daniel Güttler, Ralf Zimmer and Joannis Apostolakis

Citation: BMC Bioinformatics 2005 6(Suppl 1):S15

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
ProMiner: rule-based protein and gene entity recognition

Identification of gene and protein names in biomedical text is a challenging task as the corresponding nomenclature has evolved over time. This has led to multiple synonyms for individual genes and proteins, a...

Authors: Daniel Hanisch, Katrin Fundel, Heinz-Theodor Mevissen, Ralf Zimmer and Juliane Fluck

Citation: BMC Bioinformatics 2005 6(Suppl 1):S14

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
Automatically annotating documents with normalized gene lists

Document gene normalization is the problem of creating a list of unique identifiers for genes that are mentioned within a document. Automating this process has many potential applications in both information e...

Authors: Jeremiah Crim, Ryan McDonald and Fernando Pereira

Citation: BMC Bioinformatics 2005 6(Suppl 1):S13

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
Data preparation and interannotator agreement: BioCreAtIvE Task 1B

We prepared and evaluated training and test materials for an assessment of text mining methods in molecular biology. The goal of the assessment was to evaluate the ability of automated systems to generate a li...

Authors: Marc E Colosimo, Alexander A Morgan, Alexander S Yeh, Jeffrey B Colombe and Lynette Hirschman

Citation: BMC Bioinformatics 2005 6(Suppl 1):S12

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
Overview of BioCreAtIvE task 1B: normalized gene lists

Our goal in BioCreAtIve has been to assess the state of the art in text mining, with emphasis on applications that reflect real biological applications, e.g., the curation process for model organism databases....

Authors: Lynette Hirschman, Marc Colosimo, Alexander Morgan and Alexander Yeh

Citation: BMC Bioinformatics 2005 6(Suppl 1):S11

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
Text Detective: a rule-based system for gene annotation in biomedical texts

The identification of mentions of gene or gene products in biomedical texts is a critical step in the development of text mining applications in biosciences. The complexity and ambiguity of gene nomenclature m...

Authors: Javier Tamames

Citation: BMC Bioinformatics 2005 6(Suppl 1):S10

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
Systematic feature evaluation for gene name recognition

In task 1A of the BioCreAtIvE evaluation, systems had to be devised that recognize words and phrases forming gene or protein names in natural language sentences. We approach this problem by building a word cla...

Authors: Jörg Hakenberg, Steffen Bickel, Conrad Plake, Ulf Brefeld, Hagen Zahn, Lukas Faulstich, Ulf Leser and Tobias Scheffer

Citation: BMC Bioinformatics 2005 6(Suppl 1):S9

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
Gene/protein name recognition based on support vector machine using dictionary as features

Automated information extraction from biomedical literature is important because a vast amount of biomedical literature has been published. Recognition of the biomedical named entities is the first step in inf...

Authors: Tomohiro Mitsumori, Sevrani Fation, Masaki Murata, Kouichi Doi and Hirohumi Doi

Citation: BMC Bioinformatics 2005 6(Suppl 1):S8

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
Recognition of protein/gene names from text using an ensemble of classifiers

This paper proposes an ensemble of classifiers for biomedical name recognition in which three classifiers, one Support Vector Machine and two discriminative Hidden Markov Models, are combined effectively using...

Authors: GuoDong Zhou, Dan Shen, Jie Zhang, Jian Su and SoonHeng Tan

Citation: BMC Bioinformatics 2005 6(Suppl 1):S7

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
Identifying gene and protein mentions in text using conditional random fields

We present a model for tagging gene and protein mentions from text using the probabilistic sequence tagging framework of conditional random fields (CRFs). Conditional random fields model the probability P(t|o) of...

Authors: Ryan McDonald and Fernando Pereira

Citation: BMC Bioinformatics 2005 6(Suppl 1):S6

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
Exploring the boundaries: gene and protein identification in biomedical text

Good automatic information extraction tools offer hope for automatic processing of the exploding biomedical literature, and successful named entity recognition is a key component for such tools.

Authors: Jenny Finkel, Shipra Dingare, Christopher D Manning, Malvina Nissim, Beatrice Alex and Claire Grover

Citation: BMC Bioinformatics 2005 6(Suppl 1):S5

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
BioCreAtIvE Task1A: entity identification with a stochastic tagger

Our approach to Task 1A was inspired by Tanabe and Wilbur's ABGene system [1, 2]. Like Tanabe and Wilbur, we approached the problem as one of part-of-speech tagging, adding a GENE tag to the standard tag set. Whe...

Authors: Shuhei Kinoshita, K Bretonnel Cohen, Philip V Ogren and Lawrence Hunter

Citation: BMC Bioinformatics 2005 6(Suppl 1):S4

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
GENETAG: a tagged corpus for gene/protein named entity recognition

Named entity recognition (NER) is an important first step for text mining the biomedical literature. Evaluating the performance of biomedical NER systems is impossible without a standardized test corpus. The a...

Authors: Lorraine Tanabe, Natalie Xie, Lynne H Thom, Wayne Matten and W John Wilbur

Citation: BMC Bioinformatics 2005 6(Suppl 1):S3

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
BioCreAtIvE Task 1A: gene mention finding evaluation

The biological research literature is a major repository of knowledge. As the amount of literature increases, it will get harder to find the information of interest on a particular topic. There has been an inc...

Authors: Alexander Yeh, Alexander Morgan, Marc Colosimo and Lynette Hirschman

Citation: BMC Bioinformatics 2005 6(Suppl 1):S2

Content type: Report Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
Overview of BioCreAtIvE: critical assessment of information extraction for biology

The goal of the first BioCreAtIvE challenge (Critical Assessment of Information Extraction in Biology) was to provide a set of common evaluation tasks to assess the state of the art for text mining applied to ...

Authors: Lynette Hirschman, Alexander Yeh, Christian Blaschke and Alfonso Valencia

Citation: BMC Bioinformatics 2005 6(Suppl 1):S1

Content type: Introduction Published on: 24 May 2005

This article is part of a Supplement: Volume 6 Supplement 1
- View Full Text
- View PDF
arrayCGHbase: an analysis platform for comparative genomic hybridization microarrays

The availability of the human genome sequence as well as the large number of physically accessible oligonucleotides, cDNA, and BAC clones across the entire genome has triggered and accelerated the use of sever...

Authors: Björn Menten, Filip Pattyn, Katleen De Preter, Piet Robbrecht, Evi Michels, Karen Buysse, Geert Mortier, Anne De Paepe, Steven van Vooren, Joris Vermeesch, Yves Moreau, Bart De Moor, Stefan Vermeulen, Frank Speleman and Jo Vandesompele

Citation: BMC Bioinformatics 2005 6:124

Content type: Software Published on: 23 May 2005
- View Full Text
- View PDF

How was your experience today?

Rating Please select one rating

Awful

Bad

Good

Great

Thank you for your feedback.

Tell us why (opens in a new tab)

Featured videos

View featured videos from across the BMC-series journals

Articles

Featured videos

Important information

Annual Journal Metrics

Follow

BMC Bioinformatics

Contact us