Articles

Page 1 of 248

Metabolic flux balance analysis and the in silico analysis of Escherichia coli K-12 gene deletions

Genome sequencing and bioinformatics are producing detailed lists of the molecular components contained in many prokaryotic organisms. From this 'parts catalogue' of a microbial cell, in silico representations of...

Authors: Jeremy S Edwards and Bernhard O Palsson

Citation: BMC Bioinformatics 2000 1:1

Content type: Research article Published on: 27 July 2000
- View Full Text
- View PDF
An algorithm for mapping positively selected members of quasispecies-type viruses

Many RNA viruses do not have a single, representative genome but instead form a set of related variants that has been called a quasispecies. The sequence variability of such viruses presents a significant bioi...

Authors: Jeffrey J Stewart, Perry Watts and Samuel Litwin

Citation: BMC Bioinformatics 2001 2:1

Content type: Research article Published on: 6 March 2001
- View Full Text
- View PDF
mRNA:guanine-N 7 cap methyltransferases: identification of novel members of the family, evolutionary analysis, homology modeling, and analysis of sequence-structure-function relationships

The 5'-terminal cap structure plays an important role in many aspects of mRNA metabolism. Capping enzymes encoded by viruses and pathogenic fungi are attractive targets for specific inhibitors. There is a larg...

Authors: Janusz M Bujnicki, Marcin Feder, Monika Radlinska and Leszek Rychlewski

Citation: BMC Bioinformatics 2001 2:2

Content type: Research article Published on: 22 June 2001
- View Full Text
- View PDF
Support Vector Machines for predicting protein structural class

We apply a new machine learning method, the so-called Support Vector Machine method, to predict the protein structural class. Support Vector Machine method is performed based on the database derived from SCOP,...

Authors: Yu-Dong Cai, Xiao-Jun Liu, Xue-biao Xu and Guo-Ping Zhou

Citation: BMC Bioinformatics 2001 2:3

Content type: Research article Published on: 29 June 2001
- View Full Text
- View PDF
The relationship between the L1 and L2 domains of the insulin and epidermal growth factor receptors and leucine-rich repeat modules

Leucine-rich repeats are one of the more common modules found in proteins. The leucine-rich repeat consensus motif is LxxLxLxxNxLxxLxxLxxLxx- where the first 11–12 residues are highly conserved and the remaind...

Authors: Colin W Ward and Thomas PJ Garrett

Citation: BMC Bioinformatics 2001 2:4

Content type: Research article Published on: 27 July 2001
- View Full Text
- View PDF
A study of quality measures for protein threading models

Prediction of protein structures is one of the fundamental challenges in biology today. To fully understand how well different prediction methods perform, it is necessary to use measures that evaluate their pe...

Authors: Susana Cristobal, Adam Zemla, Daniel Fischer, Leszek Rychlewski and Arne Elofsson

Citation: BMC Bioinformatics 2001 2:5

Content type: Research article Published on: 1 August 2001
- View Full Text
- View PDF
FOUNTAIN: A JAVA open-source package to assist large sequencing projects

Better automation, lower cost per reaction and a heightened interest in comparative genomics has led to a dramatic increase in DNA sequencing activities. Although the large sequencing projects of specialized c...

Authors: Jean-Marie Buerstedde and Florian Prill

Citation: BMC Bioinformatics 2001 2:6

Content type: Methodology article Published on: 21 September 2001
- View Full Text
- View PDF
The Distributed Annotation System

Currently, most genome annotation is curated by centralized groups with limited resources. Efforts to share annotations transparently among multiple groups have not yet been satisfactory.

Authors: Robin D Dowell, Rodney M Jokerst, Allen Day, Sean R Eddy and Lincoln Stein

Citation: BMC Bioinformatics 2001 2:7

Content type: Research article Published on: 10 October 2001
- View Full Text
- View PDF
Noncoding RNA gene detection using comparative sequence analysis

Noncoding RNA genes produce transcripts that exert their function without ever producing proteins. Noncoding RNA gene sequences do not have strong statistical signals, unlike protein coding genes. A reliable g...

Authors: Elena Rivas and Sean R Eddy

Citation: BMC Bioinformatics 2001 2:8

Content type: Methodology article Published on: 10 October 2001
- View Full Text
- View PDF
FastGroup: A program to dereplicate libraries of 16S rDNA sequences

Ribosomal 16S DNA sequences are an essential tool for identifying and classifying microbes. High-throughput DNA sequencing now makes it economically possible to produce very large datasets of 16S rDNA sequence...

Authors: Victor Seguritan and Forest Rohwer

Citation: BMC Bioinformatics 2001 2:9

Content type: Methodology article Published on: 16 October 2001
- View Full Text
- View PDF
Identification and characterization of subfamily-specific signatures in a large protein superfamily by a hidden Markov model approach

Most profile and motif databases strive to classify protein sequences into a broad spectrum of protein families. The next step of such database studies should include the development of classification systems ...

Authors: Kevin Truong and Mitsuhiko Ikura

Citation: BMC Bioinformatics 2002 3:1

Content type: Methodology article Published on: 10 January 2002
- View Full Text
- View PDF
The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs

Comparative analysis of RNA sequences is the basis for the detailed and accurate predictions of RNA structure and the determination of phylogenetic relationships for organisms that span the entire phylogenetic...

Authors: Jamie J Cannone, Sankar Subramanian, Murray N Schnare, James R Collett, Lisa M D'Souza, Yushi Du, Brian Feng, Nan Lin, Lakshmi V Madabusi, Kirsten M Müller, Nupur Pande, Zhidi Shang, Nan Yu and Robin R Gutell

Citation: BMC Bioinformatics 2002 3:2

Content type: Research article Published on: 17 January 2002
- View Full Text
- View PDF
Match-Only Integral Distribution (MOID) Algorithm for high-density oligonucleotide array analysis

High-density oligonucleotide arrays have become a valuable tool for high-throughput gene expression profiling. Increasing the array information density and improving the analysis algorithms are two important c...

Authors: Yingyao Zhou and Ruben Abagyan

Citation: BMC Bioinformatics 2002 3:3

Content type: Research article Published on: 22 January 2002
- View Full Text
- View PDF
Sources of variability and effect of experimental approach on expression profiling data interpretation

We provide a systematic study of the sources of variability in expression profiling data using 56 RNAs isolated from human muscle biopsies (34 Affymetrix MuscleChip arrays), and 36 murine cell culture and tiss...

Authors: Marina Bakay, Yi-Wen Chen, Rehannah Borup, Po Zhao, Kanneboyina Nagaraju and Eric P Hoffman

Citation: BMC Bioinformatics 2002 3:4

Content type: Research article Published on: 31 January 2002
- View Full Text
- View PDF
Re-annotation of genome microbial CoDing-Sequences: finding new genes and inaccurately annotated genes

Analysis of any newly sequenced bacterial genome starts with the identification of protein-coding genes. Despite the accumulation of multiple complete genome sequences, which provide useful comparisons with cl...

Authors: Stéphanie Bocs, Antoine Danchin and Claudine Médigue

Citation: BMC Bioinformatics 2002 3:5

Content type: Research article Published on: 5 February 2002
- View Full Text
- View PDF
Universal sequence map (USM) of arbitrary discrete sequences

For over a decade the idea of representing biological sequences in a continuous coordinate space has maintained its appeal but not been fully realized. The basic idea is that any sequence of symbols may define...

Authors: Jonas S Almeida and Susana Vinga

Citation: BMC Bioinformatics 2002 3:6

Content type: Methodology article Published on: 5 February 2002
- View Full Text
- View PDF
Correlating overrepresented upstream motifs to gene expression: a computational approach to regulatory element discovery in eukaryotes

Gene regulation in eukaryotes is mainly effected through transcription factors binding to rather short recognition motifs generally located upstream of the coding region. We present a novel computational metho...

Authors: Michele Caselle, Ferdinando Di Cunto and Paolo Provero

Citation: BMC Bioinformatics 2002 3:7

Content type: Methodology article Published on: 14 February 2002
- View Full Text
- View PDF
tacg – a grep for DNA

Pattern matching is the core of bioinformatics; it is used in database searching, restriction enzyme mapping, and finding open reading frames. It is done repeatedly over increasingly long sequences, thus codes...

Authors: Harry J Mangalam

Citation: BMC Bioinformatics 2002 3:8

Content type: Methodology article Published on: 6 March 2002
- View Full Text
- View PDF
An algorithm and program for finding sequence specific oligo-nucleotide probes for species identification

The identification of species or species groups with specific oligo-nucleotides as molecular signatures is becoming increasingly popular for bacterial samples. However, it shows also great promise for other sm...

Authors: Alexander E Pozhitkov and Diethard Tautz

Citation: BMC Bioinformatics 2002 3:9

Content type: Research article Published on: 6 March 2002
- View Full Text
- View PDF
RNA:(guanine-N2) methyltransferases RsmC/RsmD and their homologs revisited – bioinformatic analysis and prediction of the active site based on the uncharacterized Mj0882 protein structure

Escherichia coli guanine-N2 (m²G) methyltransferases (MTases) RsmC and RsmD modify nucleosides G1207 and G966 of 16S rRNA. They possess a common MTase domain in the C-terminus and a variable region in the N-termi...

Authors: Janusz M Bujnicki and Leszek Rychlewski

Citation: BMC Bioinformatics 2002 3:10

Content type: Research article Published on: 3 April 2002
- View Full Text
- View PDF
Homology Induction: the use of machine learning to improve sequence similarity searches

The inference of homology between proteins is a key problem in molecular biology The current best approaches only identify ~50% of homologies (with a false positive rate set at 1/1000).

Authors: Andreas Karwath and Ross D King

Citation: BMC Bioinformatics 2002 3:11

Content type: Methodology article Published on: 23 April 2002
- View Full Text
- View PDF
CoreGenes: A computational tool for identifying and cataloging "core" genes in a set of small genomes

Improvements in DNA sequencing technology and methodology have led to the rapid expansion of databases comprising DNA sequence, gene and genome data. Lower operational costs and heightened interest resulting f...

Authors: Nikhat Zafar, Raja Mazumder and Donald Seto

Citation: BMC Bioinformatics 2002 3:12

Content type: Methodology article Published on: 24 April 2002
- View Full Text
- View PDF
NBLAST: a cluster variant of BLAST for NxN comparisons

The BLAST algorithm compares biological sequences to one another in order to determine shared motifs and common ancestry. However, the comparison of all non-redundant (NR) sequences against all other NR sequen...

Authors: Michel Dumontier and Christopher WV Hogue

Citation: BMC Bioinformatics 2002 3:13

Content type: Methodology article Published on: 8 May 2002
- View Full Text
- View PDF
RIO: Analyzing proteomes by automated phylogenomics using resampled inference of orthologs

When analyzing protein sequences using sequence similarity searches, orthologous sequences (that diverged by speciation) are more reliable predictors of a new protein's function than paralogous sequences (that...

Authors: Christian M Zmasek and Sean R Eddy

Citation: BMC Bioinformatics 2002 3:14

Content type: Methodology article Published on: 16 May 2002
- View Full Text
- View PDF
Erratum to: The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs: Correction

Authors: Jamie J Cannone, Sankar Subramanian, Murray N Schnare, James R Collett, Lisa M D'Souza, Yushi Du, Brian Feng, Nan Lin, Lakshmi V Madabusi, Kirsten M Müller, Nupur Pande, Zhidi Shang, Nan Yu and Robin R Gutell

Citation: BMC Bioinformatics 2002 3:15

Content type: Erratum Published on: 31 May 2002

The original article was published in BMC Bioinformatics 2002 3:2
- View Full Text
- View PDF
Identifying functional relationships among human genes by systematic analysis of biological literature

The availability of biomedical literature in electronic format has made it possible to implement automatic text processing methods to expose implicit relationships among different documents, and more important...

Authors: Yong-Chuan Tao and Rudolph L Leibel

Citation: BMC Bioinformatics 2002 3:16

Content type: Research article Published on: 7 June 2002
- View Full Text
- View PDF
The limit fold change model: A practical approach for selecting differentially expressed genes from microarray data

The biomedical community is developing new methods of data analysis to more efficiently process the massive data sets produced by microarray experiments. Systematic and global mathematical approaches that can ...

Authors: David M Mutch, Alvin Berger, Robert Mansourian, Andreas Rytz and Matthew-Alan Roberts

Citation: BMC Bioinformatics 2002 3:17

Content type: Methodology article Published on: 21 June 2002
- View Full Text
- View PDF
A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure

Covariance models (CMs) are probabilistic models of RNA secondary structure, analogous to profile hidden Markov models of linear sequence. The dynamic programming algorithm for aligning a CM to an RNA sequence...

Authors: Sean R Eddy

Citation: BMC Bioinformatics 2002 3:18

Content type: Research article Published on: 2 July 2002
- View Full Text
- View PDF
Visualizing the genome: techniques for presenting human genome data and annotations

In order to take full advantage of the newly available public human genome sequence data and associated annotations, biologists require visualization tools ("genome browsers") that can accommodate the high fre...

Authors: Ann E Loraine and Gregg A Helt

Citation: BMC Bioinformatics 2002 3:19

Content type: Methodology article Published on: 30 July 2002
- View Full Text
- View PDF
Kangaroo – A pattern-matching program for biological sequences

Biologists are often interested in performing a simple database search to identify proteins or genes that contain a well-defined sequence pattern. Many databases do not provide straightforward or readily avail...

Authors: Doron Betel and Christopher WV Hogue

Citation: BMC Bioinformatics 2002 3:20

Content type: Methodology article Published on: 31 July 2002
- View Full Text
- View PDF
The SGS3 protein involved in PTGS finds a family

Post transcriptional gene silencing (PTGS) is a recently discovered phenomenon that is an area of intense research interest. Components of the PTGS machinery are being discovered by genetic and bioinformatics ...

Authors: Alex Bateman

Citation: BMC Bioinformatics 2002 3:21

Content type: Research article Published on: 5 August 2002
- View Full Text
- View PDF
Microarray results: how accurate are they?

DNA microarray technology is a powerful technique that was recently developed in order to analyze thousands of genes in a short time. Presently, microarrays, or chips, of the cDNA type and oligonucleotide type...

Authors: Ravi Kothapalli, Sean J Yoder, Shrikant Mane and Thomas P Loughran Jr

Citation: BMC Bioinformatics 2002 3:22

Content type: Research article Published on: 23 August 2002
- View Full Text
- View PDF
Computational method for reducing variance with Affymetrix microarrays

Affymetrix microarrays are used by many laboratories to generate gene expression profiles. Generally, only large differences (> 1.7-fold) between conditions have been reported. Computational methods to reduce ...

Authors: Stephen Welle, Andrew I Brooks and Charles A Thornton

Citation: BMC Bioinformatics 2002 3:23

Content type: Research article Published on: 30 August 2002
- View Full Text
- View PDF
The functional importance of disease-associated mutation

For many years, scientists believed that point mutations in genes are the genetic switches for somatic and inherited diseases such as cystic fibrosis, phenylketonuria and cancer. Some of these mutations likely...

Authors: Sean D Mooney and Teri E Klein

Citation: BMC Bioinformatics 2002 3:24

Content type: Research article Published on: 9 September 2002
- View Full Text
- View PDF
Prediction of MHC class I binding peptides, using SVMHC

T-cells are key players in regulating a specific immune response. Activation of cytotoxic T-cells requires recognition of specific peptides bound to Major Histocompatibility Complex (MHC) class I molecules. MH...

Authors: Pierre Dönnes and Arne Elofsson

Citation: BMC Bioinformatics 2002 3:25

Content type: Research article Published on: 11 September 2002
- View Full Text
- View PDF
Expression profiling of human renal carcinomas with functional taxonomic analysis

Molecular characterization has contributed to the understanding of the inception, progression, treatment and prognosis of cancer. Nucleic acid array-based technologies extend molecular characterization of tumo...

Authors: Michael A Gieseg, Theresa Cody, Michael Z Man, Steven J Madore, Mark A Rubin and Eric P Kaldjian

Citation: BMC Bioinformatics 2002 3:26

Content type: Research article Published on: 30 September 2002
- View Full Text
- View PDF
Oliz, a suite of Perl scripts that assist in the design of microarrays using 50mer oligonucleotides from the 3' untranslated region

Identifying reliable oligonucleotide sequences for use in microarray experiments is a complex process. Two key issues are the accuracy of the input sequences and the specificity of the oligonucleotide sequences.

Authors: Hao Chen and Burt M Sharp

Citation: BMC Bioinformatics 2002 3:27

Content type: Methodology article Published on: 6 October 2002
- View Full Text
- View PDF
Efficient Boolean implementation of universal sequence maps (bUSM)

Recently, Almeida and Vinga offered a new approach for the representation of arbitrary discrete sequences, referred to as Universal Sequence Maps (USM), and discussed its applicability to genomic sequence anal...

Authors: John Schwacke and Jonas S Almeida

Citation: BMC Bioinformatics 2002 3:28

Content type: Methodology article Published on: 21 October 2002
- View Full Text
- View PDF
Simulation of DNA array hybridization experiments and evaluation of critical parameters during subsequent image and data analysis

Gene expression analyses based on complex hybridization measurements have increased rapidly in recent years and have given rise to a huge amount of bioinformatic tools such as image analyses and cluster analys...

Authors: Christoph K Wierling, Matthias Steinfath, Thorsten Elge, Steffen Schulze-Kremer, Pia Aanstad, Matthew Clark, Hans Lehrach and Ralf Herwig

Citation: BMC Bioinformatics 2002 3:29

Content type: Research article Published on: 22 October 2002
- View Full Text
- View PDF
Computational detection of genomic cis- regulatory modules applied to body patterning in the early Drosophila embryo

Regulation of gene transcription is crucial for the function and development of all organisms. While gene prediction programs that identify protein coding sequence are used with remarkable success in the annot...

Authors: Nikolaus Rajewsky, Massimo Vergassola, Ulrike Gaul and Eric D Siggia

Citation: BMC Bioinformatics 2002 3:30

Content type: Research article Published on: 24 October 2002
- View Full Text
- View PDF
Making sense of EST sequences by CLOBBing them

Expressed sequence tags (ESTs) are single pass reads from randomly selected cDNA clones. They provide a highly cost-effective method to access and identify expressed genes. However, they are often prone to seq...

Authors: John Parkinson, David B Guiliano and Mark Blaxter

Citation: BMC Bioinformatics 2002 3:31

Content type: Research article Published on: 25 October 2002
- View Full Text
- View PDF
SeqHound: biological sequence and structure database as a platform for bioinformatics research

SeqHound has been developed as an integrated biological sequence, taxonomy, annotation and 3-D structure database system. It provides a high-performance server platform for bioinformatics research in a locally...

Authors: Katerina Michalickova, Gary D Bader, Michel Dumontier, Hao Lieu, Doron Betel, Ruth Isserlin and Christopher WV Hogue

Citation: BMC Bioinformatics 2002 3:32

Content type: Methodology article Published on: 25 October 2002
- View Full Text
- View PDF
Development and production of an oligonucleotide MuscleChip: use for validation of ambiguous ESTs

We describe the development, validation, and use of a highly redundant 120,000 oligonucleotide microarray (MuscleChip) containing 4,601 probe sets representing 1,150 known genes expressed in muscle and 2,075 E...

Authors: Rehannah HA Borup, Stefano Toppo, Yi-Wen Chen, Tanya M Teslovich, Gerolamo Lanfranchi, Giorgio Valle and Eric P Hoffman

Citation: BMC Bioinformatics 2002 3:33

Content type: Research article Published on: 29 October 2002
- View Full Text
- View PDF
Automated modelling of signal transduction networks

Intracellular signal transduction is achieved by networks of proteins and small molecules that transmit information from the cell surface to the nucleus, where they ultimately effect transcriptional changes. U...

Authors: Martin Steffen, Allegra Petti, John Aach, Patrik D'haeseleer and George Church

Citation: BMC Bioinformatics 2002 3:34

Content type: Research article Published on: 1 November 2002
- View Full Text
- View PDF
FunSpec: a web-based cluster interpreter for yeast

For effective exposition of biological information, especially with regard to analysis of large-scale data types, researchers need immediate access to multiple categorical knowledge bases and need summary info...

Authors: Mark D Robinson, Jörg Grigull, Naveed Mohammad and Timothy R Hughes

Citation: BMC Bioinformatics 2002 3:35

Content type: Methodology article Published on: 13 November 2002
- View Full Text
- View PDF
Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study

A method to evaluate and analyze the massive data generated by series of microarray experiments is of utmost importance to reveal the hidden patterns of gene expression. Because of the complexity and the high ...

Authors: Junbai Wang, Jan Delabie, Hans Christian Aasheim, Erlend Smeland and Ola Myklebost

Citation: BMC Bioinformatics 2002 3:36

Content type: Research article Published on: 24 November 2002
- View Full Text
- View PDF
Total sequence decomposition distinguishes functional modules, "molegos" in apurinic/apyrimidinic endonucleases

Total sequence decomposition, using the web-based MASIA tool, identifies areas of conservation in aligned protein sequences. By structurally annotating these motifs, the sequence can be parsed into individual ...

Authors: Catherine H Schein, Numan Özgün, Tadahide Izumi and Werner Braun

Citation: BMC Bioinformatics 2002 3:37

Content type: Research article Published on: 25 November 2002
- View Full Text
- View PDF
Quantifying robustness of biochemical network models

Robustness of mathematical models of biochemical networks is important for validation purposes and can be used as a means of selecting between different competing models. Tools for quantifying parametric robus...

Authors: Lan Ma and Pablo A Iglesias

Citation: BMC Bioinformatics 2002 3:38

Content type: Research article Published on: 13 December 2002
- View Full Text
- View PDF
Species-specific protein sequence and fold optimizations

An organism's ability to adapt to its particular environmental niche is of fundamental importance to its survival and proliferation. In the largest study of its kind, we sought to identify and exploit the amin...

Authors: Michel Dumontier, Katerina Michalickova and Christopher WV Hogue

Citation: BMC Bioinformatics 2002 3:39

Content type: Research article Published on: 17 December 2002
- View Full Text
- View PDF
ORFer – retrieval of protein sequences and open reading frames from GenBank and storage into relational databases or text files

Functional genomics involves the parallel experimentation with large sets of proteins. This requires management of large sets of open reading frames as a prerequisite of the cloning and recombinant expression ...

Authors: Konrad Büssow, Steve Hoffmann and Volker Sievert

Citation: BMC Bioinformatics 2002 3:40

Content type: Methodology article Published on: 19 December 2002
- View Full Text
- View PDF

How was your experience today?

Rating Please select one rating

Awful

Bad

Good

Great

Thank you for your feedback.

Tell us why (opens in a new tab)

Featured videos

View featured videos from across the BMC-series journals

Articles

Featured videos

Important information

Annual Journal Metrics

Follow

BMC Bioinformatics

Contact us