Articles

Page 231 of 249

Protein secondary structure prediction for a single-sequence using hidden semi-Markov models

The accuracy of protein secondary structure prediction has been improving steadily towards the 88% estimated theoretical limit. There are two types of prediction algorithms: Single-sequence prediction algorith...

Authors: Zafer Aydin, Yucel Altunbasak and Mark Borodovsky

Citation: BMC Bioinformatics 2006 7:178

Content type: Research article Published on: 30 March 2006
- View Full Text
- View PDF
Identifying metabolic enzymes with multiple types of association evidence

Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying suc...

Authors: Peter Kharchenko, Lifeng Chen, Yoav Freund, Dennis Vitkup and George M Church

Citation: BMC Bioinformatics 2006 7:177

Content type: Methodology article Published on: 29 March 2006
- View Full Text
- View PDF
The Gaggle: An open-source software system for integrating bioinformatics software and data sources

Systems biologists work with many kinds of data, from many different sources, using a variety of software tools. Each of these tools typically excels at one type of analysis, such as of microarrays, of metabol...

Authors: Paul T Shannon, David J Reiss, Richard Bonneau and Nitin S Baliga

Citation: BMC Bioinformatics 2006 7:176

Content type: Software Published on: 28 March 2006
- View Full Text
- View PDF
LS-NMF: A modified non-negative matrix factorization algorithm utilizing uncertainty estimates

Non-negative matrix factorisation (NMF), a machine learning algorithm, has been applied to the analysis of microarray data. A key feature of NMF is the ability to identify patterns that together explain the da...

Authors: Guoli Wang, Andrew V Kossenkov and Michael F Ochs

Citation: BMC Bioinformatics 2006 7:175

Content type: Methodology article Published on: 28 March 2006
- View Full Text
- View PDF
UVPAR: fast detection of functional shifts in duplicate genes

The imprint of natural selection on gene sequences is often difficult to detect. A plethora of methods have been devised to detect genetic changes due to selective processes. However, many of those methods dep...

Authors: Vicente Arnau, Miguel Gallach, J Ignasi Lucas and Ignacio Marín

Citation: BMC Bioinformatics 2006 7:174

Content type: Software Published on: 28 March 2006
- View Full Text
- View PDF
Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change

Non-coding RNAs (ncRNAs) have a multitude of roles in the cell, many of which remain to be discovered. However, it is difficult to detect novel ncRNAs in biochemical screens. To advance biological knowledge, c...

Authors: Andrew V Uzilov, Joshua M Keegan and David H Mathews

Citation: BMC Bioinformatics 2006 7:173

Content type: Research article Published on: 27 March 2006
- View Full Text
- View PDF
GENOMEMASKER package for designing unique genomic PCR primers

The design of oligonucleotides and PCR primers for studying large genomes is complicated by the redundancy of sequences. The eukaryotic genomes are particularly difficult to study due to abundant repeats. The ...

Authors: Reidar Andreson, Eric Reppo, Lauris Kaplinski and Maido Remm

Citation: BMC Bioinformatics 2006 7:172

Content type: Software Published on: 27 March 2006
- View Full Text
- View PDF
Automatic pathway building in biological association networks

Scientific literature is a source of the most reliable and comprehensive knowledge about molecular interaction networks. Formalization of this knowledge is necessary for computational analysis and is achieved ...

Authors: Anton Yuryev, Zufar Mulyukov, Ekaterina Kotelnikova, Sergei Maslov, Sergei Egorov, Alexander Nikitin, Nikolai Daraselia and Ilya Mazo

Citation: BMC Bioinformatics 2006 7:171

Content type: Methodology article Published on: 24 March 2006
- View Full Text
- View PDF
BioWarehouse: a bioinformatics database warehouse toolkit

This article addresses the problem of interoperation of heterogeneous bioinformatics databases.

Authors: Thomas J Lee, Yannick Pouliot, Valerie Wagner, Priyanka Gupta, David WJ Stringer-Calvert, Jessica D Tenenbaum and Peter D Karp

Citation: BMC Bioinformatics 2006 7:170

Content type: Software Published on: 23 March 2006
- View Full Text
- View PDF
AltTrans: Transcript pattern variants annotated for both alternative splicing and alternative polyadenylation

The three major mechanisms that regulate transcript formation involve the selection of alternative sites for transcription start (TS), splicing, and polyadenylation. Currently there are efforts that collect da...

Authors: Vincent Le Texier, Jean-Jack Riethoven, Vasudev Kumanduri, Chellappa Gopalakrishnan, Fabrice Lopez, Daniel Gautheret and Thangavel Alphonse Thanaraj

Citation: BMC Bioinformatics 2006 7:169

Content type: Database Published on: 23 March 2006
- View Full Text
- View PDF
GEM System: automatic prototyping of cell-wide metabolic pathway models from genomes

Successful realization of a "systems biology" approach to analyzing cells is a grand challenge for our understanding of life. However, current modeling approaches to cell simulation are labor-intensive, manual...

Authors: Kazuharu Arakawa, Yohei Yamada, Kosaku Shinoda, Yoichi Nakayama and Masaru Tomita

Citation: BMC Bioinformatics 2006 7:168

Content type: Research article Published on: 23 March 2006
- View Full Text
- View PDF
Prediction of indirect interactions in proteins

Both direct and indirect interactions determine molecular recognition of ligands by proteins. Indirect interactions can be defined as effects on recognition controlled from distant sites in the proteins, e.g. ...

Authors: Peteris Prusis, Staffan Uhlén, Ramona Petrovska, Maris Lapinsh and Jarl ES Wikberg

Citation: BMC Bioinformatics 2006 7:167

Content type: Research article Published on: 22 March 2006
- View Full Text
- View PDF
SNPs3D: Candidate gene and SNP selection for association studies

The relationship between disease susceptibility and genetic variation is complex, and many different types of data are relevant. We describe a web resource and database that provides and integrates as much inf...

Authors: Peng Yue, Eugene Melamud and John Moult

Citation: BMC Bioinformatics 2006 7:166

Content type: Database Published on: 22 March 2006
- View Full Text
- View PDF
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae

Gene expression and transcription factor (TF) binding data have been used to reveal gene transcriptional regulatory networks. Existing knowledge of gene regulation can be presented using gene connectivity netw...

Authors: Hyunsoo Kim, William Hu and Yuval Kluger

Citation: BMC Bioinformatics 2006 7:165

Content type: Methodology article Published on: 21 March 2006
- View Full Text
- View PDF
Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases

Manually finding subtle yet statistically significant links to distantly related homologues becomes practically impossible for very populated protein families due to the sheer number of similarity searches to ...

Authors: Georg Schneider, Georg Neuberger, Michael Wildpaner, Sun Tian, Igor Berezovsky and Frank Eisenhaber

Citation: BMC Bioinformatics 2006 7:164

Content type: Methodology article Published on: 21 March 2006
- View Full Text
- View PDF
PPSP: prediction of PK-specific phosphorylation site with Bayesian decision theory

As a reversible and dynamic post-translational modification (PTM) of proteins, phosphorylation plays essential regulatory roles in a broad spectrum of the biological processes. Although many studies have been ...

Authors: Yu Xue, Ao Li, Lirong Wang, Huanqing Feng and Xuebiao Yao

Citation: BMC Bioinformatics 2006 7:163

Content type: Software Published on: 20 March 2006
- View Full Text
- View PDF
An application of statistics to comparative metagenomics

Metagenomics, sequence analyses of genomic DNA isolated directly from the environments, can be used to identify organisms and model community dynamics of a particular ecosystem. Metagenomics also has the poten...

Authors: Beltran Rodriguez-Brito, Forest Rohwer and Robert A Edwards

Citation: BMC Bioinformatics 2006 7:162

Content type: Methodology article Published on: 20 March 2006
- View Full Text
- View PDF
GOPET: A tool for automated predictions of Gene Ontology terms

Vast progress in sequencing projects has called for annotation on a large scale. A Number of methods have been developed to address this challenging task. These methods, however, either apply to specific subse...

Authors: Arunachalam Vinayagam, Coral del Val, Falk Schubert, Roland Eils, Karl-Heinz Glatting, Sándor Suhai and Rainer König

Citation: BMC Bioinformatics 2006 7:161

Content type: Database Published on: 20 March 2006
- View Full Text
- View PDF
More robust detection of motifs in coexpressed genes by using phylogenetic information

Several motif detection algorithms have been developed to discover overrepresented motifs in sets of coexpressed genes. However, in a noisy gene list, the number of genes containing the motif versus the number...

Authors: Pieter Monsieurs, Gert Thijs, Abeer A Fadda, Sigrid CJ De Keersmaecker, Jozef Vanderleyden, Bart De Moor and Kathleen Marchal

Citation: BMC Bioinformatics 2006 7:160

Content type: Methodology article Published on: 20 March 2006
- View Full Text
- View PDF
Amplification of the Gene Ontology annotation of Affymetrix probe sets

The annotations of Affymetrix DNA microarray probe sets with Gene Ontology terms are carefully selected for correctness. This results in very accurate but incomplete annotations which is not always desirable f...

Authors: Enrique M Muro, Carolina Perez-Iratxeta and Miguel A Andrade-Navarro

Citation: BMC Bioinformatics 2006 7:159

Content type: Methodology article Published on: 20 March 2006
- View Full Text
- View PDF
2DDB – a bioinformatics solution for analysis of quantitative proteomics data

We present 2DDB, a bioinformatics solution for storage, integration and analysis of quantitative proteomics data. As the data complexity and the rate with which it is produced increases in the proteomics field...

Authors: Lars Malmström, György Marko-Varga, Gunilla Westergren-Thorsson, Thomas Laurell and Johan Malmström

Citation: BMC Bioinformatics 2006 7:158

Content type: Software Published on: 20 March 2006
- View Full Text
- View PDF
Modeling Sage data with a truncated gamma-Poisson model

Serial Analysis of Gene Expressions (SAGE) produces gene expression measurements on a discrete scale, due to the finite number of molecules in the sample. This means that part of the variance in SAGE data shou...

Authors: Helene H Thygesen and Aeilko H Zwinderman

Citation: BMC Bioinformatics 2006 7:157

Content type: Research article Published on: 20 March 2006
- View Full Text
- View PDF
Predicting survival outcomes using subsets of significant genes in prognostic marker studies with microarrays

Genetic markers hold great promise for refining our ability to establish precise prognostic prediction for diseases. The development of comprehensive gene expression microarray technology has allowed the selec...

Authors: Shigeyuki Matsui

Citation: BMC Bioinformatics 2006 7:156

Content type: Methodology article Published on: 20 March 2006
- View Full Text
- View PDF
DNPTrapper: an assembly editing tool for finishing and analysis of complex repeat regions

Many genome projects are left unfinished due to complex, repeated regions. Finishing is the most time consuming step in sequencing and current finishing tools are not designed with particular attention to the ...

Authors: Erik Arner, Martti T Tammi, Anh-Nhi Tran, Ellen Kindlund and Bjorn Andersson

Citation: BMC Bioinformatics 2006 7:155

Content type: Software Published on: 20 March 2006
- View Full Text
- View PDF
A Regression-based K nearest neighbor algorithm for gene function prediction from heterogeneous data

As a variety of functional genomic and proteomic techniques become available, there is an increasing need for functional analysis methodologies that integrate heterogeneous data sources.

Authors: Zizhen Yao and Walter L Ruzzo

Citation: BMC Bioinformatics 2006 7(Suppl 1):S11

Content type: Proceedings Published on: 20 March 2006

This article is part of a Supplement: Volume 7 Supplement 1
- View Full Text
- View PDF
Protein Ranking by Semi-Supervised Network Propagation

Biologists regularly search DNA or protein databases for sequences that share an evolutionary or functional relationship with a given query sequence. Traditional search methods, such as BLAST and PSI-BLAST, fo...

Authors: Jason Weston, Rui Kuang, Christina Leslie and William Stafford Noble

Citation: BMC Bioinformatics 2006 7(Suppl 1):S10

Content type: Proceedings Published on: 20 March 2006

This article is part of a Supplement: Volume 7 Supplement 1
- View Full Text
- View PDF
Learning Interpretable SVMs for Biological Sequence Classification

Support Vector Machines (SVMs) – using a variety of string kernels – have been successfully applied to biological sequence classification problems. While SVMs achieve high classification accuracy they lack int...

Authors: Gunnar Rätsch, Sören Sonnenburg and Christin Schäfer

Citation: BMC Bioinformatics 2006 7(Suppl 1):S9

Content type: Proceedings Published on: 20 March 2006

This article is part of a Supplement: Volume 7 Supplement 1
- View Full Text
- View PDF
Discrete profile comparison using information bottleneck

Sequence homologs are an important source of information about proteins. Amino acid profiles, representing the position-specific mutation probabilities found in profiles, are a richer encoding of biological se...

Authors: Sean O'Rourke, Gal Chechik, Robin Friedman and Eleazar Eskin

Citation: BMC Bioinformatics 2006 7(Suppl 1):S8

Content type: Proceedings Published on: 20 March 2006

This article is part of a Supplement: Volume 7 Supplement 1
- View Full Text
- View PDF
ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context

Elucidating gene regulatory networks is crucial for understanding normal cell physiology and complex pathologic phenotypes. Existing computational methods for the genome-wide "reverse engineering" of such netw...

Authors: Adam A Margolin, Ilya Nemenman, Katia Basso, Chris Wiggins, Gustavo Stolovitzky, Riccardo Dalla Favera and Andrea Califano

Citation: BMC Bioinformatics 2006 7(Suppl 1):S7

Content type: Proceedings Published on: 20 March 2006

This article is part of a Supplement: Volume 7 Supplement 1
- View Full Text
- View PDF
The Secrets of a Functional Synapse – From a Computational and Experimental Viewpoint

Neuronal communication is tightly regulated in time and in space. The neuronal transmission takes place in the nerve terminal, at a specialized structure called the synapse. Following neuronal activation, an e...

Authors: Michal Linial

Citation: BMC Bioinformatics 2006 7(Suppl 1):S6

Content type: Review Published on: 20 March 2006

This article is part of a Supplement: Volume 7 Supplement 1
- View Full Text
- View PDF
A classification-based framework for predicting and analyzing gene regulatory response

We have recently introduced a predictive framework for studying gene transcriptional regulation in simpler organisms using a novel supervised learning algorithm called GeneClass. GeneClass is motivated by the ...

Authors: Anshul Kundaje, Manuel Middendorf, Mihir Shah, Chris H Wiggins, Yoav Freund and Christina Leslie

Citation: BMC Bioinformatics 2006 7(Suppl 1):S5

Content type: Proceedings Published on: 20 March 2006

This article is part of a Supplement: Volume 7 Supplement 1
- View Full Text
- View PDF
Network-based de-noising improves prediction from microarray data

Prediction of human cell response to anti-cancer drugs (compounds) from microarray data is a challenging problem, due to the noise properties of microarrays as well as the high variance of living cell response...

Authors: Tsuyoshi Kato, Yukio Murata, Koh Miura, Kiyoshi Asai, Paul B Horton, Koji Tsuda and Wataru Fujibuchi

Citation: BMC Bioinformatics 2006 7(Suppl 1):S4

Content type: Proceedings Published on: 20 March 2006

This article is part of a Supplement: Volume 7 Supplement 1
- View Full Text
- View PDF
PepDist: A New Framework for Protein-Peptide Binding Prediction based on Learning Peptide Distance Functions

Many different aspects of cellular signalling, trafficking and targeting mechanisms are mediated by interactions between proteins and peptides. Representative examples are MHC-peptide complexes in the immune s...

Authors: Tomer Hertz and Chen Yanover

Citation: BMC Bioinformatics 2006 7(Suppl 1):S3

Content type: Proceedings Published on: 20 March 2006

This article is part of a Supplement: Volume 7 Supplement 1
- View Full Text
- View PDF
Choosing negative examples for the prediction of protein-protein interactions

The protein-protein interaction networks of even well-studied model organisms are sketchy at best, highlighting the continued need for computational methods to help direct experimentalists in the search for no...

Authors: Asa Ben-Hur and William Stafford Noble

Citation: BMC Bioinformatics 2006 7(Suppl 1):S2

Content type: Proceedings Published on: 20 March 2006

This article is part of a Supplement: Volume 7 Supplement 1
- View Full Text
- View PDF
The Cluster Variation Method for Efficient Linkage Analysis on Extended Pedigrees

Computing exact multipoint LOD scores for extended pedigrees rapidly becomes infeasible as the number of markers and untyped individuals increase. When markers are excluded from the computation, significant po...

Authors: Cornelis A Albers, Martijn AR Leisink and Hilbert J Kappen

Citation: BMC Bioinformatics 2006 7(Suppl 1):S1

Content type: Proceedings Published on: 20 March 2006

This article is part of a Supplement: Volume 7 Supplement 1
- View Full Text
- View PDF
Empirical validation of the S-Score algorithm in the analysis of gene expression data

Current methods of analyzing Affymetrix GeneChip^® microarray data require the estimation of probe set expression summaries, followed by application of statistical tests to determine which genes are differentially...

Authors: Richard E Kennedy, Kellie J Archer and Michael F Miles

Citation: BMC Bioinformatics 2006 7:154

Content type: Methodology article Published on: 17 March 2006
- View Full Text
- View PDF
Predicting population coverage of T-cell epitope-based diagnostics and vaccines

T cells recognize a complex between a specific major histocompatibility complex (MHC) molecule and a particular pathogen-derived epitope. A given epitope will elicit a response only in individuals that express...

Authors: Huynh-Hoa Bui, John Sidney, Kenny Dinh, Scott Southwood, Mark J Newman and Alessandro Sette

Citation: BMC Bioinformatics 2006 7:153

Content type: Software Published on: 17 March 2006
- View Full Text
- View PDF
Domain-based small molecule binding site annotation

Accurate small molecule binding site information for a protein can facilitate studies in drug docking, drug discovery and function prediction, but small molecule binding site protein sequence annotation is spa...

Authors: Kevin A Snyder, Howard J Feldman, Michel Dumontier, John J Salama and Christopher WV Hogue

Citation: BMC Bioinformatics 2006 7:152

Content type: Database Published on: 17 March 2006
- View Full Text
- View PDF
GOurmet: A tool for quantitative comparison and visualization of gene expression profiles based on gene ontology (GO) distributions

The ever-expanding population of gene expression profiles (EPs) from specified cells and tissues under a variety of experimental conditions is an important but difficult resource for investigators to utilize e...

Authors: Jason M Doherty, Lynn K Carmichael and Jason C Mills

Citation: BMC Bioinformatics 2006 7:151

Content type: Software Published on: 17 March 2006
- View Full Text
- View PDF
Development of an unbiased statistical method for the analysis of unigenic evolution

Unigenic evolution is a powerful genetic strategy involving random mutagenesis of a single gene product to delineate functionally important domains of a protein. This method involves selection of variants of t...

Authors: Colleen D Behrsin, Chris J Brandl, David W Litchfield, Brian H Shilton and Lindi M Wahl

Citation: BMC Bioinformatics 2006 7:150

Content type: Methodology article Published on: 17 March 2006
- View Full Text
- View PDF
CARMA: A platform for analyzing microarray datasets that incorporate replicate measures

The incorporation of statistical models that account for experimental variability provides a necessary framework for the interpretation of microarray data. A robust experimental design coupled with an analysis...

Authors: Kevin A Greer, Matthew R McReynolds, Heddwen L Brooks and James B Hoying

Citation: BMC Bioinformatics 2006 7:149

Content type: Software Published on: 17 March 2006
- View Full Text
- View PDF
Identification of physicochemical selective pressure on protein encoding nucleotide sequences

Statistical methods for identifying positively selected sites in protein coding regions are one of the most commonly used tools in evolutionary bioinformatics. However, they have been limited by not taking the...

Authors: Wendy SW Wong, Raazesh Sainudiin and Rasmus Nielsen

Citation: BMC Bioinformatics 2006 7:148

Content type: Methodology article Published on: 16 March 2006
- View Full Text
- View PDF
Ensemble attribute profile clustering: discovering and characterizing groups of genes with similar patterns of biological features

Ensemble attribute profile clustering is a novel, text-based strategy for analyzing a user-defined list of genes and/or proteins. The strategy exploits annotation data present in gene-centered corpora and util...

Authors: JR Semeiks, A Rizki, MJ Bissell and IS Mian

Citation: BMC Bioinformatics 2006 7:147

Content type: Methodology article Published on: 16 March 2006
- View Full Text
- View PDF
INTEGRATOR: interactive graphical search of large protein interactomes over the Web

The rapid growth of protein interactome data has elevated the necessity and importance of network analysis tools. However, unlike pure text data, network search spaces are of exponential complexity. This poses...

Authors: Aaron N Chang, Jason McDermott, Zachary Frazier, Michal Guerquin and Ram Samudrala

Citation: BMC Bioinformatics 2006 7:146

Content type: Software Published on: 16 March 2006
- View Full Text
- View PDF
Genetic algorithm learning as a robust approach to RNA editing site prediction

RNA editing is one of several post-transcriptional modifications that may contribute to organismal complexity in the face of limited gene complement in a genome. One form, known as C → U editing, appears to exist...

Authors: James Thompson and Shuba Gopal

Citation: BMC Bioinformatics 2006 7:145

Content type: Methodology article Published on: 16 March 2006

The Erratum to this article has been published in BMC Bioinformatics 2006 7:406
- View Full Text
- View PDF
The 3of5 web application for complex and comprehensive pattern matching in protein sequences

The identification of patterns in biological sequences is a key challenge in genome analysis and in proteomics. Frequently such patterns are complex and highly variable, especially in protein sequences. They a...

Authors: Markus Seiler, Alexander Mehrle, Annemarie Poustka and Stefan Wiemann

Citation: BMC Bioinformatics 2006 7:144

Content type: Software Published on: 16 March 2006
- View Full Text
- View PDF
Sigma: multiple alignment of weakly-conserved non-coding DNA sequence

Existing tools for multiple-sequence alignment focus on aligning protein sequence or protein-coding DNA sequence, and are often based on extensions to Needleman-Wunsch-like pairwise alignment methods. We intro...

Authors: Rahul Siddharthan

Citation: BMC Bioinformatics 2006 7:143

Content type: Software Published on: 16 March 2006
- View Full Text
- View PDF
Score-based prediction of genomic islands in prokaryotic genomes using hidden Markov models

Horizontal gene transfer (HGT) is considered a strong evolutionary force shaping the content of microbial genomes in a substantial manner. It is the difference in speed enabling the rapid adaptation to changin...

Authors: Stephan Waack, Oliver Keller, Roman Asper, Thomas Brodag, Carsten Damm, Wolfgang Florian Fricke, Katharina Surovcik, Peter Meinicke and Rainer Merkl

Citation: BMC Bioinformatics 2006 7:142

Content type: Software Published on: 16 March 2006
- View Full Text
- View PDF
A computational approach to discovering the functions of bacterial phytochromes by analysis of homolog distributions

Phytochromes are photoreceptors, discovered in plants, that control a wide variety of developmental processes. They have also been found in bacteria and fungi, but for many species their biological role remain...

Authors: Tilman Lamparter

Citation: BMC Bioinformatics 2006 7:141

Content type: Research article Published on: 16 March 2006
- View Full Text
- View PDF
Exploring supervised and unsupervised methods to detect topics in biomedical text

Topic detection is a task that automatically identifies topics (e.g., "biochemistry" and "protein structure") in scientific articles based on information content. Topic detection will benefit many other natura...

Authors: Minsuk Lee, Weiqing Wang and Hong Yu

Citation: BMC Bioinformatics 2006 7:140

Content type: Research article Published on: 16 March 2006
- View Full Text
- View PDF

How was your experience today?

Rating Please select one rating

Awful

Bad

Good

Great

Thank you for your feedback.

Tell us why (opens in a new tab)

Featured videos

View featured videos from across the BMC-series journals

Articles

Featured videos

Important information

Annual Journal Metrics

Follow

BMC Bioinformatics

Contact us