- Methodology article
- Open Access
BIRI: a new approach for automatically discovering and indexing available public bioinformatics resources from the literature
- Guillermo de la Calle†1Email author,
- Miguel García-Remesal†1,
- Stefano Chiesa†1,
- Diana de la Iglesia†1 and
- Victor Maojo†1
© de la Calle et al; licensee BioMed Central Ltd. 2009
- Received: 12 February 2009
- Accepted: 7 October 2009
- Published: 7 October 2009
The rapid evolution of Internet technologies and the collaborative approaches that dominate the field have stimulated the development of numerous bioinformatics resources. To address this new framework, several initiatives have tried to organize these services and resources. In this paper, we present the BioInformatics Resource Inventory (BIRI), a new approach for automatically discovering and indexing available public bioinformatics resources using information extracted from the scientific literature. The index generated can be automatically updated by adding additional manuscripts describing new resources. We have developed web services and applications to test and validate our approach. It has not been designed to replace current indexes but to extend their capabilities with richer functionalities.
We developed a web service to provide a set of high-level query primitives to access the index. The web service can be used by third-party web services or web-based applications. To test the web service, we created a pilot web application to access a preliminary knowledge base of resources. We tested our tool using an initial set of 400 abstracts. Almost 90% of the resources described in the abstracts were correctly classified. More than 500 descriptions of functionalities were extracted.
These experiments suggest the feasibility of our approach for automatically discovering and indexing current and future bioinformatics resources. Given the domain-independent characteristics of this tool, it is currently being applied by the authors in other areas, such as medical nanoinformatics. BIRI is available at http://edelman.dia.fi.upm.es/biri/.
- Target Domain
- Proper Noun
- Bioinformatics Resource
- Biomedical Ontology
- Resource Category
The number of public online bioinformatics resources has grown exponentially over the past few years. Bioinformatics professionals can access and use a large number of resources for their research --including databases, tools and services. Discovering and accessing the appropriate bioinformatics resource for a specific research task has become increasingly important, as suggested in earlier reports .
To address this issue, various significant projects and initiatives have been carried out, leading to several pioneering indexes of bioinformatics resources that are currently available over the Internet. For instance, BioPortal  is a web repository of biomedical ontology resources, developed at the National Center for Biomedical Ontology (NCBO) . This application enables collaborative development of biomedical ontologies. BioPortal includes a service called Open Biomedical Resources (OBR) for annotating and indexing biomedical resources . Resources are annotated using concepts from a domain ontology. The OBR service enables researchers to locate resources by specifying ontology concepts.
Other examples of such indexes include the biomedical database collection compiled by Galperin --a yearly updated web-based list of molecular biology databases sorted in alphabetical order-- or the Bioinformatics Links Directory (BLD) [6, 7] --a catalogue of links to bioinformatics resources, tools and databases classified into eleven major categories-- where resources can be searched using keyword-based queries.
The European Bioinformatics Institute (EBI) provides a searchable index of an alphabetically-sorted inventory of bioinformatics resources . Resources are classified according to the type of service they provide--databases, tools and (web) services. The index includes both internal and external resources.
The BioMoby platform provides an annotated registry of bioinformatics web services enabling other applications to integrate and use such services [9, 10]. There are various major installations of BioMoby, such as, for instance, the PlaNet Consortium , the Australian Centre for Plant Functional Genomics , the Generation Challenge Program of the Consultative Group for International Agricultural Research , Genome Canada , or the Spanish National Institute of Bioinformatics (INB) . They provide access to different bioinformatics resources depending on their own interests and needs.
A consortium composed of the seven US National Centers for Biomedical Computing  has recently developed another index of bioinformatics resources called iTools [17, 18]. Web services are used here to provide access to resources that are annotated according to their functionality. A web-based interface enables researchers to locate the resources they need using advanced search and visual navigation tools. Another initiative for organizing bioinformatics resources is the BIO2RDF system . This system considers the information contained in biological databases, providing uniform access to biological data stored in public databases. Data is converted into RDF format using a common reference ontology. More than 20 publicly available databases --e.g. Entrez Gene, OMIM, GO, OBO, PDB, GeneBank, Prosite, etc. -- have been successfully integrated, including more than 160 million RDF documents.
In this paper we present a tool to automatically create a searchable index of bioinformatics resources from the scientific literature. The resources are annotated with metadata regarding their functionality. Metadata are extracted from abstracts of research papers retrieved from PubMed® and the ISI Web of Knowledge® using pattern-matching techniques. The generated index can be incrementally updated by feeding our tool with abstracts of articles describing new resources. In the evaluation section, we present and analyze some figures that compare the results of our approach with other existing indexes.
We extract relevant information about bioinformatics resources from papers published in high-impact journals. We focus on manuscript abstracts, which tend to condense the key information. We use classic natural language processing techniques--such as, for instance, tokenizers, parsers, transition networks or part-of-speech taggers--to extract this information. These techniques provide the basic framework for extracting data from textual resources . All the techniques we have used are detailed in the following sections. The outcome of this process is an indexed knowledge base including the most relevant information about the resources.
Building the knowledge base
Abstract selection and generation of abstract surrogates
Complete resource information stored in the knowledge base
Functionality 1: analysis of DNA microarray data
dna microarray data
Paper 1: GenePublisher: automated analysis of DNA microarray data.
Knudsen, S; Workman, C; Sicheritz-Ponten, T; Friis, C
GenePublisher, a system for automatic analysis of data from DNA microarray experiments, has been implemented with a web interface at http://www.cbs.dtu.dk/services/GenePublisher. Raw data are uploaded to the server together with a specification of the data. The server performs normalization, statistical analysis and visualization of the data. The results are run against databases of signal transduction pathways, metabolic pathways and promoter sequences in order to extract more information. The results of the entire analysis are summarized in report form and returned to the user.
Once the surrogates have been generated, the titles and abstracts are divided into sentences, using strong punctuation marks as sentence delimiters. Then, each sentence is pre-processed by a lexical analyzer. The analyzer extracts the words of the sentences using blanks and punctuation marks as word delimiters. This produces a sorted set of tokens tagged with lexical annotations. Next, each token is labelled with its corresponding part-of-speech (POS) tag using a probabilistic POS-tagger . Stop words--such as articles, prepositions, etc.--are also filtered during this process. Finally, a stemming activity is performed to reduce each token to its root form, thus returning a set of lexemes tagged with morphological information .
Manual generation of patterns and transition networks
To automatically extract information from the abstracts, a group of five experts with background in different bioinformatics and text mining techniques created a set of linguistic patterns and then analyzed the selected abstracts. The set of linguistic patterns characterizes the type of information to be extracted. We used an initial training set of 100 abstracts of randomly chosen papers from PubMed®, describing bioinformatics resources--e.g. database and software papers published in the BMC Bioinformatics Journal  or application notes from the Bioinformatics Journal . The selected abstracts were analyzed to discover and identify three different types of patterns: i) resource-naming patterns (RNP), ii) functionality patterns (FP), and iii) classification patterns (CP). RNPs aim to automatically extract the resource names together with their corresponding URL. Conversely, FPs aim to extract short textual descriptions of resource functionalities. By contrast, CPs focus on either i) the category of the resource or ii) its target domain.
Full list of BIRI categories and domains
Extracted patterns are applied to the abstracts as follows. First, the abstract is analyzed by the first TN to locate the substrings that match any of the patterns. Then, the relevant information is extracted from the substring. For instance, the input sentence "The FSSP database of structurally aligned protein fold families" matches the RNP ["The" + Proper Noun + "database" + "of"]. In this case, the analyzer identifies "FSSP" as a proper noun. Another example of RNP could be [Proper Noun + ":" + "a"] that would match the sentence "EMBL-Align: a public nucleotide and amino acid multiple sequence database. In this case, "EMBL-Align" would be recognized as a proper noun.
FPs are often more complex to match than RNPs and CPs since their associated patterns require further lexical components for instantiation--e.g. verb tenses, part-of-speech tags, etc. For instance, the input sentence "A novel method for fast and accurate multiple sequence alignment" matches the following pattern ["A" + JJ++ "method" + "for" + (JJ + (CC + JJ)?)? + JJ* + NN++ "."], where JJ represents an adjective, CC identifies a conjunction and NN is a common name. We use special characters of regular expressions to specify the patterns, where '?' represents optionality, '+' one or more occurrences and '*' zero or more occurrences of the component. In the last example, the functionality extracted would be "sequence alignment". During the pattern-matching process, the instantiation of one of the patterns triggers a procedure that extracts and stores the relevant information in the knowledge base.
Once the name and the description of the functionalities for a given resource have been extracted, we run the second TN to perform the classification. The TN is fed with the extracted description(s) of the functionality of the current resource, and it tries to match this description with a resource category and domain. If successfully matched, the resource is labelled with the preferred term associated with the respective category or domain. Note that a resource could be assigned to several categories or domains. For instance, one possible pattern for detecting and extracting a resource category and domain is: [("program" | "tool" | "server") + ("for"|"to") + (JJ* + (NN | NNS))? + "^annot" + ("of" + JJ* + (NN | NNS)+)? + "."]. Applying this pattern to the sentence "AMIGene: a tool for annotation of microbial genes", the category we get is "annotation" and the resource domain is "genes".
Additionally, any extra information--such as resource inputs and outputs--can also be extracted by the second TN when available. We refer to the type of data received by the resources when they are invoked as 'inputs' and to the type of data they return as a result as 'outputs'. For instance, applying the pattern ["discovery" + "of" + (JJ + CC?)* + (NN | NNS)++ ("within" | "in") + JJ* + (NN | NNS)++ "."] to the sentence "We present here a new tool for discovery of unstructured or disordered regions within proteins.", "proteins" would be the resource input data.
Once all relevant information contained in the abstracts has been extracted, a team of experts reviews the contents of the knowledge base to assess its correctness. The curation process mainly focuses on the detection of categories and domains incorrectly assigned to specific resources. Then, the experts compare the previously extracted functionality with the assigned categories and domains. If any errors are detected, the inaccurate entries are removed from the BIRI knowledge base.
Description of the knowledge base
Once the information has been filtered, it is stored in a knowledge base. For each discovered resource, we record the following data: i) the name of the resource, ii) its corresponding URL, iii) a set of natural language descriptions of its functionalities, iv) the resource's assigned categories, v) its target domain(s) and vi) the resource's inputs and outputs when available. Using the same example as above, this information could be extended and used to automatically orchestrate workflows involving multiple resources. The knowledge base also stores data about the original papers, including the paper's title, author(s), abstract and PubMed® and ISI Web of Knowledge® identifiers.
We implemented the data extraction tool and the web services layer (WSL) providing the query primitives to access the knowledge base using the Java programming language and associated technologies . This includes the Java Web Services Developer Pack (JWSDP) .
The system architecture includes a web services query layer developed to cover the contents of the knowledge base. To test the functionality of the developed WSL, we created a pilot web application on top of the WSL to provide users with a searchable web index of bioinformatics resources, by using popular web browsers including Internet Explorer®, Mozilla Firefox® and Safari®.
A use case
Let us suppose that a researcher wants to identify resources for analyzing information from microarrays. The web application will show the existing categories and domains in two combo-boxes. These categories and domains are dynamically loaded using different web services. When the user selects a category, the domain combo-box is automatically updated to show the domains associated with the selected category only. Similarly, when the user selects a domain; the category combo-box is updated with the appropriate categories. For instance, the researcher may select "analysis" from the combo-box category. Then, the combo-box domain will be automatically updated with all the available domains for the chosen category. It includes an extra option for selecting "All" domains. The researcher will select the "microarray" domain from that list and then click on the Search button. The category and domain selection order is not mandatory. The user could select the domain first and the category afterwards.
After the user clicks the Search button, the web application displays a table with information about the resources contained in the knowledge base that meet the user-imposed restrictions. In the above example, the table contains eleven entries, as shown in Figure 4. Each entry contains the name of and a link to the resource, its functionality, category and domain. Further details on a given resource can be retrieved by clicking on the resource name link.
The web application has been tested with multiple queries selecting different combinations of categories/domains. Users could also search for a resource by its name. Searches by name are case-insensitive. The results would be similar but restricted to the specified resource.
Another interesting feature of the application is to incrementally update the index with new resources. This is achieved by entering the respective research papers in the update module. This module verifies the new papers added to prevent double entries. Papers that have been previously processed by the system are passed up, and the others are applied as described above. Users can also contribute by suggesting the inclusion of additional tools/abstracts by sending an email to the contact address provided on the BIRI web page.
Regarding the functionality extraction process, the first TN discovered 505 descriptions of functionalities. They were assigned to the 333 identified resources. Note that a single resource may be assigned to one or more functionalities. As Figure 5c shows, a high percentage of the extracted functionalities (88%) provided complete and coherent descriptions. Conversely, 10% were incomplete descriptions that still provided valuable information for automatically extracting their associated category and domain. The remaining 2% matched up with incorrect or incoherent descriptions. The coherence of the extracted functionalities was manually assessed to verify the correctness of the extraction method.
Samples of short-form knowledge base entries
Exploration of relationships between RNA sequences
Server for structure comparison and structure similarity searching
Server for structure comparison and structure similarity searching
Protein 3D structure comparison
Accumulation structural employs progressive alignment algorithm 3D alignments useful tools for insights into protein 3D structures
Analysis of DNA microarray data
Analysis of DNA microarray data
These entries present the extracted information for four different bioinformatics resources. The first column shows the name of the extracted tools--i.e. "CrossLink", "FATCAT", "MATRAS" and "GenePublisher"--whereas the second column provides a short description of their functionality. The third column shows the category in which the corresponding resource has been classified. This classification is further refined in the fourth column, which lists the specific application domain for this resource. For instance, as shown in the Table 3, the "CrossLink" resource has been classified under the category "Exploration" and targets the "RNA" domain. Therefore, "CrossLink" can be regarded as a tool to explore RNA sequences. Also, additional information can be retrieved for each resource. Table 1 shows an example of all the information BIRI provides about any resource. This information has been automatically extracted from the scientific papers using the proposed method. It contains the resource name, functionalities, categories, domains, inputs, outputs and paper from which the information has been extracted. Links to the official web site of the resource and to the whole paper --in PubMed®-- are provided if they are available in the abstract.
Comparison of existing resource indexes
Advanced Search Capabilities
Annotation of Resources
Online Bioinformatics Resource Collection
ExPASy Life Science Directory
Molecular Biology Database
Database of Databases
Resources at the EBI
Evaluation of the BIRI contents against other indexes
Total Resources Indexed
New in BIRI
Online Bioinformatics Resource Collection
ExPASy Life Science Directory
Molecular Biology Database
Database of Databases
Resources at the EBI
Based on pattern matching methods, our method can automatically create a knowledge base of bioinformatics resources by i) detecting resource names, ii) extracting short descriptions of functionalities and iii) classifying the extracted artefacts according to a list of categories and domains of bioinformatics resources, which extends the BLD classification  on which our list is based. We believe that creating a standardized taxonomy or ontology of bioinformatics resources is a crucial task to facilitate collaborative and integrative approaches .
For the methodology proposed in this paper to work, the bioinformatics resources have to have been previously published and indexed in PubMed® or the ISI Web of Knowledge®. Otherwise, the resources would never be found using this method. In contrast, our approach guarantees that only high-quality resources are indexed. Once these resources have passed a peer-review process, confidence in their quality can be actually higher.
Table 4 compares our approach with the most relevant indexes available at the time of writing this paper --and considering that BIRI is a prototype, currently being expanded. As shown in Table 5, from an original set of over 400 papers, the system automatically discovered more than 230 resources that also appear in BLD or the Online Bioinformatics Resource Collection. Another interesting fact pointed out by Table 5 is that BIRI contains several resources not included in other indexes. The number of new resources in BIRI ranges from 81 (when compared to BLD) to 306 (when compared to Pathguide). This happens since existing indexes are often updated only considering manuscripts published in a reduced set of journals. This limited vision hinders the creation and maintenance of an exhaustive list of resources. Conversely, our methodology is not centered in any particular journal. It follows a more general approach using PubMed® or the ISI Web of Knowledge® as information sources. This provides us with a broader vision of recent developments. Once new manuscripts are available in PubMed® or the ISI Web of Knowledge®, our system can be updated with the new resources.
Our approach extends most of the search capabilities provided by other existing tools. In addition, the index is automatically generated and updated in our approach. The update process is a very time-consuming and tedious task that is usually performed manually by groups of experts. Applying the same methodology detailed above, the contents of our index and knowledge base can be automatically updated by just entering new manuscripts and abstracts into the tool.
Our index provides users with advanced search capabilities. It can, for instance, perform complex searches, such as searching for resources matching a definite category and/or target domain. As stated previously, our knowledge base is built automatically using pattern-matching techniques, whereas other indexes are created, maintained and updated manually. Besides, our knowledge base can be automatically updated with new resources simply by feeding the developed information extraction tool with manuscripts describing recently developed tools, databases and services.
Considering other existing indexes, our knowledge base provides additional information, such as, for instance, the target domains where the different types of resources can be applied or the resource inputs and outputs. Using the resources' names and extending the information about inputs and outputs, BIRI could be useful for automatically orchestrating workflows like applications combining several resources. An example of resource combination through workflow definition is described in a previous work carried out by the authors, where multiple databases are queried to retrieve information regarding the proteins involved in a genetic disease . The results provided by a database are used to build the query for the next database.
Additionally, text-mining based methods for information extraction reported that they could benefit from manuscripts with more structured abstracts .
Our tool automatically extracts and organizes relevant information about bioinformatics resources from research papers describing the resources. Our method, based on a domain-independent approach, can be used to create inventories targeting different scientific fields [32, 33]. For instance, this approach is currently being applied in the European Commission-funded Action-GRID project, coordinated by the authors . In one of the workpackages of this project, we are creating an inventory of bioinformatics and nanomedical resources that is intended to help researchers in these areas .
Several initiatives have been carried out aimed at cataloguing the existing bioinformatics resources. Although our tool could work as a standalone application, it has not been designed for this purpose. Our tool is intended not as a replacement for but an add-on to existing indexes and applications. We are working on integrating our tool as a plug-in for other consolidated applications, such as BioPortal or BioMoby. Additionally, tools for defining workflows, such as TAVERNA , could also benefit from the information provided by an extended version of our index.
The inventory of resources could also be collaboratively updated by other external contributing researchers. This collaborative approach is being successfully applied in other fields, such as, for instance, developing biomedical ontologies at the NCBO . At the NCBO, different collaborative tools--developed using Web 2.0 techniques--are available for developing biomedical ontologies. Researchers can contribute by entering new information or comments about the existing information.
The present work has been funded by the Spanish Ministry of Science and Innovation (OntoMineBase project, reference TSI2006-13021-C02-01, and COMBIOMED- RETICS), the European Commission through the ACGT integrated project (FP6-2005-IST-026996), the ACTION-Grid support action (FP7-ICT-2007-2-224176) and the Comunidad de Madrid, Spain.
- Cannata N, Merelli E, Altman RB: Time to Organize the Bioinformatics Resourceome. PLoS Comput Biol 2005, 1(7):e76. 10.1371/journal.pcbi.0010076PubMed CentralView ArticlePubMedGoogle Scholar
- Musen M, Shah N, Noy N, Dai B, Dorf M, Griffith N, Buntrock JD, Jonquet C, Montegut MJ, Rubin DL: BioPortal: Ontologies and Data Resources with the Click of a Mouse. AMIA Annual Symposium Proceedings 2008, 1223–1224.Google Scholar
- The National Center for Biomedical Ontology[http://www.bioontology.org/]
- Jonquet C, Musen MA, Shah N: A System for Ontology-Based Annotation of Biomedical Data. Proceedings of the International Workshop on Data Integration in The Life Sciences 2008, DILS'08: 144–152. full_textView ArticleGoogle Scholar
- Galperin MY: The Molecular Biology Database Collection: 2008 Update. Nucleic Acids Research 2007, (36 Database):D2-D4. 10.1093/nar/gkm1037Google Scholar
- Bioinformatics Links Directory[http://bioinformatics.ca/links_directory/]
- Brazas MD, Fox JA, Brown T, McMillan S, Ouellette BF: Keeping Pace with the Data: 2008 Update on the Bioinformatics Links Directory. Nucleic Acids Research 2008, (36 Web Server):W2-W4. 10.1093/nar/gkn399Google Scholar
- European Bioinformatics Institute Services Index[http://www.ebi.ac.uk/services/]
- Wilkinson MD, Links M: BioMOBY: an Open Source Biological Web Services Proposal. Brief Bioinform 2002, 3(4):331–341. 10.1093/bib/3.4.331View ArticlePubMedGoogle Scholar
- PlaNet. A Network of European Plant Database[http://mips.gsf.de/projects/plants/PlaNetPortal]
- Australian Centre for Plant Functional Genomics[http://www.acpfg.com.au]
- Generation Challenge Programme[http://www.generationcp.org]
- Genome Canada[http://genomecanada.ca]
- Instituto Nacional de Bioinformática[http://www.inab.org]
- National Centers for Biomedical Computing[http://www.ncbcs.org]
- iTools Home Page[http://cms.loni.ucla.edu/iTools/]
- Dinov ID, Rubin D, Lorensen W, Dugan J, Ma J, Murphy S, Kirschner B, Bug W, Sherman M, Floratos A, Kennedy D, Jagadish HV, Schmidt J, Athey B, Califano A, Musen M, Altman R, Kikinis R, Kohane I, Delp S, Parker DS, Toga AW: iTools: a Framework for Classification, Categorization and Integration of Computational Biology Resources. PLoS ONE 2008, 3(5):e2265. 10.1371/journal.pone.0002265PubMed CentralView ArticlePubMedGoogle Scholar
- Belleau F, Nolin MA, Tourigny N, Rigault P, Morissette J: Bio2RDF: Towards a Mashup to Build Bioinformatics Knowledge Systems. Journal of Biomedical Informatics 2008, 41(5):706–716. 10.1016/j.jbi.2008.03.004View ArticlePubMedGoogle Scholar
- PubMed Home[http://www.ncbi.nlm.nih.gov/pubmed/]
- ISI Web of Knowledge[http://www.isiwebofknowledge.com/]
- Krallinger M, Valencia A: Text-mining and Information-retrieval Services for Molecular Biology. Genome Biol 2005, 6(7):224. 10.1186/gb-2005-6-7-224PubMed CentralView ArticlePubMedGoogle Scholar
- Tufi sD, Mason O: Tagging Romanian Texts: a Case Study for QTAG, a Language Independent Probabilistic Tagger. Proceedings of the First International Conference on Language Resource & Evaluation (LREC98): 28–30 May 1998; Granada (Spain) 1998, 1: 589–596.Google Scholar
- Porter MF: An algorithm for suffix stripping. Program 1997, 14(3):313–316.Google Scholar
- BMC Bioinformatics[http://www.biomedcentral.com/bmcbioinformatics/]
- Oxford Journals, Life Sciences, Bioinformatics[http://bioinformatics.oxfordjournals.org/]
- Woods WA: Transition Network Grammars for Natural Language Analysis. Commun ACM 1970, 13(10):591–606.View ArticleGoogle Scholar
- Developer Resource for Java Technology[http://java.sun.com/]
- Java Web Services at a Glance[http://java.sun.com/webservices/]
- García-Remesal M: Using Hierarchical Task Network Planning Techniques to Create Custom Web Search Services over Multiple Biomedical Databases. Proceedings of the 12th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems (KES): 3–5 September 2008; Zagreb (Croatia) 2008, (2):42–49.Google Scholar
- Gerstein M, Seringhaus M, Fields S: Structured Digital Abstract Makes Text Mining Easy. Nature 2007, 447(7141):142.View ArticlePubMedGoogle Scholar
- De la Calle G, Garcia-Remesal M, Maojo V: A Method for Indexing Biomedical Resources over the Internet. Stud Health Technol Inform 2007, 136: 163–168.Google Scholar
- García-Remesal M, Maojo V, Crespo J, Billhardt H: Logical Schema Acquisition from Text-Based Sources for Structured and Non-Structured Biomedical Sources Integration. AMIA Annual Symposium Proceedings: 10 - 14 November 2007; Chicago (USA) 2007, 259–263.Google Scholar
- ACTION-Grid Project[http://www.action-grid.eu/]
- Chiesa S, Garcia-Remesal M, de la Calle G, de la Iglesia D, Bankauskaite V, Maojo V: Building an Index of Nanomedical Resources: an Automatic Approach Based on Text Mining. Proceedings of the 12th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems (KES): 3–5 September 2008; Zagreb (Croatia) 2008, (2):50–57.Google Scholar
- Hull D, Wolstencroft K, Stevens R, Goble C, Pocock MR, Li P, Oinn T: Taverna: a tool for building and running workflows of services. Nucleic Acids Res 2006, (34 Web Server):W729–732.Google Scholar
- Bader GD, Cary MP, Sander C: Pathguide: a Pathway Resource List. Nucleic Acids Research 2006, (34 Database):D504-D506.Google Scholar
- Chen YB, Chattopadhyay A, Bergen P, Gadd C, Tannery N: The Online Bioinformatics Resources Collection at the University of Pittsburgh Health Sciences Library System-a one-stop gateway to online bioinformatics databases and software tools. Nucleic Acids Research 2007, (35 Database):D780-D785.Google Scholar
- ExPASy Life Science Directory[http://expasy.org/links.html]
- Babu PA, Udyama J, Kumar RK, Boddepalli R, Mangala DS, Rao GN: DoD2007: 1082 Molecular Biology Databases. Bioinformation 2007, 2(2):64–67.PubMed CentralView ArticlePubMedGoogle Scholar
- Stevens RD, Robinson AJ, Goble CA: myGrid: Personalised Bioinformatics on the Information Grid. Bioinformatics 2003, 19(Suppl 1):i302-i304.View ArticlePubMedGoogle Scholar
- Lord P, Alper P, Wroe C, Goble C: Feta: a Light-Weight Architecture for User Oriented Semantic Service Discovery. In Proceedings of the Second European Semantic Web Conference (ESWC): 29 May - 1 June 2005; Heraklion (Greece). Volume 3532. Springer Berlin/Heidelberg; 2005:17–31.Google Scholar
- Wolstencroft K, Oinn T, Goble C, Ferris J, Wroe C, Lord P, Glover K, Stevens R: Panoply of Utilities in Taverna. In Proceedings of the First International Conference on e-Science and Grid Computing (E-SCIENCE): 5 - 8 December 2005; Washington (USA). IEEE Computer Society; 2005:156–162.View ArticleGoogle Scholar
- Saltz J, Oster S, Hastings S, Langella S, Kurc T, Sanchez W, Kher M, Manisundaram A, Shanbhag K, Covitz P: caGrid: Design and Implementation of the Core Architecture of the Cancer Biomedical Informatics Grid. Bioinformatics 2006, 22(15):1910–1916.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.