BBP: Brucella genome annotation with literature mining and curation
© Xiang et al; licensee BioMed Central Ltd. 2006
Received: 18 May 2006
Accepted: 16 July 2006
Published: 16 July 2006
Brucella species are Gram-negative, facultative intracellular bacteria that cause brucellosis in humans and animals. Sequences of four Brucella genomes have been published, and various Brucella gene and genome data and analysis resources exist. A web gateway to integrate these resources will greatly facilitate Brucella research. Brucella genome data in current databases is largely derived from computational analysis without experimental validation typically found in peer-reviewed publications. It is partially due to the lack of a literature mining and curation system able to efficiently incorporate the large amount of literature data into genome annotation. It is further hypothesized that literature-based Brucella gene annotation would increase understanding of complicated Brucella pathogenesis mechanisms.
The Brucella Bioinformatics Portal (BBP) is developed to integrate existing Brucella genome data and analysis tools with literature mining and curation. The BBP InterBru database and Brucella Genome Browser allow users to search and analyze genes of 4 currently available Brucella genomes and link to more than 20 existing databases and analysis programs. Brucella literature publications in PubMed are extracted and can be searched by a TextPresso-powered natural language processing method, a MeSH browser, a keywords search, and an automatic literature update service. To efficiently annotate Brucella genes using the large amount of literature publications, a literature mining and curation system coined Limix is developed to integrate computational literature mining methods with a PubSearch-powered manual curation and management system. The Limix system is used to quickly find and confirm 107 Brucella gene mutations including 75 genes shown to be essential for Brucella virulence. The 75 genes are further clustered using COG. In addition, 62 Brucella genetic interactions are extracted from literature publications. These results make possible more comprehensive investigation of Brucella pathogenesis. Other BBP features include publication email alert service, Brucella researchers' contact database, and discussion forum.
BBP is a gateway for Brucella researchers to search, analyze, and curate Brucella genome data originated from public databases and literature. Brucella gene mutations and genetic interactions are annotated using Limix leading to better understanding of Brucella pathogenesis.
Brucella is a Gram-negative, facultative intracellular coccobacillus which causes brucellosis in humans and animals . Brucella are taxonomically placed in the alpha-2 subdivision of the class Proteobacteria. Traditionally there are six species of Brucella based on the preferential host specificity: B. melitensis (goats), B. abortus (cattle), B. suis (swine), B. canis (dogs), B. ovis (sheep) and B. neotomae (desert mice); two new species B. cetaceae (cetacean) and B. pinnipediae (seal) have recently been discovered . The first four species are pathogenic to humans in decreasing order of severity making brucellosis a zoonotic disease. These Brucella species have been identified as priority agents amenable for use in biological warfare and bio-terrorism and listed as CDC/NIAID category B priority pathogens.
Complete genome sequences of 4 Brucella strains are currently available [3–6]. A typical Brucella genome usually has two circular chromosomes of approximately 2.1 MB and 1.2 MB. There are approximately 3,200 – 3,400 genes in each genome. The DNA sequences of different Brucella spp. share greater than 90% identity [4, 6, 7]. Genome sequences and annotated data are publicly available from existing databases such as RefSeq , Swissprot , and the TIGR Comprehensive Microbial Resource (CMR) . These databases come from different sources and have different focuses. Different data visualization and analysis tools are also available in these database systems and other genome analysis systems. A web portal that integrates these data and analysis resources will greatly help Brucella gene research.
Brucella genome data in current databases is largely derived from computational analysis without literature support. It is partially due to the lack of a literature mining and curation system. The large amount of literature data can be used to not only validate the data obtained from computational analysis but also provide new insights not available from computational analysis. Literature mining techniques are being developed rapidly in the context of the genomic fields [11, 12]. For example, Hu et al.,  describe a rule-based system, RLIMS-P, for literature mining and database annotation of protein phosphorylation from MEDLINE abstracts. Stephens et al.,  present an association and function discovery method to extract gene-gene interactions from co-occurring genes in MEDLINE abstracts. Hoffmann et al.  list more than 20 main text mining repositories and systems that are currently available. Compared to basic keyword search, many effective literature retrieval programs connect textual evidence to ontologies as main repository of formally represented knowledge. Ontologies are conceptual models that support consistent and unambiguous knowledge sharing and provide a framework for knowledge integration. TextPresso is a natural language processing (NLP) and ontology-based literature search engine with significant efficiency in biomedical literature retrieval . Since computational literature mining techniques (e.g., TextPresso) still cannot guarantee precise retrieval, time consuming manual literature curation is required to obtain accurate results for database storage. It is possible for manual curation and computational text mining to work together for rapid retrieval and analysis of facts with standardization of the extracted information . The PubSearch literature curation software is a literature curation management system with a powerful manual curation capability . Our strategy of integrating different computational text mining tools including a TextPresso-powered program with a PubSearch-powered manual curation system has led to the development of a literature mining and curation system coined "Limix" that is currently applied to Brucella genome annotation.
The brucellae infect phagocytic macrophages and nonphagocytic epithelial cells (e.g., HeLa cells) in vivo and in vitro [18–20]. Brucella virulence relies on its ability of intracellular survival and replication. It is still unclear how many Brucella genes are essential for intracellular virulence and how virulent Brucella genes interact. It is hypothesized that mechanisms of Brucella pathogenesis can be better understood by systematically annotating Brucella gene mutations and genetic networks from all Brucella literature papers.
We have developed the Brucella Bioinformatics Portal (BBP) with focus on integrating Brucella genome data and analysis tools from existing resources and annotating Brucella genes and gene-gene interactions from literature publications. The updated information allows more comprehensive examination of Brucella pathogenesis. These genome annotation systems, together with other programs including publication email alert, Brucella researchers' contact database, and discussion forum, makes BBP an ideal bioinformatics portal for the Brucella research community. The BBP website is publicly available [see Additional file 1].
Results and discussion
Public databases and software programs linked or used in BBP. Unique database identifiers (e.g., RefSeq ID) are usually stored for linking to public database web pages. Brucella literature abstracts and full text PDF files are also extracted from PubMed. Software programs are integrated into BBP in different ways.
Databases and analysis programs
Medical Subject Headings
clusters of orthologous groups
Brucella taxonomy information
3D structure DB
3D structures (typically of related proteins)
Protein coding genes, functional and structural RNAs,
EBI & SIB
Annotated protein data
protein families, domains and functions
protein families and domains
codon usage, EC numbers, condensed genome display, role category graph, GO terms, gene attribute, Role category gene list, RNAs, terminators
protein domains and families
protein domain families
Software programs integrated
Blastn, blastp, blastx, tblastn, tblastx, PSI/PHI Blast, Mega Blast, Blast 2 sequences
genome browse and analysis
NLP text mining
literature curation management
Brucella genome data query, browsing, and analysis
The following is a typical scenario when a Brucella researcher searches for more information about B. abortus sodC gene encoding Cu/Zn superoxide dismutase (SOD). The user starts with querying "sodC" gene in InterBru (Figure 2A). Four Brucella sodC genes from 4 Brucella genomes will be found, including one from B. abortus strain 2308 and one from B. abortus strain 9–941. The detail information about the sodC gene in strain 2308 is shown in the detailed Brucella gene information page (Figure 2C). This page includes basic gene information and through unique database identifiers links to many public databases, such as RefSeq, GenBank, Swissprot, InterPro, and PubMed. This page also contains sodC-specific gene annotation and genetic interaction data curated by the BBP team from literature using the Brucella Limix system. A link to the Brucella Limix is also available for users to annotate sodC gene. A direct link to PubMed allows users to access all Brucella sodC-related publications. Both DNA and protein sequences are provided with additional links to internal BLAST search services (regular Blast, Psi/Phi Blast, and Mega Blast) where different Brucella nucleotide and protein sequence libraries have been created for convenient use. For example, a simple Blastn search indicates that the sodC DNA sequence in B. abortus strain 2308 is 100% identical to that in B. abortus strain 9–941 but 99% identical to that in B. melitensis strain 16 M and B. suis strain 1330. The protein sequences in the four genomes are 100% identical to each other. The user is also directed to the BGBrowser to inspect the genes next to sodC in the genome, annotate restriction sites, or perform other analyses (Figure 2B). To get more information, the user can submit questions in the BBP discussion Forum or email to the Brucella listserv.
Brucella literature search
Four computational literature search methods have been developed to search Brucella literature: TextPresso for Brucella, MeSH browser, keyword search, and automatic Brucella publication update.
Textpresso is an information retrieval system available from the Generic Software Components for Model Organism Databases (GMOD) . It splits papers into sentences and further to XML-tagged words or phrases, which are classified using categories of ontology. The specifically designed ontology can be used to query information on specific classes of biological concepts (e.g., gene, mutant) and their relationships (e.g., association, regulation). It has been used in WormBase  and many other projects . We have adopted and extended TextPresso for Brucella literature text mining. Currently it stores abstract information of 3930 Brucella publications. Among them 1083 papers have full-text contents. While it takes approximately 24 hours for TextPresso to preprocess these 3930 PubMed abstracts and 1083 full text PDF files in our server, the online query process is fast (~0.5 sec/query).
A user can also search the locally built Brucella literature database by keywords such as author, journal, year, issue, and abstract. Although the Brucella literature database is updated periodically, it may miss the newest Brucella literature publications. In order to capture this portion of the literature, a BBP internal program has been developed to automatically extract the newly published Brucella papers from PubMed.
Brucella literature mining and curation system (Limix)
Literature-curated Brucella gene mutations and pathogenesis
Clustering of 75 attenuated Brucella genes found from literature search using the COG classification method.
Mutated Brucella genes
C: Energy production and conversion
4: cydB, eryB, narG, pyc
D: Cell cycle control, mitosis and meiosis
E: Amino acid transport and metabolism
15: aroC, aspC, gcvP, gcvT, glnA, glyA, hisC, hisD, hisF, leuA, leuC, lysA, pepN, pheA, serB
F: Nucleotide transport and metabolism
10: dut, purD, purE, purF, purH, purl, purM, purN, pyrB, pyrD
G: Carbohydrate transport and metabolism
7: eryC, gnd, ilvD, pgi, pgm, ugpA, zwf
H: Coenzyme transport and metabolism
3: cobB, hemH, ilvC
I: Lipid transport and metabolism
2: bacA, uppS
3: pth, rplS, rpsA
3: miaA, oxyR, rpoA
L: Replication, recombination and repair
2: recA, xseA
M: Cell wall/membrane biogenesis
6: gmd, omp10, omp19, omp25, virB1, wboA
O: Posttranslational modification, protein turnover, chaperones
5: cydD, degP, dnaK, lon, tig
P: Inorganic ion transport and metabolism
2: sodC, znuA
Q: Secondary metabolites biosynthesis, transport and catabolism
R: General function prediction only
2: gltD, hfq
T: Signal transduction mechanisms
4: bvrR, bvrS, ntrC, ntrY
U: Intracellular trafficking and secretion
9: fliF, virB2, virB10, virB11, virB3, virB4, virB6, virB8, virB9
-: Not in COGs
6: cowB, glnD, gluP, gpt, norM, wbkB
Literature-curated Brucella genetic interactions and pathogenesis
TextPresso-predicted and manually curated Brucella genetic interactions. One match means one highlighted sentence containing at least 2 genes and at least one word under "association" or "regulation" category. Each match represents for one predicted genetic interaction. The results are shown by manually verified vs. TextPresso-predicted interactions. The number of verified vs. predicted interactions varies depending on the numbers (#) of matches and papers to use as the cutoffs and whether or not to use full text contents besides paper abstracts.
Full text (containing abstract)
# of papers
# of matches
# of papers
# of matches
Limix also allows curators to add Brucella genetic interactions that are not detected by the TextPresso-based text mining approach. Currently 62 genetic interactions are available in the Limix databases. There are 48 genes involved in these interactions, and 28 of them are shared with the attenuated Brucella gene mutation list as discussed above. The finding of these genetic interactions has provided more comprehensive investigation of Brucella pathogenesis. For example, it not only confirms the importance of type IV secretion system and the BvrR-BvrS two-component regulatory system in Brucella pathogenesis but also provides specific pathway details. Furthermore, our curation results indicate that the secretion of the N-terminal fragment of BvrR fused to a CAT report gene is diminished in virB1 and virB10 mutants, suggesting that BvrR is probably an effector protein secreted by the VirB type IV secretion system . Another interesting observation is the interactions among sodC, hfq, and ctrA. B. abortus host factor 1 (HF-1) protein encoded by hfq contributes to stress resistance during stationary phase and is a major determinant of virulence in mice . Bacterial sodC genes are typically regulated in a growth-phase-dependent manner, and their expression is usually maximal during stationary phase. B. abortus hfq gene mutation results in greatly reduced sodC expression . CtrA is a master response regulator that is essential for viability and is transcriptionally autoregulated. The hfq gene is likely to be negatively regulated by CtrA . These two interactions suggest that CtrA may also regulate Brucella sodC expression.
Other portal features: ContactsDB, Forum, and publication Email alert service
BBP is designed to link international Brucella scientists and researchers. BBP contains a ContactsDB database that currently provides contact information for more than 100 Brucella researchers in the world. The ContactsDB can be queried based on first name, last name, address, city, institute, state, zip code, and country. Any Brucella researcher can also enter new contact information or update existing information using an interactive web page. The BBP discussion Forum has been created to facilitate discussion between scientists. Only registered BBP members can initiate a topic, reply to a message, or edit their own messages. Unregistered users can view all discussions. Up to now more than 50 Brucella researchers from 18 different countries have registered in BBP. Another BBP feature is the Publication Email Alert Service. This service automatically notifies users of newly published papers within a user-defined time interval. Those users who have not registered for this service can view new publications by visiting our automatic new Brucella paper updating website.
Many different databases related to Brucella genomes and genes exist. A variety of computational tools are also available for functional genomic analysis. The Brucella Bioinformatics Portal is a gateway to provide or link functional Brucella gene information and analysis tools useful for the Brucella researchers. Besides summarizing Brucella genomics related databases and analysis tools in HTML formats, we have also developed the InterBru database and the Brucella Genome Browser (BGBrowser). InterBru allows users to search for specific Brucella gene information and provide links to existing databases. BGBrower provides graphic visualization and analysis tools. Since most of current Brucella genes and gene-gene interaction data are derived from computational analysis and often lack literature support, we further developed several computational Brucella literature search tools for efficient retrieval of Brucella articles. The Brucella Limix system is also developed to allow retrieved data from text mining tools to be directly copied, edited and submitted to a backend relational database. The Brucella Limix system has been used to annotate a large number of Brucella genes and to find 62 Brucella genetic interactions and 75 attenuated gene mutations from literature publications in PubMed. These annotated results provide more comprehensive understanding of Brucella pathogenesis. These programs, together with other portal features including the ContactsDB and Forum, facilitate the Brucella research community to obtain and annotate Brucella genome sequences in one website. BBP is the first integrated system for Brucella genome analysis.
BBP adopts and extends many open-source software programs for Brucella genome annotation including three GMOD open-source software programs, GBrowse, TextPresso, and PubSearch (Table 1). Many interactive graphical interfaces (e.g., MeSH browser and genetic interaction map) have also been developed for efficient literature mining and database curation. While many NLP-based text mining tools (e.g., TextPresso) significantly improve the capability of biomedical text mining, an automatic literature retrieval tool that can be as accurate as manual literature curation still does not exist . As far as we know, among existing web-based dedicated genome databases, BBP is the first to strongly integrate a literature manual curation and management system (e.g., PubSearch) with NLP-based computational literature mining techniques (e.g., TextPresso for Brucella) into an efficient literature mining and curation system (Limix). The BBP Limix system also provides a genetic solution for annotating other genomes and genes based on published literature data.
Server and programming tools
This BBP system is built on two Dell Poweredge 2580 servers, one serving as database server and another as application server. Both servers are running the Redhat Linux operating system (Redhat Enterprise Linux ES 4). The database server is powered by Oracle 10g database management system. Two open source software programs, Apache HTTP Server and Apache Tomcat, are installed as the HTTP application server and the servlet container respectively. Different programming languages including PHP, Perl, and Java are implemented for development of a variety of BBP modules. The two servers also back each other regularly to secure the data.
InterBru and BGBrowser
InterBru is a web-based relational database system that contains various Brucella data and links to public databases. The protein MW and PI are calculated from the protein sequences using the modules (Bio::Tools::pICalculator and Bio::Tools::SeqStats) from Bioperl . The InterBru data can be searched by different features and sorted for proper display (Figure 2A). The Brucella Genome Browser (BGBrowser) (Figure 2B) is developed based on the GBrowse , one of the GMOD software programs . In order to speed up the query process, all Brucella sequence and annotation information for BGBrowser are stored in the database server instead of flat files. Both InterBru and BGBrowser share the same output page of detailed gene information (Figure 2C).
The Blast module in BBP uses the latest web server version of BLAST obtained from NCBI . It includes regular BLAST services (blastn, blastp, blastx, tblastn, tblastx), PSI/PHI BlAST, Mega BLAST, and BLAST 2 sequences. These services are implemented in the BBP application server and can be used to search nucleotide or protein BLAST libraries containing sequences from individual or combined Brucella genomes. The sequence libraries are updated periodically to reflect newly curated annotations and when new genomes are added.
TextPresso-powered Brucella literature search
As a software program from the GMOD, Textpresso uses a modified GNU license and is free for academic purposes . The TextPresso package is downloaded from the TextPresso website . An automatic download program is first used to download and extract from PubMed all Brucella-related article information, including titles, authors, publish years, volumes, pages, journal names and abstracts. A BBP script is also developed to extract all Brucella-related full-text PDF files from PubMed. The PDF files are converted into plain text files using the open source XPDF . The converted full text together with abstracts and titles are tokenized into sentences and then to XML-tagged words or phrases representing different ontology categories according to a pre-defined ontology format. All the processed information, including fully annotated abstracts, titles, full texts, citation information, authors, years, keywords and categories, is indexed for efficient query. A web query interface is installed and modified for users to search against the indices and check detailed matching records.
Publication email alert service and automatic updates
The BBP publication email alert service is initiated by a subscribed user to specify the notification frequency (daily, weekly, bimonthly or monthly) and the keywords to be searched against the PubMed database. A daily Linux cron job checks the subscription database, searches for updates in PubMed, and sends the updated paper notification to the users through email. The automatic literature update program allows all Brucella-related publications from the current and previous months automatically updated in the BBP website. It is implemented by dynamically querying PubMed for updated publications during a certain month when the page is opened. This program is also integrated into the Brucella Limix for data curators to obtain the newest publications not stored in local publication database.
ContactsDB and forum
The ContactsDB database stores contact information of individual Brucella researchers in an Oracle database. A PHP program is developed for the users to query, submit, and update contact information in the BBP ContactsDB web page. The discussion forum program is also implemented with PHP and Oracle database.
Brucella Bioinformatics Portal
The Conserved Domain Database
TIGR Comprehensive Microbial Resource
The Clusters of Orthologous Groups
Generic Software Components for Model Organism Databases
Literature Mining and Curation System
Medical Subject Headings
The National Center for Biotechnology Information
The National Institute of Allergy and Infectious Diseases
National Institutes of Health
Natural Language Processing
The Institute for Genomic Research
WZ was supported by the NIH-NIAID R21 grant (#1R21AI057875-01) to Dr. He. ZX was supported by Dr. He's startup funding from the University of Michigan. We appreciate the encouragement and suggestions provided by Drs. Stephen Boyle, Nammalwar Sriranganathan, Gerhardt Schurig, and Bruno Sobral at the Virginia Polytechnic Institute and State University, and Dr. Raju Lathigra at the Walter Reed Army Institute of Research. We thank Dr. Hans-Michael Muller from the California Institute of Technology for his technical help in our development of Textpresso-powered Brucella literature search engine. We also appreciate the reviews and suggestions provided by our colleague Dr. Lesley Colby at the University of Michigan Medical School.
- Corbel MJ: Brucellosis: an overview. Emerg Infect Dis 1997, 3(2):213–221.PubMed CentralView ArticlePubMedGoogle Scholar
- Cloeckaert A, Verger JM, Grayon M, Paquet JY, Garin-Bastuji B, Foster G, Godfroid J: Classification of Brucella spp. isolated from marine mammals by DNA polymorphism at the omp2 locus. Microbes Infect 2001, 3(9):729–738. 10.1016/S1286-4579(01)01427-7View ArticlePubMedGoogle Scholar
- Paulsen IT, Seshadri R, Nelson KE, Eisen JA, Heidelberg JF, Read TD, Dodson RJ, Umayam L, Brinkac LM, Beanan MJ, Daugherty SC, Deboy RT, Durkin AS, Kolonay JF, Madupu R, Nelson WC, Ayodeji B, Kraul M, Shetty J, Malek J, Van Aken SE, Riedmuller S, Tettelin H, Gill SR, White O, Salzberg SL, Hoover DL, Lindler LE, Halling SM, Boyle SM, Fraser CM: The Brucella suis genome reveals fundamental similarities between animal and plant pathogens and symbionts. Proc Natl Acad Sci U S A 2002, 99(20):13148–13153. 10.1073/pnas.192319099PubMed CentralView ArticlePubMedGoogle Scholar
- Halling SM, Peterson-Burch BD, Bricker BJ, Zuerner RL, Qing Z, Li LL, Kapur V, Alt DP, Olsen SC: Completion of the genome sequence of Brucella abortus and comparison to the highly similar genomes of Brucella melitensis and Brucella suis. J Bacteriol 2005, 187(8):2715–2726. 10.1128/JB.187.8.2715-2726.2005PubMed CentralView ArticlePubMedGoogle Scholar
- DelVecchio VG, Kapatral V, Redkar RJ, Patra G, Mujer C, Los T, Ivanova N, Anderson I, Bhattacharyya A, Lykidis A, Reznik G, Jablonski L, Larsen N, D'Souza M, Bernal A, Mazur M, Goltsman E, Selkov E, Elzer PH, Hagius S, O'Callaghan D, Letesson JJ, Haselkorn R, Kyrpides N, Overbeek R: The genome sequence of the facultative intracellular pathogen Brucella melitensis. Proc Natl Acad Sci U S A 2002, 99(1):443–448. 10.1073/pnas.221575398PubMed CentralView ArticlePubMedGoogle Scholar
- Chain PS, Comerci DJ, Tolmasky ME, Larimer FW, Malfatti SA, Vergez LM, Aguero F, Land ML, Ugalde RA, Garcia E: Whole-genome analyses of speciation events in pathogenic Brucellae. Infect Immun 2005, 73(12):8353–8361. 10.1128/IAI.73.12.8353-8361.2005PubMed CentralView ArticlePubMedGoogle Scholar
- Ratushna VG, Sturgill DM, Ramamoorthy S, Reichow SA, He Y, Lathigra R, Sriranganathan N, Halling SM, Boyle SM, Gibas CJ: Molecular targets for rapid identification of Brucella spp. BMC Microbiol 2006, 6: 13. 10.1186/1471-2180-6-13PubMed CentralView ArticlePubMedGoogle Scholar
- Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, Church DM, DiCuccio M, Edgar R, Federhen S, Helmberg W, Kenton DL, Khovayko O, Lipman DJ, Madden TL, Maglott DR, Ostell J, Pontius JU, Pruitt KD, Schuler GD, Schriml LM, Sequeira E, Sherry ST, Sirotkin K, Starchenko G, Suzek TO, Tatusov R, Tatusova TA, Wagner L, Yaschenko E: Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 2005, 33(Database issue):D39–45. 10.1093/nar/gki062PubMed CentralView ArticlePubMedGoogle Scholar
- Boeckmann B, Bairoch A, Apweiler R, Blatter MC, Estreicher A, Gasteiger E, Martin MJ, Michoud K, O'Donovan C, Phan I, Pilbout S, Schneider M: The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res 2003, 31(1):365–370. 10.1093/nar/gkg095PubMed CentralView ArticlePubMedGoogle Scholar
- Peterson JD, Umayam LA, Dickinson T, Hickey EK, White O: The Comprehensive Microbial Resource. Nucleic Acids Res 2001, 29(1):123–125. 10.1093/nar/29.1.123PubMed CentralView ArticlePubMedGoogle Scholar
- Scherf M, Epple A, Werner T: The next generation of literature analysis: integration of genomic analysis into text mining. Brief Bioinform 2005, 6(3):287–297.View ArticlePubMedGoogle Scholar
- Hoffmann R, Krallinger M, Andres E, Tamames J, Blaschke C, Valencia A: Text mining for metabolic pathways, signaling cascades, and protein networks. Sci STKE 2005, 2005(283):pe21. 10.1126/stke.2832005pe21PubMedGoogle Scholar
- Hu ZZ, Narayanaswamy M, Ravikumar KE, Vijay-Shanker K, Wu CH: Literature mining and database annotation of protein phosphorylation using a rule-based system. Bioinformatics 2005, 21(11):2759–2765. 10.1093/bioinformatics/bti390View ArticlePubMedGoogle Scholar
- Stephens M, Palakal M, Mukhopadhyay S, Raje R, Mostafa J: Detecting gene relations from Medline abstracts. Pac Symp Biocomput 2001, 483–495.Google Scholar
- Muller HM, Kenny EE, Sternberg PW: Textpresso: an ontology-based information retrieval and extraction system for biological literature. PLoS Biol 2004, 2(11):e309. 10.1371/journal.pbio.0020309PubMed CentralView ArticlePubMedGoogle Scholar
- Rebholz-Schuhmann D, Kirsch H, Couto F: Facts from text--is text mining ready to deliver? PLoS Biol 2005, 3(2):e65. 10.1371/journal.pbio.0030065PubMed CentralView ArticlePubMedGoogle Scholar
- Berardini TZ, Mundodi S, Reiser L, Huala E, Garcia-Hernandez M, Zhang P, Mueller LA, Yoon J, Doyle A, Lander G, Moseyko N, Yoo D, Xu I, Zoeckler B, Montoya M, Miller N, Weems D, Rhee SY: Functional annotation of the Arabidopsis genome using controlled vocabularies. Plant Physiol 2004, 135(2):745–755. 10.1104/pp.104.040071PubMed CentralView ArticlePubMedGoogle Scholar
- Ko J, Splitter GA: Molecular host-pathogen interaction in brucellosis: current understanding and future approaches to vaccine development for mice and humans. Clin Microbiol Rev 2003, 16(1):65–78. 10.1128/CMR.16.1.65-78.2003PubMed CentralView ArticlePubMedGoogle Scholar
- Kohler S, Michaux-Charachon S, Porte F, Ramuz M, Liautard JP: What is the nature of the replicative niche of a stealthy bug named Brucella? Trends Microbiol 2003, 11(5):215–219.View ArticlePubMedGoogle Scholar
- Roop RM, Bellaire BH, Valderas MW, Cardelli JA: Adaptation of the Brucellae to their intracellular niche. Mol Microbiol 2004, 52(3):621–630. 10.1111/j.1365-2958.2004.04017.xView ArticlePubMedGoogle Scholar
- Stein LD, Mungall C, Shu S, Caudy M, Mangone M, Day A, Nickerson E, Stajich JE, Harris TW, Arva A, Lewis S: The generic genome browser: a building block for a model organism system database. Genome Res 2002, 12(10):1599–1610. 10.1101/gr.403602PubMed CentralView ArticlePubMedGoogle Scholar
- GMOD: The Generic Model Organism Database Project (GMOD).[http://www.gmod.org/]
- Schwarz EM, Antoshechkin I, Bastiani C, Bieri T, Blasiar D, Canaran P, Chan J, Chen N, Chen WJ, Davis P, Fiedler TJ, Girard L, Harris TW, Kenny EE, Kishore R, Lawson D, Lee R, Muller HM, Nakamura C, Ozersky P, Petcherski A, Rogers A, Spooner W, Tuli MA, Van Auken K, Wang D, Durbin R, Spieth J, Stein LD, Sternberg PW: WormBase: better software, richer content. Nucleic Acids Res 2006, 34(Database issue):D475–8. 10.1093/nar/gkj061PubMed CentralView ArticlePubMedGoogle Scholar
- Textpresso: Textpresso.[http://www.textpresso.org/]
- Tatusov RL, Galperin MY, Natale DA, Koonin EV: The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res 2000, 28(1):33–36. 10.1093/nar/28.1.33PubMed CentralView ArticlePubMedGoogle Scholar
- O'Callaghan D, Cazevieille C, Allardet-Servent A, Boschiroli ML, Bourg G, Foulongne V, Frutos P, Kulakov Y, Ramuz M: A homologue of the Agrobacterium tumefaciens VirB and Bordetella pertussis Ptl type IV secretion systems is essential for intracellular survival of Brucella suis. Mol Microbiol 1999, 33(6):1210–1220. 10.1046/j.1365-2958.1999.01569.xView ArticlePubMedGoogle Scholar
- Sola-Landa A, Pizarro-Cerda J, Grillo MJ, Moreno E, Moriyon I, Blasco JM, Gorvel JP, Lopez-Goni I: A two-component regulatory system playing a critical role in plant pathogens and endosymbionts is present in Brucella abortus and controls cell invasion and virulence. Mol Microbiol 1998, 29(1):125–138. 10.1046/j.1365-2958.1998.00913.xView ArticlePubMedGoogle Scholar
- Allen CA, Adams LG, Ficht TA: Transposon-derived Brucella abortus rough mutants are attenuated and exhibit reduced intracellular survival. Infect Immun 1998, 66(3):1008–1016.PubMed CentralPubMedGoogle Scholar
- McQuiston JR, Vemulapalli R, Inzana TJ, Schurig GG, Sriranganathan N, Fritzinger D, Hadfield TL, Warren RA, Snellings N, Hoover D, Halling SM, Boyle SM: Genetic characterization of a Tn5-disrupted glycosyltransferase gene homolog in Brucella abortus and its effect on lipopolysaccharide composition and virulence. Infect Immun 1999, 67(8):3830–3835.PubMed CentralPubMedGoogle Scholar
- Gee JM, Valderas MW, Kovach ME, Grippe VK, Robertson GT, Ng WL, Richardson JM, Winkler ME, Roop RM: The Brucella abortus Cu,Zn superoxide dismutase is required for optimal resistance to oxidative killing by murine macrophages and wild-type virulence in experimentally infected mice. Infect Immun 2005, 73(5):2873–2880. 10.1128/IAI.73.5.2873-2880.2005PubMed CentralView ArticlePubMedGoogle Scholar
- Lestrate P, Dricot A, Delrue RM, Lambert C, Martinelli V, De Bolle X, Letesson JJ, Tibor A: Attenuated signature-tagged mutagenesis mutants of Brucella melitensis identified during the acute phase of infection in mice. Infect Immun 2003, 71(12):7053–7060. 10.1128/IAI.71.12.7053-7060.2003PubMed CentralView ArticlePubMedGoogle Scholar
- Fretin D, Fauconnier A, Kohler S, Halling S, Leonard S, Nijskens C, Ferooz J, Lestrate P, Delrue RM, Danese I, Vandenhaute J, Tibor A, DeBolle X, Letesson JJ: The sheathed flagellum of Brucella melitensis is involved in persistence in a murine model of infection. Cell Microbiol 2005, 7(5):687–698. 10.1111/j.1462-5822.2005.00502.xView ArticlePubMedGoogle Scholar
- Marchesini MI, Ugalde JE, Czibener C, Comerci DJ, Ugalde RA: N-terminal-capturing screening system for the isolation of Brucella abortus genes encoding surface exposed and secreted proteins. Microb Pathog 2004, 37(2):95–105. 10.1016/j.micpath.2004.06.001View ArticlePubMedGoogle Scholar
- Robertson GT, Roop RMJ: The Brucella abortus host factor I (HF-I) protein contributes to stress resistance during stationary phase and is a major determinant of virulence in mice. Mol Microbiol 1999, 34(4):690–700. 10.1046/j.1365-2958.1999.01629.xView ArticlePubMedGoogle Scholar
- Roop RM, Gee JM, Robertson GT, Richardson JM, Ng WL, Winkler ME: Brucella stationary-phase gene expression and virulence. Annu Rev Microbiol 2003, 57: 57–76. 10.1146/annurev.micro.57.030502.090803View ArticlePubMedGoogle Scholar
- Robertson GT, Reisenauer A, Wright R, Jensen RB, Jensen A, Shapiro L, Roop RM: The Brucella abortus CcrM DNA methyltransferase is essential for viability, and its overexpression attenuates intracellular replication in murine macrophages. J Bacteriol 2000, 182(12):3482–3489. 10.1128/JB.182.12.3482-3489.2000PubMed CentralView ArticlePubMedGoogle Scholar
- Graphviz: Graphviz - Graph Visualization Software.[http://www.graphviz.org/]
- SVG: W3C Scalable Vector Graphics (SVG).[http://www.w3.org/Graphics/SVG/]
- Ogata H, Goto S, Sato K, Fujibuchi W, Bono H, Kanehisa M: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res 1999, 27(1):29–34. 10.1093/nar/27.1.29PubMed CentralView ArticlePubMedGoogle Scholar
- Bader GD, Donaldson I, Wolting C, Ouellette BF, Pawson T, Hogue CW: BIND--The Biomolecular Interaction Network Database. Nucleic Acids Res 2001, 29(1):242–245. 10.1093/nar/29.1.242PubMed CentralView ArticlePubMedGoogle Scholar
- Salwinski L, Miller CS, Smith AJ, Pettit FK, Bowie JU, Eisenberg D: The Database of Interacting Proteins: 2004 update. Nucleic Acids Res 2004, 32(Database issue):D449–51. 10.1093/nar/gkh086PubMed CentralView ArticlePubMedGoogle Scholar
- BioPerl: BioPerl.[http://www.bioperl.org]
- Blast: NCBI Blast downloads.[http://www.ncbi.nih.gov/BLAST/download.shtml]
- Xpdf: Xpdf: A PDF Viewer for X.[http://www.foolabs.com/xpdf/]
- GO: GO database .[http://www.godatabase.org]
- Garrett JJ: Ajax: A New Approach to Web Applications.2005. [http://www.adaptivepath.com/publications/essays/archives/000385.php]Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.