Skip to main content

DNAVaxDB: the first web-based DNA vaccine database and its data analysis


Since the first DNA vaccine studies were done in the 1990s, thousands more studies have followed. Here we report the development and analysis of DNAVaxDB (, the first publically available web-based DNA vaccine database that curates, stores, and analyzes experimentally verified DNA vaccines, DNA vaccine plasmid vectors, and protective antigens used in DNA vaccines. All data in DNAVaxDB are annotated from reliable resources, particularly peer-reviewed articles. Among over 140 DNA vaccine plasmids, some plasmids were more frequently used in one type of pathogen than others; for example, pCMVi-UB for G- bacterial DNA vaccines, and pCAGGS for viral DNA vaccines. Presently, over 400 DNA vaccines containing over 370 protective antigens from over 90 infectious and non-infectious diseases have been curated in DNAVaxDB. While extracellular and bacterial cell surface proteins and adhesin proteins were frequently used for DNA vaccine development, the majority of protective antigens used in Chlamydophila DNA vaccines are localized to the inner portion of the cell. The DNA vaccine priming, other vaccine boosting vaccination regimen has been widely used to induce protection against infection of different pathogens such as HIV. Parasitic and cancer DNA vaccines were also systematically analyzed. User-friendly web query and visualization interfaces are available in DNAVaxDB for interactive data search. To support data exchange, the information of DNA vaccines, plasmids, and protective antigens is stored in the Vaccine Ontology (VO). DNAVaxDB is targeted to become a timely and vital source of DNA vaccines and related data and facilitate advanced DNA vaccine research and development.


A DNA vaccine is a bacterial DNA plasmid constructed to express an encoded protein or peptide antigen(s), which is administered in vivo and able to induce preventive or therapeutic antigen-specific immune response against a specific disease or infection. The DNA plasmid must encode a promoter that is active in mammalian cells [1]. The first DNA vaccine studies were conducted in the 1990s, with the first successful experiment done in 1990 when a DNA plasmid expressing a protein was injected into mice and the protein was successfully synthesized in vivo [2]. Once expressed, the protective protein antigen encoded by a gene in a DNA vaccine can be degraded into peptides by antigen presenting cells. The protective immune peptides (i.e., epitopes) can then be transferred to the cell surface and activate T cells. If the DNA vaccine is taken up by muscle cells, the muscle cells can transfer the expressed protein to the antigen presenting cells to stimulate the T cells. DNA vaccines can also stimulate antibody responses through antigen recognition by B cells [1]. Compared to other types of vaccines (e.g., live attenuated or killed whole organism vaccines, and subunit vaccine), DNA vaccines are safe, easy to prepare and store, and cost effective. In addition, DNA vaccines allow for focused immunity on the antigen of interest and have the ability to induce natural, long-lasting, and varied immune responses in vivo.

Intensive efforts have been taken to study and use DNA vaccines. There have been over 55,000 articles indexed in PubMed and Google Scholar combined. Four DNA vaccines have been licensed for veterinary uses. Among them are two DNA vaccines against infectious diseases, including West Nile-Innovator (Fort Dodge Animal Health, Fort Dodge, IA, USA) for protection against horse West Nile virus (, and Apex-IHN (Novartis Animal Health, Basel, Switzerland) against infectious haematopoietic necrosis virus in salmon ( Oncept (Merial, Lyon, France), another licensed DNA vaccine, is for protection against dog melanoma [3]. In addition, one DNA vaccine has been licensed for a gene therapy application in food pigs for increasing litter survival of breeding sows: LifeTide SW 5 (VGX Animal Health, Inc. The Woodlands, TX, USA) ( Although there is not a licensed human DNA vaccine on the market, many human clinical trials for DNA vaccine evaluation are currently being conducted. Researchers are continuing to investigate the mechanisms behind DNA vaccination and vaccine immunity.

While intensive research has been conducted and resulted in promising DNA vaccine candidates, there is no web-based central resource that allows the storage, annotation, comparison, and analysis of experimentally verified DNA vaccines. VIOLIN ( is the first web-based comprehensive vaccine database and analysis system that targets vaccine research [4, 5]. Currently, VIOLIN has included more than 3,200 vaccines for over 180 infectious diseases and many non-infectious diseases (e.g., cancers and arthritis). Each of the curated vaccines has been shown to induce significant protection against a disease in a natural host or at least one laboratory animal model. Many of the vaccines in VIOLIN are verified DNA vaccines. However, the general VIOLIN system does not include any specific information about DNA vaccine plasmids. It does not include tools to specifically query and analyze those protective antigens used in DNA vaccines. The DNA vaccine-specific plasmids and protective antigens may preserve unique patterns that would support further DNA vaccine research and development. A specific query system for DNA vaccines is not available either in the general system. The DNA vaccine research has become its own independent research and has its own community (for example, there is an active DNA Vaccines LinkedIn group ( Therefore, there is a need to develop a DNA vaccine specific resource.

To address the many needs of the DNA vaccine community, we have developed DNAVaxDB (, a web-based DNA vaccine database and analysis system. As a relatively independent program under the umbrella of the VIOLIN vaccine database and analysis system, DNAVaxDB includes the information for a large set of DNA vaccines, DNA vaccine plasmids, and protective antigens used in DNA vaccines. The analysis of these data reveals many unique patterns specific for DNA vaccines.


DNAVaxDB system and database design

DNAVaxDB was developed with a three-tier architecture built on two HP ProLiant DL380 G6 servers that run the Redhat Enterprise Linux (release 5.8) operating system. A query on DNAVaxDB can be submitted from the DNAVaxDB web user interface (presentation tier). The query is then processed using PHP/SQL (middle tier, application server based on Apache) against a relational database (MySQL version 5.5.28) (data tier, database server). Query results are then displayed in the web browser. Furthermore, DNAVaxDB curators and reviewers can submit and review curated vaccine and plasmid data through the VIOLIN data curation system. Two servers regularly back up each other's data. DNAVaxDB is one relatively independent program under the umbrella of the VIOLIN vaccine database and analysis system [4].

Figure 1 shows the DNAVaxDB workflow and system design. For each vaccine, the DNAVaxDB database contains at minimum the following information: (1) General DNA vaccine information, including name, host animal used as model, immunization route, vector, antigen name, stage of vaccine development (e.g. research, clinical trial, or licensed), and efficacy. Demonstration of protection through a challenge experiment or a protective level of antibodies is necessary for inclusion in DNAVaxDB. This information is manually curated to emphasize the details of the paper, particularly the protection experiments. Antigens inserted into the vector are searched for in the NCBI Gene, Protein, and Nucleotide databases. If found, they are added to VIOLIN by using the Gene, Protein, or Nucleotide ID and an internal script to retrieve the information in NCBI, labeled as a "protective antigen," and linked to the DNA vaccine in VIOLIN. (2) DNA vaccine plasmid information. For each plasmid, the database includes name, Vaccine Ontology ID [6, 7], and reference, and, if available, manufacturer, promoter, antibiotics resistance gene, and length. (3) Comprehensive citation information for each vaccine such as title, authors, journal, issue, volume, year, and pages. These references are usually found in PubMed [8], and the information found in VIOLIN is retrieved from PubMed using the PubMed ID (i.e., PMID) and an internal script. Each vaccine and plasmid is assigned a Vaccine Ontology (VO) identifier, linking the two databases.

Figure 1
figure 1

DNAVaxDB workflow and system design. Data, including vaccine information and evidence from peer-reviewed publications, is manually curated into DNAVaxDB. Gene IDs, Protein IDs, or Nucleotide IDs are retrieved from NCBI and used by an internal script to gather detailed information about each protective antigen, which is then stored in DNAVaxDB. DNA and protein sequences can be used for BLAST, COG, and Vaxign analyses, and the latter two are stored in DNAVaxDB. A PubMed ID (PMID) from the relevant paper is also used by an internal script to gather comprehensive citation information for each study. Each vaccine and plasmid is assigned a Vaccine Ontology (VO) ID.

Semi-automatic annotation of vaccine information

A semi-automatic DNAVaxDB annotation system is developed for DNA vaccine and related data curation and analysis by modifying an in-house web-based literature mining and curation system called Limix [4, 9, 10] (Figure 2). This system includes two parts: vaccine and protective antigen data curation based on the existing VIOLIN curation system [4], and a newly developed database specifically for DNA vaccine plasmids. The web-based DNAVaxDB annotation system provides many interactive features for curators to easily obtain information. For example, once an NCBI Gene ID is entered, the system can automatically retrieve all the gene and protein information (e.g., DNA and protein sequences, names, functions) from the NCBI Gene database [11] (Figure 2A and 2B). Similarly, a PMID is sufficient for the system to retrieve all reference information. The curation system also allows a data curator to search literature, copy and edit text, and submit data to database (Figure 2C-2E). After a curator submits data, a domain expert will serve as the data reviewer to review, edit, and approve the curated data through a user-friendly web interface. Immediately after the approval by a reviewer, the curated data submitted will be posted publicly. The DNAVaxDB annotation also stores all the history changes and traces the detailed changes made to each record (Figure 2F).

Figure 2
figure 2

Submission and review processes of DNA vaccines into DNAVaxDB. A) The overall view of the top part of the DNAVaxDB data submission and review page. B) Gene submission form. Once an NCBI Gene, Protein, or Nucleotide ID is entered, an internal script is used to retrieve information such as Taxonomy ID, protein name, and DNA and protein sequences from NCBI to create a new gene entry in VIOLIN. C) Vaccine addition page. Information such as vaccine name, plasmid and antigen information, efficacy, and references can be added on this page. D) Vaccine information section includes all vaccines curated for this pathogen. Any specific vaccine can be further examined and edited. E) Edit Dialog box. Curators and reviewers can use this tool to edit a specific part of a vaccine (e.g., vaccine name). A reference may also be added for many items of information (e.g., vaccine efficacy). F) Show Revision History page. Curators and reviewers can use this tool to see what changes have been made and compare page edits.

Analysis of protective antigens used in DNA vaccine development

For each protective antigen used in DNA vaccine development, the Vaxign software program was used for analysis of subcellular localization and adhesin probability [12, 13]. The analysis of the Clusters of Orthologous Groups (COG) for each protective antigen was also performed.

A DNAVaxDB BLAST search program has been developed to support sequence similarity search on our customized BLAST library that contains over 370 protective antigens used in DNA vaccine development.

DNAVaxDB data query, visualization, and analysis

Three user-friendly web interfaces were developed for users to separately query DNA vaccines, protective antigens, and DNA vaccine plasmids. The three parts of queries are also interlinked. For example, the display of the query results of a DNA vaccine will typically include the plasmid and protective antigen(s) used, and the name or IDs of these plasmid and antigens can then be linked to other web pages specifically for the plasmid and antigen.

To perform some systematic bioinformatics analyses, sometimes we developed MySQL scripts that could be used to directly interact with the backend MySQL database. This activity was often performed to quickly categorize and compare various statistics and characteristics of the DNA vaccines, plasmids, and antigens.

DNAVaxDB data exchange, transfer, and download

The information of the DNA vaccines, protective antigens, and DNA vaccine plasmids from DNAVaxDB is stored in Vaccine Ontology (VO; [6]. The VO is developed using the Web Ontology Language (OWL) that can be automatically processed via software programs [14].


DNAVaxDB statistics

As of September 3, 2013, DNAVaxDB has included 417 DNA vaccines. Each of these DNA vaccines was experimentally verified to be protective or therapeutic with at least one laboratory animal model. In total, these DNA vaccines use 141 DNA vaccine plasmids, and 375 protective antigens have been curated for use in DNA vaccines. Occasionally, more than one antigen is used in a single DNA vaccine. These vaccines are developed against 99 infectious and non-infectious diseases (including arthritis, cancer, and diabetes). Table 1 lists infectious and non-infectious diseases for which DNAVaxDB stores 10 or more vaccines. Some protective antigens are orthologs from different strains. For example, Brucella has 15 protective antigens listed. However, only 10 of them are unique; the other 5 are orthologs of some of the 10 genes from different Brucella strains.

Table 1 Diseases with 10 or more DNA vaccines in DNAVaxDB

Analysis of DNAVaxDB vaccine DNA plasmids

The most popular plasmids found in DNAVaxDB were analyzed (Table 2). Among the most commonly used plasmids were pcDNA3.1, pcDNA3, pVAX1, pVR1012, and pCI. Based on the information available on the manufacturers' websites, peer-reviewed articles, and other reliable sources, certain patterns were established among these popularly used plasmids. The first pattern found was the common use of the human cytomegalovirus/immediate-early (CMV) promoter among almost all the plasmids. The two popular plasmids that did not use this promoter, pCAGGS and pCAGGSP7, utilized the AG/CMV-IE/lac promoter ( The CMV promoter elicits higher expression levels than other promoters, and therefore is commonly used.

Table 2 Plasmids in DNAVaxDB with five or more associated vaccines

Secondly, several commonly used plasmids utilized an SV40 polyadenylation terminator. This type of terminator is commonly used to reduce additional peptide expression or alteration of the expressed peptide [15].

Thirdly, some plasmids have been more frequently used for development of DNA vaccines against one type of pathogens than the others (Table 2). For example, pCAGGS has been used for development of 10 successful viral DNA vaccines; however, this plasmid has not been used for any other types of DNA vaccines stored in DNAVaxDB. Similarly, 10 Gram-negative bacterial DNA vaccines use the plasmid pCMVi-UB, and this plasmid has not been used for DNA vaccines against any Gram-positive bacteria or other types of pathogens (Table 2).

Finally, most of the commonly used plasmids with information available about antibiotic resistance markers employed an ampicillin marker. This is a noteworthy observation, since there is potential hypersensitivity to ampicillin in some patients, and therefore these plasmids would not be acceptable for human use. Kanamycin is a more acceptable marker for human use [15]. Further research is necessary to determine what makes these plasmids so effective in lab animals and what can be done to modify them for human use, such as replacing the ampicillin resistance marker with a kanamycin marker and finding new ways to further increase antigen expression.

Analysis of bacterial DNA vaccines in DNAVaxDB

In total, there are 95 protective bacterial antigens used in DNA vaccines stored in DNAVaxDB. These protective antigens are also cross-described in the VIOLIN Protegen database [16]. It has been reported that most protective antigens for vaccine development are secreted extracellular bacterial proteins, cell surface proteins, or adhesins [16]. Therefore, one important aspect of this study is to analyze these DNA vaccine antigens in terms of their subcellular localization and adhesin probability and compare these with the general patterns when all protective antigens for any types of bacterial vaccines are considered [16].

As shown in Table 3, of the 28 Gram-positive bacterial protective antigens, 18 (64%) have extracellular localization, and 3 (11%) are cell wall proteins. The remaining antigens are distributed between cytoplasmic region (2 or 7%) and cytoplasmic membrane (5 or 18%). In addition, 11 antigens (39%) are adhesins or adhesin-like proteins. Therefore, extracellular and adhesin antigens are superior targets for Gram-positive DNA vaccine construction.

Table 3 Subcellular localization and adhesin probability of bacterial protective antigens

Of the 67 Gram-negative bacterial protective antigens, the extracellular and outer membrane locations are among the major subcellular regions with 11 and 20 protective antigens, respectively (16% and 30%). There are 12 cytoplasmic protein antigens (18%). The numbers of cytoplasmic membrane and periplasmic proteins are small. However, there are 18 antigens with unpredicted subcellular localization. Our study also found that 20 antigens (30%) used in Gram-negative bacterial DNA vaccines are adhesin or adhesin-like proteins (Table 3). In summary, we found that the extracellular or surface or adhesin proteins preferred targets for Gram-negative DNA vaccine construction; meanwhile, cytoplasmic proteins also form a group of frequently used antigens in this type of DNA vaccines.

Although the extracellular, cell surface, and adhesin proteins are preferred bacterial DNA vaccine antigens, the data shown in DNA vaccines against Chlamydophila provides a different view (Table 3 and 4). Fourteen unique protective Chlamydophila antigens from the Gram-negative bacterial pathogen were used for development of experimentally verified DNA vaccines. This species did not have any antigens that were repeatedly used with different adjuvants or in different combination with other antigens. These antigens have many different functions, and in total there are 7 different COG categories. Among the 14 protective Chlamydophila antigens used, 6 were cytoplasmic, 2 were at the cytoplasmic membrane, 3 were at the outer membrane, 1 was extracellular, and 2 were unknown. However, the two protective antigens with unknown localizations are believed to be localized to the cytoplasm; GatC is part of a larger complex also including GatA and GatB, which are both localized to the cytoplasm, and Ssb is a DNA binding protein and DNA is localized to the cytoplasm. Therefore, different from the common pattern of high percentage of extracellular and outer membrane antigens, the majority of protective antigens used for Chlamydophila DNA vaccines are cytoplasmic proteins.

Table 4 Antigens in DNAVaxDB for Chlamydophila spp.

Analysis of viral DNA vaccines in DNAVaxDB

In total, DNAVaxDB contains over 200 DNA vaccines against over 40 viruses. In order to get a closer look at viral DNA vaccines, we chose to analyze human immunodeficiency virus (HIV) and the related diseases simian-human immunodeficiency virus (SHIV) and simian immunodeficiency virus (SIV). HIV is a prevalent problem worldwide. The SHIV and SIV models have been frequently used for HIV vaccine studies. Information about the HIV, SHIV, and SIV vaccines can be found in Table 5. HIV-1 and SHIV89.6P were commonly used strains for challenge, but there were no strains that were used the majority of the time. Overall, the HIV, SHIV, and SIV DNA vaccines found in DNAVaxDB are diverse. Many aspects of the HIV, SHIV, and SIV DNA vaccines do not follow any particular pattern. All fourteen of the vaccines are quite unique and utilize many different techniques in order to obtain protection.

Table 5 HIV, SHIV, and SIV DNA vaccines

There were many different antigens used, the most common being env and gag. Env is an envelope protein, which is synthesized as gp160 and later cleaved into gp120 and gp41 [17]. Gag functions in the construction of virus-like particles (VLPs) [18]. Frequently more than one antigen has been used for HIV vaccine development (Table 5).

Only two plasmids were repeatedly used for at least two vaccines: pV1R and pSW3891. Although pcDNA3.1 is the most common DNA vaccine plasmid used in DNA vaccine development according to the statistics in DNAVaxDB (Table 2), this plasmid is not frequently used in HIV vaccine development. A few studies used pcDNA3.1 for HIV vaccine development, but protection was not evaluated in these investigations. For example, Du and colleagues built two vaccines, pcDNA-Vif and pcDNA-LIGHT, both based on pcDNA3.1. They then measured antibodies and T-cell responses [19]. However, since a protection study was not done, we could not include this paper in DNAVaxDB.

One pattern that did stand out was the division between prime-boost vaccines that employed the DNA vaccine as a prime and DNA vaccination alone. Eight of the vaccines in DNAVaxDB are prime-boost vaccines, and six are DNA vaccines alone. Five out of the 14 HIV DNA vaccines are also mixed with adjuvants to boost host immune responses. After analysis of the two groups, it was discovered that most of the prime-boost vaccines demonstrated high or complete protection of all challenged animals. Looking at the dates of each study, it's also clear that the prime-boost studies were more recent while the older studies were usually DNA vaccination alone.

Analysis of parasite DNA vaccines in DNAVaxDB

DNAVaxDB contains 61 DNA vaccines against 16 protozoan pathogens. The two pathogens with the most vaccines were Plasmodium spp. and Toxoplasma gondii, as seen in Table 1. These two pathogens also had the most protective antigens out of all parasites. Analysis of Plasmodium vaccines showed a wide range of vaccination methods. Variety was seen most commonly in immunization route (i.e. gene gun, intramuscular, etc.) and vaccination regimen (i.e. with or without an adjuvant, with or without boosting). However, as with all vaccines in DNAVaxDB, all these different methods proved efficacious and provided protection, demonstrating that there are multiple ways to stimulate a host's immune response via DNA vaccine.

Analysis of cancer DNA vaccines in DNAVaxDB

Cancer vaccines found in DNAVaxDB were analyzed, as cancer is the major non-infectious disease found in the database. Of the 34 cancer protective antigens, 8 were localized to the cytoplasmic membrane, 3 to the cytoplasm, 1 to the outer membrane, 2 are extracellular, and 20 have an unknown location. Eliminating those with an unknown location, 11 of the 14 protective antigens have locations near or on the outside of the cell. This makes for easier access for antibodies. Another noteworthy observation is that nearly all the vaccines stored in DNAVaxDB are used for the prevention of tumors rather than the treatment of existing tumors. While cancer prevention is important, treatment is more clinically significant [20]. DNA vaccines for cancer treatment need to be able to elicit a tumor-specific response to kill tumor cells and control the growth of new cells. The vaccines that have been discovered for cancer treatment thus far are effective using experimental models. However, in most cases, clinical trials are needed to evaluate the possibilities of using these DNA vaccines to confer protective immunity in humans.

DNAVaxDB data query, display, and data sharing

All information stored in DNAVaxDB is publically available for download and exchange as described in the Materials and Methods section. The curated DNAVaxDB data can be proficiently queried by vaccine, plasmid, or protective antigen as seen in Figure 3.

Figure 3
figure 3

An example of searching a vaccine and its related information in DNAVaxDB. (A) The vaccine name "mic8" is queried in DNAVaxDB. (B) The resulting vaccine in DNAVaxDB. (C) The DNAVaxDB page obtained by clicking on "T gondii DNA vaccine encoding MIC8" in (B). The DNAVaxDB page for the vaccine has information about the vaccine (protective antigen inserted, vector, host response, etc.). (D) The gene information page obtained by clicking on the "Detailed Gene Information Here" link from (C). This gives more information about the protective antigen used in the vaccine. (E) The Ontobee [40] page detailing the hierarchy of VO and showing the place of pVAX1, the plasmid used in this vaccine. This page is obtained by clicking on the VO ID in (C). The plasmid name can also be clicked to access a plasmid-specific page (data not shown).

The Vaccine Ontology (VO) is a community-based ontology that represents the vaccines and vaccine-related terms and the relations among these terms [7, 21]. VO is developed using the machine-processable Web Ontology Language (OWL) format [22]. All the DNA vaccines, DNA vaccine plasmids, and protective antigens used in these DNA vaccines, and the relations among these terms, have been logically represented in VO. The inclusion of these data in VO allows software programs to easily process the data and use them for further application. In addition, the vaccine information in the Vaccine Ontology (VO) has also been used to integrate the data in the DNAVaxDB web project design. For example, the VO information is queried using internal SPARQL [23] scripts against a Resource Description Framework (RDF) triple store [24] that stores the ontology contents. The VO IDs of different vaccines are used as the glue that links the vaccines in the DNAVaxDB database and the general VIOLIN vaccine database.


To the best of our knowledge, DNAVaxDB is the first web-based, publically available database and analysis system that targets the curation and analysis of DNA vaccines, related DNA vaccine plasmids, and protective antigens. DNAVaxDB contains manually-curated DNA vaccines that have been used in the prevention and treatment of a wide variety of infectious diseases, as well as diseases such as cancer. Analysis of DNAVaxDB data has allowed us to reveal specific patterns from the experimentally verified DNA vaccines, plasmids, and protective antigens. These results support better understanding of protective DNA vaccine mechanisms and facilitate rational DNA vaccine design.

A few challenges have been identified throughout the DNA vaccine curation process. One challenge is that many DNA vaccines have been developed, but very few offer significant protection against challenge. Finding vaccines that offered robust protection in animal models could be difficult at times. To make DNAVaxDB valuable, we insisted that all DNA vaccines stored in the database were experimentally verified to induce protection in at least one laboratory animal model. Accuracy will not only provide more precise analysis, but also make DNAVaxDB a more useful tool for researchers.

In most cases, our studies with protective antigens collected in DNAVaxDB are consistent with previous finding that the bacterial surface or extracellular proteins are often good vaccine candidates [10, 16]. We have previously identified another group of microbial genes called "virmugens", which is coined to represent a gene that encodes for a pathogen virulence factor and has been proven feasible in at least one animal model to make a live attenuated vaccine after this gene is knocked out [25]. It has been known that virulence or pathogenic factors can often be used as protective antigens [26]. Our study also found that some virmugens, such as herpes simplex virus 1 gD [27, 28], Brucella melitensis Omp25 [29, 30], and influenza virus M2 [31, 32], are used for DNA vaccine development and function as protective antigens. More specific studies should be performed to investigate the relationships among general protective antigens, protective antigens used in DNA vaccine development, and virmugens.

Although many bacterial DNA vaccines use surface or extracellular proteins as protective antigens, surprisingly almost the half of the Chlamydophila protective antigens used in DNA vaccines are cytoplasmic, Cytoplasmic inner proteins are typically not accessible to antibodies. It suggests that other immune response mechanism(s), such as cytolytic T lymphocytes (CTL) reactivity inducible by both inner and outer proteins, are most likely responding to these antigens [1]. However, antibodies are an important immune response, and the inability to generate such an antibody response would seem to be detrimental to a vaccine. Several of these candidates came from one paper, who also discovered that the outer membrane proteins are not always protective either [33]. More work needs to be done on this pathogen as well as other pathogens to better understand the protective vaccine mechanisms associated with different types of antigens.

As demonstrated in the patterns identified from the HIV, SHIV, and SIV DNA vaccine research, the prime-boost vaccination with DNA priming has been of particular interest recently due to its effectiveness. This vaccination regimen is able to elicit a T cell response that is focused on the antigens encoded in the vaccine. This allows for a focused and potent immune response [34]. The prime-boost method is also cost-effective; DNA vaccines alone are fairly inexpensive to manufacture. However, DNA vaccines must be administered in high doses to be effective alone, which negates their cost effectiveness advantage. By using a DNA vaccine as a priming vaccine, which can be administered in much smaller quantities than if the DNA vaccine was used alone, it will result in a cost advantage. Further, some boosting vaccines, such as protein vaccines, can be expensive, so by using DNA vaccines as a priming vaccine instead of a protein vaccine as both a priming and a boosting vaccine, it provides further savings [35]. Prime-boost vaccines in DNAVaxDB also used a variety of vaccines for the boost. Three of the eight HIV DNA vaccines utilized adenovirus vectors (Table 5). These vectors are popular due to their ability to elicit strong T cell responses, and several different serotypes of adenoviruses have been explored as potential vectors. However, several other vectors, such as poxviruses and modified vaccinia virus Ankara (MVA), have been used to generate recombinant vector vaccines for boosting, and the administration of protein boost vaccines has also been frequently used for boosting host B cell immune responses [34, 3639]. The frequent use of the prime-boost regimen strategy is evident in DNAVaxDB, as the database contains many prime-boost DNA vaccines with different types of vaccines for boosting.

The DNAVaxDB database and analysis system has many applications, with one being the ability to support rational vaccine design. Since DNAVaxDB stores both plasmids and protective antigens used in DNA vaccines, this information can be used to mix and match the components to create the safest and most efficacious vaccines possible. Further, DNA vaccines can be designed based on successful plasmid and protective antigen combinations in other pathogens. We plan to develop new software capabilities in the future to support rational DNA vaccine design.

DNAVaxDB is slated to become a vital source of DNA vaccine information and will support researchers in the areas of immunology, vaccinology, and microbiology. As a central and vital source of DNA vaccines, DNAVaxDB is a timely repository and is expected to have a significant impact for vaccine research and development.





Clusters of Orthologous Groups


Cytotoxic T lymphocyte


Deoxyribonucleic acid


Human immunodeficiency virus


Modified vaccinia virus Ankara


National Center for Biotechnology Information


PubMed identification number


Resource Description Framework


Simian immunodeficiency virus


Simian-human immunodeficiency virus


Web Ontology Language


Vaccine Investigation and Online Information Network


Vaccine Ontology


  1. Liu MA: DNA vaccines: a review. Journal of internal medicine. 2003, 253 (4): 402-410. 10.1046/j.1365-2796.2003.01140.x.

    Article  CAS  PubMed  Google Scholar 

  2. Wolff JA, Malone RW, Williams P, Chong W, Acsadi G, Jani A, Felgner PL: Direct gene transfer into mouse muscle in vivo. Science. 1990, 247 (4949 Pt 1): 1465-1468.

    Article  CAS  PubMed  Google Scholar 

  3. Bergman PJ, Camps-Palau MA, McKnight JA, Leibman NF, Craft DM, Leung C, Liao J, Riviere I, Sadelain M, Hohenhaus AE, Gregor P, Houghton AN, Perales MA, Wolchok JD: Development of a xenogeneic DNA vaccine program for canine malignant melanoma at the Animal Medical Center. Vaccine. 2006, 24 (21): 4582-4585. 10.1016/j.vaccine.2005.08.027.

    Article  CAS  PubMed  Google Scholar 

  4. Xiang Z, Todd T, Ku KP, Kovacic BL, Larson CB, Chen F, Hodges AP, Tian Y, Olenzek EA, Zhao B, Colby LA, Rush HG, Gilsdorf JR, Jourdian GW, He Y: VIOLIN: vaccine investigation and online information network. Nucleic Acids Res. 2008, 36 (Database): D923-928.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  5. He Y, Racz R, Sayers S, Lin Y, Todd TE, Hur J, Li X, Patel M, Zhao B, Chung M, Ostrow J, Sylora A, Dungarani P, Ulysse G, Kochhar K, Bidri B, Strait K, Jourdian GW, Xiang Z: Updates on the Web-based VIOLIN Vaccine Database and Analysis System. Nucl Acids Res. 2014, 42 (1): D1124-D1132. 10.1093/nar/gkt1133.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  6. He Y, Cowell L, Diehl AD, Mobley HL, Peters B, Ruttenberg A, Scheuermann RH, Brinkman RR, Courtot M, Mungall C, Xiang Z, Chen F, Todd T, Colby LA, Rush H, Whetzel T, Musen MA, Athey BD, Omenn GS, Smith B: VO: Vaccine Ontology. The 1st International Conference on Biomedical Ontology (ICBO-2009). 2009, Buffalo, NY, USA, Nature Precedings; 2009:

    Google Scholar 

  7. Lin Y, He Y: Ontology representation and analysis of vaccine formulation and administration and their effects on vaccine immune responses. J Biomed Semantics. 2012, 3 (1): 17-10.1186/2041-1480-3-17.

    Article  PubMed Central  PubMed  Google Scholar 

  8. The US National Library of Medicine. The PubMed database:, accessed on September 5, 2013

  9. Xiang Z, Zheng W, He Y: BBP: Brucella genome annotation with literature mining and curation. BMC Bioinformatics. 2006, 7 (1): 347-10.1186/1471-2105-7-347.

    Article  PubMed Central  PubMed  Google Scholar 

  10. Yang B, Sayers S, Xiang Z, He Y: Protegen: a web-based protective antigen database and analysis system. Nucleic Acids Res. 2011, 39 (Database): D1073-1078. 10.1093/nar/gkq944.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  11. Maglott D, Ostell J, Pruitt KD, Tatusova T: Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res. 2011, 39 (Database): D52-57. 10.1093/nar/gkq1237.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  12. Xiang Z, He Y: Genome-wide prediction of vaccine targets for human herpes simplex viruses using Vaxign reverse vaccinology. BMC Bioinformatics. 2013, 14 (Suppl 4): S2-10.1186/1471-2105-14-S4-S2.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  13. He Y, Xiang Z, Mobley HL: Vaxign: the first web-based vaccine design program for reverse vaccinology and applications for vaccine development. J Biomed Biotechnol. 2010, 2010: 297505-

    PubMed Central  PubMed  Google Scholar 

  14. Xiang Z, Courtot M, Brinkman RR, Ruttenberg A, He Y: OntoFox: web-based support for ontology reuse. BMC Res Notes. 2010, 3: 175-10.1186/1756-0500-3-175.

    Article  PubMed Central  PubMed  Google Scholar 

  15. Williams JA, Carnes AE, Hodgson CP: Plasmid DNA vaccine vector design: impact on efficacy, safety and upstream production. Biotechnology advances. 2009, 27 (4): 353-370. 10.1016/j.biotechadv.2009.02.003.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  16. He Y, Xiang Z: Bioinformatics analysis of bacterial protective antigens in manually curated Protegen database. Procedia in Vaccinology. 2012, 6: 3-9.

    Article  CAS  Google Scholar 

  17. Checkley MA, Luttge BG, Freed EO: HIV-1 envelope glycoprotein biosynthesis, trafficking, and incorporation. J Mol Biol. 2011, 410 (4): 582-608. 10.1016/j.jmb.2011.04.042.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  18. Freed EO: HIV-1 gag proteins: diverse functions in the virus life cycle. Virology. 1998, 251 (1): 1-15. 10.1006/viro.1998.9398.

    Article  CAS  PubMed  Google Scholar 

  19. Du J, Wu X, Long F, Wen J, Hao W, Chen R, Kong X, Qian M, Jiang W: Improvement in efficacy of DNA vaccine encoding HIV-1 Vif by LIGHT gene adjuvant. Viral immunology. 2013, 26 (1): 68-74. 10.1089/vim.2012.0073.

    Article  PubMed Central  PubMed  Google Scholar 

  20. Zhang M, Obata C, Hisaeda H, Ishii K, Murata S, Chiba T, Tanaka K, Li Y, Furue M, Chou B, Imai T, Duan X, Himeno K: A novel DNA vaccine based on ubiquitin-proteasome pathway targeting 'self'-antigens expressed in melanoma/melanocyte. Gene therapy. 2005, 12 (13): 1049-1057. 10.1038/

    Article  CAS  PubMed  Google Scholar 

  21. Ozgur A, Xiang Z, Radev DR, He Y: Mining of vaccine-associated IFN-gamma gene interaction networks using the Vaccine Ontology. J Biomed Semantics. 2011, 2 (Suppl 2): S8-10.1186/2041-1480-2-S2-S8.

    Article  PubMed Central  PubMed  Google Scholar 

  22. W3C: OWL 2 Web Ontology Language Quick Reference Guide (Second Edition), W3C Recommendation 11 December 2012. 2012, []

    Google Scholar 

  23. SPARQL query language for RDF. []

  24. Beckett D: The W3C Consortium. RDF/XML syntax specification. 2004, accessed on September 5, 2013, []

    Google Scholar 

  25. Racz R, Chung M, Xiang Z, He Y: Systematic annotation and analysis of "virmugens" - virulence factors whose mutants can be used as live attenuated vaccines. Vaccine. 2013, 31 (5): 797-805. 10.1016/j.vaccine.2012.11.066.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  26. He Y, Rappuoli R, De Groot AS, Chen RT: Emerging vaccine informatics. J Biomed Biotechnol. 2010, 2010: 218590-

    PubMed Central  PubMed  Google Scholar 

  27. Awasthi S, Lubinski JM, Eisenberg RJ, Cohen GH, Friedman HM: An HSV-1 gD mutant virus as an entry-impaired live virus vaccine. Vaccine. 2008, 26 (9): 1195-1203. 10.1016/j.vaccine.2007.12.032.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  28. Hu K, Dou J, Yu F, He X, Yuan X, Wang Y, Liu C, Gu N: An ocular mucosal administration of nanoparticles containing DNA vaccine pRSC-gD-IL-21 confers protection against mucosal challenge with herpes simplex virus type 1 in mice. Vaccine. 2011, 29 (7): 1455-1462. 10.1016/j.vaccine.2010.12.031.

    Article  CAS  PubMed  Google Scholar 

  29. Edmonds MD, Cloeckaert A, Elzer PH: Brucella species lacking the major outer membrane protein Omp25 are attenuated in mice and protect against Brucella melitensis and Brucella ovis. Vet Microbiol. 2002, 88 (3): 205-221. 10.1016/S0378-1135(02)00110-4.

    Article  CAS  PubMed  Google Scholar 

  30. Commander NJ, Spencer SA, Wren BW, MacMillan AP: The identification of two protective DNA vaccines from a panel of five plasmid constructs encoding Brucella melitensis 16M genes. Vaccine. 2007, 25 (1): 43-54. 10.1016/j.vaccine.2006.07.046.

    Article  CAS  PubMed  Google Scholar 

  31. Watanabe S, Watanabe T, Kawaoka Y: Influenza A virus lacking M2 protein as a live attenuated vaccine. J Virol. 2009, 83 (11): 5947-5950. 10.1128/JVI.00450-09.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  32. Tompkins SM, Zhao ZS, Lo CY, Misplon JA, Liu T, Ye Z, Hogan RJ, Wu Z, Benton KA, Tumpey TM, Epstein SL: Matrix protein 2 vaccination and protection against influenza viruses, including subtype H5N1. Emerg Infect Dis. 2007, 13 (3): 426-435. 10.3201/eid1303.061125.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  33. Stemke-Hale K, Kaltenboeck B, DeGraves FJ, Sykes KF, Huang J, Bu CH, Johnston SA: Screening the whole genome of a pathogen in vivo for individual protective antigens. Vaccine. 2005, 23 (23): 3016-3025. 10.1016/j.vaccine.2004.12.013.

    Article  CAS  PubMed  Google Scholar 

  34. Hutnick NA, Myles DJ, Bian CB, Muthumani K, Weiner DB: Selected approaches for increasing HIV DNA vaccine immunogenicity in vivo. Curr Opin Virol. 2011, 1 (4): 233-240. 10.1016/j.coviro.2011.08.003.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  35. Lu S: Immunogenicity of DNA vaccines in humans: it takes two to tango. Human vaccines. 2008, 4 (6): 449-452. 10.4161/hv.4.6.6179.

    Article  CAS  PubMed  Google Scholar 

  36. Hill AV, Reyes-Sandoval A, O'Hara G, Ewer K, Lawrie A, Goodman A, Nicosia A, Folgori A, Colloca S, Cortese R, Gilbert SC, Draper SJ: Prime-boost vectored malaria vaccines: progress and prospects. Human vaccines. 2010, 6 (1): 78-83. 10.4161/hv.6.1.10116.

    Article  CAS  PubMed  Google Scholar 

  37. Robert-Guroff M: Replicating and non-replicating viral vectors for vaccine development. Current opinion in biotechnology. 2007, 18 (6): 546-556. 10.1016/j.copbio.2007.10.010.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  38. McShane H, Hill A: Prime-boost immunisation strategies for tuberculosis. Microbes Infect. 2005, 7 (5-6): 962-967. 10.1016/j.micinf.2005.03.009.

    Article  CAS  PubMed  Google Scholar 

  39. Prud'homme GJ: DNA vaccination against tumors. The journal of gene medicine. 2005, 7 (1): 3-17. 10.1002/jgm.669.

    Article  PubMed  Google Scholar 

  40. Xiang Z, Mungall C, Ruttenberg A, He Y: Ontobee: A linked data server and browser for ontology terms. The 2nd International Conference on Biomedical Ontologies (ICBO): 2011. 2013, CEUR Workshop Proceedings; 279-281[]

    Google Scholar 

Download references

Declaration statement

The publication funding came from NIH-NIAID grant R01AI081062.

This article has been published as part of BMC Bioinformatics Volume 15 Supplement 4, 2014: Selected articles on Computational Vaccinology 2013. The full contents of the supplement are available online at

Author information

Authors and Affiliations


Corresponding author

Correspondence to Yongqun He.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

RR: Data curation and review, data analysis, and manuscript writing. XL, MP: Data curation. ZX: Data processing, software programming, and database administrator. YH: Project design, result analysis and interpretation, and manuscript writing. All authors approved the paper publication.

Rights and permissions

Open Access  This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit

The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Racz, R., Li, X., Patel, M. et al. DNAVaxDB: the first web-based DNA vaccine database and its data analysis. BMC Bioinformatics 15 (Suppl 4), S2 (2014).

Download citation

  • Published:

  • DOI: