SPODOBASE : an EST database for the lepidopteran crop pest Spodoptera
- Vincent Nègre1, 6,
- Thierry Hôtelier1,
- Anne-Nathalie Volkoff2,
- Sylvie Gimenez2,
- François Cousserans2,
- Kazuei Mita4,
- Xavier Sabau5,
- Janick Rocher2, 7,
- Miguel López-Ferber2,
- Emmanuelle d'Alençon2,
- Pascaline Audant3,
- Cécile Sabourault3,
- Vincent Bidegainberry3,
- Frédérique Hilliou3 and
- Philippe Fournier2Email author
© Nègre et al; licensee BioMed Central Ltd. 2006
Received: 21 December 2005
Accepted: 23 June 2006
Published: 23 June 2006
The Lepidoptera Spodoptera frugiperda is a pest which causes widespread economic damage on a variety of crop plants. It is also well known through its famous Sf9 cell line which is used for numerous heterologous protein productions. Species of the Spodoptera genus are used as model for pesticide resistance and to study virus host interactions. A genomic approach is now a critical step for further new developments in biology and pathology of these insects, and the results of ESTs sequencing efforts need to be structured into databases providing an integrated set of tools and informations.
The ESTs from five independent cDNA libraries, prepared from three different S. frugiperda tissues (hemocytes, midgut and fat body) and from the Sf9 cell line, are deposited in the database. These tissues were chosen because of their importance in biological processes such as immune response, development and plant/insect interaction. So far, the SPODOBASE contains 29,325 ESTs, which are cleaned and clustered into non-redundant sets (2294 clusters and 6103 singletons). The SPODOBASE is constructed in such a way that other ESTs from S. frugiperda or other species may be added. User can retrieve information using text searches, pre-formatted queries, query assistant or blast searches. Annotation is provided against NCBI, UNIPROT or Bombyx mori ESTs databases, and with GO-Slim vocabulary.
The SPODOBASE database provides integrated access to expressed sequence tags (EST) from the lepidopteran insect Spodoptera frugiperda. It is a publicly available structured database with insect pest sequences which will allow identification of a number of genes and comprehensive cloning of gene families of interest for scientific community. SPODOBASE is available from URL: http://bioweb.ensam.inra.fr/spodobase
Lepidoptera represent a diverse and important group of agricultural insect pests that cause widespread economic damage on food and fiber crop plants, fruit trees, forests, and stored grains. They are also important indicators of ecosystem diversity and health. Moreover, lepidopteran insects display experimental advantages such as their large body size, accessible genetics, and extreme diversity. They show a large spectrum of interactions with plants and with numerous parasites or pathogens. Among Lepidoptera, the genus Spodoptera is largely studied due to its wide geographical distribution area. Indeed Spodoptera species are scattered over all continents. Presence of S. frugiperda in the American continent and in the Caribbean area has been studied in detail [1, 2]. S. frugiperda larvae cause severe damage on many cultivated crops including corn, rice and maize. S. littoralis is reported to cause damages in Mediterranean and African subtropical regions as well as in China whereas S. litura is found in India, Indonesia and Australia. In addition to being important agricultural pests these noctuids are biological models studied for several purposes. For example,S. frugiperda is well known through its famous Sf21 cell line and its Sf9 subclone  which is used for numerous heterologous protein productions. S. frugiperda is also used to study pesticide resistance [4, 5] and baculovirus host interaction , whereas S. littoralis is a model species to study pheromone regulations [7–9] or densovirus pathogenicity .
The development of new methods of insect pest management is an important challenge for world economy and health and it will be facilitated by a better knowledge of lepidopteran crop pest genomics. Indeed, genome information provides powerful tools for understanding biological mechanisms and functions and is essential for biology, medical science, and agriculture.
Recent years have shown a tremendous development of genome projects of various species, in particular for insects. Among model organisms, genome sequences have been completed in Drosophila melanogaster , the malaria mosquito, Anopheles gambiae , the honeybee Apis mellifera  and the silkworm Bombyx mori [14, 15]. In the year 2002 an International Lepidoptera Genome Consortium was created, which gathers the cooperative efforts of various laboratories in the world on genomic and transcriptomic studies on insects of scientific and economic importance . The project is organized in a "Bombyx – Plus" scheme, where Bombyx mori represents the core node of the knowledge both in terms of genetics, physiology, and EST sequencing . Around this model, the genomic study of a variety of pests of agronomical importance has been encouraged, as functional genomics analysis were still limited by the lack of relevant genome databases for gene identification. Several EST sequencing projects have already begun, but the results of only a few are available, as for example on Choristoneura fumiferana , Helicoverpa armigera , Plutella xylostella or Manduca sexta . Some other butterflies are also investigated . We have developed resources for Spodoptera frugiperda, for which we have created a genomic BAC library  and a set of Expressed Sequence Tags (ESTs) from the well known Sf9 cell line . Other labs have also reported the development of ESTs collections (Rollie J. Clem, Kansas State University, pers. comm.).
Here we present the database, named SPODOBASE, which provides integrated access to expressed sequence tags (EST) from S. frugiperda. The SPODOBASE currently contains 29,325 sequences from various organs (Sf9 cell line, hemocytes, midgut and fat body tissues). The EST sequences were cleaned and clustered into non-redundant sets (2294 clusters and 6103 singletons). User can retrieve information using text searches, pre-formatted queries, query assistant or blast searches.
This database will enable future functional genomics studies of a variety of biological processes such as immunity, endocrinology, reproduction or behavior. Since several physiological processes have been shown to be conserved through evolution, their study in lepidopteran models will help to further elucidate the function of homologous genes and will provide complements to the model insects Drosophila and Anopheles. For example these two model insects lack the receptors for the largely used Bacillus thuringiensis toxin as well as for most of the chemical pesticides (acetyl cholinesterase type). One can predict that analysis of the lepidopteran crop pests will contribute to sustainable agriculture, protection of the environment and maintenance of biodiversity.
Construction and content
1. Construction of cDNA libraries and sequencing
Four directional cDNA libraries were generated for Spodoptera frugiperda larvae. A Sf9 cell line library has been previously constructed and described . To generate the new libraries, different tissues of last larval instars, circulating hemocytes, fat body and midgut, were collected directly in TRIZOL reagent (InVitroGen). Extracted total RNAs were reverse transcribed using the SMART cDNA library Construction Kit (Clontech) according to manufacturer instructions. The library was built in λ Triplex2. From the phages, excision and circularization of pTriplEx plasmid was heat-induced at lox P sites in order to generate a plasmid library to be sequenced. The clones were robot-picked from agarose plates (CIRAD platform, Montpellier) and stored in 20% glycerol LB medium in 96-wells plates. A total of 72, 126 and 191 plates were seeded for the hemocyte (H), fat body (F) and midgut (M) libraries, respectively.
The 37,344 bacterial clones were then spotted on high density Nylon membranes and hybridized with an oligonucleotide probe encompassing the multiple cloning site in order to detect empty plasmid clones. Hybridization was conducted at high stringency and allowed the elimination of around 30 % clones in the different libraries. After colony picking, a limited sequencing test on 1900 clones from the 3 libraries revealed that the percentage of clones without insert was around 9%, showing an effective but non total rearrangement.
A second hybridization was performed with a probe consisting of a mixture of 40 cDNAs, in order to detect clones corresponding to cDNAs that were abundantly represented within the previously analyzed Sf9 library. We were expecting to increase coverage and decrease the number of sequences corresponding to known housekeeping genes. This hybridization leaded to the elimination of 0.7%, 1.9 % and 4.4 % of the clones in F, M and H libraries, respectively. We observed (See A) that the abundance of these clones was significantly reduced by this procedure, as their percentage in the library decreased from 36 % in the initial Sf9 library to 11 % in the four tissues libraries. Elimination was not total, probably because the complex probe does not detect easily all of the 40 genes, but it was still useful to avoid useless sequencing.
To assess inserts size, DNA was extracted from 96, 48 and 48 clones from the H, F and M libraries, respectively using the Qiagen DNA extraction kit. Inserts were amplified by PCR using primers flanking the insert cloning sites and their size was controlled by agarose gel electrophoresis. We found an average size of 1.1, 1.0 and 0.9 kb for the S. frugiperda cDNAs from the H, F and M libraries, respectively.
The libraries were thus re-assorted in a total of 55 plates for the hemocyte library, 87 plates for the fat body library and 149 plates for the midgut library, stored in 5% glycerol 2YT medium, in duplicate. From those, 5184 (54 plates), 6048 (63 plates) and 5952 (62 plates) clones were subjected to sequencing for hemocyte, fat body and midgut libraries, respectively. The plasmid DNAs were extracted from overnight grown bacterial cultures using an automated plasmid isolation machine BIO ROBOT 8000 (Qiagen). The cDNAs were sequenced using ABI PRISM BigDye Terminator Cycle Sequencing Ready Reaction kits on an ABI PRISM 3700 DNA Analyzer (Applied Biosystems) in Insect Genome Laboratory of National Institute of Agrobiological Sciences (NIAS, Japan). All clones were sequenced from both 5' and 3' extremities using forward and reverse primers located in the pTriplex vector, in a region flanking the insert. We thus obtained a total of 10,368, 12,096 and 11,904 sequences for hemocytes, fat body and midgut respectively.
A second midgut cDNA library was made from pooled mRNAs extracted from midguts of 3rd instar larvae fed on artificial diet supplemented with various natural products and xenobiotics. This library generated a set of 2,688 sequences.
2. The SPODOBASE pipeline
2-1 EST quality control
Tissue distribution of the number of clones having produced either 5' or 3' end sequencing or both, and subsequent EST numbers in SPODOBASE.
clones with available
5' seq. only
3' seq. only
Both 5' & 3' seq.
Total nb of ESTs
Sf9L, Sf9 cell Line
Sf1F, Fat body
Total per category
Total nb of clones
2-2 EST clustering
2-3 EST assembling
Distribution of the number of contigs among the clusters. The final number of contigs is given.
nb of clusters
nb of contigs
2-4 EST annotation
To identify similarities with known proteins, the sequences were searched using the BLASTX algorithm against a local non-redundant protein database (NR, NCBI, release 151.0, 1st February 2006) with a cut-off E-value of 1e-10. A total of 18,736 (64 %) sequences were found to share significant similarity with a protein sequence deposited in the NCBI non-redundant database.
As genome data (including ESTs) of Bombyx mori are the most important among Lepidoptera, it represents a model organism within this order. We thus subjected the EST sequences to TBLASTX searches against 116,541 B. mori sequences deposited in the NCBI dbEST database with a cut-off E-value of 1e-10. A total of 21,185 (72%) sequences were found to share significant similarities with silkworm EST sequences.
Thus, 24 % (8141) of the S. frugiperda ESTs do not have a match in BLAST searches against neither NCBI nr nor Bombyx mori databases. To identify those that did not match because they may correspond to untranslated regions, a search for predicted coding regions was performed with the software ESTScan . Indeed, from these 8141 sequences, we identified 3624 sequences (44.5 %) lacking predicted coding regions. Consequently, only 15 % of all sequences should be considered as new sequences. At this stage it should also be emphasized that B. mori ESTs database do not represent the total number of putative silkworm genes, thus the TBLASTX should be conducted against whole B. mori genome when it will be annotated. This observation may also be correlated with the phylogenetic distance which separates the two species. Indeed, although the monophyletic origin of Lepidoptera is well admitted , Bombycoidea and Noctuidea are two well distinct super families among this order, separated by probably more than 60 million years [30–32].
We also compared the 2436 contigs and the 6103 singletons to the Uniprot  protein database (release 6.0, September 2005) using the BLASTX program with a 1e-10 cut-off. We found 1178 contigs (48%) and 1809 singletons (30%) that showed a significant similarity with a Uniprot entry.
2-5- GO assignment of the EST sequences in the SPODOBASE
To define the function of the contigs and singletons present in the SPODOBASE, we used the Gene Ontology (GO) controlled vocabulary , and more particularly GOSlim, a subset of GO terms, which provides a higher level of annotations and allows a more global view of the dataset. To this end, we searched for the GOSlim terms (provided by GOA  released on January 2006) associated with the 1178 contigs and 1809 singletons that showed a significant similarity with a Uniprot entry. These identifiers were further used to select the sequences to be printed on a Spodoptera DNA microarray (R. Feyereisen, pers. comm.).
The database is based on the AceDB database management system version , originally created for the worm Caenorhabditis elegans, and used by many databases: WormBase , crop-related databases available from the UK Crop Plant Bioinformatics Network WWW site , MagnaportheDB , ESTHER , ParaDB , TropGene , etc. This is an object-oriented system capable of storing and retrieving complex biological information. The Web server is an Apache Web server version running on Red Hat Linux version. The Web consultation interface is implemented with Perl/CGI scripts, using modules of the AcePerl Application Programming Interface (API) and the AceBrowser generic web interface . The EST pipeline was created with Perl programming language and Bioperl libraries and used additional programs (PHRED for sequence quality control, BLAST for contaminant detection and annotation step, TGICL for clustering and assembling).
Utility and discussion
1- User interface
For each sequence, series of information are available including the direction of sequencing, the existence of the other direction sequence, the relation to an existing cluster, the 10 best hits of BLASTX against NCBI and Bombyx EST database, and the library where the sequence was found. For each cluster, the software displays the distribution of sequences among the different tissue libraries, and gives the list of sequences belonging to the cluster; it offers the possibility to visualize their alignment and to download the FASTA file comprising all of them. The 10 best hits of BLASTX against Uniprot and G0 annotations are available for each contig and singleton. Users can query database in several ways. Information can be retrieved according to text search or using a query assistant.
1-1- Classical AceDB queries
User can query database with AceDB data queries (Class, Text and AceDB queries). Class query allows the user to retrieve objects by class, with the possibility of restricting the search to names that match a pattern. Text query is a keyword-based search on all the data. AceDB query uses the Ace Query Language (AQL), which was created to formulate complex queries based on several criteria. In order to create an AQL request, the user must know the structure of the object model and learn a specific syntax. However some examples of classical questions written in AQL can be found at the AQLquery top page.
1-2- Query assistant
To help the user for retrieval, we implemented the QueryBuilder tool . This is a step-by-step graphic interface to formulate Ace queries. Five initial choices are proposed, concerning the clusters, the singletons, the libraries, the contigs or the sequences themselves. After this, the retrieval can be directed within a specific field and the chain of characters or numbers to be found are used in combination with the classical Boolean operators.
1-3- BLAST search
Users can search for similarities between their own sequences using BLASTN, TBLASTN or TBLASTX searches against the whole set of S. frugiperda EST sequences.
2- Intended uses
The database provides an overview of S. frugiperda transcripts. One of the major interests of the SPODOBASE consists in the large number of sequences and the existence of 5 different tissues cDNA libraries. The database can be used, among other applications, for functional genomics (primer design for micro-array analysis), to identify the genes expressed predominantly in a given tissue, and to compare genes between different species. On the basis of extensive sequence-based analysis of relationships among noctuids, it has been recently shown  that Spodoptera is relatively close to a group of species called the "pest clade" and including Heliothinae and Noctuinae s. l. Actually SPODOBASE is constructed in such a way that it can welcome large numbers of additional sequences from other different tissues of S. frugiperda, as well as from other Spodoptera species. The implementation of S. littoralis ESTs is already programmed for a near future.
The SPODOBASE represent a major contribution to the genomics of Spodoptera frugiperda. Together with BAC library, existence of various cell lines and expression systems, this makes of S. frugiperda of the most advanced models among agricultural pests in terms of genomic resources. SPODOBASE contains EST sequences that are cleaned, clusterized and annotated. These informations are available to serve insect research community, provide better understanding of the Lepidoptera physiology and identify new molecules targeted against Lepidoptera pests that could be used as safe biopesticides for sustainable agriculture.
Availability and requirements
The database is publicly available at the following URL:http://bioweb.ensam.inra.fr/spodobase. All sequences could be downloaded from SPODOBASE (see Download section). They have also been deposited in dbEST database (accession numbers for midgut library: DV075863 to DV080045 and DY786624 to DY7927772; fat body library: DY773453 to DY780623; hemocytes: DY773453 to DY780623; Sf9 cell line library: DY895775 to DY901596).
bp: base pairs
EST: Expressed Sequence Tags
cDNA: copy DNA
ribosomal protein L8
ribosomal protein S23
cytochrome c oxydase subunit III
ribosomal protein L14
ribosomal protein L23
ribosomal protein S11
ribosomal protein L35A
ribosomal protein L10A
ribosomal protein L24
ribosomal protein S14
ribosomal protein L22
ribosomal protein L37A
ribosomal protein S8
ribosomal protein L32
ribosomal protein S10
ribosomal protein L13A
cytochrome c oxydase subunit II
ribosomal protein L37
ribosomal protein S26
ribosomal protein S3A
ribosomal protein L39
ribosomal protein S25
ribosomal protein L28
ribosomal protein S17
ribosomal protein L31
ribosomal protein S13
ribosomal protein S4
ribosomal protein L27
ribosomal protein L40
ribosomal protein S12
ribosomal protein L36A
ribosomal protein S20
ribosomal protein S24
ribosomal protein L21
ribosomal protein L27A
ribosomal protein L13
ribosomal protein S3
S = 5822
S = 14382
In additional to annual financial support from INRA (SPE department) and University of Montpellier, this work was specifically funded by grants of the Bureau des Ressources Génétiques, of the Ministère de la Recherche (Programme Séquençage à Grande Echelle), and by the INRA programme called AIP-Séquençage. We gratefully thank René Feyereisen (INRA Sophia-Antipolis) for his help in the coordination of the French Lep genome programme and for involvement of several contributors from his lab. We also thank Cyril Berthenet and Ned Lamb (IGH, CNRS) for their technical support.
- Molina-Ochoa J, Carpenter JE, Heinrichs EA, Foster JE: Parasitoids and parasites of Spodoptera frugiperda (Lepidoptera: Noctuidae) in the Americas and Caribbean Basin: an inventory. Florida Entomologist 2003, 86: 254–289. 10.1653/0015-4040(2003)086[0254:PAPOSF]2.0.CO;2View ArticleGoogle Scholar
- Molina-Ochoa J, Lezama-Gutierrez R, Gonzalez-Ramirez M, Lopez-Edwards M, Rodriguez-Vega MA, Arceo-Palacios F: Pathogens and parasitic nematodes associated with populations of fall armyworm (Lepidoptera: Noctuidae) larvae in Mexico. Florida Entomologist 2003, 86: 244–253. 10.1653/0015-4040(2003)086[0244:PAPNAW]2.0.CO;2View ArticleGoogle Scholar
- Vaughn JL, Goodwin RH, Tompkins GJ, McCawley P: The establishment of two cell lines from the insect Spodoptera frugiperda (Lepidoptera; Noctuidae). In Vitro 1977, 13(4):213–217.View ArticlePubMedGoogle Scholar
- Morillo F, Notz A: Resistance of Spodoptera frugiperda (Smith) (Lepidoptera: Noctuidae) to lambdacyhalothrin and methomyl. Entomotropica 2001, 16(2):79–87.Google Scholar
- Yu SJ, Nguyen SN, Abo-Elghar GE: Biochemical characteristics of insecticide resistance in the fall armyworm, Spodoptera frugiperda (J.E. Smith). Pesticide Biochemistry and Physiology 2003, 77(1):1–11. 10.1016/S0048-3575(03)00079-8View ArticleGoogle Scholar
- Simón O, Williams T, López-Ferber M, Caballero P: Genetic structure of a Spodoptera frugiperda nucleopolyhedrovirus population: High prevalence of deletion genotypes. Applied and Environmental Microbiology 2004, 10(9):5579–5588. 10.1128/AEM.70.9.5579-5588.2004View ArticleGoogle Scholar
- Martinez T, Fabrias G, Camps F: Sex pheromone biosynthetic pathway in Spodoptera littoralis and its activation by a neurohormone. J Biol Chem 1990, 265(3):1381–1387.PubMedGoogle Scholar
- Sadek MM, Hansson BS, Rospars JP, Anton S: Glomerular representation of plant volatiles and sex pheromone components in the antennal lobe of the female Spodoptera littoralis . J Exp Biol 2002, 205(Pt 10):1363–1376.PubMedGoogle Scholar
- Iglesias F, Marco P, Francois MC, Camps F, Fabriàs G, Jacquin-Joly E: A new member of the PBAN family in Spodoptera littoralis : molecular cloning and immunovisualisation in scotophase hemolymph. Insect Biochemistry and Molecular Biology 2002, 32(8):901–908. 10.1016/S0965-1748(01)00179-5View ArticlePubMedGoogle Scholar
- El-Mergawy R, Li Y, El-Sheikh M, El-Sayed M, Abol-Ela S, Bergoin M, Tijssen P, Fediere G: Epidemiology and biodiversity of the densovirus MlDNV in the field populations of Spodoptera littoralis and other noctuid pests. Bulletin of Faculty of Agriculture, Cairo University 2003, 54(2):269–281.Google Scholar
- Adams MD, Celniker SE, Holt RA, Evans CA, Gocayne JD, Amanatides PG, Scherer SE, Li PW, Hoskins RA, Galle RF, George RA, Lewis SE, Richards S, Ashburner M, Henderson SN, Sutton GG, Wortman JR, Yandell MD, Zhang Q, Chen LX, Brandon RC, Rogers YH, Blazej RG, Champe M, Pfeiffer BD, Wan KH, Doyle C, Baxter EG, Helt G, Nelson CR, Gabor Miklos GL, Abril JF, Agbayani A, An HJ, Andrews-Pfannkoch C, Baldwin D, Ballew RM, Basu A, Baxendale J, Bayraktaroglu L, Beasley EM, Beeson KY, Benos PV, Berman BP, Bhandari D, Bolshakov S, Borkova D, Botchan MR, Bouck J, Brokstein P, Brottier P, Burtis KC, Busam DA, Butler H, Cadieu E, Center A, Chandra I, Cherry JM, Cawley S, Dahlke C, Davenport LB, Davies P, de Pablos B, Delcher A, Deng Z, Mays AD, Dew I, Dietz SM, Dodson K, Doup LE, Downes M, Dugan-Rocha S, Dunkov BC, Dunn P, Durbin KJ, Evangelista CC, Ferraz C, Ferriera S, Fleischmann W, Fosler C, Gabrielian AE, Garg NS, Gelbart WM, Glasser K, Glodek A, Gong F, Gorrell JH, Gu Z, Guan P, Harris M, Harris NL, Harvey D, Heiman TJ, Hernandez JR, Houck J, Hostin D, Houston KA, Howland TJ, Wei MH, Ibegwam C, Jalali M, Kalush F, Karpen GH, Ke Z, Kennison JA, Ketchum KA, Kimmel BE, Kodira CD, Kraft C, Kravitz S, Kulp D, Lai Z, Lasko P, Lei Y, Levitsky AA, Li J, Li Z, Liang Y, Lin X, Liu X, Mattei B, McIntosh TC, McLeod MP, McPherson D, Merkulov G, Milshina NV, Mobarry C, Morris J, Moshrefi A, Mount SM, Moy M, Murphy B, Murphy L, Muzny DM, Nelson DL, Nelson DR, Nelson KA, Nixon K, Nusskern DR, Pacleb JM, Palazzolo M, Pittman GS, Pan S, Pollard J, Puri V, Reese MG, Reinert K, Remington K, Saunders RD, Scheeler F, Shen H, Shue BC, Siden-Kiamos I, Simpson M, Skupski MP, Smith T, Spier E, Spradling AC, Stapleton M, Strong R, Sun E, Svirskas R, Tector C, Turner R, Venter E, Wang AH, Wang X, Wang ZY, Wassarman DA, Weinstock GM, Weissenbach J, Williams SM, Woodage T, Worley KC, Wu D, Yang S, Yao QA, Ye J, Yeh RF, Zaveri JS, Zhan M, Zhang G, Zhao Q, Zheng L, Zheng XH, Zhong FN, Zhong W, Zhou X, Zhu S, Zhu X, Smith HO, Gibbs RA, Myers EW, Rubin GM, Venter JC: The genome sequence of Drosophila melanogaster . Science 2000, 287(5461):2185–2195. 10.1126/science.287.5461.2185View ArticlePubMedGoogle Scholar
- Holt RA, Subramanian GM, Halpern A, Sutton GG, Charlab R, Nusskern DR, Wincker P, Clark AG, Ribeiro JM, Wides R, Salzberg SL, Loftus B, Yandell M, Majoros WH, Rusch DB, Lai Z, Kraft CL, Abril JF, Anthouard V, Arensburger P, Atkinson PW, Baden H, de Berardinis V, Baldwin D, Benes V, Biedler J, Blass C, Bolanos R, Boscus D, Barnstead M, Cai S, Center A, Chaturverdi K, Christophides GK, Chrystal MA, Clamp M, Cravchik A, Curwen V, Dana A, Delcher A, Dew I, Evans CA, Flanigan M, Grundschober-Freimoser A, Friedli L, Gu Z, Guan P, Guigo R, Hillenmeyer ME, Hladun SL, Hogan JR, Hong YS, Hoover J, Jaillon O, Ke Z, Kodira C, Kokoza E, Koutsos A, Letunic I, Levitsky A, Liang Y, Lin JJ, Lobo NF, Lopez JR, Malek JA, McIntosh TC, Meister S, Miller J, Mobarry C, Mongin E, Murphy SD, O'Brochta DA, Pfannkoch C, Qi R, Regier MA, Remington K, Shao H, Sharakhova MV, Sitter CD, Shetty J, Smith TJ, Strong R, Sun J, Thomasova D, Ton LQ, Topalis P, Tu Z, Unger MF, Walenz B, Wang A, Wang J, Wang M, Wang X, Woodford KJ, Wortman JR, Wu M, Yao A, Zdobnov EM, Zhang H, Zhao Q, Zhao S, Zhu SC, Zhimulev I, Coluzzi M, della Torre A, Roth CW, Louis C, Kalush F, Mural RJ, Myers EW, Adams MD, Smith HO, Broder S, Gardner MJ, Fraser CM, Birney E, Bork P, Brey PT, Venter JC, Weissenbach J, Kafatos FC, Collins FH, Hoffman SL: The genome sequence of the malaria mosquito Anopheles gambiae . Science 2002, 298(5591):129–149. 10.1126/science.1076181View ArticlePubMedGoogle Scholar
- Mita K, Kasahara M, Sasaki S, Nagayasu Y, Yamada T, Kanamori H, Namiki N, Kitagawa M, Yamashita H, Yasukochi Y, Kadono-Okuda K, Yamamoto K, Ajimura M, Ravikumar G, Shimomura M, Nagamura Y, Shin IT, Abe H, Shimada T, Morishita S, Sasaki T: The genome sequence of silkworm, Bombyx mori . DNA Res 2004, 11(1):27–35. 10.1093/dnares/11.1.27View ArticlePubMedGoogle Scholar
- Xia Q, Zhou Z, Lu C, Cheng D, Dai F, Li B, Zhao P, Zha X, Cheng T, Chai C, Pan G, Xu J, Liu C, Lin Y, Qian J, Hou Y, Wu Z, Li G, Pan M, Li C, Shen Y, Lan X, Yuan L, Li T, Xu H, Yang G, Wan Y, Zhu Y, Yu M, Shen W, Wu D, Xiang Z, Yu J, Wang J, Li R, Shi J, Li H, Li G, Su J, Wang X, Li G, Zhang Z, Wu Q, Li J, Zhang Q, Wei N, Xu J, Sun H, Dong L, Liu D, Zhao S, Zhao X, Meng Q, Lan F, Huang X, Li Y, Fang L, Li C, Li D, Sun Y, Zhang Z, Yang Z, Huang Y, Xi Y, Qi Q, He D, Huang H, Zhang X, Wang Z, Li W, Cao Y, Yu Y, Yu H, Li J, Ye J, Chen H, Zhou Y, Liu B, Wang J, Ye J, Ji H, Li S, Ni P, Zhang J, Zhang Y, Zheng H, Mao B, Wang W, Ye C, Li S, Wang J, Wong GK, Yang H: A draft sequence for the genome of the domesticated silkworm ( Bombyx mori ). Science 2004, 306(5703):1937–1940. 10.1126/science.1102210View ArticlePubMedGoogle Scholar
- Wang J, Xia Q, He X, Dai M, Ruan J, Chen J, Yu G, Yuan H, Hu Y, Li R, Feng T, Ye C, Lu C, Wang J, Li S, Wong GK, Yang H, Wang J, Xiang Z, Zhou Z, Yu J: SilkDB: a knowledgebase for silkworm biology and genomics. Nucleic Acids Res 2005, 33(Database):D399–402. 10.1093/nar/gki116PubMed CentralPubMedGoogle Scholar
- Papanicolaou A, Joron M, McMillan WO, Blaxter ML, Jiggins CD: Genomic tools and cDNA derived markers for butterflies. Mol Ecol 2005, 14(9):2883–2897. 10.1111/j.1365-294X.2005.02609.xView ArticlePubMedGoogle Scholar
- Boguski MS, Lowe TM, Tolstoshev CM: dbEST-database for "expressed sequence tags". Nat Genet 1993, 4(4):332–3. 10.1038/ng0893-332View ArticlePubMedGoogle Scholar
- d'Alençon E, Piffanelli P, Volkoff AN, Sabau X, Gimenez S, Rocher J, Cérutti P, Fournier P: A genomic BAC library and a new BAC-GFP vector to study the holocentric pest Spodoptera frugiperda . Insect Biochemistry and Molecular Biology 2004, 34(4):331–341. 10.1016/j.ibmb.2003.12.004View ArticlePubMedGoogle Scholar
- Landais I, Ogliastro M, Mita K, Nohata J, López-Ferber M, Duonor-Cérutti M, Shimada T, Fournier P, Devauchelle G: Annotation pattern of ESTs from Spodoptera frugiperda Sf9 cells and analysis of the ribosomal protein genes reveal insect-specific features and unexpectedly low codon usage bias. Bioinformatics 2003, 19(18):2343–2350. 10.1093/bioinformatics/btg324View ArticlePubMedGoogle Scholar
- Ewing B, Green P: Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res 1998, 8(3):186–194.View ArticlePubMedGoogle Scholar
- Ewing B, Hillier L, Wendl MC, Green P: Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res 1998, 8(3):175–185.View ArticlePubMedGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol 1990, 215(3):403–410. 10.1006/jmbi.1990.9999View ArticlePubMedGoogle Scholar
- Pertea G, Huang X, Liang F, Antonescu V, Sultana R, Karamycheva S, Lee Y, White J, Cheung F, Parvizi B, Tsai J, Quackenbush J: TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics 2003, 19(5):651–652. 10.1093/bioinformatics/btg034View ArticlePubMedGoogle Scholar
- Iseli C, Jongeneel CV, Bucher P: ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences. Proc Int Conf Intell Syst Mol Biol 1999, 138–48.Google Scholar
- Whiting MF: Phylogeny of the holometabolous insect orders: molecular evidence. Zoologica Scripta 2002, 31(1):3–15. 10.1046/j.0300-3256.2001.00093.xView ArticleGoogle Scholar
- Merritt TJS, LaForest S, Prestwihc GD, Quattro JM, Vogt RG: Patterns of gene duplication in lepidopteran pheromone binding proteins. J Mol Evol 1998, 46: 272–276. 10.1007/PL00006303View ArticlePubMedGoogle Scholar
- Minet J: Tentative reconstruction of the ditrysian phylogeny (Lepidoptera: Glossata). Entomologica Scandinavica 1991, 22(1):69–95.View ArticleGoogle Scholar
- Gaunt MW, Miles MA: An insect molecular clock dates the origin of the insects and accords with paleontological and biogeographic landmarks. Mol Biol Evol 2002, 19(5):748–761.View ArticlePubMedGoogle Scholar
- Apweiler R, Bairoch A, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, Martin MJ, Natale DA, O'Donovan C, Redaschi N, Yeh LS: UniProt: the Universal Protein knowledgebase. Nucleic Acids Res 2004, 32(Database):D115–119. 10.1093/nar/gkh131PubMed CentralView ArticlePubMedGoogle Scholar
- Gene Ontology Consortium: The Gene Ontology (GO) project in 2006. Nucleic Acids Res 34(Database):D322–6. 2006 Jan 1 2006 Jan 1Google Scholar
- Camon E, Magrane M, Barrell D, Lee V, Dimmer E, Maslen J, Binns D, Harte N, Lopez R, Apweiler R: The Gene Ontology Annotation (GOA) Database : sharing knowledge in Uniprot with Gene Ontology. Nucleic Acids Res 32(Database):D262–6. 2004 Jan 1 2004 Jan 1Google Scholar
- Harris TW, Lee R, Schwarz E, Bradnam K, Lawson D, Chen W, Blasier D, Kenny E, Cunningham F, Kishore R, Chan J, Muller HM, Petcherski A, Thorisson G, Day A, Bieri T, Rogers A, Chen CK, Spieth J, Sternberg P, Durbin R, Stein LD: WormBase: a cross-species database for comparative genomics. Nucleic Acids Res 2003, 31(1):133–137. 10.1093/nar/gkg053PubMed CentralView ArticlePubMedGoogle Scholar
- Dicks J, Anderson M, Cardle L, Cartinhour S, Couchman M, Davenport G, Dickson J, Gale M, Marshall D, May S, McWilliam H, O'Malia A, Ougham H, Trick M, Walsh S, Waugh R: UK CropNet: a collection of databases and bioinformatics resources for crop plant genomics. Nucleic Acids Res 2000, 28(1):104–107. 10.1093/nar/28.1.104PubMed CentralView ArticlePubMedGoogle Scholar
- Martin SL, Blackmon BP, Rajagopalan R, Houfek TD, Sceeles RG, Denn SO, Mitchell TK, Brown DE, Wing RA, Dean RA: MagnaportheDB: a federated solution for integrating physical and genetic map data with BAC end derived sequences for the rice blast fungus Magnaporthe grisea . Nucleic Acids Res 2002, 30(1):121–124. 10.1093/nar/30.1.121PubMed CentralView ArticlePubMedGoogle Scholar
- Cousin X, Hotelier T, Giles K, Toutant JP, Chatonnet A: aCHEdb: the database system for ESTHER, the alpha/beta fold family of proteins and the Cholinesterase gene server. Nucleic Acids Res 1998, 26(1):226–228. 10.1093/nar/26.1.226PubMed CentralView ArticlePubMedGoogle Scholar
- Leveugle M, Prat K, Perrier N, Birnbaum D, Coulier F: ParaDB: a tool for paralogy mapping in vertebrate genomes. Nucleic Acids Res 2003, 31(1):63–67. 10.1093/nar/gkg106PubMed CentralView ArticlePubMedGoogle Scholar
- Ruiz M, Rouard M, Raboin LM, Lartaud M, Lagoda P, Courtois B: TropGENE-DB, a multi-tropical crop information system. Nucleic Acids Res 2004, 32(Database):D364–367. 10.1093/nar/gkh105PubMed CentralView ArticlePubMedGoogle Scholar
- Stein LD, Thierry-Mieg J: Scriptable access to the Caenorhabditis elegan s genome sequence and other ACEDB databases. Genome Res 1998, 8(12):1308–1315.PubMed CentralPubMedGoogle Scholar
- Mitchell A, Mitter C, Regier JC: Systematics and evolution of the cutworm moths (Lepidoptera: Noctuidae): evidence from two protein-coding nuclear genes. Systematic Entomology 2006, 31: 21–46. 10.1111/j.1365-3113.2005.00306.xView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.