RiboaptDB: A Comprehensive Database of Ribozymes and Aptamers
© Thodima et al. 2006
Published: 26 September 2006
Skip to main content
© Thodima et al. 2006
Published: 26 September 2006
Catalytic RNA molecules are called ribozymes. The aptamers are DNA or RNA molecules that have been selected from vast populations of random sequences, through a combinatorial approach known as SELEX. The selected oligo-nucleotide sequences (~200 bp in length) have the ability to recognize a broad range of specific ligands by forming binding pockets. These novel aptamer sequences can bind to nucleic acids, proteins or small organic and inorganic chemical compounds and have many potential uses in medicine and technology.
The comprehensive sequence information on aptamers and ribozymes that have been generated byin vitroselection methods are included in this RiboaptDB database. Such types of unnatural data generated byin vitromethods are not available in the public 'natural' sequence databases such as GenBank and EMBL. The amount of sequence data generated byin vitroselection experiments has been accumulating exponentially. There are 370 artificial ribozyme sequences and 3842 aptamer sequences in the total 4212 sequences from 423 citations in this RiboaptDB. We included general search feature, and individual feature wise search, user submission form for new data through online and also local BLAST search.
This database, besides serving as a storehouse of sequences that may have diagnostic or therapeutic utility in medicine, provides valuable information for computational and theoretical biologists. The RiboaptDB is extremely useful for garnering information aboutin vitroselection experiments as a whole and for better understanding the distribution of functional nucleic acids in sequence space. The database is updated regularly and is publicly available athttp://mfgn.usm.edu/ebl/riboapt/.
Until about 25 years ago, all known enzymes were proteins. But then it was discovered that some RNA molecules also have enzymatic property; that is, catalyze covalent changes in the structure of substrates (most of which are also RNA molecules)[1–3]. Catalytic RNA molecules are called ribozymes. Since the discovery of ribozymes that exist in living organisms, there has been a lot of interest in the study of new synthetic ribozymes made in the laboratory. FirstTang and Breaker lab isolated self-cleaving RNAs originating from random-sequence RNAs by usingin vitroselection method. A large number of self-cleaving RNAs have been produced that have good enzymatic activity[5–7]. Some of the synthetic ribozymes that were produced had novel structures, while some were similar to the naturally occurring hammerhead ribozyme[2, 8].
The aptamers are DNA or RNA molecules, possessing desirable affinity, selected by SELEX – Systemic Evolution of Ligands by Exponential enrichment method. This SELEX method is anin vitroiterative process that isolates binding aptamers from the random pool and amplifies each sequence by the polymerase chain reaction after each round of isolation[9–16]. The selected oligo-nucleotide sequences (~200 bp in length) have the ability to recognize specific ligands by forming binding pockets and can bind to nucleic acids, proteins or small organic, inorganic chemical compounds and even small organisms like viruses[17–25].
Aptamers are a promising class of compounds, both for target validation and therapy. As designer drugs, they exhibit high specificity, high affinity, and modifiable bioavailability[26–30]. The ability to generate inhibitors with such properties against a variety of target proteins will be invaluable as the human genome and proteome are deciphered[12, 31–37].
The RiboaptDB is not only extremely useful both for identifying available aptamers and artificial ribozymes. It is also useful for acquiring information aboutin vitroselection experiments like the type of the nucleic acid, type of the target and conditions of the experiment as a whole and for better understanding the distribution of functional nucleic acids in the given sequence space. Like other types of sequences, the amount of sequences generated byin vitroselection experiments has been accumulating exponentially[10, 14]. The sheer number and diversity of selection experiments has risen to the point where it is now essential to gather all the sequence data into a comprehensive, continuously updated database. The general sequence databases like GenBank, EMBL and DDBJ do not maintain the complete collection of artificial nucleic acid sequences like aptamer and ribozyme. Another database, 'Aptamer database' also contains lot of information on this type of data but not regularly updating with new data[38, 39].
The "sequence" table is the key table in the database to which all other tables are related directly or indirectly. This table contains the sequence ID and relates directly with its child tables, "aptamer" and "ribozyme", which contains the corresponding sequence information. The other important tables in the database are "publication" and "experiment" which store the citation information like title, journal name, authors, pubmed ID and experiment details like template type and experiment conditions respectively. The target specific information, the target name and its category ('organic', 'inorganic', 'nucleic', 'peptide', 'protein' and 'other') obtained from the "target" table. If any information about non-canonical base pair is available, it can be retrieved through the "non-canonical" table.
RiboaptDB is relatively small database but is, nonetheless, essentially complete. The data was sourced from a previous compilation and exhaustive searching of the primary literature. The current size of the database 4212 sequences from 423 citations.
In this, there are 370 artificial ribozyme sequences and 3842 aptamer sequences in the total 4212 sequences. The database is updated every month as new literature comes on aptamers and artificial ribozyme seqences. The intial collection of data is done through searching the NCBI-Pubmed for the literature with keywords like 'artificial ribozymes', 'ribozyme', 'aptamers', 'SELEX' etc. The usefulness of a database is governed by the accuracy of the data it contains. The data in this database is compiled manually from previous published, peer-reviewed articles, and verified.
The general complete search option provides an interface for a variety number of queries to the database. It can be used to search the database for sequence, experiment, target, author, publication and non-canonical along with either ribozyme or aptamer or both and also either natural or artificial type of sequences.
Alternate to the general search option on top menu, there is a search option on the side menu on the home page to search the whole database on a specific keyword. Also, specific table search is available on side menu of each related pages. The user can also retrieve the selected sequences into a text file for further studies.
The idea behind the combining of ribozymes and aptamers data into one database is, increasing the chance of generating ribozymes with modified and novel properties. One example is combining both the 'target identification' of aptamer and 'catalytic activity' of ribozymes into a commercial 'riboswitch' application[42–45].
RiboaptDB project is young. With respect to future work, the database needs to be maintained and developed regularly, ensuring our links to external databases remain up to date and newly published data is added. Initially, as with all databases, random errors will have occurred due to human error during the data accumulation or will be extant within the original experimental data. The database will be assessed for errors and inconsistencies, thus maintaining, as far as possible, the overall veracity of our data.
The goal of RiboaptDB constructors was the collection of all ribozyme and aptamer sequences that have appeared to date and their detailed and correct annotation. The ease of access to the data is of great importance and the bespoke search system and the inclusion of a BLAST search greatly facilitates this. The better the organisation of the data, the easier the work will be for researchers dealing with aptamers and ribozymes.
RiboaptDB was created and is maintained in the Department of Biological Sciences at the University of Southern Mississippi. It is publicly available at thehttp://mfgn.usm.edu/ebl/riboapt/.
Systemic Evolution of Ligands by Exponential enrichment
Basic Local Alignment Search Tool
European Molecular Biology Laboratory
DNA Data Bank of Japan
This work was supported by Dean's Research Initiative award of the University of Southern Mississippi to Youping Deng and the Mississippi Functional Genomics Network (DHHS/NIH/NCRR Grant# 2P20RR016476-04).
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.