MYCbase: a database of functional sites and biochemical properties of Myc in both normal and cancer cells
BMC Bioinformatics volume 18, Article number: 224 (2017)
Myc is an essential gene having multiple functions such as in cell growth, differentiation, apoptosis, genomic stability, angiogenesis, and disease biology. A large number of researchers dedicated to Myc biology are generating a substantial amount of data in normal and cancer cells/tissues including Burkitt’s lymphoma and ovarian cancer.
MYCbase (http://bicresources.jcbose.ac.in/ssaha4/mycbase) is a collection of experimentally supported functional sites in Myc that can influence the biological cellular processes. The functional sites were compiled according to their role which includes mutation, methylation pattern, post-translational modifications, protein-protein interactions (PPIs), and DNA interactions. In addition, biochemical properties of Myc are also compiled, which includes metabolism/pathway, protein abundance, and modulators of protein-protein interactions. The OMICS data related to Myc- like gene expression, proteomics expression using mass-spectrometry and miRNAs targeting Myc were also compiled in MYCbase. The mutation and pathway data from the MYCbase were analyzed to look at the patterns and distributions across different diseases. There were few proteins/genes found common in Myc-protein interactions and Myc-DNA binding, and these can play a significant role in transcriptional feedback loops.
In this report, we present a comprehensive integration of relevant information regarding Myc in the form of MYCbase. The data compiled in MYCbase provides a reliable data resource for functional sites at the residue level and biochemical properties of Myc in various cancers.
Myc is a multifunctional protein and is believed to regulate expression of 15% of all genes through binding on Enhancer Box sequences (E-boxes) CACGTG and recruiting histone acetyltransferases (HATs) . It has a major role to play in the cell cycle, cell growth, differentiation, apoptosis, transformation, genomic stability and angiogenesis . Since it is a key regulator of all essential activities in the cell, its over-expression is responsible for causing different types of cancers including ovarian cancer  and hepatocellular carcinoma . The role of Myc in the development of Burkitt’s lymphoma has been well documented. In many human malignant tumors, the overexpression of Myc by gene amplification, proviral insertions, as well as chromosomal translocation has been observed . Along with these types of mutations, non-synonymous point mutations have also been widely reported in the scientific literature . These mutations result in deregulation of genes involved in cell proliferation as most genes under the control of Myc are vital for survival . c-Myc protein is composed of an N-terminal domain (NTD), a C-Terminal domain (CTD) and a central region which is intrinsically disordered. The NTD, consisting of short motifs called the Myc boxes, for example, Myc box I (MbI) and Myc box II (MbII), is involved in most of the known Protein-protein Interactions (PPIs) . On the other hand, the CTD, composed of the basic helix-loop-helix and leucine zipper region (bHLH-LZ), is involved in interaction with Max to form Myc-Max heterodimers . Myc-Max dimer is responsible for transactivation of many genes which lead to proliferation and cancer . The central region, being disordered, may be looked into to find short sequence motifs important in mediating PPIs .
Other relevant information
With the advent of new technologies such as high throughput proteomics and genomics and development of epigenetics, much new information about Myc has been generated. The methylation pattern of the c-Myc gene has also been linked to colorectal cancer  which gives an insight into epigenetic regulation. Myc is stabilized by post-translational modification (PTM), most importantly by phosphorylation at Serine 62 (S62) . This event primes c-Myc for the second phosphorylation at Threonine 58 (T58) which enhances its degradation through recruitment of FBXW7. The role of PTMs, especially phosphorylation and ubiquitination, in the regulation of biological processes mediated by Myc has been of interest to the scientific community for several years. Understanding the PTMs may, therefore, open new avenues into targeting c-Myc for inhibition of cell proliferation. Myc protein also regulates genes involved in several pathways including lipogenesis, glycolysis, glucose and glutamate import, lactate export, nucleotide biosynthesis and glutaminolysis . Since Myc is involved in 70% of the cancers occurring in humans, it has the potential of becoming a drug target. In fact, many small chemicals targeting Myc-Max dimers have been designed to prevent transcription of proliferation-related genes . Also, over-expression of Myc has been blocked by using some anti-G quadruplex compounds and BET inhibitors . Other strategies that have been developed to target Myc are with the use of miRNAs such as miR-26a used in animal models .
Need for the database
Over the last few decades, a substantial number of publications and conferences have been dedicated to specific key essential genes including p53 and Myc. Although an extensive amount of research is being done in the field of Myc biology, a gap remains in understanding and targeting the deregulated Myc in the disease state. The primary concern with anti-Myc therapy lies in the fact that it continues to be essential for normal proliferating tissues. Myc is also involved in a vast network of protein-protein interactions (PPIs) as well as protein-DNA interactions and modulates many signalling pathways which make it even harder to inhibit without causing serious side effects. A deeper understanding of these aspects of Myc may guide researchers into effectively targeting Myc for anti-cancer therapy. The foremost challenge lies in collecting the extensive amount of information from the vast reserve of scientific literature and secondary databases. Even though existing databases such as UniProt  and GeneCards  do provide information on various aspects of Myc such as gene sequence, size, an overview of interactors and PTMs, they lack precise details such as mutated residues and their link with PTMs or the region of Myc involved in PPI. These can only be obtained after the meticulous search of the scientific literature or browsing dedicated databases. It is therefore of utmost importance to accumulate all relevant data in a user-friendly platform in the form of a database from where the user may achieve new insights into disease biology and treatment.
In order to get an overall idea about the functional sites of Myc along with its properties, an attempt has been made to construct a database named MYCbase. MYCbase can be seen as a repository of all aspects of Myc relevant to its biochemical characterization including mutations, PPIs, small chemical drug molecule targeting, metabolic pathways, methylation pattern and others. We also focus on various analysis and conclusions that can be derived from this database which may help the scientific community in designing and troubleshooting their experiments in the field of Myc biology. To the best of our knowledge, there is no existing database, which compiles this variety and quantity of information regarding Myc, with relevance to cancer, studies under a single platform.
Construction and content
Database schema and implementation
MYCbase comprises eleven categories of information about Myc protein and gene, collected from different resources as shown in Fig. 1. It is a tertiary database containing 2223 entries from various aspects of MYC. The data compiled can be divided into three broad categories. First, the manually curated data from PubMed literature, such as mutation (352), metabolism and pathway (126) and methylation pattern (29). Second, the partially curated data like protein-protein interactions (PPIs) (925), where the cell lines used in all the experiments, the region of Myc with which they interact and the outcome of the interactions were manually curated from scientific literature and added to the existing information derived from a specialized database. Third, the data derived from other specialized databases, including gene expression information of Myc (115), Mass-spectrometry related data of Myc protein (41), miRNAs interacting with c-Myc gene along with MYCN and MYC-Max (202), modulators of protein-protein interactions or PPIMs (30), DNA interactions of Myc (131), Myc protein abundance data in various cell lines and tissues (168) and finally post-translational modifications present in Myc (104).
Manually curated tables
Mutation: Around 446 different abstracts were downloaded from PubMed till November 2015 by using the keywords 'c-myc gene' and 'c-myc gene mutations'. All relevant publications were sorted to extract information on c-myc gene mutations. The manually extracted information was placed in categories such as the location of mutation, residue mutated, type of mutation, cell line or sample used, methods, the disease affected, follow-up method, PubMed ID, and frequency of mutations. The functional sites mutated at the residue level are also linked to the PTM table so that any modifications at the site of mutations can also be seen easily.
Methylation pattern: PubMed was searched for (Myc[Title]) AND (Methylation[Title]) from which 42 relevant hits were obtained till April 2016. Only 29 publications had relevant information about c-myc methylation with disease states. These findings were arranged under the region explored, the nucleotide sequence, methylation status, cell line used, an outcome of methylation and relation to any disease.
Pathways/Metabolism: PubMed was searched for Myc in connection with metabolism and pathway which gave 126 till November 2015 results with information pertinent to the database. All papers were thoroughly examined and results were grouped under pathway, enzyme, regulatory action, comments, cell line and PubMed ID.
Partially curated tables
Protein-Protein Interactions (PPIs): The already existing database for PPIs Biological General Repository of Interaction Datasets (BioGRID)  was searched to collect the PPIs of Myc protein which contained information of 925 interactions. This information was arranged under protein names, type of interaction, UniProt ID of the interactor, experimental technique, PubMed ID, interaction domain, cell line used and the function. The cell line, interaction domain and result of the functions were manually extracted from original publications by comprehensive PubMed search. Some interactions, for an example, Myc-Max, have more than one entry (46), using different methods, cell lines, and references. The presence of many entries under the same interaction adds confidence as it has been reported by more than one group of researchers.
Derived from other specialized databases
miRNA interaction: miRNAs that interact with (or target) mRNA of the c-myc to regulate its gene expression were collected from different publically available databases such as TarBase v7.0, miRTarBase release 6.1, miRecords TransmiR and miRGen v3 [21,22,23,24,25]. These contain experimentally validated interaction which totals to 202 appropriate results including miRNAs which target MYCN and MYC-Max along with the ones targeting c-myc gene. They were grouped under miRNA name, the database used, tissue, cell line, methods used, PubMed ID, MiRBase Accession numbers, the gene they target and relevant comments.
DNA interaction: As c-Myc protein is a transcription factor with a DNA binding site it controls the expression of many genes. We have collected interaction between Myc as a transcription factor with other genes from TRANSFAC database . These 109 results were categorized into the gene, location within the gene, binding site identifier, binding reaction, effect and quality score. The ChIP-seq data from ENCODE project  were also included in MYCbase.
Protein- Protein Interaction Modulators (PPIM): TIMBAL  database was used to collect information regarding the modulators of Myc protein. A total of 30 modulators were collected and arranged in a table having target PPI, chemical name, smile, complex description, target UniProt ID, PDB ID, assay description, assay type, confidence description and type of interaction columns. The chemicals detail information was collected and hyperlinked from PubChem database.
Gene Expression: Gene Expression Omnibus (GEO), NCBI  was used to identify 255 c-myc gene expression datasets. Information regarding the title of the dataset, GSE number, cell line, method used to find out the results, platform used for the experiment, disease-associated and PubMed ID of related publication were compiled.
Mass Spectrometry-based proteomics data: PRoteomics IDEntifications (PRIDE), EMBL-EBI  was used to extract all proteomics data where Myc was identified. This search using (P01106 accession number) revealed 44 hits. The results were assembled according to the title, project ID, cell lines used, type of experiment done, disease-associated and PubMed ID.
Protein abundance: Myc protein abundance data was derived from PaxDb: Protein Abundance Database  which resulted in around 168 hits. The search results were arranged according to the source of the protein, the method used, abundance, rank, interaction consistency score, coverage and PubMed ID.
PTM(s): All known PTMs of Myc along with their functional sites of modifications at the residue level and enzymes responsible were elucidated in MYCbase. There are 104 appropriate entries downloaded from iPTMnet . Their original sources of the information along with the PubMed IDs were reported.
Search Section: The search option, available on the Home page, allows the user to search for one or multiple datasets using gene symbols, disease names, and tissue types. Users can either search using the ‘All’ option which gives results for all eleven categories or selects a category in which the keyword is to be searched. A list of keyword examples for each category is displayed on the home page as shown in Fig. 2a. The search will generate a table giving the number of entries matching the query in each of the eleven categories along with the list of all information matching the particular keywords in the database, as shown in Fig. 2b. The links to PubMed and other sites such as PubChem and UniProt, where ever applicable, is also given for further reference and understanding of the user.
Browse Section: MYCbase allows the user to browse the database and acquire information present in all the eleven categories. These are: i) Mutations, ii) Protein-Protein Interactions (PPI), iii) Gene expression, iv) Mass Spectrometry data (PRIDE), v) Protein-Protein Interaction Modulators (PPIM), vi) miRNA, vii) Pathway, viii) Methylation Pattern, ix) Protein abundance, x) DNA interaction, and xi) Post-translational modifications (PTMs). All information under each of these eleven categories can be accessed by clicking on them. In addition to this, a statistical representation in the form of a pie-chart is given to outline the percentage of data distribution under each category. Hovering the mouse over each sector of the pie-chart expands to show the number of entries under each category as shown in Fig. 2c and d.
Help Section: It provides an outline of the sources from which the data has been assembled. This section also gives the new user an idea of how to use MYCbase. Any queries related to MYCbase can be sent to the address mentioned on this page. Download Section: A dedicated download page helps the users to download all information present in the eleven categories available in MYCbase in .xls format. Team Section: The team page acknowledges the authors responsible for creating MYCbase.
The different kinds of mutations present in Myc gene can be represented in the form of a pie chart as shown in Fig. 3a. While translocation including the one present in Burkitt’s lymphoma remains to be one of the most documented mutations in the scientific literature, 43 out of 352 entries in MYCbase, some point mutations giving rise to other kinds of cancers were reported. From the 227 point mutations present in MYCbase the top most mutated residues were identified. Mutation proportions for the top nine most mutated residues were calculated by dividing the number of mutations at each site for the particular disease by the total number of mutations reported for that site. Top ten diseases caused by mutations present in the Myc gene were also derived from MYCbase (Table 1). A heat map was constructed using heatmap.3 function in R with default parameters to show the mutation proportions of nine most mutated residues for three diseases (Fig. 4). From this figure, we can conclude that mutations in S62 and T58 are present in all three diseases. Burkitt’s lymphoma, though being characterized by the most common translocation t (8; 14) (q24; q32), also shows mutations in all of the nine top mutated residues. The S62 and T58 being important sites for phosphorylation which is involved in proteasomal degradation of Myc may give a major clue as to why they are also most frequently mutated. For this reason, we have interlinked the sites of mutations and the sites of PTMs in MYCbase to give an idea as to how mutations in these regions may affect the regulation of Myc. Myc, as we see from MYCbase, plays an important role in regulating different cellular pathways (Fig. 3b). Myc is majorly seen to stimulate many genes involved in nucleotide synthesis, ribosome biogenesis, and translation. It is represented by 76 out of 126 entries in MYCbase under pathway/metabolism category. Myc is also seen to drive the expression of other genes that are involved in glucose import, glycolysis, amino acid uptake and catabolism, particularly of glutamine, supporting increased protein synthesis required for a growing cell.
The region of interaction of PPIs is a new addition to data already present in existing databases. From this, we get to know that little information is available for the central disordered region of c-Myc protein. It would be of interest to explore interaction motifs present in these regions which may be essential for PPIs. It was observed that few proteins which are Myc interactors in PPI table are also present as genes regulated by Myc in DNA interactions which are transcription targets of Myc. These target proteins can be speculated to be involved in transcriptional feedback loops. To identify these common targets we created a Venn diagram with the help of Venny 2.1 using data present in the two tables from MYCbase (Fig. 5a) . We have identified eight proteins namely Bromodomain-containing protein 7 (BRD7), Cyclin-dependent kinase 4 (CDK4), Eukaryotic translation initiation factor 4E (EIF4E), Zinc finger protein GLI1 (GLI), Heat shock protein HSP 90α (HSP90AA1), Galectin-1 (LGALS1), DNA replication licensing factor MCM7 (MCM7) and S-phase kinase-associated protein 2 (SKP2) as the eight common targets. These targets are therefore involved in more than one type of relation with Myc and hence may prove to be involved in regulation of Myc and the coordination of its multiple functions. Most of these common targets are involved in cell cycle regulation and transcriptional activities. They can be further explored in future to establish the feedback loops. We also found out the Gene Ontology (GO) terms for each of the proteins under the PPI and DNA interaction categories using AmiGO 2 Term Enrichment Services for Biological processes . The results from this were filtered such that any GO terms below level three and above level five were removed and analyzed using Venny 2.1 to find the common biological processes for the two molecular relations (Fig. 5b). We found 27 common GO terms for PPIs and DNA interactions. Some of the common GO terms were the regulation of cell cycle arrest (GO:0071156), cell cycle phase transition (GO:0044770) and regulation of cell proliferation (GO:0042127).
A case study of breast cancer using MYCbase
We analyzed data regarding breast cancer in MYCbase by performing a simple keyword search with “Breast Cancer” in all options on the home page. The results we received, summarized the role of Myc in breast cancer. Gene amplification, including duplication, is the most important type of mutation that has been reported in breast cancer (in 7 out of 11 cases present in MYCbase). This is consistent with the knowledge that MYC amplification plays an important role in breast cancer development, progression and also associated with poor outcome [35, 36]. Breast cancer-associated gene, BRCA1, which functions as a tumour suppressor, binds to c-Myc protein and repress transcription is represented in the MYCbase PPI table with validated experimental techniques . It is also represented in DNA interactions, as Myc is able to activate transcription of BRCA1 thus confirming that the loss of BRCA1 plays an important role in breast cancer . Two missense mutations present in MYC found in breast cancer corresponds to S62 and T58, which, as already mentioned, are important sites for PTMs leading to proteasomal degradation of Myc. This study thus shows that MYCbase gives us a better representation of various aspects of Myc from a single platform.
The recent development in proteomic, transcriptomic and whole genome sequencing methodologies have led to the generation of extensive amounts of data and research articles related to c-Myc. MYCbase was developed by compiling the literature and existing secondary databases. MYCbase contains information of various functional sites at the residue level and biochemical aspects at a single platform, using which researchers can gain complete knowledge of the role of Myc in disease biology. MYCbase will be updated regularly to incorporate any updates in the secondary databases as well as in scientific literature.
Myc is a master regulator and can regulate of over 1500 coding and non-coding genes that include approximately 48 validated transcription factors [39, 40]. Many different methods have been used to identify these DNA, as well as, protein level interactions. For over a decade high throughput methods such as ChIP-seq and AP-MS are being used to provide a better understanding of Myc as a “hub” for these complex interactions. The data available in ENCODE project is the largest available repository for ChIP-seq data of human TFs and hence enriches MYCbase with high quality DNA-Myc interaction data. Myc may also be involved in feed forward loops which play a regulatory role both at transcriptional and post-transcriptional levels (miRNA mediated) . MYCbase highlights the miRNAs that Myc, as a transcription factor, regulates (data from TransmiR) along with miRNAs that target MYC gene (data from miRGen v3). These miRNAs are of importance to regulate the expression of Myc protein, as well as, control downstream signalling processes. In addition, miRNAs which target MYCN and MYC-Max and have also been explored with miRGen v3  and are included in MYCbase. ChIPBase is another database that incorporates ChIP-seq data which may be used to find out miRNAs that regulate Myc [42, 43]. The complex PPI network of Myc may be of interest to identify targets for anti-Myc therapy, which stands as a promising aspect for further studies by manipulating it experimentally. Till date, however, only PPIMs against Myc-Max and some bromodomain-containing proteins regulating Myc (BET inhibitors) have been identified [15, 16]. Therefore, there is the need to understand this complex Myc network to identify novel PPIs which can be targeted by small molecule inhibitors for proper regulation of Myc in disease states.
The data analysis of MYCbase has enabled us in drawing some conclusions on metabolic pathways and mutations at the residue level. First, Myc plays a major role in nucleotide biosynthesis which is essential for proliferating cells involved in tumorigenesis. A possible mechanism by which Myc regulates nucleotide biosynthesis is postulated to be through regulation of phosphoribosyl-pyrophosphate synthetase 2 (PRPS2) which results in promotion of increased nucleotide biosynthesis . Second, Myc is also involved in glutamine synthesis and import pathways which are upregulated by overexpression of Myc to supply increased energy demands in a cancer cell . Third, mutations present in Myc gene were studied for decades and particular emphasis is given to translocation of Myc gene present in Burkitt’s lymphoma. But, silent mutations present in Myc also have a major role to play in cancer. It was reported recently that silent mutations disrupt the interaction surface mediating PPIs . This may open up a new front in the importance of exploring the regions of interaction involved in PPIs in Myc through the idea of “edgetic” perturbation . Fourth, the mutations in T58 and S62 sites in the top three diseases caused mutations of Myc gene including Burkitt’s lymphoma are also important PTM sites. It has been seen that decreased T58 and increased S62 phosphorylation is present in human cancer cell lines associated with increased c-Myc protein stability . Furthermore, the eight proteins which were identified as common targets both in Myc-protein interactions and Myc-DNA binding may be exploited in feedback regulations of Myc. Networks generated from such studies may help to elucidate at which level Myc can be regulated for controlling cancer progression.
MYCbase is a tertiary database compiling various aspects of Myc relevant to cancer biology. Information present here has been accumulated from peer-reviewed literature and secondary databases. It is an open access database and information present can be downloaded freely by the users. In addition to the development of MYCbase, analysis of compiled data provided valuable insights regarding the role of Myc in cancer. An example of a case study using breast cancer incorporating the application of MYCbase has also been described.
Availability of data and materials
Database homepage: http://bicresources.jcbose.ac.in/ssaha4/mycbase. These data are freely available without restrictions for use by academics.
Affinity purification followed by mass spectrometry
Bromodomains and extra-terminal motif proteins
basic helix-loop-helix and leucine zipper region
Chromatin Immunoprecipitation followed by sequencing
Myc box I
Myc box II
Protein-protein interaction modulators
Phosphoribosyl-pyrophosphate synthetase 2
Graves JA, Rothermund K, Wang T, Qian W, Van Houten B, Prochownik EV. Point mutations in c-Myc uncouple neoplastic transformation from multiple other phenotypes in rat fibroblasts. PLoS One. 2010;5(10):e13717.
Oster SK, Ho CS, Soucie EL, Penn LZ. The myc oncogene: MarvelouslY Complex. Adv Cancer Res. 2002;84:81–154.
Chen CH, Shen J, Lee WJ, Chow SN. Overexpression of cyclin D1 and c-Myc gene products in human primary epithelial ovarian cancer. Int J Gynecol Cancer. 2005;15(5):878–83.
Takahashi Y, Kawate S, Watanabe M, Fukushima J, Mori S, Fukusato T. Amplification of c-myc and cyclin D1 genes in primary and metastatic carcinomas of the liver. Pathol Int. 2007;57(7):437–42.
Dalla-Favera R, Bregni M, Erikson J, Patterson D, Gallo RC, Croce CM. Human c-myc onc gene is located on the region of chromosome 8 that is translocated in Burkitt lymphoma cells. Proc Natl Acad Sci U S A. 1982;79(24):7824–7.
Bhatia K, Huppi K, Spangler G, Siwarski D, Iyer R, Magrath I. Point mutations in the c-Myc transactivation domain are common in Burkitt's lymphoma and mouse plasmacytomas. Nat Genet. 1993;5(1):56–61.
Trumpp A, Refaeli Y, Oskarsson T, Gasser S, Murphy M, Martin GR, Bishop JM. c-Myc regulates mammalian body size by controlling cell number but not cell size. Nature. 2001;414(6865):768–73.
Ponzielli R, Katz S, Barsyte-Lovejoy D, Penn LZ. Cancer therapeutics: targeting the dark side of Myc. Eur J Cancer. 2005;41(16):2485–501.
Beaulieu ME, McDuff FO, Bedard M, Montagne M, Lavigne P. Methods for the expression, purification, preparation, and biophysical characterization of constructs of the c-Myc and Max b-HLH-LZs. Methods Mol Biol. 2013;1012:7–20.
Amati B, Littlewood TD, Evan GI, Land H. The c-Myc protein induces cell cycle progression and apoptosis through dimerization with Max. EMBO J. 1993;12(13):5083–7.
Sarkar D, Patra P, Ghosh A, Saha S. Computational Framework for Prediction of Peptide Sequences That May Mediate Multiple Protein Interactions in Cancer-Associated Hub Proteins. PLoS One. 2016;11(5):e0155911.
Sharrard RMRJ, Rogers S, Shorthouse AJ. Patterns of methylation of the c-myc gene in human colorectal cancer progression. Br J Cancer. 1992;65(5):667–72.
Yada M, Hatakeyama S, Kamura T, Nishiyama M, Tsunematsu R, Imaki H, Ishida N, Okumura F, Nakayama K, Nakayama KI. Phosphorylation-dependent degradation of c-Myc is mediated by the F-box protein Fbw7. EMBO J. 2004;23(10):2116–25.
Dang CV. Therapeutic targeting of Myc-reprogrammed cancer cell metabolism. Cold Spring Harb Symp Quant Biol. 2011;76:369–74.
Clausen DM, Guo J, Parise RA, Beumer JH, Egorin MJ, Lazo JS, Prochownik EV, Eiseman JL. In vitro cytotoxicity and in vivo efficacy, pharmacokinetics, and metabolism of 10074-G5, a novel small-molecule inhibitor of c-Myc/Max dimerization. J Pharmacol Exp Ther. 2010;335(3):715–27.
Delmore JE, Issa GC, Lemieux ME, Rahl PB, Shi J, Jacobs HM, Kastritis E, Gilpatrick T, Paranal RM, Qi J, et al. BET bromodomain inhibition as a therapeutic strategy to target c-Myc. Cell. 2011;146(6):904–17.
Kota J, Chivukula RR, O'Donnell KA, Wentzel EA, Montgomery CL, Hwang HW, Chang TC, Vivekanandan P, Torbenson M, Clark KR, et al. Therapeutic microRNA delivery suppresses tumorigenesis in a murine liver cancer model. Cell. 2009;137(6):1005–17.
I. S. Vlachos, M. D. Paraskevopoulou, D. Karagkouni, G. Georgakilas, T. Vergoulis, I. Kanellos, I-L. Anastasopoulos, S. Maniou, K. Karathanou, D. Kalfakakou, A. Fevgas, T. Dalamagas and A. G. Hatzigeorgiou. DIANA-TarBase v7.0: indexing more than half a million experimentally supported miRNA:mRNA interactions. Nucl. Acids Res. 2014;43(Database issue):D153-9. doi:10.1093/nar/gku1215.
Chou CH, Chang NW, Shrestha S, Hsu SD, Lin YL, Lee WH, Yang CD, Hong HC, Wei TY, Tu SJ, Tsai TR, Ho SY, Jian TY, Wu HY, Chen PR, Lin NC, Huang HT, Yang TL, Pai CY, Tai CS, Chen WL, Huang CY, Liu CC, Weng SL, Liao KW, Hsu WL, Huang HD. miRTarBase 2016: updates to the experimentally validated miRNA-target interactions database. Nucleic Acids Res. 2016;44(D1):D239-47. doi:10.1093/nar/gkv1258.
Feifei Xiao, Zhixiang Zuo, Guoshuai Cai, Shuli Kang, Xiaolian Gao, Tongbin Li. miRecords: an integrated resource for microRNA–target interactions. Nucleic Acids Res. 2009 January; 37(Database issue): D105–D110. Published online 2008 November 7. doi: 10.1093/nar/gkn851
Juan Wang, Ming Lu, Chengxiang Qiu, Qinghua Cui. TransmiR: a transcription factor–microRNA regulation database. Nucleic Acids Res. 2010 January; 38(Database issue): D119–D122. Published online 2009 September 28. doi: 10.1093/nar/gkp803
Georgakilas G, Vlachos IS, Zagganas K, Vergoulis T, Paraskevopoulou MD, Kanellos I, Tsanakas P, Dellis D, Fevgas A, Dalamagas T, et al. DIANA-miRGen v3.0: accurate characterization of microRNA promoters and their regulators. Nucleic Acids Res. 2016;44(D1):D190–195.
ENCODE Project: https://www.encodeproject.org/ Consortium EP. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489(7414):57–74.
Gene Expression Omnibus (GEO), NCBI: http://www.ncbi.nlm.nih.gov/geo/
PRoteomics IDEntifications (PRIDE), EMBL-EBI: http://www.ebi.ac.uk/pride/archive/
PaxDb, Protein Abundance Database: http://pax-db.org/
Venny: Oliveros JC. An interactive tool for comparing lists with Venn's diagrams. 2007-2015.http://bioinfogp.cnb.csic.es/tools/venny/index.html
AmiGO 2 version: 2.5.3 (tomodachi): http://amigo2.geneontology.org/amigo
Aulmann S, Adler N, Rom J, Helmchen B, Schirmacher P, Sinn HP. c-myc amplifications in primary breast carcinomas and their local recurrences. J Clin Pathol. 2006;59(4):424–8.
Corzo C, Corominas JM, Tusquets I, Salido M, Bellet M, Fabregat X, Serrano S, Sole F. The MYC oncogene in breast cancer progression: from benign epithelium to invasive carcinoma. Cancer Genet Cytogenet. 2006;165(2):151–6.
Li H, Lee TH, Avraham H. A novel tricomplex of BRCA1, Nmi, and c-Myc inhibits c-Myc-induced human telomerase reverse transcriptase gene (hTERT) promoter activity in breast cancer. J Biol Chem. 2002;277(23):20965–73.
Menssen A, Hermeking H. Characterization of the c-MYC-regulated transcriptome by SAGE: identification and analysis of c-MYC target genes. Proc Natl Acad Sci U S A. 2002;99(9):6274–9.
Zeller KI, Jegga AG, Aronow BJ, O'Donnell KA, Dang CV. An integrated database of genes responsive to the Myc oncogenic transcription factor: identification of direct genomic targets. Genome Biol. 2003;4(10):R69.
Zeller KI, Zhao X, Lee CW, Chiu KP, Yao F, Yustein JT, Ooi HS, Orlov YL, Shahab A, Yong HC, et al. Global mapping of c-Myc binding sites and target gene networks in human B cells. Proc Natl Acad Sci U S A. 2006;103(47):17834–9.
El Baroudi M, Cora D, Bosia C, Osella M, Caselle M. A curated database of miRNA mediated feed-forward loops involving MYC as master regulator. PLoS One. 2011;6(3):e14742.
Yang JH, Li JH, Jiang S, Zhou H, Qu LH. ChIPBase: a database for decoding the transcriptional regulation of long non-coding RNA and microRNA genes from ChIP-Seq data. Nucleic Acids Res. 2013;41(Database issue):D177–187.
Zhou KR, Liu S, Sun WJ, Zheng LL, Zhou H, Yang JH, Qu LH. ChIPBase v2.0: decoding transcriptional regulatory networks of non-coding RNAs and protein-coding genes from ChIP-seq data. Nucleic Acids Res. 2017;45(D1):D43–50.
Cunningham JT, Moreno MV, Lodi A, Ronen SM, Ruggero D. Protein and nucleotide biosynthesis are coupled by a single rate-limiting enzyme, PRPS2, to drive cancer. Cell. 2014;157(5):1088–103.
DeBerardinis RJ, Lum JJ, Hatzivassiliou G, Thompson CB. The biology of cancer: metabolic reprogramming fuels cell growth and proliferation. Cell Metab. 2008;7(1):11–20.
Engin HB, Kreisberg JF, Carter H. Structure-Based Analysis Reveals Cancer Missense Mutations Target Protein Interaction Interfaces. PLoS One. 2016;11(4):e0152929.
Meyer MJ, Das J, Wang X, Yu H. INstruct: a database of high-quality 3D structurally resolved protein interactome networks. Bioinformatics. 2013;29(12):1577–9.
Wang X, Cunningham M, Zhang X, Tokarz S, Laraway B, Troxell M, Sears RC. Phosphorylation regulates c-Myc's oncogenic activity in the mammary gland. Cancer Res. 2011;71(3):925–36.
The authors thank Bioinformatics Centre of Bose Institute for providing the infrastructure for the studies carried out for this paper. They would also like to thank Arijita Sarkar of Bose Institute for the help received in searching miRNA databases. The authors are grateful to the ENCODE Consortium and the ENCODE production laboratory for generating the particular datasets of MYC as a target in ChIP-seq experiments.
DC is supported by Bose Institute Junior Research Fellowship. Bose Institute is not involved in the design or conclusion of the study.
SS conceived and designed the database. DC, SDM, SA and AB collected the data. TJ wrote the code. DC and SS performed the analysis and drafted the manuscript. All the authors read and approved the manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Chakravorty, D., Jana, T., Das Mandal, S. et al. MYCbase: a database of functional sites and biochemical properties of Myc in both normal and cancer cells. BMC Bioinformatics 18, 224 (2017). https://doi.org/10.1186/s12859-017-1652-6