- Open Access
GIW and InCoB are advancing bioinformatics in the Asia-Pacific
BMC Bioinformatics volume 16, Article number: I1 (2015)
GIW/InCoB2015 the joint 26th International Conference on Genome Informatics (GIW) and 14th International Conference on Bioinformatics (InCoB) held in Tokyo, September 9-11, 2015 was attended by over 200 delegates. Fifty-one out of 89 oral presentations were based on research articles accepted for publication in four BMC journal supplements and three other journals. Sixteen articles in this supplement and six articles in the BMC Systems Biology GIW/InCoB2015 Supplement are covered by this introduction. The topics range from genome informatics, protein structure informatics, image analysis to biological networks and biomarker discovery.
The International Conference on Bioinformatics (InCoB) is annual conference of the Asia-Pacific Bioinformatics Network , while GIW is the annual conference of the Association of Asian Societies for Bioinformatics (AASBi) . Both GIW and InCoB are intimately associated with the development, growth and maturation of bioinformatics in Asia. Twenty-six years after the first GIW and fourteen years after the first InCoB, bioinformatics is now a well-established and diverse discipline within life and computer sciences. Hence, the differences in thrust and focus areas of the two conferences (e.g. genome informatics for GIW) faded over time. In the past five years, GIW and InCoB had to cope with increasingly tight travel budgets of researchers who are subject to stringent key performance indicators often requiring their work presented in talks to be published as articles in journals with sufficiently high impact factors. As both conferences draw submissions from a largely overlapping clientele, APBioNet and AASBi member society Japanese Society for Bioinformatics (JSBi) decided to organize a joint GIW/InCoB2015 conference  rather than competing for submissions and potential delegates. The immediate effects of joining forces were the ability to offer manuscript submission to seven journal tracks (Bioinformatics , BMC Genomics , BMC Bioinformatics , BCM Medical Genomics , BMC Systems Biology , IEEE/ACM Transactions on Computational Biology and Bioinformatics  and Journal of Computational Biology and Bioinformatics ) and savings in logistics, operational and administrative costs that allowed us to sustain moderate registration fees.
As bioinformaticians we are familiar with network topologies. If we treat conference stakeholders similar to vertices and their connections in dynamic networks and try to optimize them we can achieve a state of scale-free conferences. APBioNet considered GIW/InCoB2015 a test run for a future, potentially larger, joint multi-partner supraregional bioinformatics conference in Asia. The expected economy of scale and richness in diversity of topics will be beneficial for all stakeholders in the conference: participants, invited speakers, organizers/hosts, publishers, sponsors, funders, exhibitors, venues, transport and accommodations.
Manuscript submission and review
All submitted manuscripts were reviewed by at least two reviewers before a decision on rejection, major revision or minor revision was reached. Details on the reviewing process, acceptance, the list of manuscripts selected for the Best Paper Awards and the names of the program committee members are available in the Introduction of the BMC Genomics GIW/InCoB2015 supplement . An overview of the 16 articles in this supplement  and six articles in BMC Systems Biology supplement  is given in the next sections.
Biomarkers and disease networks
The wider application of biomarkers in diagnosing complex diseases and their progression is still limited by suboptimal accuracy and stability. Two studies addressing these issues apply network and algorithm approaches to cardioembolic stroke and Alzheimer's disease data. Wong et al.  constructed protein-protein interaction networks based on temporal gene expression profiles derived from microarray time course data of cardioembolic stroke. Pathway analyses combined with a stroke-relevant scoring function revealed network-based biomarkers that were common or distinct for three post-stroke time points. Vandewater et al.  tackled the problem of combinatorial solution space in finding robust blood-based biomarker and demographic feature combinations that are predictive for the progression of cognitive impairment towards Alzheimer's disease. An adaptive genetic algorithm with logistic regression approach resulted in a 30 feature model that showed superior performance in predicting progression to both mild cognitive impairment and Alzheimer's disease.
Another common challenge in biomarker discovery for complex diseases is that high-throughput transcriptome, proteome and metabolome data are amenable for sub-typing (e.g. cancer) and sub-processes in the disease but require more than Gene Ontology-based functional enrichment to understand the underlying molecular mechanisms. Wu et al.  designed a large-scale text-mining system for thyroid cancer sub-classification and detailed molecular pathway understanding which appears to be sufficiently generic to be applied to other complex diseases.
Copy number alterations (CNAs) are known to contribute to the deregulation of gene expression in cancers . Piccetti et al.  developed a methodology related to learning cooperative regulation networks from gene expression data  to derive for each gene and sample of bladder cancer with CNAs a network-wide deregulation score. The approach is expected to provide useful information on shared and different deregulation types among cancer subtypes as well as individual patients. Taguchi  applied principal component analysis with unsupervised feature extraction to identify deregulated genes and aberrantly methylated promoters in the endocrine inhibitor vinclozolin exposed rat F3 cell lineages representing different developmental stages. His approach yielded candidate chemokine signaling pathways and several leucine-rich repeat proteins that may play a role in transgenerational-mediated epigenetic diseases.
Transcriptional regulation and protein-protein interactions
Multicellular human and unicellular yeast share core eukaryotic pathways that can be exploited in humanized yeast cell  disease models and study of transcriptional regulation. Wu et al.  implemented a one-stop web-based yeast-associated genes mining tool YAGM for biological associations that include TF binding, regulation, mutant phenotype, physical and genetic interactions among others.
Functional redundancy of TFs in yeast has been studied experimentally using knock-out strains indicating that only a small percentage of binding sites regulated by a particular TF are affected when the TF is knocked-out. Wu et al.  reanalyzed the functional redundancy for molecular features that may explain why one TF can compensate for another one. Features that rendered a TF redundant included low expression levels, TF binding sites in close vicinity of transcription start sites, few bound TFs among other factors. Benchmarking of new or modified computational methods of TF binding predictions can become time consuming when done for all methods. Lai et al.  developed a cooperative TF pair evaluator that incorporated fourteen different methods and their performance indices.
Protein-protein interactions (PPIs) can change spatially and temporally depending on the context. When PPIs are clustered, the temporal and/or spatial differences that delineate molecular biological information can be lost. Stoney et al.  designed a clustering strategy that can accommodate context dependent PPIs utilizing Gene Ontology annotations or sequence homology to determine functional communalities. The approach allows PPIs to be represented in multiple pathways rather than one node with multiple connections, a prerequisite for exploring functional organization and changes.
The work of Konishi  also addresses clustering in the presence of a temporal dimension. The scaled principal component analysis of mammary gland development microarray time course data showed robustness to noise and separation of groups across all time points.
Protein structures and post-translational modifications
Post-translational modification (PTM) or the absence of it affects the structure of a protein. Potential PTM-mediated changes in protein-protein binding characteristics may alter the regulation of protein networks but also make PTM a drugable target in diseases pathways. For example, aberrant neddylation has been reported to be involved in several cancers, cardiac and neurodegenerative diseases . Yavuz et al.  developed a new neddylation site prediction method using a support vector machine algorithm. The feature selection includes protein properties beyond sequence conservation such as hydrophobicity, disorder state and various physicochemical properties. The second PTM prediction method and tool reported in this supplement utilizes maximal dependence decomposition of potential motifs associated with O-linked glycosylation catalyzed by O-GlcNAc transferase .
Sowmya and Ranganathan  investigated generic features of PPIs that govern binding at the interface of proteins on a quantitative level. Among nine feature classes: interface area, interface polar abundance, interface charged residues percentage, and solvation free energy gain upon interface formation, binding energy turned out to be significantly different. The finding has the potential to improve the characterization of protein complexes at the single-residue level and improve docking methods for predicting protein-protein binding. Large-scale prediction of protein-binding affinities based on sequence properties was addressed by Srinivasulu et al.  who introduced an SVM with support vector regression feature selection. Fourteen features of physiochemical properties were found to be informative for predicting the affinities of heterodimeric protein complexes. The last paper in this section deals with the issue of antigenic protein surface residues in conformational B-cell epitope prediction. Ren et al.  developed a weighted SVM algorithm that has been implemented with together with data pre-processing steps as the positive-unlabeled prediction pipeline (PUPre). PUPre performance was evaluated using unbound antigen structures of antigens that circumvents constraints of bound antigen structures. PUPre outperformed existing methods when tested on the three antigens with known epitopes, and it will be interesting to see if the method will be implemented as web tool for use in prediction of unknown B-cell epitopes by a wider user community.
Biological image classification is gaining more and more importance beyond cell imaging and diagnostics. Artificial neural networks (ANN) have been successfully applied in the taxonomic classification of algae from images . Here, Kien et al.  developed an automated ANN-based species identification technique for copepod zooplankton using dorsal microscopy images. The methodology shows promise to reduce the time spent on the taxonomic analysis of plankton in aquatic ecosystem studies. The focus of the second bioimaging paper is on single-cell Raman spectra of bacteria . The authors present an improved method that transforms the Raman spectrum into a discrete spectrum that results into rapid spectra comparison and higher classification accuracy.
Light-RCV  is another next-generation sequencing reads and alignment viewer. The genome-wide read coverage at base level is facilitated by a memory dump of the coverage which does not require lengthy sequential loading of a specific base position. Kimura and Koike  used the Burrows-Wheeler transform to identify the exact position of break points in genomic rearrangements. The exact position is determined without using split reads by applying discordant pairs and a lossless dictionary of reads. When applied to heterogeneous cancer genome sequence samples previously unknown somatic breakpoints were detected. Genomics is increasingly routinely applied in environmental sciences. For example in soil microbiome studies 16S rDNA sequencing is used to determine taxonomic distribution of bacteria in samples. Chen et al.  designed a sequencing pipeline with reduced primer bias and post-processing that resulted in longer 16S rDNA sequences during a proof-of-principle sequencing of dioxin-containing soil samples. In microbial metagenomic studies the number of mixed bacterial populations is termed richness. Jayasundara et al.  improved the estimation of richness for viral strains using quasispecies spectra obtained with a new probabilistic method.
In 2012 the US Presidential Commission for the Study of Bioethical Issues released a report on "privacy and progress in whole genome sequencing" . The analysis results and recommendations range from policies and ethics to technical issues such as computational access. Database developers and providers face the conundrum of promoting data access and sharing data while providing protection, security and privacy. In the past three years a few papers explored fuzzy encryption to deal with privacy issues when searching genotype and single-nucleotide polymorphism (SNP) data [39, 40]. Shimizu et al.  proposed an additive-homomorphic cryptosystem for protecting user and database privacy. The results of a case-study on searching a large chemical compound database have been encouraging and may result in future applications in drug discovery-related searching of sensitive data and searching for SNPs in personal genome databases.
The articles in the two GIW/InCoB2015 supplements of BMC Bioinformatics and BMC Systems Biology represent a variety of new bioinformatics methods that will enable new studies to advance our biological and biomedical knowledge on molecular and system levels. The next opportunity to present and publish innovative bioinformatics tools and novel bioinformatics-driven research in the framework of InCoB and collaborating partners is coming soon. InCoB2016 will be held from September 21-23, 2016 in Singapore .
- Association of Asian Societies for Bioinformatics
- artificial neural network
- Asia-Pacific Bioinformatics Network
- copy number alteration
- International Conference on Genome Informatics
- Hidden Markov Model
- IEEE Communications Society/Association for Computing Machinery
- International Conference on Bioinformatics
- Japanese Society for Bioinformatics
- protein-protein interaction
- post-translational modification
- single-nucleotide polymorphism
- transcription factor.
APBioNet. 2015, [http://www.apbionet.org/]
AASBi. 2015, [http://http:/www.aasbi.org/]
GIW/InCoB2015. 2015, [http://incob.apbionet.org/incob15]
Bioinformatics. 2015, [http://bioinformatics.oxfordjournals.org/]
Joint 26th Genome Informatics Workshop and 14th International Conference on Bioinformatics: Genomics. 2015, [http://www.biomedcentral.com/bmcgenomics/supplements/16/S12]
Joint 26th Genome Informatics Workshop and 14th International Conference on Bioinformatics: Bioinformatics. 2015, [http://www.biomedcentral.com/bmcbioinformatics/supplements/16/S18]
Joint 26th Genome Informatics Workshop and 14th International Conference on Bioinformatics: Medical Genomics. 2015, [http://www.biomedcentral.com/bmcmedgenomics/supplements/8/S4]
Joint 26th Genome Informatics Workshop and 14th International Conference on Bioinformatics: Systems Biology. 2015, [http://www.biomedcentral.com/bmcsystbiol/supplements/9/S6]
IEEE/ACM Transactions on Computational Biology and Bioinformatics. 2015, [http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=8857]
Journal of Bioinformatics and Computational Biology. 2015, [http://www.worldscientific.com/toc/jbcb/13/05]
Joint 26th Genome Informatics Workshop and 14th International Conference on Bioinformatics: Genomics. 2015, [http://www.biomedcentral.com/bmcgenomics/supplements/16/S12/I1]
Wong YH, Wu CC, Lai HY, Jheng BR, Weng HY, Chang TH, et al: Identification of network-based biomarkers of cardioembolic stroke using a systems biology approach with time series data. BMC Syst Biol. 2015, 9 (Suppl 6): S4-
Vandewater L, Brusic V, Wilson W, Macaulay L, Zhang P: An adaptive genetic algorithm for selection of blood-based biomarkers for prediction of Alzheimer's disease progression. BMC Bioinformatics. 2015, 16 (Suppl 18): S1-
Wu C, Schwartz JM, Brabant G, Peng SL, Nenadic G: Constructing a molecular interaction network for thyroid cancer via large-scale text mining of gene and pathway events. BMC Syst Biol. 2015, 9 (Suppl 6): S5-
Cheung VG, Spielman RS: Genetics of human gene expression: mapping DNA variants that influence gene expression. Nat Rev Genet. 2009, 10: 595-604.
Picchetti T, Chiquet J, Elati M, Neuvial P, Nicolle R, Birmelé E: A model for gene deregulation detection using expression data. BMC Syst Biol. 2015, 9 (Suppl 6): S6-
Elati M, Neuvial P, Bolotin-Fukuhara M, Barillot E, Radvanyi F, Rouveirol C: LICORN: learning cooperative regulation networks from gene expression data. Bioinformatics. 2007, 23 (18): 2407-2414.
Taguchi YH: Identification of aberrant gene expression associated with aberrant promoter methylation in primordial germ cells between E13 and E16 rat F3 generation vinclozolin lineage. BMC Bioinformatics. 2015, 16 (Suppl 18): S16-
Franssens V, Bynens T, Van den Brande J, Vandermeeren K, Verduyckt M, Winderickx J: The benefits of humanized yeast models to study Parkinson's disease. Oxid Med Cell Longev. 2013, 2013: 760629-
Wu WS, Wang CC, Jhou MJ, Wang YC: YAGM: a web tool for mining associated genes in yeast based on diverse biological associations. BMC Syst Biol. 2015, 9 (Suppl 6): S1-
Wu WS, Lai FJ: Functional redundancy of transcription factors explains why most binding targets of a transcription factor are not affected when the transcription factor is knocked out. BMC Syst Biol. 2015, 9 (Suppl 6): S2-
Lai FJ, Chang HT, Wu WS: PCTFPeval: a web tool for benchmarking newly developed algorithms for predicting cooperative transcription factor pairs in yeast. BMC Bioinformatics. 2015, 16 (Suppl 18): S2-
Stoney RA, Ames RM, Nenadic G, Robertson DL, Schwartz JM: Disentangling the multigenic and pleiotropic nature of molecular function. BMC Syst Biol. 2015, 9 (Suppl 6): S3-
Konishi T: Principal component analysis for designed experiments. BMC Bioinformatics. 2015, 16 (Suppl 18): S7-
Kandala S, Kim IM, Su H: Neddylation and deneddylation in cardiac biology. Am J Cardiovasc Dis. 2014, 4 (4): 140-58.
Yavuz AS, Sözer NB, Sezerman OU: Prediction of neddylation sites from protein sequences and sequence-derived properties. BMC Bioinformatics. 2015, 16 (Suppl 18): S9-
Kao HJ, Huang CH, Bretaña NA, Lu CT, Huang KY, Weng SL, et al: A two-layered machine learning method to identify protein O-GlcNAcylation sites with O-GlcNAc transferase substrate motifs. BMC Bioinformatics. 2015, 16 (Suppl 18): S10-
Sowmya G, Ranganathan S: Discrete structural features among interface residue-level classes. BMC Bioinformatics. 2015, 16 (Suppl 18): S8-
Srinivasulu YS, Wang JR, Hsu KT, Tsai MJ, Charoenkwan P, Huang WL, Huang HL, Ho SY: Characterizing informative sequence descriptors and predicting binding affinities of heterodimeric protein complexes. BMC Bioinformatics. 2015, 16 (Suppl 18): S14-
Ren J, Liu Q, Ellis J, Li J: Positive-unlabeled learning for the prediction of conformational B-cell epitopes. BMC Bioinformatics. 2015, 16 (Suppl 18): S12-
Coltelli P, Barsanti L, Evangelista V, Frassanito AM, Gualtieri P: Water monitoring: automated and real time identification and classification of algae using digital microscopy. Environ Sci Process Impacts. 2014, 16 (11): 2656-2665.
Kien LL, Li-Lee C, Dhillon SK: Automated identification of copepods using digital image processing and artificial neural network. BMC Bioinformatics. 2015, 16 (Suppl 18): S4-
Sun S, Wang X, Gao X, Ren L, Su X, Bu D, et al: Condensing Raman spectrum for single-cell phenotype analysis. BMC Bioinformatics. 2015, 16 (Suppl 18): S15-
Chang CW, Lee WB, Chen-Deng A, Liu T, Tseng JT, Chang DTH: Light-RCV: a lightweight read coverage viewer for next generation sequencing data. BMC Bioinformatics. 2015, 16 (Suppl 18): S11-
Kimura K, Koike A: Analysis of genomic rearrangements by using the Burrows-Wheeler transform of short-read data. BMC Bioinformatics. 2015, 16 (Suppl 18): S5-
Chen YL, Lee CC, Lin YL, Yin KM, Ho CL, Liu T: Obtaining long 16S rDNA sequences using multiple primers and its application on dioxin-containing samples. BMC Bioinformatics. 2015, 16 (Suppl 18): S13-
Jayasundara D, Saeed I, Chang BC, Tang SL, Halgamuge SK: Accurate reconstruction of viral quasispecies spectra through improved estimation of strain richness. BMC Bioinformatics. 2015, 16 (Suppl 18): S3-
Presidential Commission for the Study of Bioethical Issues. 2012. Privacy and progress in whole genome sequencing. 2015, [http://bioethics.gov/cms/sites/default/files/PrivacyProgress508.pdf]
Hormozdiari F, Joo JW, Wadia A, Guan F, Ostrosky R, Sahai A, et al: Privacy preserving protocol for detecting genetic relatives using rare variants. Bioinformatics. 2014, 30 (12): i204-211.
He D, Furlotte NA, Hormozdiari F, Joo JW, Wadia A, Ostrovsky R, et al: Identifying genetic relatives without compromising privacy. Genome Res. 2014, 24 (4): 664-672.
Shimizu K, Nuida K, Arai H, Mitsunari S, Attrapadung N, Hamada M, et al: Privacy-preserving search for chemical compound databases. BMC Bioinformatics. 2015, 16 (Suppl 18): S6-
International Conference on Bioinformatics 2016 (InCoB2016). 2015, [http://incob16.apbionet.org/]
We thank all reviewers for their timely submission of reports. We also thank NOVARTIS Foundation (Japan) for the Promotion of Science, Level Five Ltd., Nabe International Co. Ltd., Amelieff Co. Ltd., and EpiVax Inc. for sponsorships given to GIW/InCoB2015. We are grateful for the material support from AIST Tokyo Waterfront, JSBi and Human Genome Center (University of Tokyo). The excellent support by staff of the APBioNet Secretariat and CBRC was appreciated. Finally, we thank Isobel Peters (BioMed Central) for her support during the prepublication phase.
This article has been published as part of BMC Bioinformatics Volume 16 Supplement 18, 2015: Joint 26th Genome Informatics Workshop and 14th International Conference on Bioinformatics: Bioinformatics. The full contents of the supplement are available online at http://www.biomedcentral.com/bmcbioinformatics/supplements/16/S18.
The authors declare that they have no competing interests.
CS and SR wrote the introduction. CS and SMY (Program Committee Co-chairs) managed the review, revision and decision processes. TWT, PH and SR supported the post-acceptance and editorial processing, respectively. All authors have read and approved the final manuscript.
About this article
Cite this article
Schönbach, C., Horton, P., Yiu, S. et al. GIW and InCoB are advancing bioinformatics in the Asia-Pacific. BMC Bioinformatics 16, I1 (2015). https://doi.org/10.1186/1471-2105-16-S18-I1
- Unsupervised Feature Extraction
- Temporal Gene Expression Profile
- Maximal Dependence Decomposition
- Neddylation Site