Proceedings of the 2017 MidSouth Computational Biology and Bioinformatics Society (MCBIOS) Conference
© The Author(s). 2017
Published: 28 December 2017
The XIVth Annual MidSouth Computational Biology and Bioinformatics Society was held in Little Rock, AR From March 23-25st 2017 and was co-hosted by University of Arkansas at Little Rock, University of Arkansas for Medical Sciences, Little Rock, AR and National Center for Toxicological Research, Jefferson, AR. The fourteenth annual conference entitled “Make them Safer Make them Better: Bioinformatics and the Development of Therapeuticals”. There were 220 conference registrants and 129 abstracts submitted, including 62 oral and 67 poster presentations.
The conference was co-chaired by Cesar M. Compadre, Ph.D. from UAMS and William Slikker Jr., Ph.D. from FDA/NCTR. The program was co-chaired by Shraddha Thakkar, Ph.D., and Weida Tong, Ph.D., from Division of Bioinformatics and Biostatics of FDA/NCTR Conference committee members were Darin E. Jones, Ph.D., Assistant Professor and Mary Yang, Ph.D., from UALR, Miss Ujwani Nukala, MS, from university of Arkansas at Little Rock, AR. For 2018–9, Dr. Ramin Homayouni, Ph.D. from University of Memphis, Memphis, TN was chosen as President-Elect and Bindu Nanduri, Ph.D., from Mississippi State University, Starkville, MS as President.
Keynote speakers were:
Day 1: March 23rd 2017
“Pharmacogenetic and Genomic applications for Safety and Therapeutic Efficacy Assessment in drug development programs” by Prof. Dr. Jürgen Borlak, Hannover Medical School, Hannover, Germany.
Day 2: March 24th 2017
“The Top 5 Greatest Bioinformatics Graphs Never Published” by Wendell Jones, Ph.D., Q2 Solutions | EA Genomics, Morrisville, North Carolina.
Day 3: March 25th 2017
“Targeting Undruggable Protein Tyrosine Phosphatases” by John Lazo, Ph.D., Professor in Pharmacology, University of Virginia, Charlottesville, VA.
The conference program included four workshops:
Workshop 1: MedDRA, by Anna Zhao-Wong, Ph.D.
Workshop 2: Next-Generation sequencing using Galaxy, by Binsheng Gong, Ph.D.
Workshop 3: PubChem, by Yanli Wang, Ph.D.
Workshop 4: Next-Generation Sequencing and Bioinformatics by Wenming Xiao, Ph.D.
There were 9 breakout sessions. Topics and facilitators were:
Breakout Session I: Metagenomics and the Microbiome, Carl E. Cerniglia, Ph.D.
Breakout Session II: Biomedical Informatics, Fred Prior, Ph.D.
Breakout Session III: Machine Learning and Chemoinformatics, Joshua Swamidass, MD, Ph.D.
Breakout Session IV: Drug Design and Development, Cesar Compadre, Ph.D.
Breakout Session V: Biomarker and high-throughput data analysis, Mary Yang, Ph.D.
Breakout Session VI: Genomics and therapeutics development, Zhichao Liu, Ph.D.
Breakout Session VII: Reproducible Genomics and Toxicogenomics, Joshua Xu, Ph.D.
Breakout Session VIII: in silico and in vivo Adverse Reaction Detection, Minjun Chen, Ph.D.
Breakout Session IX: Systems pharmacology and Bioinformatics, Jake Chen, Ph.D.
Best Paper Award, MCBIOS 2017: Cory Giles et al. “ALE: Automated Label Extraction from GEO metadata” .
This year MCBIOS launched the “MCBIOS Young Scientist Excellence” awards to recognize students and postdoctoral fellows that exhibit scientific excellence in the field of Bioinformatics. Student and postdoctoral fellows to go through a rigorous award application and the top candidates give a plenary presentation on the first day of the conference. To be able to compete, students submitted an abstract with separate descriptions of the innovations in the research and their specific roles in carrying out the work. Candidates were first evaluated by the MCBIOS board members and then by a panel of judges (including keynote speakers), who evaluated applications for the quality and impact of the research. The quality of professional presentation is the primary consideration for receiving the award as well as creativity, dedication, and multidisciplinary contributions demonstrated by the candidates. The idea was to select candidates with demonstrated multidisciplinary contributions and initiative.
Post-Doctoral winners of the MCBIOS Young Scientist Excellence award
First Place (tie):
Harsh Dweep, Ph.D.
Institution: US FDA/NCTR.
Title of the presentation: Defining a Landscape of Genes, miRNA and their Associations in Hepatocarcinogenesis: A Study of Thioacetamide with Multiple Doses and Time Intervals.
Tanmay Bera, Ph.D.
Institution: US FDA/NCTR.
Title: Developing Image Analysis Methods to Identify the Species of Food Contaminating Beetles.
Name: Suxing Liu, Ph.D.
Institution: Arkansas State University.
Title: Novel Low Cost 3D Surface Model Reconstruction System for Plant Phenotyping.
Name: Suguna Devi Sakkiah.
Institution: US FDA/NCTR.
Title: Development and Validation of Estrogen Receptor Beta Binding Prediction Model Using Large Sets of Chemicals.
Student winners of the MCBIOS Young Scientist Excellence award
Name: Brian Delavan.
University: University of Arkansas of Little Rock (UALR) and US FDA/NCTR.
Title: Drug Repurposing for LEOPARD Syndrome by Integrating Chemical Structure and Genomics based Approaches.
Name: Ujwani Nukala.
University: UALR and UAMS.
Title: Development of novel vitamin E analogs, Tocoflexols with enhanced bioavailability using in silico approaches.
Name: Shahin Boluki.
University: Texas A&M University.
Title: Prior Construction for Optimal Bayesian Classification Using Unlabeled Data.
Poster session award:
The poster session was held at the end of the first day of the meeting. Student and post-doctoral presenters presented their work at the poster sessions and it was judged for presentation quality by a panel of MCBIOS professional members that attended the conference.
Name: Priyam Patel
University: University of Memphis.
Title: Automated Bioinformatics Analysis Package using System On Chip (ABAPSoC).
Second Place (tie):
Name: Alan Amaya
University: East Tennessee State University.
Title: Analyzing the Protein-Protein Interaction Network of Tnf-Alpha.
Name: Bryan Naidenov
University: Oklahoma State University.
Title: Novel Gene Discovery by Genome Completion through De Novo Assembly of Long-Reads.
Name: Anqi Walbaum.
Title: Computational Modeling of the Interaction of Novel Quaternary Ammonium Molecules with the α9β10 Nicotinic Acetylcholine Receptor.
Post-Doctoral Winners for the poster presentation
Name: Patrick Apopa, Ph.D.
Title: The Role of Fastkd3 Gene Expression In Oncogenesis of Non-Small Cell Lung Carcinoma.
Name: Shuzhen Sun, Ph.D.
University: Oklahoma State University.
Title: SNP Variable Selection by Generalized Graph Domination.
Third Place (tie)
Name: Thidathip Wongsurawat, Ph.D.
Title: R-loop Forming Structure Prediction in Viral Genomes.
Name: Dianke Yu
Title: Roles of Long Non-Coding RNAs In Acetaminophen-Induced Liver Injury.
Selecting papers for the MCBIOS XI proceedings
From the work presented at MCBIOS 2017, a total of 24 papers were submitted to be considered for publication in this year’s Proceedings, and 15 papers were accepted (63% acceptance rate). At least 2 reviewers anonymously peer-reviewed all submitted papers and acceptable papers were quantitatively ranked on the basis of three evaluation criteria: Novelty (1–5), Impact (1–5) and Clarity (1–3). Editors that were co-authors of submitted papers were not permitted to handle their own papers editorially. Papers generally fell into three categories:
Networks and microbial communities
Zongliang Yue et al. applied a systems pharmacology framework to reposition drugs for polygenic diseases, in this case Parkinson’s disease (PD) . Integration of GWAS and gene expression data in the context of regulatory networks enabled the identification of PD-specific modules that can be targeted through drug repositioning with a score based on known drug-gene activity profiles.
Hyundoo Jeong et al. report CUFID-query, software for local network alignment, whereby one can query network modules against larger networks . CUFID-query will detect conserved functional modules within a large network that are expected to perform similar functions to the queried subnetwork on the basis of how nodes are interconnected.
Quang Minh Tran et al. address a challenging problem in metagenomics by identification and quantification of microbial genomes from unknown bacteria from environmental samples using next-generation sequencing information . For this analysis they used 16S rRNA instead of whole genome. They demonstrated in the manuscript that, accurate and robust predictions can be made at different read coverage and percentage of unknown bacteria.
Genomics & Transcriptomics
Li and Yang report on an approach to identify orthologous long non-coding RNAs (lncRNAs) . Unlike proteins, lncRNAs tend to have much less sequence conservation, and are harder to identify. Focusing on lncRNAs in human vs rat brain tissue, they identify 140 new lncRNAs not present in the existing databases.
Keqin Liu et al. investigated the role of non-significantly mutated cancer genes that remain underexplored due to hard significance thresholds . The study demonstrates that non-significantly mutated genes in endometrial cancer can effectively classify histological subtypes, predict clinical outcomes, and are enriched in relevant signaling pathways. The results suggest that less significant gene mutations should be considered along with the more significant ones.
Se-ran Jun et al. present an in-silico study of the Zika Virus genome, focusing specifically on the contrast between the Brazilian strain, unique for causing microcephaly, and the Asian and African strains . They use a robust set of Zika viral genome data and compare their findings to those of other groups that have worked with different subsets of this data and address the occasional divergence in the results/conclusions that currently exist in the Zika literature.
Chun-Chi Chen et al. develop a novel method for predicting piRNAs in genome sequences . The method classifies piRNAs based on shared sequence motifs and identifies predictive features using n-gram models. They demonstrate the algorithm using evaluations in three species – Homo sapiens, Rattus norvegicus, and Mus musculus.
Ethan Rath et al. aimed to identify trans-acting sRNAs that can be substrates of RNaseIII by comparing the RNase III gene deleted mutants with the Streptococcus pyogenes wild-type using RNA-seq data . To achieve that, they developed a custom script that can detect reads that support the intergenic regions of the S. pyogenes genome. With their analysis they were able to identify the novel sRNAs to expand understanding of the regulatory elements involved in S. Pyogenes.
Cory Giles et al. developed a tool for automated extraction of labels such as gender, tissue, etc. for GEO data . They also present a tool for predicting missing labels using probabilistic measures via gene expression data. They find that first assigning labels using heuristic text-extraction approaches enables the creation of larger training datasets for downstream machine learning models, and achieves better label prediction.
The manuscript by Recep Erol et al. reports an improved computational approach to detect malignancy in skin lesions from dermatoscopic images . Their method utilized Level Set Propagation (LSP) to detect abrupt lesion boundaries. The texture features of the lesions were then used in several different machine learning classifiers and evaluated for accuracy of malignancy detection. Using a fully-connected multi-hidden layer Neural Net classifier they achieved a specificity of 78%.
Maxwell et al. compared Deep learning Neural Networks (DNN) with standard multi-label classification methods for classifying chronic diseases such as diabetes, hypertension and fatty liver from anonymous medical records from over 110,300 for intelligent health risk prediction. DNNS had the highest accuracy compared to SVM and MLKNN classifiers .
Shahin Boluki et al. introduce a new method called Maximal Knowledge-Driven Information Prior (MKDIP), which utilizes an Optimal Bayesian Classifier framework to integrate gene regulatory and pathway knowledge for phenotypic classification . The performance of MKDIP was favorable compared to several Bayesian and non-Bayesian classification methods using two well-known pathways and a gene expression dataset on non-small cell lung cancer.
Mohsen Sharifi et al. develop a machine-learning method to predict which drugs could cause a potentially life-threatening arrhythmia known as Torsade de Pointes (TdP) . The method uses 3-dimensional spectral data-activity relationships (3D–SDAR) to identify molecular features responsible for the structure-activity relationship between drugs and the hERG receptor. The contribution of this method is to enable new drugs to be screened early against their potential to cause cardiac arrhythmias, which is a major cause for eventual failure of new drugs.
Visanu Wanchai et al. present a web-friendly interface that provides all available bacterial organisms from major public databases taken from several annotation sources including draft genomes . The tool offers users analysis and visualization capabilities whereby quality scores can be utilized as metrics for downstream assessments of bacterial genome comparisons. Quality scores are calculated using the following methods: assembly quality, number of rRNA, and tRNA genes and the occurrence of conserved functional domains.
Mikailov et al. report on a new parallelization method to remove limitations of multi-threading and Message Passing Interface parallelization techniques, scale bioinformatics applications performing sequence search and alignment across a HPC cluster, and adds checkpointing capabilities. This method, referred to as a “dual segmentation” method, is based on segmentation of both query and reference database combining partial solutions published earlier. Applying this method, BLAST run time fell from 27 days to <4 h .
The 15th Annual MCBIOS conference will be held in Starkville, Mississippi from March 29th-31st, 2018.
We would like to thank the many anonymous peer reviewers who helped to ensure the quality of these Proceedings. MCBIOS is a regional affiliate of the International Society for Computational Biology (http://www.ISCB.org). For information regarding MCBIOS and our future meetings, see http://www.MCBIOS.org.
Funding for the publication of this editorial was authorized by and obtained from the Mid-South Computational Biology and Bioinformatics Society. Funding for the conference was made possible in part by, the Food and Drug Administration through grant 1R13FD005931 and Arkansas INBRE program, supported by NIH/NIGMS grant # P20GM103429 (formerly P20RR016460). Views expressed in written conference materials or publications and by speakers and moderators do not necessarily reflect the official policies of Department of Health and Human Services.
About this supplement
This article has been published as part of BMC Bioinformatics Volume 18 Supplement 14, 2017: Proceedings of the 14th Annual MCBIOS conference. The full contents of the supplement are available online at https://bmcbioinformatics.biomedcentral.com/articles/supplements/volume-18-supplement-14.
All authors of this paper served as editors for these proceedings, with JDW serving as Senior Editor. All authors helped write this editorial. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Giles CB, Brown CA, Ripperger M, Dennis Z, Roopnarinesingh X, Porter H, Perz A, Wren JD. ALE: automated label Extraction from GEO metadata. BMC Bioinformatics. 2017;S1Google Scholar
- Yue Z, Arora I, Zhang EY, Laufer V, Bridges SL, Chen JY. Repositioning drugs by targeting network modules: a Parkinson’s disease case study. BMC Bioinformatics. 2017;S2Google Scholar
- Jeong H, Qian X, Yoon B. CUFID-query: accurate network querying through random walk based network flow estimation. BMC Bioinformatics. 2017;S12Google Scholar
- Tran Q, Pham D, Phan V. Using 16S rRNA gene as marker to detect unknown bacteria in microbial communities. BMC Bioinformatics. 2017;S14Google Scholar
- Li D, Yang MQ. Identification and functional annotation of conserved lncRNAs in human and rat brain. BMC Bioinformatics. 2017;S3Google Scholar
- Liu K, He L, Liu Z, Xu J, Liu Y, Kuang Q, Wen Z, Li M. Mutation status coupled with RNA-sequencing data can efficiently identify important non-significantly mutated genes serving as diagnostic biomarkers of endometrial cancer. BMC Bioinformatics. 2017;S4Google Scholar
- Jun S, Wassenaar TM, Wanchai V, Patumcharoenpol P, Nookaew I, Ussery DW. Suggested mechanisms for Zika virus causing microcephaly: what do the genomes tell us? BMC Bioinformatics. 2017;S7Google Scholar
- Chen C, Qian X, Yoon B. Effective computational detection of piRNAs using n-gram models and support vector machine. BMC Bioinformatics. 2017;S9Google Scholar
- Rath EC, Pitman S, Cho KH, Bai Y. Identification of streptococcal small RNAs that are putative targets of RNase III through bioinformatics analysis of RNA sequencing data. BMC Bioinformatics. 2017;S10Google Scholar
- Erol R, Bayraktar M, Kockara S, Kaya S, Halic T. Texture based skin lesion abruptness quantification to detect malignancy. BMC Bioinformatics. 2017;S5Google Scholar
- Maxwell A, Li R, Yang B, Weng H, Ou A, Hong H, Zhou Z, Gong P, Zhang C: Deep Learning Architectures for Multi-label Classification of Intelligent Health Risk Prediction BMC Bioinformatics 2017:S11.Google Scholar
- Boluki S, Esfahani MS, Qian X, Dougherty ER. Incorporating biological prior knowledge for Bayesian learning via maximal knowledge-driven information priors. BMC Bioinformatics. 2017;S6Google Scholar
- Sharifi M, Buzatu D, Harris S, Wilkes J. Development of models for predicting torsade de pointes cardiac arrhythmias using perceptron neural networks. BMC Bioinformatics. 2017;S8Google Scholar
- Wanchai V, Patumcharoenpol P, Nookaew I, Ussery DW. dBBQs : dataBase of bacterial quality scores. BMC Bioinformatics. 2017;S13Google Scholar
- Mikailov M, Luo F, Barkley S, Valleru L, Whitney S, Liu Z, Thakkar S, Tong W, Petrick N. Scaling bioinformatics applications on HPC. BMC Bioinformatics. 2017;S15Google Scholar