Skip to main content

Using HPC for teaching and learning bioinformatics software: Benefits and challenges

Background

We present our work on using the XSEDE high-performance computing (HPC) network to support and facilitate hands-on bioinformatics tasks for participants of our Essentials of Next Generation Sequencing (NGS) workshop, as well as for students and other learners. In the summer of 2012, the University of Kentucky hosted the NGS workshop, attended by faculty and students from across the Commonwealth who were introduced to the laboratory and bioinformatic components of next-generation sequencing and sequence analysis. Participants used next-generation technology to sequence real genetic material, then used a variety of bioinformatics software tools to assemble those sequences, compare and align them to other sequences, predict genes, and visualize the genome. Due to the success of the 2012 workshop, the second workshop, planned for this summer, is expected to be larger in scale and to include even more participants. It will furthermore include several additional bioinformatics tools and tasks. Since participants will be simultaneously running intensive bioinformatics computing tasks, the resources required will exceed the capacity of the single twelve-core server used to support the workshop last year. One particular resource that appears promising to meet our intensive computational needs is the XSEDE grid computing network, a follow-on to the TeraGrid project designed specifically for "e-Science" and scientific computing. Many of the systems in the XSEDE network already support some of the software used within our workshop; however, many of the programs we will demonstrate have not previously been installed on tested on the XSEDE network. We will describe our experiences porting these applications to, and deploying them on, XSEDE. We will also discuss the challenges that the HPC approach presents for teaching and learning, particularly the complexities of navigating between time-sharing systems and remote job scheduling.

Acknowledgements

This work was partially supported by NIGMS Grant 1R01GM086888-01, Kentucky NSF-EPSCoR Grant 0814194, NSF Grant EF-0523661, and USDA-NRA Grant 2005-35319-16141.

Author information

Affiliations

Authors

Corresponding author

Correspondence to Tyler Parke.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Parke, T., Farman, M., Farnsworth, E. et al. Using HPC for teaching and learning bioinformatics software: Benefits and challenges. BMC Bioinformatics 14, A18 (2013). https://doi.org/10.1186/1471-2105-14-S17-A18

Download citation

Keywords

  • Sequence Analysis
  • Generation Sequencing
  • Next Generation Sequencing
  • Software Tool
  • Genetic Material