TarPan: an easily adaptable targeted sequencing panel viewer for research and clinical use

Ashby, Cody; Rutherford, Michael; Bauer, Michael A.; Peterson, Erich A.; Wang, Yan; Boyle, Eileen M.; Wardell, Christopher P.; Walker, Brian A.

doi:10.1186/s12859-020-3477-y

Software
Open access
Published: 15 April 2020

TarPan: an easily adaptable targeted sequencing panel viewer for research and clinical use

Cody Ashby ORCID: orcid.org/0000-0002-9361-0283^1,2,
Michael Rutherford^1,2,
Michael A. Bauer^1,2,
Erich A. Peterson¹,
Yan Wang²,
Eileen M. Boyle²,
Christopher P. Wardell^1,2 &
…
Brian A. Walker³

BMC Bioinformatics volume 21, Article number: 144 (2020) Cite this article

2220 Accesses
2 Citations
5 Altmetric
Metrics details

Abstract

Background

The study of cancer genomics continually matures as the number of patient samples sequenced increases. As more data is generated, oncogenic drivers for specific cancer types are discovered along with their associated risks. This in turn leads to potential treatment strategies that pave the way to precision medicine. However, significant financial and analytical barriers make it infeasible to sequence the entire genome of every patient. In contrast, targeted sequencing panels give reliable information on relevant portions of the genome at a fiscally responsible cost. Therefore, we have created the Targeted Panel (TarPan) Viewer, a software tool, to investigate this type of data.

Results

TarPan Viewer helps investigators understand data from targeted sequencing data by displaying the information through a web browser interface. Through this interface, investigators can easily observe copy number changes, mutations, and structural events in cancer samples. The viewer runs in R Shiny with a robust SQLite backend and its input is generated from bioinformatic algorithms reliably described in the literature. Here we show the results from using TarPan Viewer on publicly available follicular lymphoma, breast cancer, and multiple myeloma data. In addition, we have tested and utilized the viewer internally, and this data has been used in high-impact peer-reviewed publications.

Conclusions

We have designed a flexible, simple to setup viewer that is easily adaptable to any type of cancer targeted sequencing, and has already proven its use in a research laboratory environment. Further, we believe with deeper sequencing and/or more targeted application it could be of use in the clinic in conjunction with an appropriate targeted sequencing panel as a cost-effective diagnostic test, especially in cancers such as acute leukemia or diffuse large B-cell lymphoma that require rapid interventions.

Background

The field of cancer genomics has, and continues, to rapidly advance. Large-scale genomic studies have revealed a host of information about a wide variety of cancer types that have led to genetic-driven targeted therapy. However, despite these advancements, due to cost it currently remains infeasible to run whole genome or exome sequencing for every patient. To that end, genomic researchers and clinicians have employed the use of targeted sequencing panels that reveal driver mutations, copy number and structural events relevant to their specific cancer of interest that may be used for diagnostic, prognostic, and theranostic purposes.

Tools such as Integrative Genome Viewer [1] and the Integrated Genome Browser [2] have been designed to view large-scale genomic data and are well-suited for that purpose. However, while it is certainly possible, it is arduous to analyze targeted sequencing samples using conventional tools developed with whole-genome or whole-exome data in mind. For example, to analyze somatic events such as mutations, copy number changes, and structural variants would require the user to load multiple tracks and bam files and manually move the browser to each region of interest. Further, the hardware requirements to load all this data can be burdensome. In response to this problem, tools have begun to appear to aid researchers in these types of analysis such as CNSpecter [3], which is a browser tool that aims to facilitate the inspection of somatic copy number changes. Still, there remains the need for a viewer that can allow a researcher to easily inspect mutations, copy number changes, and structural variants in one application. In response to this problem we have developed the Targeted Panel (TarPan) Viewer.

Implementation

Tool development

TarPan Viewer is written in R Shiny and utilizes argument-driven python scripts to manage targeted panel sequencing data within SQLite databases. The complete design diagram for the tool is shown in Fig. 1. These scripts handle all database functions including creating, updating, and deleting databases and the targeted panel sequencing entries within them.

R Shiny was utilized for its simplicity and ability to harness the power of R’s vast libraries and rich visualization capabilities. Most important is its ability to scale, ensuring smooth access to accommodate a considerable number of users and sizable datasets. R Shiny along with SQLite met our need for cross-platform portability and reliability, no matter the file size.

Installation

TarPan is available (with example data) on GitHub (https://github.com/tcashby/tarpan) and can be installed on all platforms capable of running R Shiny including Windows, macOS, and Linux. Besides R Shiny, all R dependencies must be installed using the included script. Finally, a Python environment with the pandas module [4] is required to use the import scripts. TarPan may either be run in local or server mode depending on the version of R Shiny installed by the user.

Data preparation

TarPan accesses an SQLite database file generated from user provided input and is converted to SQLite tables by a series of custom python scripts. A single database (see Supplemental Figure 1 for database schema) file containing example data is included in the repository. Users may view and interact with the example database, or their specific databases using a standard SQLite database viewer.

The first step is creation of a non-populated database that contains the tables and required information that applies to all samples imported into this specific database. The required data to create an initial database are the output name of the database, the reference genome used (hg19 and hg38 are supported), the pipeline version used, a Browser Extensible Data (BED) file containing the targeted regions of the panel, and a BED file that contains the gene group annotations. All samples that were processed using the same required data may be grouped within the same database. Finally, the user must specify the name of the database in the configuration (config.yml) file for it to appear in TarPan Viewer.

The scripts are designed to take as input standardized formats such as BED and Variant Call Format (VCF). This allows the inclusion of a wide variety of algorithms and tools dependent on user preference. Non-standard formats must be converted according to the instructions specified in the installation document. After conversion of user data to SQLite tables, they are accessed by the RSQLite R package by TarPan and visualized using in a Shiny compatible browser of the user’s choice.

Testing pipeline software

While TarPan is designed to accept common data formats (e.g. BED and VCF files) from a variety of tools, initial testing was performed with the following tools:

Copy number: depths provided by CNVKit (ver 0.9.5) [5]
Somatic variants: provided by Strelka2 (ver. 2.9.10) [6]
Structural variants: provided by Manta (ver. 1.5.0) [7]
Annotations: provided by Variant Effect Predictor (ver. 95.3) [8]

Results and discussion

TarPan user interface

Overview

The TarPan user interface is presented via any web browser. Options for data filtering and view specific options are provided on the left in a panel and the user may switch between views by clicking on tabs at the top. An image showing the interface for TarPan viewer is provided in Fig. 2.

Database selection

If the user has more than one database containing samples, they may select the current one by the dropdown in the selection pane.

Data filtering

TarPan offers the user the ability to filter based on all data available or on specific chromosomes. This filtering applies to all views in TarPan, including the copy number, mutation and structural variant views. Further, if a blacklist BED file is provided to the tool, those regions are hidden by default. If the user wishes to see these blacklist regions they may select the “Show blacklist genes” option.

Copy number calculation and visualization

By default, TarPan will use the ratio values provided by the tool used for depth comparison. If these values are not provided, the tool will calculate a normalized ratio based on chromosomes 1–22. Further adjustments may be made by the user at the whole chromosome level by selecting the “Use specific chromosome for normalization” option and selecting which chromosomes to use. This is especially useful for samples with highly aberrant chromosomes.

Copy number ratio is calculated by the following formula:

$$ R=\frac{T/{T}_{nf}}{N/{N}_{nf}} $$

Where R is the copy number ratio, T is the read depth for a given interval in the tumor sample, N is the read depth for the corresponding interval in the normal sample, T_nf is the tumor normalization factor generated by taking the mean of all given interval depths in the tumor sample and N_nf is the normal normalization factor generated by taking the mean of all given interval depths in the normal sample.

A user may select any combination of chromosomes using checkboxes for manual correction of normalization. Whenever manual correction is employed, only intervals located on the chromosomes that the user selects are used for the normalization. This is useful in cases where certain chromosomes seldom have any copy number aberrations (CNAs).

In addition, there are three views provided for copy number inspection by TarPan. The first is the Whole Genome View in which all intervals on chromosomes are visible and positioned according to their genomic position. The second is Chromosome View in which all intervals on a single user selected chromosome are visible and positioned according to their genomic position. The third is Chromosome Continuous View in which all intervals on a single user selected chromosome are visible and positioned equidistantly in order to more easily view patterns for specific genes. An example of all three views are shown in Fig. 3.

Finally, the user may see visible SNP differences by clicking the “Show SNPs for these regions” option. This shows a plot with the SNPs that were seen as heterozygous in the normal and how they compare in the tumor sample. This data provides the user with the ability to identify where there is loss of heterozygosity in regions where the copy number is more than 1 as well as more confidence in regions of deletion.

Structural variant visualization

Structural variants may be inspected by selecting the structural variants tab. This provides a view of all structural variants that have PASS or “.” in the FILTER column of the VCF file. Optionally the user may choose to view all structural variants detected by the variant caller by selecting the “Show SVs that failed filter” checkbox. Variants are presented in a table, which is searchable and sortable.

Mutation visualization

Mutations may be inspected by selecting the mutation tab. This provides a view of all mutations that had PASS or “.” in the FILTER column of the VCF file. Optionally the user may choose to view all mutations detected by the variant caller by selecting “Show mutations that failed filter”. The algorithm detects annotations produced by the variant effect predictor (VEP) and extracts information such as gene symbol, consequence, amino acid protein code, nucleotide codon code, etc. if available. Otherwise, the raw VCF is displayed. Mutations are presented in a table, which is searchable and sortable.

Auto CN and auto CN groups

Although a strength of TarPan is in the ability of the user to easily manually inspect Copy Number (CN) regions, the user may click the Auto CN and Auto CN Groups tab to view a table containing calls made by the viewer itself. Auto CN shows the user a list of all normalized depth ratios for each interval in the targeted region bed file provided. Auto CN Groups shows the user a list of copy number calls for the entire group for each interval in the gene group BED file provided.

Circos visualization

A Circos-style visualization is provided by the RCircos package [9]. Structural variants, copy number and mutations appear on this Circos plot. Users may choose to hide/show any of these features by selecting the appropriate option. In addition, in the case of translocations users may choose to show the partner chromosome by selecting the “Show Inter-chromosomal SVs” option.

TarPan case studies

Visualization of multiple myeloma data using TarPan

We currently have used TarPan for the visualization and analysis of 100 multiple myeloma samples which are available under European Genome Archive (EGA) accession number EGAS00001002859. In addition, there have been several peer reviewed publications [10,11,12,13] where TarPan was used for screening, validation, or analysis. Examples showing important multiple myeloma events in TarPan are shown such as bi-allelic inactivation of TP53 (Fig. 4a), gain of 1q (Fig. 4b), translocations to the Ig regions (Fig. 4c) and complex structural events involving the proto-oncogene MYC (Fig. 4d). Data from two publicly available multiple myeloma samples are provided as example data in the GitHub repository.

Visualization of follicular lymphoma data using TarPan

Whole exome sequencing data was downloaded for patient TCRBOA7 from the open-access Texas Cancer Research Biobank [14]. This patient sample was obtained from a white male aged 61–70 years old, diagnosed with B-cell follicular lymphoma (FL). To analyze this data in TarPan we created a pseudo-targeted panel by limiting the exome to 490 genes used by the FL custom panel reported by Bouska et al. [15] plus two genes reported as mutated in TCRBOA7 [14]. We then ran the sample through the best practices pipeline described above and visualized the result in TarPan. We observed all previously reported mutations (NFE2L3, KMT2D, CREBBP) [5]. We also observed a deletion event in 1p (shown in Fig. 5a) that affected four genes on the pseudo-targeted panel which was also shown by ichorCNA [16] on WGS data (shown in Supplemental Figure 3).

Visualization of breast cancer cell line using TarPan

Whole genome sequencing data was obtained for the publicly available breast cancer cell line HCC2218C from Illumina Basespace with matching baseline sample B-lymphoblastoid cell line HCC2218BL. To analyze this data with TarPan we created a pseudo-targeted panel based on 79 genes referenced in [17]. We found one mutation in the gene MIEN1. We compared the copy number data from the panel to the results of SNP array data [18]. We observed similar features including a massive amplification on chromosome 17 (Shown in Fig. 5b) that impacted genes ERBB2 (HER2) and GRB7, both of which can be seen in the previously reported data [18] and both of which have been reported as overexpressed or amplified in breast cancer [19, 20].

Visualization of lung cancer data using TarPan

We obtained BAM files for a formalin-fixed paraffin-embedded lung cancer tissue specimen along with its paired normal from the Johann Lab at the UAMS Cancer Institute. The panel used for these samples contained 72 genes of interest in lung cancer. The germline sample was sequenced to a mean depth of 1322 and the tumor was sequenced to a mean depth of 7365. We identified 8 mutations in 7 genes: LRP1B, NFE2L2, GRM8, NTRK2, TP53, MUC16, and SMARCA4. As this panel was strictly a mutation panel and did not include copy number regions we used low-depth WGS with ichorCNA (shown in Supplemental Figure 3) to calculate the depth ratio for targeted genes. This sample included a deletion in in 17p, which in conjunction with the mutation suggests bi-allelic inactivation of TP53 (shown in Fig. 5c).

TarPan use cases

Targeted panel visualization

The primary intended use for TarPan is to visualize data generated from targeted panel sequencing experiments. These can be mutation panels, copy number panels, structural variant panels or any combination thereof. Two of the case studies above show data generated from this use case. The multiple myeloma case study shows the results from a mutation + copy number + translocation targeted panel and the lung cancer case study shows the results from a mutation targeted panel. However, the user may also use other sequencing modalities to refine the results by providing the input to TarPan. An example of this is shown with the lung cancer sample by using low depth WGS to capture copy number depth ratios for genes of interest.

Whole-exome or genome visualization

While the main focus of TarPan is in the visualization of targeted panel sequencing data, as shown above in the follicular lymphoma and breast cancer case studies there are also applications in the field of conventional whole genome and exome sequencing. For example, if a researcher is interested in a smaller subset of genes to answer a specific question, they can easily reduce the data to a more manageable size and use TarPan to visualize the result. A similar use case is in the rapid prototyping of targeted sequencing panels and visualizing the type of information they might provide. For example, a user could limit a more comprehensive exome or genome sample to regions of interest and observe an approximation of what the data would look like on a targeted panel.

Database queries

As TarPan Viewer uses a simple SQLite database file as input, it is also possible for researchers familiar with SQL to easily batch query the entire database. For example, the user could query the copy number or mutation status of a captured interval across all samples.

TarPan future development

There are several potential future developments planned for TarPan. The first is the ability to process and import tumor-only samples, which would be especially useful whenever a paired normal is unavailable such as in some cell line data. The second is a user annotation view that would allow users to manually annotate features of interest. The third is the inclusion of genomic data of different species, such as mouse. Currently TarPan only works on human genomic data (hg19 and hg38). Allowing custom genomic data would open up many new use cases for the viewer. Finally, we think the tool could easily be adapted to potentially aid in cost-effective clinical diagnostic tests using targeted sequencing by automatically generating reports or summaries (e.g. cancer subtypes or risk scores), which could be manually validated by a trained scientist.

Conclusion

TarPan Viewer was developed to aid researchers in visual inspection and exploration of targeted panel sequencing data. Conventional genomic viewers designed for whole-genome or exome data are ill-suited to visualizing data from targeted panel sequencing data. In contrast, TarPan provides easily understood copy number, mutation and structural variant plotting and searchable tables of information relevant to a sample of interest. Its fast and responsive interface runs in a standard browser on any operating system capable of running R Shiny. The tool and scripts are easy to install, requiring only basic knowledge of R and Python, and step-by-step instructions are provided. The included example datasets demonstrate common use cases and highlight TarPan’s key features. The viewer has been well-utilized internally by trained bioinformaticians and cancer biologists to annotate samples. We believe that the described use cases and features illustrate its distinct usefulness. TarPan fills a clear need in the greater scientific community, helps advance cancer research, and is a step closer to the realization of personalized medicine.

Availability of data and materials

The source code and installation instructions for TarPan can be found at: https://github.com/tcashby/tarpan.

·Project name: TarPan Viewer

·Project home page: https://github.com/tcashby/tarpan

·Operating system(s): Platform Independent

·Programming language(s): Python, R

·Other requirements: Python 3, R 3.5 or higher

·License: GPL v3

·Any restriction to use by non-academics: non-applicable

Abbreviations

BED:: Browser Extensible Data (file type)
CN:: Copy Number
CNA:: Copy Number Aberrations
CNV:: Copy Number Variant
EGA:: European Genome Archive
FL :: Follicular Lymphoma
Ig:: Immunoglobulin
SNP:: Single Nucleotide Polymorphism
SQL:: Structured Query Language
SV:: Structural Variant
TarPan Viewer:: Targeted Panel Viewer
UAMS:: University of Arkansas for Medical Sciences
VCF:: Variant Call Format (file type)
VEP:: Variant Effect Predictor
WES:: Whole-Exome Sequencing
WGS:: Whole-Genome Sequencing
YML:: YAML Ain’t Markup Language (file type)

References

Thorvaldsdottir H, Robinson JT, Mesirov JP. Integrative genomics viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform. 2013;14:178–92.
Article CAS Google Scholar
Nicol JW, Helt GA, Blanchard SG, Raja A, Loraine AE. The integrated genome browser: free software for distribution and exploration of genome-scale datasets. Bioinformatics. 2009;25:2730–1.
Article CAS Google Scholar
Markham JF, Yerneni S, Ryland GL, Leong HS, Fellowes A, Thompson ER, De Silva W, Kumar A, Lupat R, Li J, Ellul J, Fox S, Dickinson M, Papenfuss AT, Blombery P. CNspector: a web-based tool for visualisation and clinical diagnosis of copy number variation from next generation sequencing. Sci Rep. 2019;9:6426.
Article Google Scholar
McKinney W. Data Structures for Statistical Computing in Python; 2010. p. 51–6.
Google Scholar
Talevich E, Shain AH, Botton T, Bastian BC. CNVkit: genome-wide copy number detection and visualization from targeted DNA sequencing. PLoS Comput Biol. 2016;12:e1004873.
Article Google Scholar
Saunders CT, Wong WSW, Swamy S, Becq J, Murray LJ, Cheetham RK. Strelka: accurate somatic small-variant calling from sequenced tumor-normal sample pairs. Bioinformatics. 2012;28:1811–7.
Article CAS Google Scholar
Chen X, Schulz-Trieglaff O, Shaw R, Barnes B, Schlesinger F, Källberg M, Cox AJ, Kruglyak S, Saunders CT. Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics. 2016;32(8):1220–2.
McLaren W, Gil L, Hunt SE, Riat HS, Ritchie GRS, Thormann A, Flicek P, Cunningham F. The Ensembl variant effect predictor. Genome Biol. 2016;17:122.
Article Google Scholar
Zhang H, Meltzer P, Davis S. RCircos: an R package for Circos 2D track plots. BMC Bioinformatics. 2013;14:244.
Article Google Scholar
Walker BA, Mavrommatis K, Wardell CP, Ashby TC, Bauer M, Davies FE, Rosenthal A, Wang H, Qu P, Hoering A, Samur M, Towfic F, Ortiz M, Flynt E, Yu Z, Yang Z, Rozelle D, Obenauer J, Trotter M, Auclair D, Keats J, Bolli N, Fulciniti M, Szalat R, Moreau P, Durie B, Stewart AK, Goldschmidt H, Raab MS, Einsele H, et al. Identification of novel mutational drivers reveals oncogene dependencies in multiple myeloma. Blood. 2018;132:587–97.
Article CAS Google Scholar
Walker BA, Mavrommatis K, Wardell CP, Ashby TC, Bauer M, Davies F, Rosenthal A, Wang H, Qu P, Hoering A, Samur M, Towfic F, Ortiz M, Flynt E, Yu Z, Yang Z, Rozelle D, Obenauer J, Trotter M, Auclair D, Keats J, Bolli N, Fulciniti M, Szalat R, Moreau P, Durie B, Stewart AK, Goldschmidt H, Raab MS, Einsele H, et al. A high-risk, double-hit, group of newly diagnosed myeloma identified by genomic analysis. Leukemia. 2018;33:159–70.
Article Google Scholar
Thanendrarajan S, Tian E, Qu P, Mathur P, Schinke C, van Rhee F, Zangari M, Rasche L, Weinhold N, Alapat D, Bellamy W, Ashby C, Mattox S, Epstein J, Yaccoby S, Barlogie B, Hoering A, Bauer M, Walker BA, Davies FE, Morgan GJ. The level of deletion 17p and bi-allelic inactivation of TP53 has a significant impact on clinical outcome in multiple myeloma. Haematologica. 2017;102:e364–7.
Article CAS Google Scholar
Ashby C, Tytarenko RG, Wang Y, Weinhold N, Johnson SK, Bauer M, Wardell CP, Schinke C, Thanendrarajan S, Zangari M, van Rhee F, Davies FE, Sawyer JR, Morgan GJ, Walker BA. Poor overall survival in hyperhaploid multiple myeloma is defined by double-hit bi-allelic inactivation of <i>TP53</i&gt. Oncotarget. 2019;10:732–7.
Article Google Scholar
Becnel LB, Pereira S, Drummond JA, Gingras M-C, Covington KR, Kovar CL, Doddapaneni HV, Hu J, Muzny D, McGuire AL, Wheeler DA, Gibbs RA. An open access pilot freely sharing cancer genomic data from participants in Texas. Sci Data. 2016;3:160010.
Article Google Scholar
Bouska A, Zhang W, Gong Q, Iqbal J, Scuto A, Vose J, Ludvigsen M, Fu K, Weisenburger DD, Greiner TC, Gascoyne RD, Rosenwald A, Ott G, Campo E, Rimsza LM, Delabie J, Jaffe ES, Braziel RM, Connors JM, Wu C-I, Staudt LM, D’amore F, McKeithan TW, Chan WC. Combined copy number and mutation analysis identifies oncogenic pathways associated with transformation of follicular lymphoma. Leukemia. 2017;31:83–91.
Article CAS Google Scholar
Adalsteinsson VA, Ha G, Freeman SS, Choudhury AD, Stover DG, Parsons HA, Gydush G, Reed SC, Rotem D, Rhoades J, Loginov D, Livitz D, Rosebrock D, Leshchiner I, Kim J, Stewart C, Rosenberg M, Francis JM, Zhang C-Z, Cohen O, Oh C, Ding H, Polak P, Lloyd M, Mahmud S, Helvie K, Merrill MS, Santiago RA, O’Connor EP, Jeong SH, et al. Scalable whole-exome sequencing of cell-free DNA reveals high concordance with metastatic tumors. Nat Commun. 2017;8:1324.
Article Google Scholar
Smith NG, Gyanchandani R, Shah OS, Gurda GT, Lucas PC, Hartmaier RJ, Brufsky AM, Puhalla S, Bahreini A, Kota K, Wald AI, Nikiforov YE, Nikiforova MN, Oesterreich S, Lee AV. Targeted mutation detection in breast cancer using MammaSeq™. Breast Cancer Res. 2019;21:22.
Article Google Scholar
Samorodnitsky E, Jewell BM, Hagopian R, Miya J, Wing MR, Lyon E, Damodaran S, Bhatt D, Reeser JW, Datta J, Roychowdhury S. Evaluation of hybridization capture versus amplicon-based methods for whole-exome sequencing. Hum Mutat. 2015;36:903–14.
Article CAS Google Scholar
Tan M, Yu D. Molecular mechanisms of ErbB2-mediated breast Cancer Chemoresistance. In: Advances in experimental medicine and biology, vol. 608; 2007. p. 119–29.
Google Scholar
Bivin WW, Yergiyev O, Bunker ML, Silverman JF, Krishnamurti U. GRB7 expression and correlation with HER2 amplification in invasive breast carcinoma. Appl Immunohistochem Mol Morphol. 2017;25:553–8.
Article CAS Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

No funding was received for this study.

Author information

Authors and Affiliations

Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, USA
Cody Ashby, Michael Rutherford, Michael A. Bauer, Erich A. Peterson & Christopher P. Wardell
Cancer Institute: Myeloma Center, University of Arkansas for Medical Sciences, Little Rock, AR, USA
Cody Ashby, Michael Rutherford, Michael A. Bauer, Yan Wang, Eileen M. Boyle & Christopher P. Wardell
Division of Hematology Oncology, Indiana University, Indianapolis, IN, USA
Brian A. Walker

Authors

Cody Ashby
View author publications
You can also search for this author in PubMed Google Scholar
Michael Rutherford
View author publications
You can also search for this author in PubMed Google Scholar
Michael A. Bauer
View author publications
You can also search for this author in PubMed Google Scholar
Erich A. Peterson
View author publications
You can also search for this author in PubMed Google Scholar
Yan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Eileen M. Boyle
View author publications
You can also search for this author in PubMed Google Scholar
Christopher P. Wardell
View author publications
You can also search for this author in PubMed Google Scholar
Brian A. Walker
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

CA developed and implemented TarPan Viewer with code review from EAP and MR. MR and CA designed and wrote the TarPan import scripts. CA wrote the manuscript with input from MR, EMB, MAB, EAP, CPW and BAW. CA, MAB, and YW downloaded and processed the samples. CA and BAW were involved in the interpretation of the data. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Cody Ashby.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1.

Additional file 2.

Additional file 3.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Ashby, C., Rutherford, M., Bauer, M.A. et al. TarPan: an easily adaptable targeted sequencing panel viewer for research and clinical use. BMC Bioinformatics 21, 144 (2020). https://doi.org/10.1186/s12859-020-3477-y

Download citation

Received: 06 February 2020
Accepted: 31 March 2020
Published: 15 April 2020
DOI: https://doi.org/10.1186/s12859-020-3477-y

TarPan: an easily adaptable targeted sequencing panel viewer for research and clinical use

Abstract

Background

Results

Conclusions

Background

Implementation

Tool development

Installation

Data preparation

Testing pipeline software

Results and discussion

TarPan user interface

Overview

Database selection

Data filtering

Copy number calculation and visualization

Structural variant visualization

Mutation visualization

Auto CN and auto CN groups

Circos visualization

TarPan case studies

Visualization of multiple myeloma data using TarPan

Visualization of follicular lymphoma data using TarPan

Visualization of breast cancer cell line using TarPan

Visualization of lung cancer data using TarPan

TarPan use cases

Targeted panel visualization

Whole-exome or genome visualization

Database queries

TarPan future development

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary information

Additional file 1.

Additional file 2.

Additional file 3.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Bioinformatics

Contact us