Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management

Table 2 Pipeline performance evaluation on a synthetic dataset containing two mutations

% of reads from amplicon A	SD1			SD2
	Trimming pipeline		MiSeq pipeline	Trimming pipeline		MiSeq pipeline
	First step of variant calling	Second step of variant calling	MiSeq pipeline	First step of variant calling	Second step of variant calling	MiSeq pipeline
36.75	1	2	1	1	1	1
100	2	2	2	2	2	2
90	2	2	2	2	2	2
80	2	2	2	2	2	2
70	2	2	2	2	2	2
60	2	2	2	2	2	2
50	1	2	2	1	2	2
40	1	2	1	1	2	2
30	1	2	1	1	2	2
20	1	2	1	1	2	1
10	1	2	1	1	2	1
0	0	0	0	0	0	0

Results for both SD1 (two single nucleotide mutations) and SD2 (a single nucleotide insertion and a single nucleotide mutation) synthetic datasets are reported. The number of mutations found with trimming pipeline (during the first and second variant calling step) is reported. MiSeq pipeline performances for SD1 are comparable with the first step of variant calling of trimming pipeline and can identify the second mutation only if the percentage of reads from amplicon A (not affected by ADO) is above 50 %. For lower percentages, only trimming pipeline with the second step of variant calling can correctly identify the second mutation, even if amplicon A reads percentage lowers to 10 %. In SD2, trimming pipeline performances are identical to SD1, while MiSeq performances slightly improve, being able to identify the second mutation in two additional configurations (30 % and 40 %)

ISSN: 1471-2105