TY - JOUR AU - Meacham, Frazer AU - Boffelli, Dario AU - Dhahbi, Joseph AU - Martin, David IK AU - Singer, Meromit AU - Pachter, Lior PY - 2011 DA - 2011/11/21 TI - Identification and correction of systematic error in high-throughput sequence data JO - BMC Bioinformatics SP - 451 VL - 12 IS - 1 AB - A feature common to all DNA sequencing technologies is the presence of base-call errors in the sequenced reads. The implications of such errors are application specific, ranging from minor informatics nuisances to major problems affecting biological inferences. Recently developed "next-gen" sequencing technologies have greatly reduced the cost of sequencing, but have been shown to be more error prone than previous technologies. Both position specific (depending on the location in the read) and sequence specific (depending on the sequence in the read) errors have been identified in Illumina and Life Technology sequencing platforms. We describe a new type of systematic error that manifests as statistically unlikely accumulations of errors at specific genome (or transcriptome) locations. SN - 1471-2105 UR - https://doi.org/10.1186/1471-2105-12-451 DO - 10.1186/1471-2105-12-451 ID - Meacham2011 ER -