Skip to main content

Table 2 The summary of real and simulated datasets used in the experiment along with the corresponding reference genome links

From: S-conLSH: alignment-free gapped mapping of noisy long reads

Dataset

Type

Platform

# of reads

Reference genome

H. sapiens-real

Real

PacBio RS II P5/C3 release

290,992

hg38

E. coli-real

Real

PacBio RS II P5/C3 release

300

Escherichia coli str. K-12 substr. MG1655

A. thaliana-real

Real

PacBio RS II P5/C3 release

3,448,228

TAIR10

O. sativa-real

Real

PacBio RS II P5/C3 release

590,268

Build 4.0

S. cerevisiae-real

Real

PacBio RS II P5/C3 release

594,243

S288C (assembly R64)

H. sapiens-sim

Simulated

PBSIM

146,932

hg38