Skip to main content

Table 2 Cluster ranking. AptaCluster (Freq.), AptaCluster (Div.), and APTANI (Freq.), APTANI (Div.) represent the cluster ranking of frequency and diversity (the number of non-redundant sequences) of AptaCluster and APTANI, respectively. Sequences with a frequency of less than 10 were excluded before the clustering analysis. Because FASTAptamer and APTANI did not finish with all sequence data. *: This sequence is filtered as the frequency is less than 10. **: The ranking of clusters is tied; however, the sequences are not grouped in the same cluster. ***: These sequences did not include any motifs estimated by AptaTRACE, thus the sequences are not grouped into any clusters

From: FSBC: fast string-based clustering for HT-SELEX data

Sequence information    Cluster ranking      
SequenceIDRankingFrequencyBindingFASTAptamerAptaClusterAptaClusterAPTANIAPTANIAptaTRACEFSBC
      (Freq.)(Div.)(Freq.)(Div.) (lmin=5)
aggaggggGACTTaggactgggtttagggseq1692237Yes675787015
agggTATGGACTTCgacgtctcggctgaaseq22420057Yes1517151569911
cgcacaggaaggTATGGACTTCgacgtttseq3638750Yes2464655829011
ggTATGGACTTCgacgtcttctgacctaaseq4826753Yes15817268218811
gaaaTATGGACTTCgatacgccggctgagseq52551483Yes60229112740102626 11
agtatctatccGACTTggatttacgttcgseq6845984Yes5469921280561993626 NA***5
tatccGACTTggatggctgagcaaggctaseq710091415Yes731944901252622038626 55
aggaggggGACTTaggactgggtttatgaseq82814784YesNANANANANANANA
gcaggtgtggtttgctgaggTGGGCCctgseq91583447No1121125426
tttggtttgctgTATGGtgggctctgttaseq10870095No78108916 416
gtgagggtgAGGACaggttagcgtggtggseq111051669No911916916 754
ggtgaggcgGACGTatcttttagcaaatcseq121245038No101213135201141
tcgcttgaacggggaactactccaGACGTseq132320380No142123452270NA***41
gTGGGCgcacttagacggggtgatcgtaaseq14375831No75335767833871739NA***37
ACTTAtttgtcttaagtggcgggtcaatgseq15398771No782385564602188847
gggtccCTTCGgggtgacgatggtatctaseq16520504No10746612087417582253NA***11
ggtGTGGGgagggtcgtattgtgtcctgtseq173847126No388456859849921566
cttatttgtgtttagtggcgggcGTTTGtseq182932441No5053911044323NA***92
ctatttgTTCTAgtggcggtcatctaaggseq194400031No509134485920432253NA***88