Skip to main content
Fig. 6 | BMC Bioinformatics

Fig. 6

From: microclass: an R-package for 16S taxonomy classification

Fig. 6

Effect of unknown taxa on r-scores. The four histograms show distribution of r-scores. The colors are: Green for all positive r-scores and black for scores more negative than ever observed in the contax.full data set. The transition from yellow to red indicates gradually smaller probabilities (from around 10−1 at yellow to 10−8 at dark red) of observing the corresponding r-score in the training set. Red colors are probabilities below 10−5. The upper left panel are r-scores where all classified taxa are present in the training data, i.e. no unknown taxa. In the upper right panel each genus is unknown, i.e. when classifying a sequence from genus A, there are no sequences from this genus in the training data. In the lower panels the same procedure has been repeated but the training data lack sequences from the same order and phylum, respectively

Back to article page