Fig. 4From: Improvements in viral gene annotation using large language models and soft alignmentsDistribution of min–max normalized pooling distances and soft alignment scores for highly similar (94–99%, blue) and dissimilar (5–10%, orange) sequences. Distances from low- and high-similarity sequences were combined prior to min–max normalizationBack to article page