Fig. 1From: GRASP2: fast and memory-efficient gene-centric assembly and homolog search for metagenomic sequencing dataOverview of the GRASP2 algorithm. a The reads are overlapped to construct an assembly string graph. b Bubbles and tips are identified and subsequently removed from the assembly graph. c Unbranched paths are collapsed into single edges. Each collapsed edge is considered as a unitig and receives an arbitrary label. d Ungapped alignment is performed between the reference protein sequence and each of the unipath. e High-scoring unipaths are treated as anchors to initialize gapped alignments. f Candidate paths are generated based on the identified anchor unipaths. Gapped alignment is performed between the reference protein sequence against the candidate paths to recruit homolog pathsBack to article page