From: ProtPlat: an efficient pre-training platform for protein classification based on FastText
# protein sequence
# protein families
< 100
5474
< 200
7433
< 300
8775
< 500
10,523
≥ 500
7249