Skip to main content

Table 2 Top 50 InterPro superfamily/domains that have been mapped to clusters with one-to-one correspondence

From: Clustering protein sequences with a novel metric transformed from sequence similarity scores and sequence alignments with neural networks

InterPro family/Domain ID

Type

Number of proteins in the benchmark dataset

Description

IPR001128

Family

507

Cytochrome P450

IPR000685

Family

398

Ribulose bisphosphate carboxylase, large chain

IPR002198

Family

290

Short-chain dehydrogenase/reductase SDR

IPR004000

Family

255

Actin/actin-like

IPR002423

Family

226

Chaperonin Cpn60/TCP-1

IPR001023

Family

221

Heat shock protein Hsp70

IPR002085

Family

181

Zinc-containing alcohol dehydrogenase superfamily

IPR000173

Family

177

Glyceraldehyde 3-phosphate dehydrogenase

IPR001175

Family

169

Neurotransmitter-gated ion-channel

IPR000910

Family

169

HMG1/2 (high mobility group) box

IPR001353

Family

147

20S proteasome, A and B subunits

IPR000894

Family

141

Ribulose bisphosphate carboxylase, small chain

IPR000298

Family

135

Cytochrome c oxidase, subunit III

IPR001019

Family

135

Guanine nucleotide binding protein (G-protein), alpha subunit

IPR000568

Family

134

H+-transporting two-sector ATPase, A subunit

IPR001400

Family

133

Somatotropin hormone

IPR000883

Family

131

Cytochrome c oxidase, subunit I

IPR001364

Family

131

Hemagglutinin, HA1/HA2 chain

IPR00970

Family

130

Secreted growth factor Wnt protein

IPR001664

Family

127

Intermediate filament protein

IPR000847

Domain

127

Bacterial regulatory protein, LysR

IPR001659

Family

124

Phycobilisome protein

IPR001694

Family

123

Respiratory-chain NADH dehydrogenase, subunit 1

IPR001811

Family

119

Small chemokine, interleukin-8 like

IPR000215

Family

118

Proteinase inhibititor I4, serpin

IPR001926

Family

114

Pyridoxal-5'-phosphate-dependent enzyme, beta subunit

IPR000515

Family

113

Binding-protein-dependent transport systems inner membrane component

IPR001424

Family

112

Copper/Zinc superoxide dismutase

IPR001804

Family

111

Isocitrate/isopropylmalate dehydrogenase

IPR001691

Domain

109

Glutamine synthetase, catalytic domain

IPR000934

Domain

105

Metallophosphoesterase

IPR001189

Family

105

Manganese and iron superoxide dismutase

IPR001041

Domain

105

Ferredoxin

IPR001099

Family

104

Naringenin-chalcone synthase

IPR001450

Domain

102

4Fe-4S ferredoxin, iron-sulfur binding domain

IPR001427

Family

102

Pancreatic ribonuclease

IPR000484

Family

100

Photosynthetic reaction centre protein

IPR000954

Family

98

Aminotransferase class-III

IPR001576

Family

93

Phosphoglycerate kinase

IPR000230

Family

93

Ribosomal protein S12, bacterial and chloroplast form

IPR002068

Domain

91

Heat shock protein Hsp20

IPR001750

Domain

90

NADH/Ubiquinone/plastoquinone (complex I)

IPR000836

Domain

90

Phosphoribosyltransferase

IPR001993

Family

90

Mitochondrial substrate carrier

IPR001236

Family

85

Lactate/malate dehydrogenase

IPR002210

Family

83

Papillomavirus major capsid L1 (late) protein

IPR001395

Family

81

Aldo/keto reductase

IPR000943

Family

80

Sigma-70 factor

IPR002226

Family

80

Catalase

IPR001766

Domain

80

Fork head transcription factor