Skip to main content

Table 1 Results from the nine protein families.

From: Discovering co-occurring patterns and their biological significance in protein families

Protein name

Pfam ID

Co-occurrence cluster count

Size of the best cluster

PDB ID of the best cluster

Average APC distance of the best cluster

Average pairwise distance

Lipocalin

PF00061

6

4

2CZT

16.77 Ã…

19.26 Ã…

Bacterial rhodopsins

PF01036

2

2

1JGJ

16.52 Ã…

22.51 Ã…

Bacterial antenna complex

PF00556

4

5

1IJD

0 Ã…

19.92 Ã…

Cytochrome c oxidase subunit I

PF00115

2

25

3OM3

26.78 Ã…*

30.00 Ã…

Photosynthetic reaction centre protein family

PF00124

2

7

1PSS

27.87 Ã…

30.19 Ã…

Leptin

PF02024

2

14

1AX8

15.73 Ã…

18.37 Ã…

G-alpha subunit

PF00503

3

8

4G5O

15.78 Ã…

27.45 Ã…

Protein kinase domain

PF00069

2

2

3OZ6

15.32 Ã…

27.51 Ã…

Tyrosine kinase

PF07714

2

8

4HW7

14.43 Ã…

24.99 Ã…

  1. Displays the Co-occurrence Cluster with the lowest average eigenvector distance, and are used to verify the algorithm's effectiveness with a PDB structure. The shorter distance in the comparison is bolded. * means that one or more APCs were not found.