Skip to main content

Table 1 Distribution of one protein in the PPI to multiple nodes in the pathway.

From: New challenges for text mining: mapping between text and manually curated pathways

Distribution Frequency Ratio
1 12,718 0.0737
2 16,180 0.0937
3 31,769 0.1841
4 49,408 0.2863
5 3,403 0.0197
6 18,205 0.1055
7 3,454 0.0200
8 6,435 0.0373
9 4,797 0.0278
10 2,082 0.0121
11 2,125 0.0123
12 35 0.0002
13 262 0.0015
18 46 0.0003
>20 21,655 0.1255
  1. A single node in the PPI network tends to correspond to multiple nodes in the manually constructed pathway according to its state transitions. The distribution describes the number of nodes in the pathway that proteins in the pairs extracted from MEDLINE correspond to. The frequency, on the other hand, means how frequent each distribution is. That is, the frequency 16,180 of the distribution 2 means that protein names mapped to two nodes in the pathway occur 16,180 times in pairs extracted from MEDLINE. In the manually curated pathway a single protein in extracted pairs has 7.81 nodes on average which can be associated with it.
\