Skip to main content

Advertisement

Table 4 Dataset statistics

From: Automated assessment of biological database assertions using the scientific literature

Articles statistics      
  Mentions
# articles Mention Avg Max Max entity  
1,135,611 Gene 6.15 2040 PMC100320  
  Disease 11.01 1272 PMC100785  
Object mentions in #documents      
Mention # uniq men. Avg Max Max entity  
Gene 54,447,840 24.56 61,248 NAT2  
Disease 55,850,078 1402 273,007 Heavy chain  
Relational statements      
Type Avg Min Max Correct Incorrect
Etiology 141.86 1 12,296 989 1002
PPIs 14.11 1 1626 1758 2899