Skip to main content

Table 4 Dataset statistics

From: Automated assessment of biological database assertions using the scientific literature

Articles statistics

     
 

Mentions

# articles

Mention

Avg

Max

Max entity

 

1,135,611

Gene

6.15

2040

PMC100320

 
 

Disease

11.01

1272

PMC100785

 

Object mentions in #documents

     

Mention

# uniq men.

Avg

Max

Max entity

 

Gene

54,447,840

24.56

61,248

NAT2

 

Disease

55,850,078

1402

273,007

Heavy chain

 

Relational statements

     

Type

Avg

Min

Max

Correct

Incorrect

Etiology

141.86

1

12,296

989

1002

PPIs

14.11

1

1626

1758

2899