Skip to main content

Table 3 The distribution of words and sentences in the scheme-annotated CRA corpus

From: A comparison and user-based evaluation of models of textual information structure in the context of cancer risk assessment

S1

OBJ

METH

RES

CON

        
 

61483

39163

89575

35564

Words

       
 

2145

1396

3203

1241

Sentences

       
 

27%

17%

40%

16%

Sentences

       

S2

BKG

OBJ

METH

RES

CON

REL

FUT

     
 

36828

23493

41544

89538

30752

2456

1174

Words

    
 

1429

674

1473

3185

1082

95

47

Sentences

    
 

18%

8%

18%

40%

14%

1%

1%

Sentences

    

S3

HYP

MOT

BKG

GOAL

OBJT

EXP

MOD

METH

OBS

RES

CON

 
 

2676

4277

28028

10612

15894

22444

1157

17982

17402

75951

29362

Words

 

99

172

1088

294

474

805

41

637

744

2582

1049

Sentences

 

1%

2%

14%

4%

6%

10%

1%

8%

9%

32%

13%

Sentences