Skip to main content

Advertisement

Table 3 The distribution of words and sentences in the scheme-annotated CRA corpus

From: A comparison and user-based evaluation of models of textual information structure in the context of cancer risk assessment

S1 OBJ METH RES CON         
  61483 39163 89575 35564 Words        
  2145 1396 3203 1241 Sentences        
  27% 17% 40% 16% Sentences        
S2 BKG OBJ METH RES CON REL FUT      
  36828 23493 41544 89538 30752 2456 1174 Words     
  1429 674 1473 3185 1082 95 47 Sentences     
  18% 8% 18% 40% 14% 1% 1% Sentences     
S3 HYP MOT BKG GOAL OBJT EXP MOD METH OBS RES CON  
  2676 4277 28028 10612 15894 22444 1157 17982 17402 75951 29362 Words
  99 172 1088 294 474 805 41 637 744 2582 1049 Sentences
  1% 2% 14% 4% 6% 10% 1% 8% 9% 32% 13% Sentences