Skip to main content

Table 1 Basic textual statistics of the selected DCC records, showing the mean value per record and the boundaries of the second and third quartile (top) and the total count in the dataset (bottom)

From: Negation detection in Dutch clinical texts: an evaluation of rule-based and machine learning methods

Letter category

# sentences

# words

# unique words

Word length

General Practitioner

2.1 (1, 2)

17.8 (8, 23)

16.8 (8, 22)

5.9 (5, 6.5)

entries

2034

33840

11080

 

Specialist letters

3.8 (2, 4)

30.0 (9, 34)

24.9 (8, 30)

7 (5.8, 7.7)

 

2737

29674

10207

 

Radiology reports

3.6 (2, 4)

19.1 (7, 26)

16.8 (6, 23)

7.2 (6.1, 8)

 

3939

28614

6371

 

Discharge letters

4.1 (2, 5)

33.8 (14, 45)

27.8 (13, 38)

6.1 (5.5, 6.6)

 

3057

33458

6351

Â