Skip to main content

Table 2 Description of parallel corpora obtained from MEDLINE data

From: Combining MEDLINE and publisher data to create parallel corpora for the automatic translation of biomedical text

 

ENFR

ENES

 

English

French

English

Spanish

Number of citations

14,815

14,817

3,371

3,371

Number of sentences

137,938

130,692

33,167

32,085

Number of words

2,699,851

2,863,638

676,092

760,863