Skip to main content

Table 2 ProteinNet summary statistics

From: ProteinNet: a standardized data set for machine learning of protein structure

Data set Cutoff date Structures Sequences
ProteinNet 7 2006/5/1 34,557 4,817,827
ProteinNet 8 2008/5/5 48,087 15,756,117
ProteinNet 9 2010/5/3 60,350 24,688,095
ProteinNet 10 2012/5/1 73,116 63,477,198
ProteinNet 11 2014/5/1 87,573 173,908,140
ProteinNet 12 2016/5/1 104,059 332,283,871
  1. Cutoff dates for inclusion of sequence and structure data, based on the start of prior CASP experiments, are shown along with the number of sequences and structures in each ProteinNet set. Numbers are for non-redundant entries