Skip to main content

Table 2 ProteinNet summary statistics

From: ProteinNet: a standardized data set for machine learning of protein structure

Data set

Cutoff date

Structures

Sequences

ProteinNet 7

2006/5/1

34,557

4,817,827

ProteinNet 8

2008/5/5

48,087

15,756,117

ProteinNet 9

2010/5/3

60,350

24,688,095

ProteinNet 10

2012/5/1

73,116

63,477,198

ProteinNet 11

2014/5/1

87,573

173,908,140

ProteinNet 12

2016/5/1

104,059

332,283,871

  1. Cutoff dates for inclusion of sequence and structure data, based on the start of prior CASP experiments, are shown along with the number of sequences and structures in each ProteinNet set. Numbers are for non-redundant entries