From: Speeding disease gene discovery by sequence based candidate prioritization
Feature | Median in control set | Median in disease set | Significance |
---|---|---|---|
Gene length | 19 k | 27 k | P < 0.001 |
cDNA length | 2,126 bp | 2,442 bp | P < 0.001 |
Protein length | 383 aa | 494 aa | P < 0.001 |
3' UTR length | 446 bp | 488 bp | P < 0.01 |
Exon number | 8 | 10 | P < 0.001 |
Distance to neighbouring gene | 46 kb | 52 kb | P < 0.01 |
Protein identity with BRH in mouse | 80% | 87% | P < 0.001 |
Gene encodes signal peptide | 17% | 35% | P < 0.0001 (calculated using the chi squared test) |
5' CpG islands | 12% | 16% | P < 0.028 (calculated using the chi squared test) |