From: A statistical approach for 5′ splice site prediction using short sequence motifs and without encoding sequence data