Skip to main content

Advertisement

Springer Nature is making SARS-CoV-2 and COVID-19 research free. View research | View latest news | Sign up for updates

Sequence Analysis In Promoter Regions

A gene's promoter region is a prominent factor in determining that gene's expression networks and cycle. The significant elements which determine the promoter's effect are generally discovered through laboratory methods. This project's aim is to develop high throughput sequence analysis methods for identifying the important promoter features and classifying promoters according to the occurrence of these features.

Saccharomyces Cervisiae, as one of the first fully sequenced organisms which also has publicly available expression data, was chosen as a case study for this work. Motifs of over-represented sequence segments were sought using a bespoke implementation of the Smith-Waterman algorithm running on the Cambridge-Cranfield high performance computer facility. The algorithm identifies all pairs of sequence segments whose similarity surpasses a given threshold. The found segments are then grouped by similarity, and those groups with a many members are the motifs which are over-represented.

The motifs identified using this approach are compared to known transcription factor binding sites from the S. Cervisiae Promoter Database. Most of the motifs produced are expected to correspond with binding sites; others could be new discoveries. Having validated the method on S. Cervisiae, it can then be used with some confidence to analyse promoters from other species, whose characteristics are less well understood.

Author information

Correspondence to Jeremy Austin.

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Austin, J. Sequence Analysis In Promoter Regions. BMC Bioinformatics 6, P2 (2005). https://doi.org/10.1186/1471-2105-6-S3-P2

Download citation

Keywords

  • Gene Expression and Regulation
  • Algorithm Development