Extracting Genetic Pathways From Text and Grounding at the Spatio-Temporal Level

Sinclair, Gail; Webber, Bonnie; Davidson, Duncan

doi:10.1186/1471-2105-6-S3-P26

Volume 6 Supplement 3

BioSysBio: Bioinformatics and Systems Biology Conference

Poster presentation
Open access
Published: 21 September 2005

Extracting Genetic Pathways From Text and Grounding at the Spatio-Temporal Level

Gail Sinclair¹,
Bonnie Webber¹ &
Duncan Davidson¹

BMC Bioinformatics volume 6, Article number: P26 (2005) Cite this article

1707 Accesses
Metrics details

The molecular biology literature is believed to contain a wealth of information that has not yet made it into any structured database. Of particular current interest is information about genetic pathways, and there are many ongoing studies into automatically extracting such pathways from the molecular biology literature. In the area of development, however, pathways must be linked to changes in the developing tissues, usually described in terms of cellular processes and where they are happening. Thus our aim is to extract complementary information regarding the tissue location in which pathways are active, along with the biological process they are active in and the stage of embryonic development in which the process occurs.

Temporal information in developmental biology text has a rather different character than newswire (e.g. 5 pm, last year). With respect to murine developmental staging, there are at least two separate ways of explicitly specifying the developmental stage of the embryo – Theiler stages (TS), and days post coitum/embryonic day (d.p.c./E). These cannot be simply mapped to one another as can days, weeks and years. Stages can also be referred to implicitly, by the state of the embryo or the processes currently taking place within it, e.g. tubulogenesis = circa TS20 to birth.

To start, we have collected a corpus of articles about one aspect of kidney development, annotating instances of information about specific gene expression in tissues and about the processes involved. (Inter-annotator agreement on this gold standard is at 94%.) From deeper linguistic analysis of similar (automatically retrieved) sentences, the task at hand is in recognising how biologists write about sequential events and then adapting existing/formulating new natural language processing techniques to extract these events and relate them to each other. These techniques can then be evaluated by way of literature describing a different part of kidney development.

Author information

Authors and Affiliations

School of Informatics, University of Edinburgh, Edinburgh, EH9 3JZ, UK
Gail Sinclair, Bonnie Webber & Duncan Davidson

Authors

Gail Sinclair
View author publications
You can also search for this author in PubMed Google Scholar
Bonnie Webber
View author publications
You can also search for this author in PubMed Google Scholar
Duncan Davidson
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gail Sinclair.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Sinclair, G., Webber, B. & Davidson, D. Extracting Genetic Pathways From Text and Grounding at the Spatio-Temporal Level. BMC Bioinformatics 6 (Suppl 3), P26 (2005). https://doi.org/10.1186/1471-2105-6-S3-P26

Download citation

Published: 21 September 2005
DOI: https://doi.org/10.1186/1471-2105-6-S3-P26

BioSysBio: Bioinformatics and Systems Biology Conference

Extracting Genetic Pathways From Text and Grounding at the Spatio-Temporal Level

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Bioinformatics

Contact us

BioSysBio: Bioinformatics and Systems Biology Conference

Extracting Genetic Pathways From Text and Grounding at the Spatio-Temporal Level

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Bioinformatics

Contact us