Giving raw data a chance to talk: A demonstration of de-identified Pediatric Research Database (PRD) and exploratory analysis techniques for possible research cohort discovery and identifiable high risk factors for readmission

Viangteeravat, Teeradache; Huang, Eunice Y; Wade, Grady

doi:10.1186/1471-2105-14-S17-A5

Volume 14 Supplement 17

Proceedings of the 12th Annual UT-ORNL-KBRIN Bioinformatics Summit 2013

Meeting abstract
Open access
Published: 22 October 2013

Giving raw data a chance to talk: A demonstration of de-identified Pediatric Research Database (PRD) and exploratory analysis techniques for possible research cohort discovery and identifiable high risk factors for readmission

Teeradache Viangteeravat^1,2,
Eunice Y Huang^1,2 &
Grady Wade³

BMC Bioinformatics volume 14, Article number: A5 (2013) Cite this article

1799 Accesses
2 Citations
Metrics details

Background

Secondary use of large and open data sets provides researchers with an opportunity to address high impact questions that would otherwise be prohibitively expensive and time consuming. In spite of the data availability, often generating hypotheses from huge data sets is challenging, and lack of complex analysis of data might lead to weak hypotheses.

Materials and methods

To overcome these issues and to assist researchers in building hypotheses from raw data, we are working on a methodology and informatics resource called the PRD, an acronym for “Pediatric Research Database.” The PRD is a de-identified database designed to make secondary use of rich data sources, i. e., electronic medical records (EMR). The development of visual analytics [1, 2] makes the process of data elaboration, information gathering, knowledge generation, and complex information exploration transparent to tool users and provides researchers with the ability to sort and filter by various criteria, which can lead to strong, novel hypotheses. This database allows researchers to query large patient populations to identify small subsets based on certain inclusion and exclusion criteria. This not only permits users to detect expected events, such as might be predicted by models, but also helps users discover the unexpected – surprising anomalies, changes, patterns and relationships that are then examined and assessed to develop new insights. Only de-identified data are available from PRD to researchers. We maintain the identified data in a separate HIPAA class server room with very limited access. Individual-level data cannot be accessible without appropriate IRB approval. All potential re-identification attempts are protected by following the best practice de-identification process. All patients’ medical record number is replaced by an arbitrarily generated sequence number in order to prevent re-identification issues. To further protect patient re-identification, all patient count that is less than 10 would return “less than 10 patients” without any information. The outcome goal of the PRD is to facilitate clinical research and improve the health of children.

References

Kamel Boulos M, Viangteeravat T, Anyanwu MN, Ra Nagisetty V, Kuscu E: Web GIS in practice IX: a demonstration of geospatial visual analytics using Microsoft Live Labs Pivot technology and WHO mortality data. Int J Health Geogr. 2011, 10: 19-10.1186/1476-072X-10-19.
Article PubMed Central PubMed Google Scholar
Flake G: Is Pivot a turning point for Web exploration?. (http://TED.comvideo-Feb/Mar2010 ), [http://www.ted.com/talks/gary_flake_is_pivot_a_turning_point_for_web_exploration.html]

Download references

Acknowledgement

The authors thank the UTHSC Department of ITS Computing Systems and Office of Biomedical Informatics for use of informatics resources and collaboration.

Author information

Authors and Affiliations

Biomedical Informatics Core, Children’s Foundation Research Institute, Memphis, TN, 38103, USA
Teeradache Viangteeravat & Eunice Y Huang
Department of Pediatrics, University of Tennessee Health Science Center, Memphis, TN, 38163, USA
Teeradache Viangteeravat & Eunice Y Huang
Decision Support, Le Bonheur Children’s Hospital, Memphis, TN, 38103, USA
Grady Wade

Authors

Teeradache Viangteeravat
View author publications
You can also search for this author in PubMed Google Scholar
Eunice Y Huang
View author publications
You can also search for this author in PubMed Google Scholar
Grady Wade
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Teeradache Viangteeravat.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Viangteeravat, T., Huang, E.Y. & Wade, G. Giving raw data a chance to talk: A demonstration of de-identified Pediatric Research Database (PRD) and exploratory analysis techniques for possible research cohort discovery and identifiable high risk factors for readmission. BMC Bioinformatics 14 (Suppl 17), A5 (2013). https://doi.org/10.1186/1471-2105-14-S17-A5

Download citation

Published: 22 October 2013
DOI: https://doi.org/10.1186/1471-2105-14-S17-A5

Proceedings of the 12th Annual UT-ORNL-KBRIN Bioinformatics Summit 2013

Giving raw data a chance to talk: A demonstration of de-identified Pediatric Research Database (PRD) and exploratory analysis techniques for possible research cohort discovery and identifiable high risk factors for readmission

Background

Materials and methods

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Bioinformatics

Contact us

Proceedings of the 12th Annual UT-ORNL-KBRIN Bioinformatics Summit 2013

Giving raw data a chance to talk: A demonstration of de-identified Pediatric Research Database (PRD) and exploratory analysis techniques for possible research cohort discovery and identifiable high risk factors for readmission

Background

Materials and methods

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Bioinformatics

Contact us