ontology based phenotype database and mining tool

1
Ontology Based Phenotype Database and Mining Tool Mary Shimoyama, Liz Worthey, Jennifer Smith, Rajni Nigam, Victoria Petri, Stan Laulederkind, Tim Lowry, Tom Hayman, Shur-Jen Wang, Jeff DePons, Pushkala Jayaraman, Marek Tutaj, Weisong Liu, Diane Munzenmaier, Melinda Dwinell, Simon Twigger, Howard Jacob Rat Genome Database, Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, Wisconsin Database Structure Data Retrieval and Visualization – Rat Data ABSTRACT: The strength of the rat as a model organism lies in its utility in pharmacology, biochemistry, and physiology research. Data resulting from such studies is difficult to represent in databases and the creation of user-friendly data mining tools has proven difficult. The Rat Genome Database has developed a comprehensive ontology-based data structure and annotation system to integrate phenotype data along data on samples, measurement methods and experimental conditions. RGD uses standard data formats and multiple ontologies to integrate phenotype data from large scale phenotype projects, QTL studies and other projects. The four major ontologies include the strain ontology, clinical measurement ontology, measurement method ontology and experimental conditions ontology. This has facilitated the development of a phenotype data mining tool which allows users to search for phenotype information using a series of filters based on clinical measurement type, strain, measurement method and/or experimental conditions. Users may start their query by choosing strain or clinical measurement or conditions and filter further as they move through the query. Results facilitate comparisons of phenotype readings for single or multiple strains, conditions or measurement methods allowing researchers to compare their own phenotype data with those from multiple studies. This new tool will assist researchers in choosing the best model strains for their studies. RGD’s phenotype database and data mining tool are also a further effort to directly link phenotype data with genomic and genetic variations. CLINICAL MEASUREMENT MEASUREMENT METHOD EXPERIMENTAL CONDITION STRAIN Ontologies Clinical Measurement CM Value CM Units Measurement Method Measurement Site Condition 1 Condition Value Condition 1 Duration Condition 1 Ordinality Condition 2 Condition 2 Value Condition 2 Duration Condition 2 Ordinality 1 Systolic blood pressure 160 mmHg Dinamap 1846 SX/P Blood pressure monitor, thigh cuff Right forearm sitting 2 Systolic blood pressure 145 mmHg Colin STBP-780 automated blood pressure monitor, adult cuff Right upper arm Stationary bicycle dynanometer 123 W – 60% workload 2 min 3 Systolic blood pressure 138 mmHg Colin STBP-780 automated blood pressure monitor, adult cuff Right upper arm Exercise training program – stationary bicycle 2 weeks – 3 X week for 30 min 1 Stationary bicycle dynanometer 123 W – 60% workload 2 min 2 Human Data Strain Se x No Anima ls in Sampl e Age Clinical Measurement CM Value CM Units Measurement Method Measurement Site Conditio n 1 Conditio n 1 Value Conditio n 1 Duration Condition 2 Conditio n 2 Value Condition 2 Duration SHRSP/ Ezo M 6 5- 10 wks Systolic blood pressure 222 mmHg Photoplethysmogra phy, tail cuff tail Standard conditio ns SHR/Kyo M 6 5- 10 wks Systolic blood pressure 154 mmHg Photoplethysmogra phy, tail cuff tail Standard conditio ns GH/ OmrMcwi M 4 9- 14 wks Mean arterial blood pressure 158.2 mmHg Fluid Filled Catheter Femoral artery Air oxygen content 21 % GH/ OmrMcwi F 7 9- 14 wks Mean arterial blood pressure 153.5 mmHg Fluid Filled Catheter Femoral artery Air carbon dioxide content 7 % 7 mins SS- 13BN/ Mcwi M 18 9- 14 wks Mean arterial blood pressure 143.3 mmHg Fluid Filled Catheter Femoral artery Sodium controll ed diet 0.4% Sodium controlle d diet 8% 17 days FHH/ EurMcwi F 8 9- 14 wks Mean arterial blood pressure 120 mmHg Fluid Filled Catheter Femoral artery Walking on treadmil l 0.8 m/min 3 mins Running on treadmill 1.6 m/min 3 mins The data structure provides for information on the study and experiment, reference and associated researcher as well as the necessary information to record all aspects of the experiment.

Upload: jennifer-smith

Post on 15-Jun-2015

311 views

Category:

Technology


5 download

TRANSCRIPT

Page 1: Ontology based phenotype database and mining tool

Ontology Based Phenotype Database and Mining ToolMary Shimoyama, Liz Worthey, Jennifer Smith, Rajni Nigam, Victoria Petri, Stan Laulederkind, Tim Lowry, Tom Hayman, Shur-Jen Wang,

Jeff DePons, Pushkala Jayaraman, Marek Tutaj, Weisong Liu, Diane Munzenmaier, Melinda Dwinell, Simon Twigger, Howard JacobRat Genome Database, Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, Wisconsin

Database Structure Data Retrieval and Visualization – Rat Data

ABSTRACT: The strength of the rat as a model organism lies in its utility in pharmacology, biochemistry, and physiology research. Data resulting from such studies is difficult to represent in databases and the creation of user-friendly data mining tools has proven difficult. The Rat Genome Database has developed a comprehensive ontology-based data structure and annotation system to integrate phenotype data along data on samples, measurement methods and experimental conditions. RGD uses standard data formats and multiple ontologies to integrate phenotype data from large scale phenotype projects, QTL studies and other projects. The four major ontologies include the strain ontology, clinical measurement ontology, measurement method ontology and experimental conditions ontology. This has facilitated the development of a phenotype data mining tool which allows users to search for phenotype information using a series of filters based on clinical measurement type, strain, measurement method and/or experimental conditions. Users may start their query by choosing strain or clinical measurement or conditions and filter further as they move through the query. Results facilitate comparisons of phenotype readings for single or multiple strains, conditions or measurement methods allowing researchers to compare their own phenotype data with those from multiple studies. This new tool will assist researchers in choosing the best model strains for their studies. RGD’s phenotype database and data mining tool are also a further effort to directly link phenotype data with genomic and genetic variations.

CLINICAL MEASUREMENT MEASUREMENT METHOD EXPERIMENTAL CONDITION

STRAIN

Ontologies

Clinical Measurement

CM Value

CM Units Measurement Method Measurement Site

Condition 1 Condition Value

Condition 1 Duration

Condition 1 Ordinality

Condition 2 Condition 2 Value

Condition 2 Duration

Condition 2 Ordinality

1 Systolic blood pressure

160 mmHg Dinamap 1846 SX/P Blood pressure monitor, thigh cuff

Right forearm sitting

2 Systolic blood pressure

145 mmHg Colin STBP-780 automated blood pressure monitor, adult cuff

Right upper arm Stationary bicycle dynanometer

123 W – 60% workload

2 min

3 Systolic blood pressure

138 mmHg Colin STBP-780 automated blood pressure monitor, adult cuff

Right upper arm Exercise training program – stationary bicycle

2 weeks – 3 X week for 30 min

1 Stationary bicycle dynanometer

123 W – 60% workload

2 min 2

Human Data

Strain Sex

No Animals in Sample

Age Clinical Measurement

CM Value

CM Units

Measurement Method Measurement Site

Condition 1

Condition 1 Value

Condition 1 Duration

Condition 2 Condition 2 Value

Condition 2 Duration

SHRSP/Ezo

M 6 5-10 wks

Systolic blood pressure

222 mmHg Photoplethysmography, tail cuff

tail Standard conditions

SHR/Kyo M 6 5-10 wks

Systolic blood pressure

154 mmHg Photoplethysmography, tail cuff

tail Standard conditions

GH/OmrMcwi

M 4 9-14 wks

Mean arterial blood pressure

158.2 mmHg Fluid Filled Catheter Femoral artery Air oxygen content

21 %

GH/OmrMcwi

F 7 9-14 wks

Mean arterial blood pressure

153.5 mmHg Fluid Filled Catheter Femoral artery Air carbon dioxide content

7 % 7 mins

SS-13BN/Mcwi

M 18 9-14 wks

Mean arterial blood pressure

143.3 mmHg Fluid Filled Catheter Femoral artery Sodium controlled diet

0.4% Sodium controlled diet

8% 17 days

FHH/EurMcwi

F 8 9-14 wks

Mean arterial blood pressure

120 mmHg Fluid Filled Catheter Femoral artery Walking on treadmill

0.8 m/min 3 mins Running on treadmill

1.6 m/min 3 mins

The data structure provides for information on the study and experiment, reference and associated researcher as well as the necessary information to record all aspects of the experiment.