sai moturu
DESCRIPTION
Sai Moturu. Introduction. Current approaches to microarray data analysis Analysis of experimental data followed by a posterior process where biological information is incorporated to make inferences Integrative analysis technique in this paper - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/1.jpg)
Sai Moturu
![Page 2: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/2.jpg)
Introduction
• Current approaches to microarray data analysis– Analysis of experimental data followed by a
posterior process where biological information is incorporated to make inferences
• Integrative analysis technique in this paper– Integrate gene annotation with expression data
to discover intrinsic associations among both data sources based on co-occurrence patterns
![Page 3: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/3.jpg)
Methods and Data
– Association Rules Discovery
– Gene expression data
– Gene annotation: Gene ontology categories, metabolic pathways and transcriptional regulators
– Applied to two previously studied experiments
![Page 4: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/4.jpg)
Association Rules Discovery
– Antecedent -> Consequent X -> Y
– Measures of Quality
• Support: P(XυY)
• Confidence: P(Y|X) = P(XυY)/P(Y)
• Improvement: Confidence/Consequent = P(XυY)/(P(X)*P(Y))
![Page 5: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/5.jpg)
Association Rules Discovery
– Itemsets• Genes and the set of experiments in which gene is
over or underexpressed• Gene characteristics
– Constraint• Antecedent needs to be gene annotation
– Expression Thresholds• Genes with log expression values >1 are
overexpressed and <-1 are underexpressed (two fold)
![Page 6: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/6.jpg)
Mining Association Rules
– The association rules that we are interested in have low support values and high confidence values
– A variant of the apriori algorithm is used that has helped previously with mining low support-high confidence biologically significant patterns
![Page 7: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/7.jpg)
Filtering
– Major drawback with association rules is the number of rules generated is huge
– Also there is redundancy
– This is taken care of with two filters• Redundant filter
• Single antecedent filter
![Page 8: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/8.jpg)
Diauxic shift dataset
– Gene expression accompanying the metabolic shift from fermentation to respiration that occurs when fermenting yeast cells
– Expression levels recorded at 7 time points
– External information• Metabolic pathways• Transcriptional regulators
![Page 9: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/9.jpg)
Results
– Association rules among metabolic pathways and expression patterns
• 1126 out of over 6000 genes were annotated with at least one pathway
• Association rules with minimum support of 5, minimum confidence of 40% and minimum improvement of 1
• Redundant and single antecedent filters applied
• 21 association rules
![Page 10: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/10.jpg)
Results
– Association rules among transcriptional regulators and expression patterns
• 3490 genes were annotated with at least one regulator
• Association rules with minimum support of 5, minimum confidence of 80% and minimum improvement of 1
• Redundant filter applied
• 28 association rules
![Page 11: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/11.jpg)
Results
– Association rules among transcriptional regulators, metabolic pathways and expression patterns
• 3882 genes
• Association rules with minimum support of 5, minimum confidence of 80% and minimum improvement of 1
• Redundant filter applied
• 37 association rules
![Page 12: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/12.jpg)
Results
![Page 13: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/13.jpg)
Results
![Page 14: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/14.jpg)
Results
![Page 15: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/15.jpg)
Serum stimulation dataset
– Gene expression program of human fibroblast after serum exposure
– External information• Gene ontology terms
![Page 16: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/16.jpg)
Results
– Association rules among biological process annotation and expression patterns
• 4092 genes of over 8000
• Support of 4, min confidence of 10% and min improvement of 1
• Single antecedent and redundant filters applied
• 12 associations
![Page 17: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/17.jpg)
Results
– Association rules among terms from all GO categories
• 4630 genes of over 8000
• Support of 4, min confidence of 10% and min improvement of 1
• Redundant filter applied
• 31 associations
![Page 18: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/18.jpg)
Results
![Page 19: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/19.jpg)
Results
![Page 20: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/20.jpg)
Results
![Page 21: Sai Moturu](https://reader036.vdocuments.net/reader036/viewer/2022081419/5681381a550346895d9fcb96/html5/thumbnails/21.jpg)
Conclusions
– Some of the biological implications matched the ones found experimentally
– The others could be explored further
– Integrative data analysis is very useful for meaningful discoveries using gene expression data