bioinformatics at wsu
DESCRIPTION
Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG) . Bioinformatics at WSU. Biology. Computer Science. Statistics. What is Bioinformatics. The analysis of biological information using computers and statistical techniques. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Bioinformatics at WSU](https://reader035.vdocuments.net/reader035/viewer/2022081514/5681332a550346895d9a1a30/html5/thumbnails/1.jpg)
Bioinformatics at WSU
Matt SettlesBioinformatics Core
Washington State University
Wednesday, April 23, 2008WSU Linux User Group (LUG)
![Page 2: Bioinformatics at WSU](https://reader035.vdocuments.net/reader035/viewer/2022081514/5681332a550346895d9a1a30/html5/thumbnails/2.jpg)
What is Bioinformatics
Computational Biologist
BioinformaticianBiostatistician
Biology
Computer Science
Statistics
The analysis of biological information using computers and statistical techniques
![Page 3: Bioinformatics at WSU](https://reader035.vdocuments.net/reader035/viewer/2022081514/5681332a550346895d9a1a30/html5/thumbnails/3.jpg)
Subfields of Bioinformatics
Sequence analysis Main areas: Sequence alignment and Sequence
databases Genome annotation
Main areas: Gene finding, Gene predicting Computational evolutionary biology
Main areas: Systematics, Phylogenetics Analysis of High-throughput data
Main areas: RNA microarrays, aCGH, Whole genome genotyping arrays.
Analysis of Whole Genome Sequencing Data Emerging Field
![Page 4: Bioinformatics at WSU](https://reader035.vdocuments.net/reader035/viewer/2022081514/5681332a550346895d9a1a30/html5/thumbnails/4.jpg)
Subfields of Bioinformatics
Comparative genomics How are species different and how are they the
same? Systems Biology
Networks of Networks (the golden goose!!) Quantitative Genetics Measuring Biodiversity Modeling biological systems High-throughput image analysis Analysis of protein expression Prediction of protein structure Protein-protein docking
![Page 5: Bioinformatics at WSU](https://reader035.vdocuments.net/reader035/viewer/2022081514/5681332a550346895d9a1a30/html5/thumbnails/5.jpg)
What does a Bioinformatician Do?
Works in an interdisciplinary team Design of experiments Data management, databases Analysis from start to finish Data integration, annotation, visualization
Software Development Visual tools Databases
Research New techniques for the storage and analysis of
biological data, both statistical and compuational
![Page 6: Bioinformatics at WSU](https://reader035.vdocuments.net/reader035/viewer/2022081514/5681332a550346895d9a1a30/html5/thumbnails/6.jpg)
What tools do we use
Software programs developed by others, GUI and command line Open Source preferably
Statistical Programming Languages/Environments R – programming environment
www.r-project.org www.bioconductor.org C like Interpreted language that acts similar to scheme, Full graphics capabilities C/python/perl interfaces
Software programs we ourselves develop
![Page 7: Bioinformatics at WSU](https://reader035.vdocuments.net/reader035/viewer/2022081514/5681332a550346895d9a1a30/html5/thumbnails/7.jpg)
Central dogma of molecular biology
Each gene is transcribed (at the appropriate time) from DNA into mRNA, which then leaves the nucleus and is translated into the required protein.
![Page 8: Bioinformatics at WSU](https://reader035.vdocuments.net/reader035/viewer/2022081514/5681332a550346895d9a1a30/html5/thumbnails/8.jpg)
Whole Genome Association Analysis
Whole Genome Genotyping Array Bovine (COW) 58,000 SNPs Illumina Beadarray Represents all 29 chromosomes, X chromosome
and the Unknown chromosome Samples
255 dairy cattle from 4 different heards 130 Control cattle (healthy) 125 Johne's positive cattle (sick)
14.8 MILLION DATA POINTS !!! Biological Question of interest
Is there a collection of SNPs that are associated with the disease Johne's?
![Page 9: Bioinformatics at WSU](https://reader035.vdocuments.net/reader035/viewer/2022081514/5681332a550346895d9a1a30/html5/thumbnails/9.jpg)
Analysis Outline
Read in and format data into something we can work with in R and plink.
Quality Assurance Toss samples that do not meet QA (7 samples) Toss SNPs that do not meet QA (8,935 SNPs)
Treat SNPs as independent and analyze each with a statistical model.
Correct for multiple testing Visualize results
![Page 10: Bioinformatics at WSU](https://reader035.vdocuments.net/reader035/viewer/2022081514/5681332a550346895d9a1a30/html5/thumbnails/10.jpg)
Results
10 regions were identified as being potentially interesting with a p < 0.001 multiple testing correction (permutation based)
![Page 11: Bioinformatics at WSU](https://reader035.vdocuments.net/reader035/viewer/2022081514/5681332a550346895d9a1a30/html5/thumbnails/11.jpg)
Next Step
Validate in the lab, the regions of interest. Perform multi-locus analysis, computer
cluster will be necessary here. Mine the data for additional information
![Page 12: Bioinformatics at WSU](https://reader035.vdocuments.net/reader035/viewer/2022081514/5681332a550346895d9a1a30/html5/thumbnails/12.jpg)
Job Position
Position with the Bioinformatics Core ~ 20 hours per week ~ $12-$15/hour Potential internship credit Description: Aid in the analysis of microarray
data, create analysis pipeline to be used by WSU researchers.
Required Skills: know how to code Bonuses: Possibility of publications!
![Page 13: Bioinformatics at WSU](https://reader035.vdocuments.net/reader035/viewer/2022081514/5681332a550346895d9a1a30/html5/thumbnails/13.jpg)
The END
QUESTIONS??