overview. what is annotation? annotation is the process of determining the location and function of...
TRANSCRIPT
![Page 1: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/1.jpg)
Overview
![Page 2: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/2.jpg)
What is Annotation?
Annotation is the process of determining the location and function of all identifiable genes in a genome.
Annotation is an important part of bioinformatics
• whole-genome shotgun sequencing provides the raw material
• annotation provides an interpretation of the sequencing results
![Page 3: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/3.jpg)
Figure 1 from Stothard & Wishart (2006) Automated bacterial genome analysis and annotation. Current Opinion in Microbiology 9: 505-510.
1. Verify predicted function basedon amino acid sequence homology
2 Predict protein structure and localization
1. Find start and stop codons – separated by 800-900 bp?
2. Find Shine-Dalgarno sequence (RBS)– upstream of start codon?
3. Find core promoter – consensus sequences for -10 & -35?
4. Find rho-independent terminator
5. Predict whether the gene could beorganized into an operon– compare chromosomal neighborhood
![Page 4: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/4.jpg)
What will we be doing?
Verifying ORF calls Verifying function based on sequence conservation
Verifying function based on structural conservation
Verifying function based on localization data
(insert image of E. coli lac permease)
Insert Figure 8-40 fromMicrobiology – An Evolving Science
© 2009 W.W. Norton & Company, Inc.
![Page 5: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/5.jpg)
Why manually annotate?
• Automated annotations tend to over-predict….produce many false-positives
• Automated annotations also miss things….
• Accuracy of any annotation is only as good as the quality of annotated genes in reference databases
• High sequencing error rates. . .
A curated, finished genome has gene callsverified & proteins organized into pathways
![Page 6: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/6.jpg)
Undergraduates provide “human expertise”
GOAL: Demonstrate that student annotationscan be accurate, up-to-date, reliable, and useful to scientific community!
Possible solutions?
Reference paper:Genome re-annotation: a wiki solution?by Steven SaltzbergGenome Biology (2007), 8:102
![Page 7: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/7.jpg)
What is imgACT?
- Web portal to access genome database, img/edu
- Contains wiki-based Lab Notebook & Report Page for organizing annotation data
http://img-act.jgi-psf.org/user/login
![Page 8: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/8.jpg)
http://imgweb.jgi-psf.org/cgi-bin/img_edu_v260/main.cgi
What is img/edu?
- Simplified database for undergraduate genome annotation
- Features and functions similar to that found in IMG
- Directly linked to imgACT
Click!
IMG companionsystem
![Page 9: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/9.jpg)
http://img.jgi.doe.gov/cgi-bin/pub/main.cgi
What is IMG?
INTEGRATED MICROBIAL GENOMES (IMG)
- Database managed by the U.S. Department of Energy (DOE)Joint Genome Institute (JGI)
- JGI currently producing ~ 22% of the reported number of bacterial genome projects worldwide
- Key mission of IMG is to provide a data management platform that supports comprehensive analysis and annotation of all publicly available genomes in a comparative genomics context
![Page 10: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/10.jpg)
What are we annotating?
(insert information about organism including location/map of collection site,image and description of organism, etc.)
![Page 11: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/11.jpg)
Why annotate a GEBA organism?
Phylogenetic tree Phylogenetic tree of of BacteriaBacteria showing showing
established & established & candidate phylacandidate phyla
Insert Figure 1 from Handelsman (2004) Microbiol. Mol. Biol. Rev. 68: 669-685.
Note that genome sequences from members of those phyla in yellow and orange are under-represented relative to those in red
GEBA (Genomic Encyclopedia of Bacteria and Archaea) goal is to sequence genomes from under- represented phyla
![Page 12: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/12.jpg)
What is our goal?
Insert Figure 2 from Scott KM et al. (2006) The Genome of Deep-Sea Vent Chemolithoautotroph Thiomicrospira crunogena XCL-2. PLoS Biology, 4: 2196
Annotate genes in pathways & complexes
![Page 13: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/13.jpg)
Student Goals: Conceptual
• Apply basic concepts in biochemistry, microbial physiology & ecology, and evolutionary biology
• Question basic assumptions about biochemistry, physiology and evolution
• Understand the power and limitations of bioinformatics
![Page 14: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/14.jpg)
• Proficiently use multiple database analysis software packages
• Strengthen web-based library search skills (Pubmed)
• Develop skills creating hypotheses and designing experiments to test them
• Sharpen skills in analysis, synthesis and presentation of results and data interpretation
• Experience the collaborative nature of science
Student Goals: Technical
![Page 15: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/15.jpg)
• Each team will annotate genes encoding enzymes in a metabolic pathway or components of a cellular complex in [insert organism name]
• Your T.A. or instructor will tell you specific assignments
• Consult KEGG map and use orthologous gene in other related organisms to query the genome of [insert organism name] in IMG/EDU database
• For best “hit”, complete the corresponding modules of imgACT lab notebook and lab report for that gene
• Complete the module(s) presented each week. The imgACT online notebook & report for Modules #1 – 8 must be finished for all genes assigned (3 per student).
Annotation Project
![Page 16: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/16.jpg)
Assignments
• Online notebook checks end of weeks:
• Final Report due dates:
Annotation Project
![Page 17: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/17.jpg)
Click “Create an account”
http://img-act.jgi-psf.org/user/login
How do we get started?
![Page 18: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/18.jpg)
Email address
First Name
Pick something youcan remember
Specific for our class
Click “Register” once information entered
Register for an img-act account
Last Name
Noabbreviationsor nicknames
xxxxxxxxxxxxxxxxxxxxxxxxxx
![Page 19: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/19.jpg)
Once registration complete, log in to imgACT
![Page 20: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/20.jpg)
What you should see. . .
If you can’t get this far, tell your instructor immediately!
Winter 2010
![Page 21: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/21.jpg)
Next, take pre-annotation surveyCookies must be enabled for survey to work properly.
![Page 22: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/22.jpg)
What next? Practice!
Explore the Explore the imgACTimgACT web portal web portal
• All students will be assigned at least one gene, which should be used to navigate through the imgACT online lab notebook (Modules #1 – 8) and the lab report
• Note that students are not responsible for annotating this gene. It may be used to help students get used to navigating the web portal.
“Practice gene”
![Page 23: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/23.jpg)
click
![Page 24: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/24.jpg)
click
imgACT Lab Notebook
The first time you log in to Lab Notebook, you will also need to log in to the wiki.Use the same username & password as created for imgACT account.
![Page 25: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/25.jpg)
imgACT Lab Notebook
Only responsible for Modules #1 – 8 in this class
![Page 26: Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is](https://reader035.vdocuments.net/reader035/viewer/2022062802/56649e985503460f94b9b5db/html5/thumbnails/26.jpg)
imgACT Lab Report
Correspond tomodules in
Lab Notebook
To be completedat end of the quarter