general introduction to the genome

76
General Introduction to the Genome

Upload: esma

Post on 25-Jan-2016

25 views

Category:

Documents


0 download

DESCRIPTION

General Introduction to the Genome. An Outlines. Molecular Biology Major Events DNA, RNA Protein Synthesis(Transcription & Translation) Genome Anatomy Bioinformatics Genomics Signal Processing. Molecular Biology Major Events DNA, RNA Protein Synthesis(Transcription & Translation) - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: General Introduction to the Genome

General Introduction to the Genome

Page 2: General Introduction to the Genome

An Outlines• Molecular Biology Major Events• DNA, RNA • Protein Synthesis(Transcription & Translation)• Genome Anatomy• Bioinformatics• Genomics Signal Processing

2

Page 3: General Introduction to the Genome

• Molecular Biology Major Events• DNA, RNA • Protein Synthesis(Transcription & Translation)• Genome Anatomy• Bioinformatics• Genomics Signal Processing

3

Page 4: General Introduction to the Genome

Molecular Biology Major Events

DNA Discovery

1865Mendel

Inheritance is controlled by unit factors

1881

Chromosomes are composed of DNA

1869Johann Friedrich

Page 5: General Introduction to the Genome

Molecular Biology Major Events

1881

Chromosomes are composed of DNA

1911

Thomas Hunt

Genes on chromosomes are the discrete units of heredity

1941George Beadle

Identify that genes make proteins

Edward Tatum

Page 6: General Introduction to the Genome

The Central Dogma1

2

3

TargetBookBook shelvesNucleus

Page 7: General Introduction to the Genome

What is Life made of?

715

Page 8: General Introduction to the Genome

Eukaryotes vs Prokaryotes

8

DNA

DNA

Page 9: General Introduction to the Genome

Prokaryotes EukaryotesSingle cell Single or multi cell

No nucleus Nucleus

No organelles Organelles

One piece of circular DNA Chromosomes

No mRNA post transcriptional modification

Exons/Introns splicing

915

Page 10: General Introduction to the Genome

The Cell: Chemical Composition

–70% Water–7% Small molecules• Salts• Amino acids (Protein)• Nucleotides (DNA, RAN)

–23% macromolecules• Proteins• Polysaccharides• Lipids

10

Page 11: General Introduction to the Genome

The Cell: The 3 Critical Molecules

DNA

Hold Genetic information Transfer Information

Synthesize Protein

PROTEIN

Form enzymesForm body’s components

RNA

m-RNA t-RNAr-RNA

Page 12: General Introduction to the Genome

• Molecular Biology Major Events

• DNA, RNA • Protein Synthesis(Transcription & Translation)• Genome Anatomy• Bioinformatics• Genomics Signal Processing

12

Page 13: General Introduction to the Genome

DNA: the Nucleotide

13

Phosphate

Sugar

Nitrogenous base

A

Page 14: General Introduction to the Genome

DNA: Nitrogenous base

14

Purines Pyrimidines

A TG C

Page 15: General Introduction to the Genome

DNA: Polymerization reaction

A T G C

5 P’ 3OH’

A T G C A T G C

5 3

Page 16: General Introduction to the Genome

DNA: hydrogn bounds

A C G T A C G T

ACG

TACG

T

No of base pairs= Genome SizeHG= 3200 Mbp (Mb)

Page 17: General Introduction to the Genome

AC

GT

AC

GTA

CG

TA

CG

T

Sugar- Phosphate Back bone

DNA: Watson - Crick Model 1951

Page 18: General Introduction to the Genome

DNA: Watson - Crick Model

Sugar- Phosphate Back bone

No of base pairs= Genome SizeHG= 3200 Mbp (Mb)

Page 19: General Introduction to the Genome

RNA versus DNA

19

Phosphate

Sugar "Ribose”

Nitrogenous base

Phosphate

Sugar” deoxyRibose”

Nitrogenous base

G, A ,C,T G, A ,C,U

Page 20: General Introduction to the Genome

Protein structure

• 1902 - Emil Hermann Fischer wins Nobel prize: showed amino acids are linked and form proteins

20

A AFNG

GS T

SD

K

Page 21: General Introduction to the Genome

Amino acid: Basic unit of proteinAmino acid: Basic unit of protein

COO-NH3+ C

R

HAn amino

acid

Different side chains, R, determine the properties of 20 amino acids.

Amino group Carboxylic acid group

21

Page 22: General Introduction to the Genome

22

Page 23: General Introduction to the Genome

Protein structure

• Primary structure

• Secondary structure

• Super-secondary structure

• Tertiary structure

• Quaternary structure

Page 24: General Introduction to the Genome

Protein Structure: Predication Problem

Protein sequence

Protein 3D structure

Protein Function

A FNG S T

Page 25: General Introduction to the Genome

The Central Dogma:Genes is protein’s blueprint, Genes is protein’s blueprint,

Gene

GenomeDNA

Protein

Gene GeneGene

Gene

GeneGeneGeneGene

GeneGeneGeneGene

GeneGene

Protein Protein

ProteinProtein

Protein

ProteinProtein

Protein

Protein

Protein

Protein

ProteinProtein

Protein

Page 26: General Introduction to the Genome

• Molecular Biology Major Events• DNA, RNA • Protein Synthesis(Transcription & Translation)• Genome Anatomy• Bioinformatics• Genomics Signal Processing

26

Page 27: General Introduction to the Genome

Protein Synthesis: DNA, RNA, and the Flow of Information

TranslationTranscription

Replication

27

Page 28: General Introduction to the Genome

Protein Synthesis: Gene Expression

28

Page 29: General Introduction to the Genome

Gene 2

Gen

e 1

Pre-mRNA

mRNAGen

e 3

Transcription

1

2

3

Translation

Splicing

1

2

3

Page 30: General Introduction to the Genome

Gene 2

Gen

e 1

Pre-mRNA

mRNAGen

e 3

Transcription

1

2

3

Translation

Alternative Splicing

1

3

2

Page 31: General Introduction to the Genome

Gene 2

Gen

e 1

Pre-mRNA

mRNAGen

e 3

Transcription

1

2

3

Translation

m-RNA Editing

1

2

3

Page 32: General Introduction to the Genome

32

Page 33: General Introduction to the Genome

Gene 2

Gen

e 1

Pre-mRNA

mRNAGen

e 31

2

3

Translation

AUGAUAACUAG

MS

AK

Start Codon

Stop Codon

CV

Page 34: General Introduction to the Genome

Protein Synthesis: The Genetic Code

34

Start

Stop

Page 35: General Introduction to the Genome

Gene 1

R Ge

ne 1

1

2

3

1

2

3

Gene Regulation

Regulatory protein

Page 36: General Introduction to the Genome

Gene Regulation

Regulatory protein Gene 1

Gene 1 Gene 2

Regulatory protein Gene 2

We have a little knowledge about regulatory mechanisms

Page 37: General Introduction to the Genome

What a big Genome Size?

• The 12 font size enables approximately 60 nucleotides of DNA sequence to be written in a line 10 cm in length.

• Genome size = total number of nucleotide base pairs.– typically in millions of base pairs, or megabases

[abbreviated Mb or Mbp])

37

Page 38: General Introduction to the Genome

• Molecular Biology Major Events• DNA, RNA • Protein Synthesis(Transcription & Translation)

• Genome Anatomy• Bioinformatics• Genomics Signal Processing

38

Page 39: General Introduction to the Genome

the human genome sequence would stretch for 5000 km, the distance from Montreal to London, Los Angeles to Panama, Tokyo to Calcutta, Cape Town to Addis Ababa, or Auckland to Perth

The sequence would fill about 3000 books the size of book 600 pages size.

39

Page 40: General Introduction to the Genome

Genome size of organism are different

40

Page 41: General Introduction to the Genome

41

Genome size is not good indicator for genes number

Page 42: General Introduction to the Genome

42

• Space is saved in the genomes of less complex organisms because the genes are more closely packed together.

Page 43: General Introduction to the Genome

C-value paradox

• Correlation between the complexity of an organism and the size of its genome was looked on as a bit of a puzzle.

43

Page 44: General Introduction to the Genome

Genome Anatomy

Gene 1

Gene 6Gene 5

Gene 4

Gene 2

Gene 3

Page 45: General Introduction to the Genome

Human Genome Anatomy

Human genome Nuclear genome Mitochondrial genome

45

Page 46: General Introduction to the Genome

Human Mitochondrial Genome Anatomy

46

• it is much smaller than the nuclear genome(~17 kB), and it contains just 37 genes.

• 13 code proteins and 24 specify non-coding RNA.

• do not contain intron.• is typical of the

mitochondrial genomes of other animals

Page 47: General Introduction to the Genome

47

Page 48: General Introduction to the Genome

Nuclear Human Genome Anatomy

48

62%

Page 49: General Introduction to the Genome

Nuclear Human Genome Anatomy: Protein Coding Genes

Page 50: General Introduction to the Genome

Nuclear Human Genome Anatomy: Protein Coding Genes

50

five exons, separated by four introns.

average exons= nine exons per gene

Page 51: General Introduction to the Genome

51

Two gene segments (V28 and V29-1)

Page 52: General Introduction to the Genome

Nuclear Human Genome Anatomy: pseudogene

52

Non functional genes

Page 53: General Introduction to the Genome

Nuclear Human Genome Anatomy: genome-wide repeat

Page 54: General Introduction to the Genome

Nuclear Human Genome Anatomy: genome-wide repeat

•Tandemly repeated DNA•Minisatellite DNA•Microsatellite DNA

•Interspersed genome-wide repeats•SINE•LINES•LTR•DNA transposons

54

Page 55: General Introduction to the Genome

Nuclear Human Genome Anatomy: genome-wide repeat Minisatellite DNA

• we are familiar with because of its association with structural features of chromosomes.

• Telomeric DNA, which in humans comprises hundreds of copies of the motif 5 -TTAGGG-3 .′ ′

55

TTAGGGAATCCC

TTAGGGAATCCC

TTAGGGAATCCC

………………………..………………………..

Page 56: General Introduction to the Genome

The content of the human nuclear genome: genome-wide repeat Microsatellite DNA

• microsatellites with a CA repeat, such as:

make up 0.25% of the genome, 8 Mb in all. • Single base-pair repeats such as:

make up another 0.15%.

56

Page 57: General Introduction to the Genome

Nuclear Human Genome Anatomy: genome-wide repeat Interspersed repeat

57

Page 58: General Introduction to the Genome

Gene Classification: Gene function

• This system has the advantage that the fairly broad functional categories used in can be further subdivided to produce a hierarchy of increasingly specific functional descriptions for smaller and smaller sets of genes.

• The weakness : functions have not yet been assigned to

many eukaryotic genes.

58

Page 59: General Introduction to the Genome

Gene Classification: Gene function

• The gene catalog couldn’t tell us why we are human?

• it may still not be possible simply from genome comparisons with the chimpanzee genome to determine what makes us human

59

Page 60: General Introduction to the Genome

Gene Classification: Gene function

• The major categories of protein coding genes represent the most studied areas of cell biology, which means that many of the relevant genes can be recognized because their protein products are known.

• Genes whose products have not yet been identified are more likely to be involved in the less well studied areas of cellular activity.

60

Page 61: General Introduction to the Genome

Gene classification: Protein Domain

• A more powerful method is to base the classification not on the functions of genes but on the structures of the proteins that they specify.

• A protein molecule is constructed from a series of domains, each of which has a particular biochemical function.

61

Page 62: General Introduction to the Genome

Gene classification: Protein Domain

62

Page 63: General Introduction to the Genome

• Molecular Biology Major Events• DNA, RNA • Protein Synthesis(Transcription & Translation)• Genome Anatomy

• Bioinformatics• Genomics Signal Processing

63

Page 64: General Introduction to the Genome

What is Bioinformatics?

• Integration of computational and biological methodsto convert biological information into general theories.

64

aatgcatgcggctatgctaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatccgatgactatgctaagctgggatccgatgacaatgcatgcggctatgctaatgaatggtcttgggatttaccttggaatgctaagctgggatccgatgacaatgcatgcggctatgctaatgaatggtcttgggatttaccttggaatatgctaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatccgatgactatgctaagctgcggctatgctaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatcctgcggctatgctaatgaatggtcttgggatttaccttggaatgctaagctgggatccgatgacaatgcatgcggctatgctaatgaatggtcttgggatttaccttggaatatgctaatgcatgcggctatgctaagctgggaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatccgatgactatgctaagctgcggctatgctaatgcatgcggctatgctaagctcatgcggctatgctaagctgggaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatccgatgactatgctaagctgcggctatgctaatgcatgcggctatgctaagctcggctatgctaatgaatggtcttgggatttaccttggaatgctaagctgggatccgatgacaatgcatgcggctatgctaatgaatggtcttgggatttaccttggaatatgctaatgcatgcggctatgctaagctgggaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatccgatgactatgctaagctgcggctatgctaatgcatgcggctatgctaagctcatgcgg

aatgcatgcggctatgctaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatccgatgactatgctaagctgggatccgatgacaatgcatgcggctatgctaatgaatggtcttgggatttaccttggaatgctaagctgggatccgatgacaatgcatgcggctatgctaatgaatggtcttgggatttaccttggaatatgctaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatccgatgactatgctaagctgcggctatgctaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatcctgcggctatgctaatgaatggtcttgggatttaccttggaatgctaagctgggatccgatgacaatgcatgcggctatgctaatgaatggtcttgggatttaccttggaatatgctaatgcatgcggctatgctaagctgggaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatccgatgactatgctaagctgcggctatgctaatgcatgcggctatgctaagctcatgcggctatgctaagctgggaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatccgatgactatgctaagctgcggctatgctaatgcatgcggctatgctaagctcggctatgctaatgaatggtcttgggatttaccttggaatgctaagctgggatccgatgacaatgcatgcggctatgctaatgaatggtcttgggatttaccttggaatatgctaatgcatgcggctatgctaagctgggaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatccgatgactatgctaagctgcggctatgctaatgcatgcggctatgctaagctcatgcgg

Page 65: General Introduction to the Genome

Bioinformatics

Statistics

BiologyComputer Science

Chemistry

Data structuresSoftware engineering

(C, C++,PERL)Cell structure

Genome, genesDNA, RNA

Protein structureMolecular bounds

Markof ModelNeural Network

65

Page 66: General Introduction to the Genome

Bioinformatics Subareas

• The subareas within bioinformatics include Genomics and Proteomics.

66

Genome comparisonevolutionary tree

Microarray AnalysisGene predicationGene classificationGene regulation

Protein 3D predicationProtein protein interactionProtein alignment

Page 67: General Introduction to the Genome

• Molecular Biology Major Events• DNA, RNA • Protein Synthesis(Transcription & Translation)• Genome Anatomy• Bioinformatics

• Genomics Signal Processing

67

Page 68: General Introduction to the Genome

What is GSP?

aatgcatgcggctatgctaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatccgatgactatgctaagctgggatccgatgacaatgcatgcggctatgctaatgaatggtcttgggatttaccttggaatgctaagctgggatccgatgacaatgcatgcggctatgctaatgaatggtcttgggatttaccttggaatatgctaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatccgatgactatgctaagctgcggctatgctaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatcctgcggctatgctaatgaatggtcttgggatttaccttggaatgctaagctgggatccgatgacaatgcatgcggctatgctaatgaatggtcttgggatttaccttggaatatgctaatgcatgcggctatgctaagctgggaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggatccgatgactatgctaagctgcggctatgctaatgcatgcggctatgctaagctcatgcggctatgctaagctgg

Analysis Processing

Using Theory and Methods of Signal Processing

To gain global understanding of Genome.

Page 69: General Introduction to the Genome

GSP Labs

• The Genomic Signal Processing Laboratory at

Texas A&M University.• The Computational Biology

Division of the Translational Genomics

Research Institute in Phoenix, Arizona.

Edward R. DoughertyTo model Genomic Regulatory Mechanisms for the purposes of diagnosis and therapy.

Page 70: General Introduction to the Genome

GSP Labs

• Columbia's Genomic Information Systems Laboratory

at Columbia University

Dimitris Anastassiou

Page 71: General Introduction to the Genome

GSP Labs

• DSP Group, Department of Electrical Engineering, California Institute of Technology

P. P. Vaidyanathan

Page 72: General Introduction to the Genome

Mapping Character String to Numerical Sequences

AAAATTTTCCCGGGTAGCTTTCCCGGGT

0001110101010101111111111000

Page 73: General Introduction to the Genome

Research Area of GSP

• Gene Predication• Genes Predication– Hidden Markov Models (HMM)– Fourier Transform– Wavelet Transform• Resonant Recognition Model (RRM)To identify the common hot spots of many protein

molecules using Fourier transform methods.•

Page 74: General Introduction to the Genome

References

• http://biology.ucok.edu/bidlack/biology/notes.htm

• http://www.ncbi.nlm.nih.gov/books/bv.fcgi?rid=genomes

• http://www.estrellamountain.edu/faculty/farabee/biobk/biobooktoc.html

• http://www.werathah.com/• http://lectures.molgen.mpg.de/

online_lectures.html

74

Page 75: General Introduction to the Genome

References

• http://www.biology.lsu.edu/webfac/jmoroney/BIOL3090/

75

Page 76: General Introduction to the Genome

THANKYOU FOR YOUR

ATTENATION