tropical geometry for biology

17
Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley

Upload: candice-emerson

Post on 30-Dec-2015

34 views

Category:

Documents


1 download

DESCRIPTION

Tropical Geometry for Biology. Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley. Tropical arithmetic Annotation is sequence labeling Annotation is important for biology Annotation is tropical arithmetic Tropical geometry Tree basics - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Tropical Geometry for Biology

Tropical Geometry for Biology

Lior Pachter and Bernd SturmfelsDepartment of Mathematics

U.C. Berkeley

Page 2: Tropical Geometry for Biology

Tropical arithmetic• Annotation is sequence labeling• Annotation is important for biology• Annotation is tropical arithmetic

Tropical geometry• Tree basics• Tree reconstruction is important for biology • Tree space is the tropical Grassmanian

Back to the data

Page 3: Tropical Geometry for Biology

What is annotation?

QuickTime™ and aTIFF (LZW) decompressor

are needed to see this picture.

INPUT: ..t..r…o..p..i..c..a..a..l...g..e..e..t..r..y..OUTPUT: ..t..r…o..p..i..c..a..a..l...g..e..e..t..r..y..

Annotation is the labeling of the input sequence,in this case with 3 colors:

ome

Page 4: Tropical Geometry for Biology

TAAT ATGTCCACGG TTGTACACGGCA G GTATTGAGGTATTGAG ATGTAAC TGAA

Input: TAATATGTCCACGGGTATTGAGCATTGTACACGGGGTATTGAGCATGTAATGAA

Biology example: gene annotation

Output:

Leucine

Page 5: Tropical Geometry for Biology

x

y

z

Best annotation for TAAT is obtained by evaluating

Example: assign “scores”, say x,y,z to each color regardless of letter

Finding a good annotationwith tropical arithmetic

Page 6: Tropical Geometry for Biology

Tropical arithmetic• Annotation is sequence labeling• Annotation is important for biology• Annotation is tropical arithmetic

Tropical geometry• Tree basics• Tree reconstruction is important for biology • Tree space is the tropical Grassmanian

Back to the data

Page 7: Tropical Geometry for Biology

What is a phylogenetic X-tree?

In Darwin’s exampleX = {A,B,C,D,1}

Page 8: Tropical Geometry for Biology

Tree basics1 3

2 4

1 2

3 4

1 2

4 3

In general, the number of trees is the Schröder number(2n-5)!! = (2n-5)*(2n-7)*… 3*1

12

34

0.1

0.2

0.40.2

0.3

Page 9: Tropical Geometry for Biology

Data

Page 10: Tropical Geometry for Biology

Metrics and trees

[ dij ]Distance between species i and j

Page 11: Tropical Geometry for Biology

A primate tree from genome sequences

Page 12: Tropical Geometry for Biology

Tree space is the tropical Grassmanian

Page 13: Tropical Geometry for Biology

Example: X={1,2,3,4,5}

31

2

4 5

Page 14: Tropical Geometry for Biology

Back to the data

Page 15: Tropical Geometry for Biology
Page 16: Tropical Geometry for Biology

Alignment

Phylogeny

AnnotationMulti HMM Generalized HMM

Tree Markov models

GeneralizedMulti HMM

Evol. HMM Generalized hidden MarkovPhylogeny

Graphical Models

Final message: Tropical mathematics is important for comparative genomics.

Page 17: Tropical Geometry for Biology

For more on mathematics and tropical geometry (and combinatorics and algebra and statistics…):L. Pachter and B. Sturmfels, Tropical Geometry of Statistical Models, PNAS 101, 2004L. Pachter and B. Sturmfels, Parametric Inference for Biological Sequence Analysis, PNAS 101, 2004D. Speyer and B. Sturmfels, The Tropical Grassmanian, Advances in Geometry 4, 2004.L. Pachter and B. Sturmfels, Mathematics of Phylogenomics, arxiv math.ST/0409132, 2004.

and coming soon:

Book (to be published by Cambridge University Press)

Algebraic Statistics for Computational Biologyedited by Pachter and Sturmfels