building interpreters

55
Building Interpreters Mooly Sagiv html://www.cs.tau.ac.il/~msagiv/courses/ wcc13.html Chapter 4 1

Upload: tirza

Post on 04-Jan-2016

58 views

Category:

Documents


1 download

DESCRIPTION

Building Interpreters. Mooly Sagiv html://www.cs.tau.ac.il/~msagiv/courses/wcc13.html. Chapter 4. Symbol Table. Structure of a simple compiler/interpreter. Runtime System Design. Lexical analysis. Code generation. Intermediate code (AST). Syntax analysis. Machine dependent. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Building Interpreters

Building Interpreters

Mooly Sagiv

html://www.cs.tau.ac.il/~msagiv/courses/wcc13.html

Chapter 4

1

Page 2: Building Interpreters

Structure of a simple compiler/interpreter

Lexical

analysis

Syntax

analysis

Context

analysis

Intermediate code

(AST)

Code

generation

Interpretation

Symbol Table

Runtime System

Design

PL dependent PL+pardigm dependent

Machine dependent

2

Page 3: Building Interpreters

Types of Interpreters

• Recursive– Recursively traverse the tree– Uniform data representation– Conceptually clean– Excellent error detection– 1000x slower than compiler

• Iterative– Closer to CPU– One flat loop– Explicit stack– Good error detection– 30x slower than compiler– Can invoke compiler on code fragments

3

Page 4: Building Interpreters

Input language (Overview)

• Fully parameterized expressions

• Arguments can be a single digit

expression digit | ‘(‘ expression operator expression ‘)’

operator ‘+’ | ‘*’

digit ‘0’ | ‘1’ | ‘2’ | ‘3’ | ‘4’ | ‘5’ | ‘6’ | ‘7’ | ‘8’ | ‘9’

4

Page 5: Building Interpreters

#include "parser.h"

#include "backend.h"static int Interpret_expression(Expression *expr) { switch (expr->type) { case 'D': return expr->value; break; case 'P': { int e_left = Interpret_expression(expr->left); int e_right = Interpret_expression(expr->right); switch (expr->oper) { case '+': return e_left + e_right; case '*': return e_left * e_right; }} break; }}void Process(AST_node *icode) { printf("%d\n", Interpret_expression(icode));}

5

Page 6: Building Interpreters

AST for (2 * ((3*4)+9))

P

*

oper

typeleft right

P

+

P

*

D

2

D

9

D

4

D

3 6

Page 7: Building Interpreters

Uniform self-identifying data representation

• The types of the sizes of program data values are not known when the interpreter is written

• Uniform representation of data types– Type– Size

• The value is a pointer

7

Page 8: Building Interpreters

Example: Complex Number

3.0

4.0

re:

im:

8

Page 9: Building Interpreters

9

Page 10: Building Interpreters

Status Indicator

• Direct control flow of the interpreter

• Possible values– Normal mode– Errors– Jumps– Exceptions– Return

10

Page 11: Building Interpreters

Example: Interpreting C Return

PROCEDURE Elaborate return with expression statement (RWE node):

SET Result To Evaluate expression (RWE node . expression);

IF Status . mode /= Normal mode: Return mode;

SET Status . mode To Return mode;

SET Status . value TO Result;

11

Page 12: Building Interpreters

Interpreting If-Statement

12

Page 13: Building Interpreters

Symbol table

• Stores content of variables, named constants, …• For every variable V of type T

– A pointer to the name of V

– The file name and the line it is declared

– Kind of declaration

– A pointer to T

– A pointer to newly allocated space

– Initialization bit

– Language dependent information (e.g. scope)

13

Page 14: Building Interpreters

Summary Recursive Interpreters

• Can be implemented quickly– Debug the programming language

• Not good for heavy-duty interpreter– Slow– Can employ general techniques to speed the

recursive interpreter• Memoization• Tail call elimination• Partial evaluation

14

Page 15: Building Interpreters

Memoization

int fib(int n) { if (n == 0) return 0 ; if (n==1) return 1; return fib(n-1) + fib(n-2) ; }

int sfib[100] = {-1, -1, …, -1}int fib(int n) { if (sfib[n] > 0) return sfib[n]; if (n == 0) return 0 ; if (n==1) return 1; sfib[n] = fib(n-1) + fib(n-2) ; return sfib[n]; }

15

Page 16: Building Interpreters

Tail Call Elimination

void a(…) { … b(); }void b(){code;}

void a(…) { … code; }void b(){code;}

16

Page 17: Building Interpreters

Tail Call Elimination

void a(int n) { code if (n > 0) a(n-1); }

void a(int n) { loop: code if (n > 0) { n = n -1 ; goto loop }

17

Page 18: Building Interpreters

Partial Evaluation

• Partially interpret static parts in a program

• Generates an equivalent program

Partial EvaluatorProgram Program’

Input 1Input 2

18

Page 19: Building Interpreters

Example

int pow(int n, int e)

{

if (e==0)

return 1;

else return n * pow(n, e-1);

}

e=4

int pow4(int n)

{

return n * n * n *n;

}

19

Page 20: Building Interpreters

Example2

Bool match(string, regexp)

{

switch(regexp) {

….

}

}

regexp=a b* 20

Page 21: Building Interpreters

Partial Evaluation Generalizes Compilation

Partial EvaluatorInterpreter Program

AST ProgramInput

21

Page 22: Building Interpreters

But ….

22

Page 23: Building Interpreters

Iterative Interpretation

• Closed to CPU

• One flat loop with one big case statement

• Use explicit stack– Intermediate results– Local variables

• Requires fully annotated threaded AST– Active-node-pointer (interpreted node)

23

Page 24: Building Interpreters

Demo Compiler

24

Page 25: Building Interpreters

Threaded AST• Annotated AST• Every node is connected to the immediate

successor in the execution• Control flow graph

– Nodes• Basic execution units

– expressions– assignments

– Edges• Transfer of control

– sequential– while– …

25

Page 26: Building Interpreters

Threaded AST for (2 * ((3*4)+9))

P

*

oper

typeleft right

P

+

P

*

D

2

D

9

D

4

D

3

Dummy_node Start

26

Page 27: Building Interpreters

Demo Compiler

27

Page 28: Building Interpreters

C Examplewhile ((x > 0) && (x < 10))

{

x = x + y ;

y = y – 1 ;

}

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

T

exitF

28

Page 29: Building Interpreters

Threading the AST(3.2.1)

• One preorder AST pass

• Every type of AST has its threading routine

• Maintains Last node pointer – Global variable

• Set successor of Last pointer when node is visited

29

Page 30: Building Interpreters

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

30

Page 31: Building Interpreters

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

31

Page 32: Building Interpreters

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

32

Page 33: Building Interpreters

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

33

Page 34: Building Interpreters

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

34

Page 35: Building Interpreters

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

35

Page 36: Building Interpreters

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

36

Page 37: Building Interpreters

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

37

Page 38: Building Interpreters

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

T

38

Page 39: Building Interpreters

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

T

39

Page 40: Building Interpreters

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

T

40

Page 41: Building Interpreters

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

T

41

Page 42: Building Interpreters

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

T

42

Page 43: Building Interpreters

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointermain

T

43

Page 44: Building Interpreters

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointermain

First node pointer

T

44

Page 45: Building Interpreters

Demo Compiler

45

Page 46: Building Interpreters

Conditional Statement

if

condthen_part else_part

Last node pointer

46

Page 47: Building Interpreters

Conditional Statement

if

condthen_part else_part

Last node pointer

End_If

T F

47

Page 48: Building Interpreters

Iterative Interpretation

• Closed to CPU

• One flat loop with one big case statement

• Use explicit stack– Intermediate results– Local variables

• Requires fully annotated threaded AST– Active-node-pointer (interpreted node)

48

Page 49: Building Interpreters

Demo Compiler

49

Page 50: Building Interpreters

Conditional Statements

50

Page 51: Building Interpreters

Storing Threaded AST

• General Graph

• Array

• Pseudo Instructions

51

Page 52: Building Interpreters

Threaded AST as General Graph

condition

statement 1

IF

statement 2

statement 3

statement 4END

If 52

Page 53: Building Interpreters

Threaded AST as Array

condition

IF

statement 1

statement 2

statement 3

statement 4

53

Page 54: Building Interpreters

Threaded AST as Pseudo Instructions

condition

IFFALSE

statement 1

statement 2statement 3

statement 4

JUMP

54

Page 55: Building Interpreters

Iterative Interpreters (Summary)

• Different AST representations

• Faster than recursive interpreters– Some interpretative overhead is eliminated

• Portable

• Secure

• Similarities with the compiler

55