Lecture 8

  1. About the Java PL0 Interpreter
  2. Interpreter Stack Frames
  3. PL0 Interpreter Implementation in Java
  4. Rewrite Rules for Grammars

1.0 - Implementation of PL0 Interpreter in Java

The Tutorial 4 Compiler isn’t actually a compiler - it doesn’t have the code generation phase. In place of code generation phase, we have implemented an interpreter that executes the nodes in the Abstract Syntax Tree (AST)

1.1 - The PL0 Interpreter

2.0 - Interpreter Stack Frames

2.1 - Nested Procedures in PL0

// main program defined at static level 1
var x : int;
    y : int; // Main program variables, static level 1

procedure p() = // defined at static level 2
  var y : int; // Procedure p variable, static level 2
  
  procedure q() = 
    var z : int; // Procedure q variables, static level 3
    begin z := y; x := z end;
  
  begin ... end // procedure p's block
begin ... end // main program block

2.2 - Interpreter Stack Frames

2.2.1 - Example of Interpreter Stack Frames

  1. To start running this code, we first create an activation record for the main procedure:

    • Static Link = null Since the main procedure is at the highest static level
    • Dynamic Link = null Since the main procedure activation record is the first activation record in the stack
    • We note that our entries array is [,] - in our symbol table we designate that each variable will be stored in the entries array with a given offset.
      • When implementing the entries array, we initialise it to an array of NULLs.
      • We can implement in the Dynamic Semantics that if we try to access a variable that hasn’t been assigned a value (i.e. is NULL) we get an error.

    ▶️ main

    Static Link = null Dynamic Link = null level = 1 entries = [_, _]
  2. We then start by running the statement list in the main procedure’s body. The first statement to execute is an assignment statement

    x := 2
    
    • The expression 22 is a constant expression, with no additional complications
    • We now want to evaluate the left value, xx.
      • It is a variable, at static level of 1, with an offset of 0.
      • The current frame (i.e. the one created in the previous step) has a level of 1 so therefore, the value of the expression will be stored in entries[0]
      • And we then update our stack frame:

    ▶️ main

    Static Link = null Dynamic Link = null level = 1 entries = [2, _]
  3. We then execute the second statement in the statement list:

    call f()
    
    • When we call the procedure f(), we need to create an activation record for it.
    • Note that for the activation record for the procedure f(), both the Static Link and Dynamic Link point to the same activation record (the activation record for the main procedure)
      • The Dynamic Link points to the activation record that called this procedure.
      • The Static Link points to the most recent activation record
    • Also note that the entries array only has one element - this is for the re-declared integer variable y which has an offset of 0 (stored in entries[0])
    • Additionally, note that the static level has been incremented by 1
    • Now we, we create the new activation record, and grow the stack downward.

    ▶️ main

    Static Link = null Dynamic Link = null level = 1 entries = [2, _]

    procedure f

    Static Link = main Dynamic Link = main level = 2 entries = [_]
  4. We have now entered the procedure, and now evaluate the first statement in the assignment:

    y := x;
    
    • First, we evaluate the expression xx - we need to know what static level and offset it has.

      • We know that xx is defined at a static level of 1, so we follow the dynamic links until we reach an activation record with static level of 1.
      • In this case, this is the previous frame.
      • From the previous frame’s metadata, we know that xx is defined at a level of 1, with offset 0, i.e. xx is stored at entries[0]
      • From there, we obtain the current value of xx, x=2\color{lightblue}x=2 and now we work on assigning it to yy

      • We know that yy is at a static level of 2 and an offset of 0 - stored at entries[0] at level 2
      • We then know that we want to assign y, entries[0] the value of 2.

      ▶️ main

      Static Link = null Dynamic Link = null level = 1 entries = [21, _]

      procedure f

      Static Link = main Dynamic Link = main level = 2 entries = [2]
    1. We then evaluate the following conditional:

      if x > 0 then
          begin
              x := x - 1;
              call f()
          end
      else
      
      • Using the process outlined before in step 4, we determine that x=2x=2
        • Since x>0x>0, we execute the code in the then part of the conditional (as opposed to the else part of the conditional)
    2. In evaluating the conditional, we want to execute the following statement:

      x := x - 1;
      
      • We know from before that x=2x=2 and the activation record that contains xx is the activation record for the main procedure.
      • We want to decrement the value of x, so we update that activation record

      ▶️ main

      Static Link = null Dynamic Link = null level = 1 entries = [1, _]

      procedure f

      Static Link = main Dynamic Link = main level = 2 entries = [2]
    3. We then execute the other statement in the then branch of the conditional - calling procedure f() again

      • Since we have called a function, we create a new activation record.

      ▶️ main

      Static Link = null Dynamic Link = null level = 1 entries = [1, _]

      procedure f

      Static Link = main Dynamic Link = main level = 2 entries = [2]

      procedure f

      Static Link = procedure f Dynamic Link = main level = 2 entries = [_]
      • The static link is procedure f as that is the activation record that called the function
      • The dynamic link is main as that is the most recent invocation (occurrence??) of the procedure that defines the function
      • The level is 2, as that is the depth of scope of the procedure (note that it doesn’t increment).
      • So, variable yy will be stored at a static level of 2, and offset of 0, i.e. entries[0]
    4. We then execute the first statement in the body of f()

      y := x;
      
      • From before, we know that xx is stored at a static level of 1, offset of 0 and x=2x=2
      • We look at our current activation record, and see that it’s at a static level of 1
        • Therefore, xx is not stored here
        • Need to follow the chain of static links back until we get to the correct static level
        • Following the static links, we see that x=1x=1
      • Looking at the activation record, we see that y is at a static level of 2, with offset of 0
        • The current activation record is at a static level of 2 and therefore we’re looking at the right activation record
        • We assign y the value of x=1x=1 by setting entries[0] to 1

      ▶️ main

      Static Link = null Dynamic Link = null level = 1 entries = [1, _]

      procedure f

      Static Link = main Dynamic Link = main level = 2 entries = [2]

      procedure f

      Static Link = procedure f Dynamic Link = main level = 2 entries = [1]
    5. We then evaluate the conditional statement - since x>0x > 0, we execute the then branch of the conditional.

      • We evaluate the first statement in the branch:
      x := x - 1;
      
      • From before, we know that xx is stored in the activation record for the main procedure, at an offset of 0.
        • We decrement the entry by 1

      ▶️ main

      Static Link = null Dynamic Link = null level = 1 entries = [10, _]

      procedure f

      Static Link = main Dynamic Link = main level = 2 entries = [2]

      procedure f

      Static Link = procedure f Dynamic Link = main level = 2 entries = [1]
    6. We then evaluate the second statement in the branch:

      • Another procedure invocation of f()
      call f()
      
      • To represent this procedure invocation, we create a new activation record

      ▶️ main

      Static Link = null Dynamic Link = null level = 1 entries = [0, _]

      procedure f

      Static Link = main Dynamic Link = main level = 2 entries = [2]

      procedure f

      Static Link = procedure f Dynamic Link = main level = 2 entries = [1]

      procedure f

      Static Link = procedure f Dynamic Link = main level = 2 entries = [_]
    7. We then evaluate the first statement in the procedure f:

      y := x;
      
      • We know that xx is stored at a static level of 1, and offset of 0 (in the activation record for the main procedure)
        • From this, we know that x=0x=0
      • We know that yy is stored at a static level of 2, and offset of 0 (in the current activation record) and from this we set the value of y

      ▶️ main

      Static Link = null Dynamic Link = null level = 1 entries = [0, _]

      procedure f

      Static Link = main Dynamic Link = main level = 2 entries = [2]

      procedure f

      Static Link = procedure f Dynamic Link = main level = 2 entries = [1]

      procedure f

      Static Link = procedure f Dynamic Link = main level = 2 entries = [0]
    8. We then evaluate the conditional

      • Since
      • Since x0x \ngtr 0, we evaluate the else branch of the conditional
      • We’re essentially done with this procedure invocation, and can remove the activation record
      • We then pop that activation record, and return to the previous activation record (using the static link)

      ▶️ main

      Static Link = null Dynamic Link = null level = 1 entries = [0, _]

      procedure f

      Static Link = main Dynamic Link = main level = 2 entries = [2]

      procedure f

      Static Link = procedure f Dynamic Link = main level = 2 entries = [1]
    9. In the second invocation of the procedure f, we continue executing the next statement after the procedure call (of the activation record that we have just popped from the stack).

      write y
      
      **// Output (To STDOUT):
      0**
      
    10. Following this same pattern, we finish the procedure execution of the second invocation of procedure f and pop from the stack once again after evaluating the write y statement.

      ▶️ main

      Static Link = null Dynamic Link = null level = 1 entries = [0, _]

      procedure f

      Static Link = main Dynamic Link = main level = 2 entries = [2]
      **// Output (To STDOUT):
      01**
      
    11. For the last time, we evaluate the write y statement and pop the activation record corresponding to the first invocation of the procedure f,

      • At this point, the static level is 1, and we have returned to the activation record corresponding to the main

      ▶️ main

      Static Link = null Dynamic Link = null level = 1 entries = [0, _]
      **// Output (To STDOUT):
      012**
      

3.0 - PL0 Interpreter Implementation in Java

The Interpreter class is contained in the interpreter package.

3.1 - The Value Abstract Class & Subclasses

3.2 - The Interpreter Class

3.3 - Evaluating Expressions in the Java PL0 Interpreter

3.3.1 - Evaluating an Error Expression Node

What happens when we evaluate an ErrorExpNode?

3.3.2 - Evaluating a Constant Node

⚠️ Note here that we have the beginExec(...) and endExec(...) function calls - these just print out a message when we run the interpreter in debug mode.

3.3.3 - Evaluating an Identifier Node

⚠️ Identifier nodes are introduced into our AST during the parsing phase, but were all eliminated during the static analysis phase. Therefore, we program defensively again:

/**
 * Expression evaluation for an identifier node - should never be reached
 */
public Value visitIdentifierNode(ExpNode.IdentifierNode node) {
    /* Error when identifier node is evaluated, identifier nodes should
     * be eliminated by the semantic syntax process
     */
    errors.fatal("PL0 Internal error: attempt to evaluate IdentifierNode",
            node.getLocation());
    return null; // Never reached
}

3.3.4 - Evaluating a Variable Node

3.3.5 - Evaluating a Binary Node

/**
 * Expression evaluation for a binary operator expression
 **/
public Value visitBinaryNode(ExpNode.BinaryNode node) {
    beginExec("Binary");
    int result = -1;
    /* Evaluate the left and right sides of the operator expression */
    int left = node.getLeft().evaluate(this).getInteger();
    int right = node.getRight().evaluate(this).getInteger();
    /* Perform the operation on the left and right side of the expression */
    switch (node.getOp()) {
        /* Mathematical operations */
        case ADD_OP:
            result = left + right;
            break;
        case SUB_OP:
            result = left - right;
            break;
        case MUL_OP:
            result = left * right;
            break;
        case DIV_OP:
            /* Error when division by zero occurs */
            if (right == 0) {
                runtime("Division by zero", node.getRight().getLocation(),
                        currentFrame);
            }
            result = left / right;
            break;
        /* Logical operations - resulting in 1 for true and 0 for false */
        case EQUALS_OP:
            result = (left == right ? Type.TRUE_VALUE : Type.FALSE_VALUE);
            break;
        case NEQUALS_OP:
            result = (left != right ? Type.TRUE_VALUE : Type.FALSE_VALUE);
            break;
        case GREATER_OP:
            result = (left > right ? Type.TRUE_VALUE : Type.FALSE_VALUE);
            break;
        case LESS_OP:
            result = (left < right ? Type.TRUE_VALUE : Type.FALSE_VALUE);
            break;
        case LEQUALS_OP:
            result = (left <= right ? Type.TRUE_VALUE : Type.FALSE_VALUE);
            break;
        case GEQUALS_OP:
            result = (left >= right ? Type.TRUE_VALUE : Type.FALSE_VALUE);
            break;
        case INVALID_OP:
        default:
            errors.fatal("PL0 Internal error: Unknown operator",
                    node.getLocation());
    }
    endExec("Binary");
    return new IntegerValue(result);
}


3.3.6 - Evaluating a Unary node

The concept of evaluating a Unary Node is the same as evaluating a Binary Node (except, simpler)

  1. Evaluate the argument
  2. Use the switch case to perform operator-dependent things
/**
 * Expression evaluation for a unary operator expression
 **/
public Value visitUnaryNode(ExpNode.UnaryNode node) {
    beginExec("Unary");
    /* Handle unary operators */
    int result = node.getArg().evaluate(this).getInteger();
    //noinspection SwitchStatementWithTooFewBranches
    switch (node.getOp()) {
        case NEG_OP:
            result = -result;
            break;
        default:
            // Never reached
            errors.fatal("PL0 Internal error: Unknown operator",
                    node.getLocation());
    }
    endExec("Unary");
    return new IntegerValue(result);
}

3.3.7 - Evaluating a Dereference Node

We use Dereference nodes to get access to the value of a variable.

/**
 * Expression evaluation for dereference - evaluate sub-expression
 */
public Value visitDereferenceNode(ExpNode.DereferenceNode node) {
    beginExec("Dereference");
    Value lValue = node.getLeftValue().evaluate(this);
    /* Resolve the frame containing the variable node */
    Frame frame = currentFrame.lookupFrame(lValue.getAddressLevel());
    /* Retrieve the variables value from the frame */
    Value result = frame.lookup(lValue.getAddressOffset());
    if (result == null) {
        runtime("variable accessed before assignment", node.getLocation(),
                currentFrame);
        return null; // Never reached
    }
    endExec("Dereference");
    return result;
}
  1. We evaluate the left value of the dereference node, which will return an address.

    Value lValue = node.getLeftValue().evaluate(this);
    
  2. We then need to determine the value of the variable stored at this address.

    • We start by looking up the activation record in which is at the static level of the lValue’s address.

    • The currentFrame.lookupFrame method follows the link in the activation record until it reaches a frame with the correct static level.

    • Once we have obtained the correct frame, we look up the value in the array (using the address offset).

      Value lValue = node.getLeftValue().evaluate(this);
      /* Resolve the frame containing the variable node */
      Frame frame = currentFrame.lookupFrame(lValue.getAddressLevel());
      /* Retrieve the variables value from the frame */
      Value result = frame.lookup(lValue.getAddressOffset());
      
  3. We then perform some validation, alerting the user if a variable has been accessed before it has been assigned.

    • Note that this is an implementation decision or runtime semantics that we have decided upon for this particular compiler implementation.

      if (result == null) {
          runtime("variable accessed before assignment", node.getLocation(),
                  currentFrame);
          return null; // Never reached
      }
      
  4. We then return the result

    endExec("Dereference");
    return result;
    

3.3.8 - Evaluating a Narrow Subrange Node

3.4 - Executing Statements

3.4.1 - Executing a StatementList

public void visitStatementListNode(StatementNode.ListNode node) {
    beginExec("StatementList");
    for (StatementNode statement : node.getStatements()) {
        statement.accept(this);
    }
    endExec("StatementList");
}

3.4.2 - Executing an Assignment Node

public void visitAssignmentNode(StatementNode.AssignmentNode node) {
    beginExec("Assignment");
    /* Evaluate the code to be assigned */
    Value value = node.getExp().evaluate(this);
    /* Assign the value to the variables offset */
    Value lValue = node.getVariable().evaluate(this);
    assignValue(lValue, value);
    endExec("Assignment");
}

3.4.3 - Visiting a Read Node

public void visitReadNode(StatementNode.ReadNode node) {
    beginExec("Read");
    /* Read next int from standard input */
    IntegerValue result = null;
    try {
        result = new IntegerValue(Integer.parseInt(in.readLine()));
    } catch (Exception e) {
        runtime("invalid value read - must be an integer",
                node.getLocation(), currentFrame);
        // Never reached
    }
    Value lValue = node.getLValue().evaluate(this);
    assignValue(lValue, result); // Assign the result to the address of the left value.
    endExec("Read");
}

3.4.4 - Visiting a Write Node

public void visitWriteNode(StatementNode.WriteNode node) {
    beginExec("Write");
    /* Evaluate the write expression */
    int result = node.getExp().evaluate(this).getInteger();
    /* Print the result to the outStream */
    outStream.println(result);
    endExec("Write");
}

3.4.5 - Visiting a Procedure Call

A procedure call is a little more complicated - we have to add another activation record to our stack for the procedure that we’re calling.

public void visitCallNode(StatementNode.CallNode node) {
    beginExec("Call");
    /* Decent to the executing procedures frame */
    currentFrame = currentFrame.enterFrame(node.getEntry());
    /* Resolve the code block to call and execute the block */
    node.getEntry().getBlock().accept(this);
    /* Return to the parent frame */
    currentFrame = currentFrame.exitFrame();
    endExec("Call");
}

  1. First we initialise the call, and we enter in to the SymEntry.ProcedureEntry.

    • Note that node.getEntry() returns the symbol table procedure entry of the procedure call.
    • Once we get the symbol table procedure entry corresponding to the new procedure call, we set it to the top of the stack.
    currentFrame = currentFrame.enterFrame(node.getEntry());
    
  2. We then execute the block:

    • node.getEntry() returns the symbol table procedure entry
    • node.getEntry().getBlock() gets the block of code to be executed
    • node.getEntry().getBlock().accept(this) executes the block of code.
    node.getEntry().getBlock().accept(this);
    
  3. After we complete the execution of the code, we want to return to the previous stack frame (using the dynamic link in the stack frame).

    currentFrame = currentFrame.exitFrame();
    
  4. Run endExec for debugging purposes.

    endExec("Call");
    

3.4.6 - Executing an If Statement / If Node

public void visitIfNode(StatementNode.IfNode node) {
    beginExec("If");
    ExpNode condition = node.getCondition();
    if (condition.evaluate(this).getInteger() == Type.TRUE_VALUE) {
        /* Execute then statement if condition evaluates to true */
        node.getThenStmt().accept(this);
    } else {
        /* Execute else statement if condition evaluates to false */
        node.getElseStmt().accept(this);
    }
    endExec("If");
}

3.4.7 - Executing a While Statement / While Node

public void visitWhileNode(StatementNode.WhileNode node) {
    beginExec("While");
    /* Execute loop statement while the condition is true */
    ExpNode condition = node.getCondition();
    while (condition.evaluate(this).getInteger() == Type.TRUE_VALUE) {
        node.getLoopStmt().accept(this);
    }
    endExec("While");
}

4.0 - Rewrite Rules for Grammars

4.1 - Left Factoring Grammar Productions

4.1.1 - Formal Definition of Left Factor Rewriting Rule

4.2 - Removing Left Recursion from Grammars


Let’s try to re-write this production


4.2.1 - Formal Definition of Immediate Left Recursion Rule (Simple Case)

4.2.2 - Formal Definition of Immediate Left Recursion Rule (General Case)

4.2.3 - Immediate Left Recursion Example

EE+TETTEE+TETTEEE\rightarrow E+T | E-T|T\\ E\rightarrow E{\color{pink}+T} | E{\color{pink}-T}|{\color{lightblue}T}\\ E\rightarrow E
Aβ1 AAϵα1Aα2AA\rightarrow{\color{lightblue}\beta_1}\ A'\\ A'\rightarrow\epsilon|{\color{pink}\alpha_1}A'|{\color{pink}\alpha_2}A'
ET EEϵ+TATAE\rightarrow{\color{lightblue}T}\ E'\\ E'\rightarrow\epsilon|{\color{pink}+T}A'|{\color{pink}-T}A'

This is of the same form as AAα1Aα2β1A\rightarrow A{\color{pink}\alpha_1}|A{\color{pink}\alpha_2}|{\color{lightblue}\beta_1}

Using the Immediate Left Recursion rule, we have:

α1=+Tα2=T{\color{pink}\alpha_1}=+T\\ {\color{pink}\alpha_2}=-T

Which instantiated for the example gives:

Note: In EBNF, this is equivalent to ET{+TT}E\rightarrow T \{+T|-T\}

4.2.4 - Indirect Left-Recursion Rewriting Rule