Lecture 21
1.0 - Static Semantics of PL0 Declarations
1.1 - PL0 Abstract Syntax
1.1.1 - Abstract Syntax for Declarations
- The notation
stands for a mapping from identifiers to declarations
1.1.2 - Abstract Syntax for Statements
1.1.3 - Example PL0 Program
// Declarations for main procedure
const K = 10;
type S = [-K..K];
W = S;
var x : S;
y : W;
procedure q() =
var x : boolean;
begin
x := (y > 0);
if x then write 1 else write 0
end; // q
begin // main
x := 2;
y := -x;
call q();
write x
end // main
- The code in the main method (enclosed between begin and end) are interpreted with respect to the variables defined at the top of the file
- The code in the procedure q is interpreted with respect to the symbol table of the main method extended with the variables defined in the procedure itself.
- The definition of the identifier
in procedure as var x : boolean;
masks the definition ofin the scope of the main method
- The definition of the identifier
Abstract Syntax Tree
-
Consider the declarations in the program above:
const K = 10; type S = [-K..K]; W = S; var x : S; y : W; procedure q() = ...
-
We have the following declarations:
-
In which, we also have the procedure block defined as follows, in the AST
1.1.4 - Reminder → PL0 Types
-
Scalar Types
int
boolean
subrange(T, lower, upper)
, where T is either int or boolean
-
Reference types,
ref(T)
for variables store an address. -
Product Types
is type type of pairs where and , -
Function Types
is the type of functions with arguments of type and results of type , known as the function type -
For example, the following are the types of some operators
1.1.5 - Symbol Table
-
The symbol table stores all of the declarations from the PL0 programs.
-
The symbol is a mapping from symbol identifiers to entries:
-
The symbol table entries contain constants, types, variables and procedures:
- Observe that the constructor for a constant entry takes a PL0 type and an integer
- The constructor for a type entry takes a PL0 type.
- The constructor for a variable entry takes a PL0 type.
- The constructor for a PL0 entry takes a block (which contains declarations and the procedure code itself)
1.2 - Static Semantics of Constants
-
From before, we had that constants and types can appear in declarations:
-
Since we can determine the value of a constant at compilation time, we can enforce additional constraints on them.
-
We determine the value of constants in the static semantics phase of a compiler.
-
The following rules outline how we evaluate the value of constant expressions during the type checking phase of the compiler.
- Integer Constant Rule
- Constant Identifier Rule
- Negated Constant Rule
- Type Identifier Rule
- Subrange Type Rule
- Constant Declaration Rule
- Constant Entry Rule
- Type Declaration Rule
- Type Entry Rule
Integer Constant Rule
- This means that in the context of (any) symbol table, an integer constant evaluates to its value if it is between 0 and
(inclusive).
Constant Identifier Rule
- An identifier evaluates to a value
if it is in the domain of the symbol table ( ) and the entry associated with that identifier in the symbol table is a constant entry (with type and value )
Negated Constant Rule
- A constant identifier evaluates to
if the constant expression is well typed, and of an integer type as well as evaluates to some value
Type Identifier Rule
- The type of an identifier is defined if the identifier is in the domain of the symbol table
and the symbol table has a type entry corresponding to the identifier of type , then the type of the identifier is
Subrange Type Rule
-
If the constant expression
is well typed and has type and evaluates to the value in the context of the symbol table, and -
If the constant expression
is well typed and has type and evaluates to the value in the context of the symbol table, and -
We also require that
and that the types of the two constant expressions are the same, and are either an integer or Boolean -
For example, let’s consider the symbol table entry for the subrange type
in our PL0 code above: const K = 10; type S = [-K..K];
-
Consider its well-formedness:
Constant Declaration Rule
- A constant declaration is well formed in the context of the symbol table iff:
- The constant expression
evaluates to some value , and - The constant expression has some type
, which is either an integer or Boolean
- The constant expression
Constant Entry Rule
- If the above premises hold (same as for the Constant Declaration Rule), we can find out what the constant entry is in the context of the symbol table.
- We can find the constant entry, with type
and value
Type Declaration
- Given a declaration of type t,
, we know it is well formed in the context of a symbol table, if the type of t in the context of the symbol table is of some PL0 type T - We have additional rules which describe how to evaluate what the type
is.
Type Entry
- If in the context of a symbol table, the type of
is type , then we know that the symbol table entry for the type is a
1.3 - Static Semantics Rules for Variable Entries
Variable Declaration
- A variable declaration is well formed, if the type
exists in the context of the symbol table, and its type evaluates to some PL0 type
Variable Entry
- If the premise from the rule above holds, we can also find the variable entry in the symbol table.
- The type is
as it is a variable type.
- The type is
1.4 - Static Semantics Rules for Procedures
Procedure Declaration
- This is quite simple at the moment, given that the PL0 compiler doesn’t accept parameters.
- A procedure declaration is well formed in the context of the symbol table if and only if the block is well formed.
Procedure Entry
- Given that the above premise holds, we can once again determine the symbol table entry of the procedure.
2.0 - Cycles in Declarations
- The syntax of PL0 allows for cyclic definitions, as shown on the slide.
- In the static semantics of PL0, we enforce that cyclic definitions are not allowed.
2.1 - Uses Operator
We define this operator here, so that we can use it in the definition below.
-
We create the
operator, which specifies the identifiers used in the declaration: -
The identifiers used within a constant expression or type are also given by the function
-
Note here that
denotes a number
2.2 - Static Semantics Rules for Procedure Blocks
Well Formed Block
-
We first know that we want the statements in a procedure block to be well formed.
-
However, we want it to be extended in the context of the symbol table augmented with the definitions defined in
as well
-
-
We also want the declarations in the procedure definition block to be well formed in the context of the (augmented) symbol table
-
The extended symbol table is defined by the following formal notation:
- That is, the augmented symbol table
is given by adding the existing symbol table to the mapping of every identifier in the declarations to each entry declaration - this is formally denoted as
- That is, the augmented symbol table
-
We also add a rule which describes that we aren’t allowed to have cycles in our declarations
. -
We first define a “directly uses” relationship:
-
For any identifier in our declarations,
, if there is another identifier that directly uses this (original identifier) then there exists a mapping between these two identifiers, denoted -
However, our constraint shouldn’t just be whether a mapping - we need to consider the case where we have transitive dependencies.
-
Therefore, our constraint is:
-
Where
is the transitive closure of this mapping relationship: - Transitive Closure
-
That is, an identifier cannot transitively depend on itself.
-
EntryDecl
We said before, that each identifier gets an entry in the augmented symbol table. What does this mean, and how is it done?
-
The symbol table entry for
, where in the context of the symbol table , denoted is given as the entry of in the context of the extended symbol table . -
Note here that the definition of the augmented symbol table
has changed - it is only augmented by the identifiers in that directly uses, denoted -
Additionally, if this condition is met, the identifier will have an entry declaration (which is defined recursively using this rule)
-
This recursion is acceptable (and will terminates) given that the rule above specifies that
2.3.1 - Example of Using Static Semantics Rules
-
Given the following PL0 code:
// Declarations for main procedure const K = 10; type S = [-K..K]; W = S; var x : S; y : W; procedure q() = var x : boolean; begin x := (y > 0); if x then write 1 else write 0 end; // q begin // main x := 2; y := -x; call q(); write x end // main
-
And its corresponding abstract syntax tree,
We want to static check it:
-
We begin by going through
and constructing the symbol table entries -
- We initially have the declaration
in which we note that doesn’t depend on any identifiers - By the Constant Entry rule using Rule 5.1 and Rule 3.1, we have
- We initially have the declaration
-
-
We initially have the declaration
-
We note that this declaration depends on the identifier
twice -
We therefore must compute the entry of
, and augment the symbol table with this entry (by the EntryDecl Rule) -
By our previous calculation, we know that the entry for
is -
We now determine what the entry of the subrange type
, in the context of the augmented symbol table . We apply the Type Entry rule, as well as Rule 5.5 and 3.2 to get the TypeEntry below:
-
-
-
We initially have the declaration
. -
Since this declaration depends on the identifier
, we need to compute the entry of this symbol table with respect to the augmented symbol table. (By the EntryDecl rule) -
Since the entry depends on
, we augment the symbol table with the type entry declaration for it. -
By the previous computation, we know that
evaluates to -
By applying the Type Entry rule and Type Identifier rule, we derive that the type is given by
-
-
-
We initially have the declaration
-
Since this declaration depends on the identifier
, we need to compute the entry of this symbol table with respect to the augmented symbol table. (By the EntryDecl rule) -
Since the entry depends on
, we augment the symbol table with the type entry declaration for it. -
By the previous computation, we know that
evaluates to -
We then apply the Variable Entry and Type Identifier rule to get the following symbol table entry
-
-
-
We initially have the declaration
-
Since this declaration depends on the identifier
, we need to compute the entry of this symbol table with respect to the augmented symbol table. -
We augment the symbol table with the type entry declaration for it:
-
From the previous computation, we know that this can be expressed as
-
By the Variable Entry rule and Type Identifier rule, we have:
-
-
-
We initially have the declaration
. -
By the EntryDecl rule, the declaration is transformed to the following, noting that procedures have no parameters, and therefore only depend on the identifiers used in the block.
-
Therefore, by the EntryDecl rule, we have:
-
Additionally, by the Procedure Entry rule, we simplify this to:
-
-
We now concatenate the above derivations together, which will be later used to form the augmented symbol table
-
Given that we have worked out the extended symbol table, we can now finish the computation of the rule, which requires the checking of the procedure block. The example specific additions are denoted in
-
Observe here that the original rule form for computing the extended symbol table was
Here, we have inserted all of the symbol table entries that we have previously computed.
-
Now, we need to check that in the context of the new symbol table, the statements in the procedure block are well formed:
-
We also need to check that all of the identifiers in the declarations
are well formed in the context of the symbol table: -
We also need to check that we don’t have any circular definitions of identifiers:
The entries in
are derived from the declarations from before: - We have
from the declaration - We have
from the declaration - We have
from the declaration - We have
from the declaration
For each of these, we need to compute their transitive closure. It is given as:
- Observe that the transitive closure listed contains all of the items in
as well as all of the transitive items that we can derive (e.g. - Given this transitive closure, we then have to compute that no identifier depends on itself (in this example, the check passes)
- We have
-
2.3 - Static Semantics Rules for the Main Program
The main program is well formed if:
Where the symbol table for the predefined identifiers is