Tutorial 5
For a production of the form A→αNβ
Add First(β)−{ϵ}∈Follow(N)
Question 1
Consider the following grammar for possibly nested lists of numbers and identifiers enclosed in parentheses. The symbols NUM and ID are terminal symbols (lexical tokens) that matches numbers and identifiers, respectively.
lexpatomlistlexpseq→atom | list→NUM ∣ ID→’(’ lexpSeq ’)’→lexpSeq lexp | lexp
-
Remove any left recursion in the grammar
To remove the left recursion from:
A→Aα∣β
Which matches a β followed by one or more occurrences of α, one can re-write the production as follows: A→βA′A′→ϵ∣αA′
-
The left recursion in this grammar occurs in the production
lexpseq→lexpSeq lexp | lexp
- We resolve this by introducing a new non-terminal symbol, say lexpNext, and a new production
- Using the formula defined above, A = lexpseq, alpha=lexp, beta=lexp
lexpseq→lexp lexpNextlexpNext→ϵ∣α lexpNext’
-
Finally, the productions are:
lexpatomlistlexpseqlexpNext→atom | list→NUM ∣ ID→’(’ lexpSeq ’)’→lexp lexpNext→ϵ∣lexp lexpNext’
-
Construct First and Follow sets for the non-terminals of the resulting grammar for the question above
- The non-terminal symbols in the productions above are:
{lexp, atom, list,lexpSeq, lexpNext}
First(NUM)=
-
d
lexpatomlistlexpseqlexpNext→atom | list→NUM ∣ ID→’(’ lexpSeq ’)’→lexp lexpNext→ϵ∣lexp lexpNext
- FIrst column - add all of the terminal symbols in the production
- lexp can start with an atom, so we add the first set of an atom to lexp
lexp |
|
NUM, ID |
NUM, ID |
NUM, ID |
|
|
atom |
{NUM, ID} |
{NUM, ID} |
{NUM, ID} |
{NUM, ID} |
|
|
list |
{’(’} |
{’(’} |
{’(’} |
{’(’} |
|
|
lexpSeq |
|
|
NUM, ID |
NUM, ID, “(” |
|
|
lexpNext |
ϵ |
ϵ |
ϵ |
ϵ |
|
|
d.
The following productions are given from before
lexpseq→lexp lexpNextlexpNext→ϵ∣lexp lexpNextWe can convert to extended bnf form or some shit
lexpSeq → lexp {lexp}
Q2
Type → INT | Type’
Type’ → \epsilon | LBRACKET RBRACKET Type’
Convert to EBNF
Type → INT {LB RB}