Aho - Chapter 1.1 to 1.2 Summary

Sections

1.1 - Language Processors 1.2 - The Structure of a Compiler

1.1 - Language Processors

What is the difference between a compiler and an interpreter?

Figure 1: A compiler is a program that translates a source program into a target language

Figure 2: An interpreter is a program which directly executes operations specified in the source program

1.1.1 - The Java Compiler (Hybrid Compiler)

Figure 3: A hybrid compiler combines both translation and sequential execution

1.1.3 - The Preprocessor

Figure 4: Language Processing System

1.1.4 - Exercises for Chapter 1.1

  1. What is the difference between a compiler and an interpreter?
    • A compiler is a program that translates an entire program (from a source language to a target language). An interpreter is a program that translates a program (from a source language to a target language) statement by statement.
  2. What are the advantages of a compiler over an interpreter?
    • A program produced by a compiler is typically faster to execute (i.e. faster at mapping inputs to outputs)
  3. What are the advantages of an interpreter over an interpreter?
    • A program produced by an interpreter can produce better error diagnostics as statements are translated (and ultimately executed) sequentially.
  4. A compiler that translates a high-level language into another high-level language is called a source-to-source translator. What advantages are there to using C as a target language for a compiler?
    • Compiled into a lower level language - compiler from C → target machines
      • Don’t have to write compiler for language, can leverage compilers for C
      • Just have to focus on compiler from language to C.
  5. Describe some of the tasks that an assembler needs to perform
    • Translate from assembly language to machine code (Fairhurst, G)

1.2 - The Structure of a Compiler

Figure 5: Phases of a Compiler

1.2.1 - Lexical Analysis

The first phase of a compiler is called lexical analysis or scanning.

Lexical Analysis Example

Suppose the source program contains the following assignment statement

position = initial + rate * 60
Figure 6: Lexical Analysis Example

1.2.2 - Syntax Analysis

The second phase of a compiler is syntax analysis / parsing and uses the first component of the tokens generated by Lexical Analysis to generate an intermediate representation (syntax tree)

Figure 6: Syntax Analysis Example
  1. The node * indicates that we must multiply the value of rate by the integer 60.
  2. The node + indicates that that we must add the result of the multiplication to the value of initial
  3. The root node of the tree labelled = indicates that we must store the result of this addition into the location for the identifier positionposition

1.2.3 - Semantic Analysis

The Semantic Analyser checks the source program for semantic consistency and gathers type information for subsequent use (& performs type checking).

A binary arithmetic operator may be applied to either a pair of integers or to a pair of floating-point numbers. If the operator is applied to a floating-point number and an integer, the compiler may convert / coerce the integer into a floating-point number.

Figure 6: Syntax Tree

1.2.4 - Intermediate Code Generation

When translating source code into target code, a compiler may construct one or more intermediate representations (which can have a variety of forms) e.g. syntax trees

t1 = inttofloat(60)
t2 = id3 * t1
t3 = id2 + t2
id1 = t3

1.2.5 - Code Optimization

Goal: Improve the intermediate code so that better target code is generated. Typically “better” means faster, but other objectives may be desired, such as shorter code, or target code that consumes less power.

1.2.6 - Code Generation

Generate code in the target language based on the intermediate representation

{ optimised three-address code}
t1 = id3 * 60.0
id1 = id2 + t1 

{ the corresponding machine code}
LDF R2, id3 
MULF R2, R2, #60.0
LDF R1, id2
ADDF R1, R1, R2
STF id1, R1

1.2.7 - Symbol Table

The symbol table is a data structure containing a record for each variable name, with fields for the attributes of the name. The data structure should be designed to allow the compiler to find the record for each name quickly and to store or retrieve data from that record quickly.

1.2.8 - Compiler Phases into Passes

Can group several compiler phases into a single pass.

1.2.9 - Compiler Construction Tools

Tools (and their respective descriptions) that can be used to aide the construction of a compiler.