BSc CSIT (TU) Science Compiler Design and Construction (BSc CSIT, CSC365) Question Paper 2074 Nepal

Q: Where can I find the BSc CSIT (TU) Compiler Design and Construction (BSc CSIT, CSC365) question paper 2074?

The full BSc CSIT (TU) Compiler Design and Construction (BSc CSIT, CSC365) 2074 (Regular (annual)) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.

Q: Does the Compiler Design and Construction (BSc CSIT, CSC365) 2074 paper come with solutions?

Yes. Every question on this Compiler Design and Construction (BSc CSIT, CSC365) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.

Q: How many marks is the BSc CSIT (TU) Compiler Design and Construction (BSc CSIT, CSC365) 2074 paper?

The BSc CSIT (TU) Compiler Design and Construction (BSc CSIT, CSC365) 2074 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.

Q: Is practising this Compiler Design and Construction (BSc CSIT, CSC365) past paper free?

Yes — reading and attempting this Compiler Design and Construction (BSc CSIT, CSC365) past paper on Kekkei is completely free.

Question

1Long answer10 marks

Explain the different phases of a compiler with a suitable diagram, showing the output of each phase for the statement position = initial + rate * 60.

phasescompiler

Answer 1

Phases of a Compiler

A compiler translates a source program into an equivalent target program through a sequence of phases, grouped into the analysis (front end) and synthesis (back end). Each phase consults the symbol table and an error handler.

Source program
     |
[1] Lexical Analyzer (Scanner)
     |   -> tokens
[2] Syntax Analyzer (Parser)
     |   -> syntax (parse) tree
[3] Semantic Analyzer
     |   -> annotated tree
[4] Intermediate Code Generator
     |   -> three-address code
[5] Code Optimizer
     |   -> optimized IR
[6] Target Code Generator
     |
Target program

(Symbol Table and Error Handler interact with all phases.)

Description of each phase

Lexical analysis – groups characters into tokens (identifiers, operators, constants) and removes whitespace/comments.
Syntax analysis – checks that tokens form valid sentences per the grammar and builds a parse/syntax tree.
Semantic analysis – performs type checking, scope resolution and inserts coercions.
Intermediate code generation – produces a machine-independent IR such as three-address code.
Code optimization – improves the IR (removes redundancy, folds constants) without changing meaning.
Code generation – produces target/assembly code, assigns registers.

Output of each phase for `position = initial + rate * 60`

Lexical analysis (tokens):

id1 = id2 + id3 * 60

where id1=position, id2=initial, id3=rate, and =, +, * are operators, 60 is a number.

Syntax analysis (syntax tree):

Semantic analysis – 60 (integer) is converted to real to match the real operands:

        =
       / \
     id1   +
          /  \
        id2    *
              /  \
            id3   inttoreal
                     |
                     60

Intermediate code (three-address code):

t1 = inttoreal(60)
t2 = id3 * t1
t3 = id2 + t2
id1 = t3

Code optimization:

t1 = id3 * 60.0
id1 = id2 + t1

Code generation (assembly):

MOVF  id3, R2
MULF  #60.0, R2
MOVF  id2, R1
ADDF  R2, R1
MOVF  R1, id1

Answer 2

LR(1) / Canonical LR Parsing Table

Augmented grammar:

0. S' -> S
1. S  -> C C
2. C  -> c C
3. C  -> d

FIRST sets

$FIRST(S)=FIRST(C)=\{c,d\}$ .

Canonical collection of LR(1) item sets

I0: S' -> .S, $ ; S -> .CC, $ ; C -> .cC, c/d ; C -> .d, c/d

I1 = goto(I0,S): S' -> S., $ (accept)

I2 = goto(I0,C): S -> C.C, $ ; C -> .cC, $ ; C -> .d, $

I3 = goto(I0,c): C -> c.C, c/d ; C -> .cC, c/d ; C -> .d, c/d

I4 = goto(I0,d): C -> d., c/d

I5 = goto(I2,C): S -> CC., $

I6 = goto(I2,c): C -> c.C, $ ; C -> .cC, $ ; C -> .d, $

I7 = goto(I2,d): C -> d., $

I8 = goto(I3,C): C -> cC., c/d

I9 = goto(I6,C): C -> cC., $

Gotos: goto(I3,c)=I3, goto(I3,d)=I4; goto(I6,c)=I6, goto(I6,d)=I7.

Parsing table

(sN=shift to N, rN=reduce by rule N, acc=accept)

State	c	d	$	S	C
0	s3	s4		1	2
1			acc
2	s6	s7			5
3	s3	s4			8
4	r3	r3
5			r1
6	s6	s7			9
7			r3
8	r2	r2
9			r2

Parsing the string `cdd$`

Stack	Input	Action
0	c d d $	shift s3
0 c 3	d d $	shift s4
0 c 3 d 4	d $	reduce C->d (r3), goto(3,C)=8
0 c 3 C 8	d $	reduce C->cC (r2), goto(0,C)=2
0 C 2	d $	shift s7
0 C 2 d 7	$	reduce C->d (r3), goto(2,C)=5
0 C 2 C 5	$	reduce S->CC (r1), goto(0,S)=1
0 S 1	$	accept

The string cdd is accepted (valid).

Answer 3

Directed Acyclic Graph (DAG)

A DAG is a directed graph with no directed cycles, used to represent the structure of basic blocks/expressions. Unlike a syntax tree, common sub-expressions are shared (a node is created only once and reused), so the DAG exposes redundancy and helps in optimization. Interior nodes are operators; leaves are identifiers/constants.

Expression: `a + a * (b - c) + (b - c) * d`

The common sub-expression (b - c) is computed once and shared.

DAG (described): a leaf b and c feed a - node giving t1. This t1 is used by both a * with a (t2 = a * t1) and a * with d (t4 = t1 * d). Then t3 = a + t2, and finally t5 = t3 + t4.

Three-Address Code

t1 = b - c
t2 = a * t1
t3 = a + t2
t4 = t1 * d
t5 = t3 + t4

Quadruples

(op, arg1, arg2, result)

#	op	arg1	arg2	result
0	-	b	c	t1
1	*	a	t1	t2
2	+	a	t2	t3
3	*	t1	d	t4
4	+	t3	t4	t5

Triples

(temporaries referenced by instruction index (n))

#	op	arg1	arg2
(0)	-	b	c
(1)	*	a	(0)
(2)	+	a	(1)
(3)	*	(0)	d
(4)	+	(2)	(3)

Answer 4

Compiler vs Interpreter

Compiler	Interpreter
Translates the whole source program into machine/target code at once.	Translates and executes the program line by line / statement by statement.
Produces a separate object/target program.	Does not produce object code; executes directly.
Execution is faster (translation done once).	Execution is slower (re-translates each run).
Reports all errors after scanning the entire program.	Reports the first error and stops, easing debugging.
Needs more memory for object code.	Generally needs less memory.
Examples: C, C++ compilers.	Examples: Python, BASIC interpreters.

Answer 5

Grammar: `A -> Aab | Ac | bd | f`

Step 1: Eliminate left recursion

The rule has the form A -> A\alpha | \beta with $\alpha \in \{ab, c\}$ and $\beta \in \{bd, f\}$ .

Using the standard transformation A -> \beta A', A' -> \alpha A' | \epsilon:

A  -> bd A' | f A'
A' -> ab A' | c A' | ε

Step 2: Left factoring

In the A productions, both alternatives bd A' and f A' begin with different symbols (b vs f), so there is no common prefix — no left factoring is needed for A.

For A', the alternatives ab A', c A', ε also have distinct first symbols (a, c), so no common prefix exists.

Final grammar (left-recursion-free, already left-factored)

A  -> bd A' | f A'
A' -> ab A' | c A' | ε

Answer 6

Symbol Table

A symbol table is a data structure maintained by the compiler that stores information about each identifier (variables, constants, functions, types) appearing in the source program. Typical attributes stored are the name, type, scope, storage location/offset, size, and parameter details.

Role in a compiler

Lexical analysis – enters identifiers into the table as tokens are recognized.
Syntax & semantic analysis – used for type checking, scope resolution and detecting undeclared/duplicate identifiers.
Intermediate & code generation – provides addresses/offsets and type/size information for generating correct code.
Optimization – supplies attribute information used during analysis.

It supports two main operations: insert (add a new entry) and lookup (retrieve an entry). It is commonly implemented using a hash table for fast access, or lists/trees. Thus the symbol table acts as a central repository accessed by almost all phases of the compiler.

Answer 7

Parse Tree vs Syntax Tree

Parse Tree (Concrete Syntax Tree)	Syntax Tree (Abstract Syntax Tree)
Shows the complete derivation according to the grammar.	A condensed form of the parse tree.
Every grammar symbol, including non-terminals and intermediate productions, appears as a node.	Only operators and operands are kept; redundant nodes are removed.
Leaves are terminals; interior nodes are non-terminals.	Interior nodes are operators; leaves are operands.
Larger, contains chains for single productions.	Smaller, more efficient for later phases.

Example: for a + b * c

Parse tree expands through E -> E + T, T -> T * F, F -> id, etc., showing all E, T, F nodes.

Syntax tree:

Answer 8

Ambiguous Grammar

A grammar is ambiguous if there exists at least one string in its language that has more than one distinct parse tree (or equivalently, more than one leftmost/rightmost derivation).

Example

Grammar: E -> E + E | E * E | id

For the string id + id * id, two different parse trees exist:

Parse tree 1 (multiplication first — correct precedence):

      +
     / \
   id   *
       / \
     id   id

Parse tree 2 (addition first):

        *
       / \
      +   id
     / \
   id   id

Since the same string yields two parse trees, the grammar is ambiguous. Ambiguity is undesirable because it makes parsing non-deterministic; it can be removed by introducing precedence/associativity (e.g. using E -> E + T | T, T -> T * F | F, F -> id).

Answer 9

Role of the Lexical Analyzer (Scanner)

The lexical analyzer is the first phase of a compiler. It reads the source program as a stream of characters and groups them into meaningful units called tokens (e.g. identifiers, keywords, constants, operators, punctuation).

Main roles / functions

Tokenization – scans input and produces <token-name, attribute-value> pairs for the parser.
Removing whitespace and comments – discards blanks, tabs, newlines and comments.
Symbol table interaction – enters identifiers/constants into the symbol table.
Error reporting – detects illegal characters/tokens and reports their line number.
Macro/preprocessing & line counting in some implementations.

It works on demand: the parser calls getNextToken(), and the scanner returns the next token. It typically uses regular expressions specified by the language and is implemented as a finite automaton (DFA), often generated by tools like LEX/Flex.

Answer 10

LL(1) vs LR Parsers

LL(1) Parser	LR Parser
Top-down parsing.	Bottom-up parsing.
First L=left-to-right scan, second L=Leftmost derivation.	L=left-to-right scan, R=Rightmost derivation (in reverse).
Builds the parse tree from root to leaves.	Builds the parse tree from leaves to root.
`1` means one lookahead symbol.	Uses item sets/states; LR(0), SLR(1), LALR(1), CLR(1).
Grammar must have no left recursion and must be left-factored.	Handles a larger class of grammars (incl. left recursion).
Uses predictive/recursive-descent parsing with a stack.	Uses a shift-reduce parser driven by a parsing table.
Less powerful, simpler to implement by hand.	More powerful; usually generated by tools (YACC).

Answer 11

Regular Expressions

A regular expression (RE) is a notation for describing regular languages — the set of strings recognized by finite automata. They are used in lexical analysis to specify the patterns of tokens.

Operations

Given REs r and s:

Union: r | s
Concatenation: r s
Kleene closure (zero or more): r*
Positive closure (one or more): r+

Regular expression for identifiers

An identifier begins with a letter or underscore, followed by zero or more letters, digits, or underscores:

letter -> A | B | ... | Z | a | b | ... | z
digit  -> 0 | 1 | ... | 9

identifier -> ( letter | _ ) ( letter | digit | _ )*

Compactly: [A-Za-z_][A-Za-z0-9_]*

Answer 12

Handle and Handle Pruning in Shift-Reduce Parsing

Handle

A handle of a right-sentential form $\gamma$ is a substring that matches the right side of a production and whose reduction to the left-side non-terminal represents one step along the reverse of a rightmost derivation. Formally, if $S \Rightarrow_{rm}^{*} \alpha A w \Rightarrow_{rm} \alpha \beta w$ , then $A \rightarrow \beta$ in the position following $\alpha$ is a handle.

Handle Pruning

Handle pruning is the process of repeatedly finding a handle and reducing it (replacing it by the corresponding non-terminal) until the start symbol is reached. It produces a rightmost derivation in reverse and is the basis of bottom-up (shift-reduce) parsing.

Example

Grammar: E -> E + E | E * E | id, input id + id * id:

Right-sentential form	Handle	Reducing production
id + id * id	id (first)	E -> id
E + id * id	id	E -> id
E + E * id	id	E -> id
E + E * E	E * E	E -> E * E
E + E	E + E	E -> E + E
E	—	(start symbol reached)

In a shift-reduce parser, symbols are shifted onto the stack until a handle appears on top, which is then reduced (pruned).

Level	BSc CSIT (TU)
Stream	Science
Subject	Compiler Design and Construction (BSc CSIT, CSC365)
Year	2074 BS
Exam session	Regular (annual)
Full marks	60
Time allowed	180 minutes
Questions	12, all with step-by-step solutions

BSc CSIT (TU) Science Compiler Design and Construction (BSc CSIT, CSC365) Question Paper 2074 Nepal

Section A: Long Answer Questions

Phases of a Compiler

Description of each phase

Output of each phase for `position = initial + rate * 60`

LR(1) / Canonical LR Parsing Table

FIRST sets

Canonical collection of LR(1) item sets

Parsing table

Parsing the string `cdd$`

Directed Acyclic Graph (DAG)

Expression: `a + a * (b - c) + (b - c) * d`

Three-Address Code

Quadruples

Triples

Section B: Short Answer Questions

Compiler vs Interpreter

Grammar: `A -> Aab | Ac | bd | f`

Step 1: Eliminate left recursion

Step 2: Left factoring

Final grammar (left-recursion-free, already left-factored)

Symbol Table

Role in a compiler

Parse Tree vs Syntax Tree

Ambiguous Grammar

Example

Role of the Lexical Analyzer (Scanner)

Main roles / functions

LL(1) vs LR Parsers

Regular Expressions

Operations

Regular expression for identifiers

Handle and Handle Pruning in Shift-Reduce Parsing

Handle

Handle Pruning

Example

Frequently asked questions

Section A: Long Answer Questions

Phases of a Compiler

Description of each phase

Output of each phase for position = initial + rate * 60

LR(1) / Canonical LR Parsing Table

FIRST sets

Canonical collection of LR(1) item sets

Parsing table

Parsing the string cdd$

Directed Acyclic Graph (DAG)

Expression: a + a * (b - c) + (b - c) * d

Three-Address Code

Quadruples

Triples

Section B: Short Answer Questions

Compiler vs Interpreter

Grammar: A -> Aab | Ac | bd | f

Step 1: Eliminate left recursion

Step 2: Left factoring

Final grammar (left-recursion-free, already left-factored)

Symbol Table

Role in a compiler

Parse Tree vs Syntax Tree

Ambiguous Grammar

Example

Role of the Lexical Analyzer (Scanner)

Main roles / functions

LL(1) vs LR Parsers

Regular Expressions

Operations

Regular expression for identifiers

Handle and Handle Pruning in Shift-Reduce Parsing

Handle

Handle Pruning

Example

Frequently asked questions

Output of each phase for `position = initial + rate * 60`

Parsing the string `cdd$`

Expression: `a + a * (b - c) + (b - c) * d`

Grammar: `A -> Aab | Ac | bd | f`