BSc CSIT (TU) Science Theory of Computation (BSc CSIT, CSC257) Question Paper 2075 Nepal

Q: Where can I find the BSc CSIT (TU) Theory of Computation (BSc CSIT, CSC257) question paper 2075?

The full BSc CSIT (TU) Theory of Computation (BSc CSIT, CSC257) 2075 (Regular (annual)) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.

Q: Does the Theory of Computation (BSc CSIT, CSC257) 2075 paper come with solutions?

Yes. Every question on this Theory of Computation (BSc CSIT, CSC257) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.

Q: How many marks is the BSc CSIT (TU) Theory of Computation (BSc CSIT, CSC257) 2075 paper?

The BSc CSIT (TU) Theory of Computation (BSc CSIT, CSC257) 2075 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.

Q: Is practising this Theory of Computation (BSc CSIT, CSC257) past paper free?

Yes — reading and attempting this Theory of Computation (BSc CSIT, CSC257) past paper on Kekkei is completely free.

Question

1Long answer10 marks

Define context-free grammar (CFG). Convert the following CFG into an equivalent grammar in Chomsky Normal Form (CNF): S -> ASA | aB, A -> B | S, B -> b | epsilon.

cfgchomsky-normal-form

Answer 1

Context-Free Grammar (CFG)

A context-free grammar is a 4-tuple $G = (V, T, P, S)$ where:

$V$ = finite set of variables (non-terminals)
$T$ = finite set of terminals, with $V \cap T = \varnothing$
$P$ = finite set of productions of the form $A \to \alpha$ , where $A \in V$ and $\alpha \in (V \cup T)^*$
$S \in V$ = the start symbol

It is context-free because the left side of every production is a single variable, so a variable can be replaced regardless of context.

Converting to Chomsky Normal Form (CNF)

A grammar is in CNF if every production has the form $A \to BC$ (two variables) or $A \to a$ (single terminal). Given grammar:

S \to ASA \mid aB, \quad A \to B \mid S, \quad B \to b \mid \varepsilon

Step 1 — Add a new start symbol. Introduce $S_0 \to S$ so the start symbol never appears on a right side.

S_0 \to S,\quad S \to ASA \mid aB,\quad A \to B \mid S,\quad B \to b \mid \varepsilon

Step 2 — Remove $\varepsilon$ -productions. Nullable variables: $B$ (from $B \to \varepsilon$ ), and $A$ (since $A \to B$ ). Remove $B \to \varepsilon$ and add all combinations omitting nullable symbols:

$S \to ASA$ gives $S \to ASA \mid SA \mid AS \mid S$ (dropping one or both $A$ 's).
$S \to aB$ gives $S \to aB \mid a$ .
$A \to B$ stays; nothing new since $B$ has a non-empty body.

Resulting set:

S_0 \to S,\; S \to ASA \mid SA \mid AS \mid S \mid aB \mid a,\; A \to B \mid S,\; B \to b

Step 3 — Remove unit productions. Units are $S_0 \to S,\ S \to S,\ A \to B,\ A \to S$ . Drop $S \to S$ (trivial). Substitute the bodies:

$S_0 \to ASA \mid SA \mid AS \mid aB \mid a$
$S \to ASA \mid SA \mid AS \mid aB \mid a$
$A \to b$ (from $A \to B \to b$ ) and $A \to ASA \mid SA \mid AS \mid aB \mid a$ (from $A \to S$ )
$B \to b$

Step 4 — Replace terminals in long bodies. Introduce $T_a \to a$ and replace $a$ in $aB$ by $T_a$ , giving bodies $T_aB$ .

Step 5 — Break bodies longer than 2. For $ASA$ , introduce $A_1 \to SA$ , so $ASA \to A\,A_1$ .

Final CNF Grammar

\begin{aligned} S_0 &\to AA_1 \mid SA \mid AS \mid T_aB \mid a \\ S &\to AA_1 \mid SA \mid AS \mid T_aB \mid a \\ A &\to AA_1 \mid SA \mid AS \mid T_aB \mid a \mid b \\ A_1 &\to SA \\ B &\to b \\ T_a &\to a \end{aligned}

Every production now has the form $A \to BC$ or $A \to a$ , so the grammar is in Chomsky Normal Form and generates the same language as the original.

Answer 2

Pushdown Automaton (PDA)

A Pushdown Automaton is a finite automaton augmented with an auxiliary stack (LIFO) memory, which gives it more power than a finite automaton and lets it recognize context-free languages. Formally it is a 7-tuple:

M = (Q, \Sigma, \Gamma, \delta, q_0, Z_0, F)

$Q$ = finite set of states
$\Sigma$ = input alphabet
$\Gamma$ = stack alphabet
$\delta: Q \times (\Sigma \cup \{\varepsilon\}) \times \Gamma \to$ finite subsets of $Q \times \Gamma^*$ = transition function
$q_0$ = start state, $Z_0$ = initial stack symbol, $F \subseteq Q$ = accepting states

PDA for $L = \{a^n b^n \mid n \ge 1\}$

Idea: push a marker for each $a$ , then pop one for each $b$ ; accept when input ends and the stack is back to its bottom.

Let $M = (\{q_0, q_1, q_2\}, \{a,b\}, \{Z_0, A\}, \delta, q_0, Z_0, \{q_2\})$ (acceptance by final state). Transitions:

#	$\delta$	Meaning
1	$\delta(q_0, a, Z_0) = (q_0, AZ_0)$	push first $A$
2	$\delta(q_0, a, A) = (q_0, AA)$	push more $A$ 's
3	$\delta(q_0, b, A) = (q_1, \varepsilon)$	first $b$ : pop $A$
4	$\delta(q_1, b, A) = (q_1, \varepsilon)$	pop $A$ per $b$
5	$\delta(q_1, \varepsilon, Z_0) = (q_2, Z_0)$	stack empty of $A$ 's, accept

Trace for the string `aabb`

ID notation: $(\text{state}, \text{remaining input}, \text{stack})$ , stack top on the left.

\begin{aligned} (q_0,\ aabb,\ Z_0) &\vdash (q_0,\ abb,\ AZ_0) && \text{rule 1 (read } a)\\ &\vdash (q_0,\ bb,\ AAZ_0) && \text{rule 2 (read } a)\\ &\vdash (q_1,\ b,\ AZ_0) && \text{rule 3 (read } b\text{, pop)}\\ &\vdash (q_1,\ \varepsilon,\ Z_0) && \text{rule 4 (read } b\text{, pop)}\\ &\vdash (q_2,\ \varepsilon,\ Z_0) && \text{rule 5 (}\varepsilon\text{-move)} \end{aligned}

The input is fully consumed and the machine is in accepting state $q_2$ , so aabb is accepted. Hence $M$ accepts $L = \{a^n b^n \mid n \ge 1\}$ .

Answer 3

Formal Definition of a Turing Machine

A Turing Machine (TM) is a 7-tuple:

M = (Q, \Sigma, \Gamma, \delta, q_0, B, F)

$Q$ = finite set of states
$\Sigma$ = input alphabet (does not contain the blank)
$\Gamma$ = tape alphabet, $\Sigma \subseteq \Gamma$ , with blank $B \in \Gamma$
$\delta: Q \times \Gamma \to Q \times \Gamma \times \{L, R\}$ = transition function (write a symbol, move head Left or Right)
$q_0 \in Q$ = start state, $B$ = blank symbol, $F \subseteq Q$ = accepting (halting) states

TM for $L = \{a^n b^n c^n \mid n \ge 1\}$

Strategy: repeatedly mark the leftmost unmarked $a$ as $X$ , the leftmost $b$ as $Y$ , the leftmost $c$ as $Z$ , then return to the left and repeat. This matches one $a$ , one $b$ , one $c$ per pass, guaranteeing equal counts in correct order. Accept when only $X, Y, Z$ remain.

States: $q_0$ (find $a$ ), $q_1$ (scan right for $b$ ), $q_2$ (scan right for $c$ ), $q_3$ (return left), $q_4$ (verify all marked), $q_{acc}$ .

State	Read	Write	Move	Next	Action
$q_0$	$a$	$X$	R	$q_1$	mark an $a$
$q_0$	$Y$	$Y$	R	$q_4$	no $a$ left, verify
$q_1$	$a,Y$	same	R	$q_1$	skip to a $b$
$q_1$	$b$	$Y$	R	$q_2$	mark a $b$
$q_2$	$b,Z$	same	R	$q_2$	skip to a $c$
$q_2$	$c$	$Z$	L	$q_3$	mark a $c$
$q_3$	$a,b,Y,Z$	same	L	$q_3$	move left
$q_3$	$X$	$X$	R	$q_0$	back at start, repeat
$q_4$	$Y,Z$	same	R	$q_4$	ensure no $a,b,c$ remain
$q_4$	$B$	$B$	R	$q_{acc}$	accept

If at any point the expected symbol is missing (e.g. a $b$ before all $a$ 's are matched, or leftover $a/b/c$ ), no transition applies and the machine halts and rejects.

Transition Diagram (described)

Nodes $q_0 \to q_1 \to q_2 \to q_3 \to q_0$ form a loop: each cycle converts one $a\to X$ , one $b\to Y$ , one $c\to Z$ . The arc $q_0 \to q_4 \to q_{acc}$ (taken when the first remaining symbol is $Y$ and the rest are only $Y,Z,B$ ) leads to the accepting state. Self-loops on $q_1, q_2, q_3$ skip over already-marked or not-yet-needed symbols.

Working on `aabbcc`

Pass 1: $\underline{a}abbcc \Rightarrow XaYbZc \dots$ — first $a,b,c$ marked. Pass 2: remaining $a,b,c$ marked as $X,Y,Z$ . Tape becomes $XXYYZZ$ ; in $q_4$ the head scans only $Y,Z$ then a blank, reaching $q_{acc}$ . Thus the string is accepted.

Because this TM both writes and reads and may visit a cell many times, it recognizes the context-sensitive / recursively enumerable language $L$ , which is not context-free.

Answer 4

We need strings over $\{0,1\}$ containing at least one 0 and at least one 1. A clean regular expression is:

(0+1)^*\,0\,(0+1)^*\,1\,(0+1)^* \;+\; (0+1)^*\,1\,(0+1)^*\,0\,(0+1)^*

The first term covers strings where some 0 appears before some 1; the second covers strings where a 1 appears before a 0. Their union therefore guarantees the presence of both symbols.

(Equivalently, in extended notation: all strings except those with no 0 or no 1, i.e. $(0+1)^* \setminus (1^* + 0^*)$ .)

Answer 5

Arden's Theorem

Statement: Let $P$ and $Q$ be two regular expressions over an alphabet $\Sigma$ . If $P$ does not contain the empty string $\varepsilon$ (i.e. $\varepsilon \notin P$ ), then the equation

R = Q + RP

has a unique solution:

R = QP^*

Use in Obtaining a Regular Expression from a Finite Automaton

For a finite automaton, write one equation per state describing the strings that reach that state. For each state $q$ , its equation is the sum, over all incoming transitions, of (source-state expression $\cdot$ input symbol); the start state's equation also includes $\varepsilon$ .

This yields a system of simultaneous equations of the form $R = Q + RP$ . Using Arden's theorem, we repeatedly substitute and solve each self-referential equation by replacing $R = Q + RP$ with $R = QP^*$ , eliminating variables one by one. The expression finally obtained for the final (accepting) state is the regular expression denoting the language accepted by the automaton.

Why the $\varepsilon \notin P$ condition matters: it ensures uniqueness; if $P$ contained $\varepsilon$ , infinitely many solutions would satisfy the equation.

Answer 6

Ambiguous Grammar

A context-free grammar $G$ is said to be ambiguous if there exists at least one string $w \in L(G)$ that has two or more distinct derivation trees (equivalently, two distinct leftmost or two distinct rightmost derivations). Ambiguity is a property of the grammar, not of the language.

Example

Consider the expression grammar:

E \to E + E \mid E * E \mid id

Take the string $id + id * id$ . It has two different leftmost derivations / parse trees:

Parse 1 (treats $+$ as the top operator):

E \Rightarrow E + E \Rightarrow id + E \Rightarrow id + E * E \Rightarrow id + id * id

Parse 2 (treats $*$ as the top operator):

E \Rightarrow E * E \Rightarrow E + E * E \Rightarrow id + E * E \Rightarrow id + id * id

Since the same string $id + id * id$ admits two distinct parse trees (one grouping as $id + (id * id)$ , the other as $(id + id) * id$ ), the grammar is ambiguous.

Answer 7

Eliminating Left Recursion from $A \to Aa \mid b$

The production $A \to Aa$ is immediately left-recursive (the variable $A$ appears as the leftmost symbol of its own body).

General rule: a rule of the form

A \to A\alpha \mid \beta

(where $\beta$ does not start with $A$ ) is rewritten using a new variable $A'$ as:

A \to \beta A', \qquad A' \to \alpha A' \mid \varepsilon

Applying it here with $\alpha = a$ and $\beta = b$ :

\boxed{A \to b\,A', \qquad A' \to a\,A' \mid \varepsilon}

This grammar is right-recursive and generates exactly the same language $\{b a^n \mid n \ge 0\}$ (i.e. $b$ followed by zero or more $a$ 's), but is now suitable for top-down (e.g. recursive-descent / LL) parsing.

Answer 8

Greibach Normal Form (GNF)

A CFG is in GNF if every production has the form

A \to a\,\alpha

where $a$ is a single terminal and $\alpha \in V^*$ is a (possibly empty) string of variables. Thus every body begins with exactly one terminal followed only by non-terminals.

Conversion Procedure (CFG → GNF)

Pre-requisite: first convert the grammar to Chomsky Normal Form and remove $\varepsilon$ -productions, unit productions, and useless symbols.

Step 1 — Order the variables. Rename the variables $A_1, A_2, \dots, A_n$ .

Step 2 — Make productions increasing in index. Modify productions so that every rule $A_i \to A_j \gamma$ has $j > i$ :

If $j < i$ , substitute every production of $A_j$ into the body of $A_i$ , then repeat.
If $j = i$ , it is left recursion; eliminate it using a new variable $B_i$ :

A_i \to \beta \mid \beta B_i, \qquad B_i \to \alpha \mid \alpha B_i

(where $A_i \to A_i\alpha \mid \beta$ ).

Step 3 — Make leading symbols terminals (back-substitution). Now $A_n$ 's bodies already start with a terminal. Working backwards from $A_n$ to $A_1$ , substitute the (already-GNF) productions of $A_j$ into any rule whose body starts with $A_j$ , so that every leading symbol becomes a terminal.

Step 4 — Fix the new variables $B_i$ . Apply the same back-substitution to the introduced $B_i$ variables so their bodies also begin with a terminal.

The result is an equivalent grammar in GNF. GNF is important because it guarantees each derivation step consumes exactly one input terminal, which directly yields an equivalent PDA and bounds derivation length to $|w|$ steps for a string $w$ .

Answer 9

Instantaneous Description (ID) of a PDA

An instantaneous description captures the complete momentary configuration of a PDA. It is a triple:

(q, w, \gamma)

$q$ = the current state
$w$ = the remaining unread input string
$\gamma$ = the current stack contents (conventionally written with the top of the stack on the left)

A single move is written with the turnstile relation $\vdash$ : if $\delta(q, a, Z)$ contains $(p, \beta)$ , then

(q,\ a w,\ Z\gamma) \;\vdash\; (p,\ w,\ \beta\gamma).

$\vdash^*$ denotes zero or more moves.

Acceptance by Final State

Starting from $(q_0, w, Z_0)$ , the PDA accepts $w$ by final state if there is a sequence of moves leading to an accepting state, regardless of the stack contents:

L(M) = \{\, w \mid (q_0,\ w,\ Z_0) \vdash^* (p,\ \varepsilon,\ \gamma),\ p \in F \,\}

i.e. the whole input is consumed and the machine ends in some final state $p \in F$ .

Acceptance by Empty Stack

The PDA accepts $w$ by empty stack if, after consuming all input, the stack becomes completely empty (the set $F$ is irrelevant here):

N(M) = \{\, w \mid (q_0,\ w,\ Z_0) \vdash^* (p,\ \varepsilon,\ \varepsilon) \,\}

Equivalence: the two acceptance modes define the same class of languages (the context-free languages); any PDA accepting by final state can be converted to one accepting by empty stack and vice versa.

Answer 10

Multi-Tape Turing Machine

A multi-tape Turing Machine has $k \ge 2$ tapes, each with its own independent read/write head. The machine reads the symbols under all $k$ heads simultaneously and, in one move, can write a new symbol on each tape and move each head independently Left, Right, or stay (S).

Working / Transition Function

For a $k$ -tape machine the transition function is

\delta: Q \times \Gamma^k \to Q \times \Gamma^k \times \{L, R, S\}^k.

On a single step the control unit, based on the current state and the $k$ scanned symbols $(a_1, \dots, a_k)$ , does the following at once:

Moves to a new state,
Writes a symbol on each of the $k$ tapes,
Moves each of the $k$ heads independently.

Initially the input is placed on tape 1 and the other tapes hold blanks; the extra tapes serve as scratch / working memory, which often makes algorithms much simpler to express (e.g. copying, counting, or comparing substrings).

Power

A multi-tape TM is no more powerful than a standard single-tape TM: any $k$ -tape TM can be simulated by a single-tape TM (storing the $k$ tapes interleaved on one tape with head-position markers). The simulation costs at most a quadratic slowdown ( $O(T^2)$ steps to simulate $T$ steps), so the class of languages recognized (recursively enumerable) is identical. Multi-tape machines are mainly a convenience that simplifies design and improves time efficiency.

Answer 11

Universal Turing Machine (UTM)

A Universal Turing Machine is a single Turing Machine $U$ that can simulate the behaviour of any other Turing Machine on any input. It takes as input an encoding $\langle M \rangle$ of an arbitrary TM $M$ together with an encoded input string $w$ , written as $\langle M, w \rangle$ , and then:

simulates the computation of $M$ on $w$ step by step, and
accepts/halts exactly when $M$ would accept/halt on $w$ (and loops if $M$ loops).

Formally, $U$ on $\langle M, w \rangle$ produces the same result that $M$ produces on $w$ . $U$ keeps three pieces of information on its tape: the encoded description of $M$ , the simulated tape contents of $M$ , and $M$ 's current state.

Significance

Stored-program concept: the UTM shows that a single fixed machine can run any algorithm if the program (the description of $M$ ) is supplied as data. This is the theoretical foundation of the modern general-purpose, stored-program computer (von Neumann architecture).
Existence of a universal model: it proves computation is programmable — one machine, many tasks — rather than needing a new machine per problem.
Undecidability: the UTM is central to proving the Halting Problem undecidable; the universal/diagonal construction shows there is no algorithm to decide whether an arbitrary $M$ halts on $w$ .
It establishes the notion of a recursively enumerable universal language $L_u = \{\langle M, w\rangle \mid M \text{ accepts } w\}$ , which is RE but not recursive.

Answer 12

Recursive vs Recursively Enumerable Languages

Both classes are defined in terms of Turing Machines, but differ in whether the TM is guaranteed to halt.

Aspect	Recursive (Decidable)	Recursively Enumerable (RE)
TM behaviour	A TM always halts on every input — it halts and accepts strings in $L$ , and halts and rejects strings not in $L$ .	A TM halts and accepts strings in $L$ , but may run forever (loop) on strings not in $L$ .
Decidability	The language is decidable; membership can always be decided.	The language is only semi-decidable (recognizable); membership can be confirmed but non-membership may never be confirmed.
Also called	Decidable / Turing-decidable.	Turing-recognizable / Type-0.
Closure under complement	Closed — if $L$ is recursive, so is $\overline{L}$ .	Not closed under complement in general.
Relationship	Every recursive language is RE.	RE is a strict superset: there exist RE languages that are not recursive (e.g. the universal language $L_u$ , the Halting language).

Key theorem: $L$ is recursive iff both $L$ and its complement $\overline{L}$ are recursively enumerable.

Example: $\{a^n b^n c^n \mid n \ge 1\}$ is recursive (decidable). The Halting Problem language is recursively enumerable but not recursive.

Level	BSc CSIT (TU)
Stream	Science
Subject	Theory of Computation (BSc CSIT, CSC257)
Year	2075 BS
Exam session	Regular (annual)
Full marks	60
Time allowed	180 minutes
Questions	12, all with step-by-step solutions

BSc CSIT (TU) Science Theory of Computation (BSc CSIT, CSC257) Question Paper 2075 Nepal

Section A: Long Answer Questions

Context-Free Grammar (CFG)

Converting to Chomsky Normal Form (CNF)

Final CNF Grammar

Pushdown Automaton (PDA)

PDA for $L = \{a^n b^n \mid n \ge 1\}$

Trace for the string `aabb`

Formal Definition of a Turing Machine

TM for $L = \{a^n b^n c^n \mid n \ge 1\}$

Transition Diagram (described)

Working on `aabbcc`

Section B: Short Answer Questions

Arden's Theorem

Use in Obtaining a Regular Expression from a Finite Automaton

Ambiguous Grammar

Example

Eliminating Left Recursion from $A \to Aa \mid b$

Greibach Normal Form (GNF)

Conversion Procedure (CFG → GNF)

Instantaneous Description (ID) of a PDA

Acceptance by Final State

Acceptance by Empty Stack

Multi-Tape Turing Machine

Working / Transition Function

Power

Universal Turing Machine (UTM)

Significance

Recursive vs Recursively Enumerable Languages

Frequently asked questions

Section A: Long Answer Questions

Context-Free Grammar (CFG)

Converting to Chomsky Normal Form (CNF)

Final CNF Grammar

Pushdown Automaton (PDA)

PDA for L={anbn∣n≥1}L = \{a^n b^n \mid n \ge 1\}L={anbn∣n≥1}

Trace for the string aabb

Formal Definition of a Turing Machine

TM for L={anbncn∣n≥1}L = \{a^n b^n c^n \mid n \ge 1\}L={anbncn∣n≥1}

Transition Diagram (described)

Working on aabbcc

Section B: Short Answer Questions

Arden's Theorem

Use in Obtaining a Regular Expression from a Finite Automaton

Ambiguous Grammar

Example

Eliminating Left Recursion from A→Aa∣bA \to Aa \mid bA→Aa∣b

Greibach Normal Form (GNF)

Conversion Procedure (CFG → GNF)

Instantaneous Description (ID) of a PDA

Acceptance by Final State

Acceptance by Empty Stack

Multi-Tape Turing Machine

Working / Transition Function

Power

Universal Turing Machine (UTM)

Significance

Recursive vs Recursively Enumerable Languages

Frequently asked questions

PDA for $L = \{a^n b^n \mid n \ge 1\}$

Trace for the string `aabb`

TM for $L = \{a^n b^n c^n \mid n \ge 1\}$

Working on `aabbcc`

Eliminating Left Recursion from $A \to Aa \mid b$