BSc CSIT (TU) Science Image Processing (BSc CSIT, CSC413) Question Paper 2075 Nepal
This is the official BSc CSIT (TU) (Science stream) Image Processing (BSc CSIT, CSC413) question paper for 2075, as set in the regular annual examination. It carries 60 full marks and a time allowance of 180 minutes, across 12 questions. On Kekkei you can attempt this Image Processing (BSc CSIT, CSC413) past paper online with a timer, get instant AI feedback and step-by-step solutions, and track the topics where you lose marks — completely free. Whether you are revising for your BSc CSIT (TU) Image Processing (BSc CSIT, CSC413) exam or solving previous years' question papers, this 2075 paper is a great way to practise under real exam conditions.
Section A: Long Answer Questions
Attempt any TWO questions.
Explain the Fourier transform and its application in digital image processing. Discuss its important properties with derivations.
Fourier Transform in Digital Image Processing
Definition
The Fourier Transform (FT) decomposes an image (a spatial-domain signal) into its constituent sinusoidal frequency components. It maps an image from the spatial domain to the frequency domain .
For a digital image of size , the 2-D Discrete Fourier Transform (DFT) is:
and the inverse DFT is:
Each value is complex: , giving a magnitude (spectrum) and a phase .
Applications in Image Processing
- Frequency-domain filtering — low-pass (smoothing/blurring), high-pass (sharpening), band-pass and notch filtering.
- Image enhancement by suppressing/boosting selected frequencies.
- Image restoration — inverse and Wiener filtering for deblurring.
- Image compression (basis for transform coding; DCT used in JPEG).
- Texture analysis and pattern recognition via the spectrum.
- Fast convolution — convolution in spatial domain becomes multiplication in frequency domain.
Important Properties (with derivations)
1. Separability. The 2-D DFT separates into two successive 1-D DFTs (rows then columns):
This reduces computation from to , and allows the FFT.
2. Linearity. , directly from the linearity of the summation.
3. Translation (shift). Shifting in space introduces a phase factor:
and multiplying in space by an exponential shifts the spectrum. Setting gives , used to center the spectrum.
4. Periodicity. The DFT and IDFT are periodic with periods and : , since .
5. Conjugate symmetry. For real : , hence is symmetric about the origin.
6. Rotation. Rotating the image by an angle rotates its spectrum by the same angle (shown using polar coordinates ).
7. Convolution theorem. . Proof sketch: take the DFT of the convolution sum and apply the shift property to obtain the product of transforms. This is the foundation of frequency-domain filtering.
8. Average value. , i.e. the DC term equals the average intensity scaled by .
What is spatial filtering? Explain low-pass, high-pass, and band-pass filtering with their masks and effects on the image.
Spatial Filtering
Spatial filtering processes an image directly on its pixels by moving a small mask (kernel/window) over the image and replacing each pixel with a function of its neighbourhood. For a linear filter this is correlation/convolution:
where are the mask coefficients. The behaviour depends entirely on the mask.
1. Low-Pass (Smoothing) Filtering
Passes low frequencies, attenuates high frequencies → blurs / smooths the image and reduces noise. All coefficients are positive and the mask sums to 1 (averaging). A averaging mask:
Effect: removes fine detail and noise but blurs edges; useful before downsampling or for noise reduction.
2. High-Pass (Sharpening) Filtering
Passes high frequencies (edges, detail), attenuates low frequencies → sharpens / enhances edges. The mask has a positive centre, negative surround, and coefficients sum to 0:
Effect: highlights edges and fine detail; flat regions become near zero. Adding the high-pass result back to the original gives high-boost / unsharp masking.
3. Band-Pass Filtering
Passes a range (band) of frequencies between two cut-offs while attenuating both very low and very high frequencies. It is obtained as the difference of two low-pass filters (e.g. Difference of Gaussians):
Effect: retains features of a specific scale/size while removing slowly varying background and high-frequency noise; used in texture analysis and feature/blob detection.
Summary Table
| Filter | Mask sum | Passes | Effect |
|---|---|---|---|
| Low-pass | low freq. | smoothing, noise removal, blur | |
| High-pass | high freq. | edge/detail sharpening | |
| Band-pass | mid band | selects features of a given scale |
Explain image compression. Discuss the JPEG compression algorithm step by step.
Image Compression
Image compression reduces the number of bits required to represent an image by removing redundancy, lowering storage and transmission cost.
Types of redundancy: coding redundancy (inefficient code words), spatial/inter-pixel redundancy (neighbouring pixels are correlated), and psychovisual redundancy (information the human eye cannot perceive).
Categories: Lossless (e.g. RLE, Huffman, LZW — exact reconstruction) and Lossy (e.g. JPEG — higher compression with acceptable quality loss).
JPEG Compression Algorithm (step by step)
JPEG is a lossy, transform-based scheme operating on blocks.
1. Colour transform & sub-sampling. Convert RGB → YCbCr. Because the eye is less sensitive to colour, the chrominance channels (Cb, Cr) are typically sub-sampled (e.g. 4:2:0).
2. Block partition. Divide each component into non-overlapping blocks.
3. Level shift & 2-D DCT. Subtract 128 from each pixel, then apply the Forward Discrete Cosine Transform to each block:
with , otherwise . This concentrates energy in low-frequency coefficients (top-left, DC term).
4. Quantization (lossy step). Divide each DCT coefficient by a value from a quantization table and round:
High-frequency coefficients (large values) become zero — this is where compression and quality loss occur. A quality factor scales .
5. Zig-zag scanning. Read the block in a zig-zag order to group the (now mostly zero) high-frequency coefficients into long runs of zeros, producing a 1-D sequence.
6. Entropy coding.
- DC coefficient: coded as the difference (DPCM) from the previous block's DC.
- AC coefficients: run-length encoded (run of zeros, value) then Huffman (or arithmetic) coded.
The result is the compressed bit-stream. Decompression reverses these steps: entropy decode → de-zig-zag → de-quantize () → inverse DCT → level shift → YCbCr→RGB.
Diagram (in words)
Input → Color/sub-sample → 8×8 blocks → DCT → Quantize → Zig-zag → RLE + Huffman → Bit-stream.
Section B: Short Answer Questions
Attempt any EIGHT questions.
What is the relationship between pixels (neighbours and connectivity)?
Relationship Between Pixels: Neighbours and Connectivity
Neighbours of a pixel at :
- 4-neighbours : the 4 horizontal/vertical pixels .
- Diagonal neighbours : the 4 diagonal pixels .
- 8-neighbours : all 8 surrounding pixels.
Connectivity decides whether two pixels belong to the same object. Let be the set of intensity values defining connectivity (e.g. for a binary image). Two pixels (with values in ) are:
- 4-connected: if .
- 8-connected: if .
- m-connected (mixed): if , or and contains no pixel from . Mixed connectivity removes the ambiguous multiple paths that arise with 8-connectivity.
Connectivity leads to paths, connected components, regions, and boundaries, and is fundamental to labelling and segmentation.
Explain contrast stretching.
Contrast Stretching
Contrast stretching (normalization) is a point (intensity-transformation) enhancement technique that expands the range of intensity levels in a low-contrast image so that it spans the full available range (e.g. –), making the image clearer.
For an input intensity with minimum and maximum , the linear stretch maps it to :
A more general piecewise-linear transformation uses two control points and to control the slope: a steep slope between and increases contrast in that band, while the ends may be clipped.
Special cases:
- If , , at a threshold thresholding (binary image).
- A gentle S-shaped curve brightens mid-tones.
Effect: a dull, low-contrast image (histogram concentrated in a narrow band) becomes one whose histogram is spread across the full range, improving visual detail. Unlike histogram equalization, the mapping here is user-specified/linear rather than derived from the histogram.
What is histogram specification (matching)?
Histogram Specification (Matching)
Histogram specification (matching) is an enhancement technique that transforms an input image so that its histogram matches a specified (target) histogram, rather than the uniform histogram produced by histogram equalization. It gives control to emphasise particular intensity ranges.
Procedure
Let be input intensities with PDF and the desired output intensities with target PDF .
1. Equalize the input image: (CDF of input).
2. Compute the equalizing transform of the target: (CDF of target), with .
3. Map: .
Discrete steps
- Compute the CDF of the input histogram.
- Compute the CDF of the specified histogram.
- For each input level , find the level whose is closest to , and map .
Use: when histogram equalization over-enhances or when matching one image's tonal distribution to a reference image is required.
Differentiate between mean filter and median filter.
Mean Filter vs Median Filter
Both are spatial smoothing filters applied over a neighbourhood (e.g. ), but they differ in operation and behaviour.
| Aspect | Mean (Averaging) Filter | Median Filter |
|---|---|---|
| Type | Linear filter | Non-linear (order-statistic) filter |
| Operation | Replaces pixel with the average of its neighbourhood | Replaces pixel with the median of its neighbourhood |
| Output value | May be a value not present in the image | Always an actual pixel value from the window |
| Edges | Blurs edges | Preserves edges much better |
| Best for | Gaussian / uniform random noise | Salt-and-pepper (impulse) noise |
| Effect of outliers | Outliers strongly affect the average | Outliers are rejected by sorting |
| Cost | Cheap (sum/divide) | Higher (requires sorting each window) |
Example (3×3 window, salt-pepper): values → mean (corrupted), median (correct). This shows the median's robustness to impulse noise.
What is the Discrete Cosine Transform (DCT)?
Discrete Cosine Transform (DCT)
The DCT is a real-valued orthogonal transform that expresses a signal/image as a sum of cosine basis functions of different frequencies. Unlike the DFT it uses only cosines (no imaginary part), and it has strong energy compaction — most of the signal energy is packed into a few low-frequency coefficients — which makes it ideal for compression (used in JPEG and MPEG).
1-D DCT
where and for .
2-D DCT (for images)
Properties / uses
- Real and orthogonal, separable, invertible.
- Energy compaction → excellent for transform coding/compression.
- The term is the DC (average) coefficient.
- Reduces blocking and gives near-optimal decorrelation for highly correlated images, approaching the Karhunen–Loève transform.
Explain the Laplacian operator for edge detection.
Laplacian Operator for Edge Detection
The Laplacian is a second-order derivative operator used to detect edges. It is isotropic (rotation-invariant) and responds to intensity transitions (edges) as zero-crossings, where the second derivative changes sign.
For an image :
Using finite differences the discrete form is:
Common masks
Characteristics
- Produces double edges and detects edges via zero-crossings, giving precise localisation.
- It is very sensitive to noise (because it differentiates twice), so the image is usually smoothed first, e.g. the Laplacian of Gaussian (LoG / Marr–Hildreth) operator.
- Gives no edge direction information (unlike gradient operators).
- For sharpening: (subtract when centre coefficient is negative) enhances edges.
What is the difference between erosion and dilation?
Erosion vs Dilation
Both are fundamental morphological operations using a structuring element (SE) on a (usually binary) image set .
Dilation
Grows / thickens objects:
The SE is placed at each pixel; output is 1 if the SE overlaps any foreground pixel. Effects: enlarges objects, fills small holes and gaps, connects nearby components.
Erosion
Shrinks / thins objects:
Output is 1 only if the SE fits completely within the foreground. Effects: shrinks objects, removes small isolated specks and thin protrusions, breaks thin bridges.
Comparison
| Erosion | Dilation | |
|---|---|---|
| Condition | SE fully inside | SE overlaps |
| Object size | Shrinks / thins | Grows / thickens |
| Holes/gaps | Enlarges | Shrinks / fills |
| Noise | Removes small white specks | Removes small black holes |
| Duality | — |
Note: erosion and dilation are duals with respect to complementation and reflection. Combining them gives opening (erosion then dilation) and closing (dilation then erosion).
Define image degradation model.
Image Degradation Model
The image degradation/restoration model describes how an original (true) image is corrupted to produce the observed degraded image , so that restoration can recover an estimate .
The degradation is modelled as a degradation function (e.g. blur, motion, optical system) acting on , followed by additive noise .
Spatial domain
where is convolution and is the point spread function (PSF) of the degradation.
Frequency domain
Using the convolution theorem:
Restoration applies the inverse/Wiener filter to using a knowledge or estimate of and the noise statistics to obtain , and hence via the inverse transform.
Block diagram (in words)
f(x,y) → [Degradation H] →(+ noise η)→ g(x,y) → [Restoration filter] → f̂(x,y)
Write short notes on the Hadamard transform.
Hadamard Transform — Short Notes
The Hadamard (Walsh–Hadamard) transform is a real, orthogonal image transform whose basis functions take only values +1 and −1 (rectangular/square waves) instead of sinusoids. This makes it computationally very cheap — it needs only additions and subtractions, no multiplications.
Hadamard matrix
The matrix is built recursively (Sylvester construction). The smallest is
It is symmetric and orthogonal: , so the inverse equals the (scaled) transpose.
2-D transform
For an image :
Properties / uses
- Real, orthogonal, separable, energy-compacting (less than DCT).
- The number of sign changes per row is the sequency (analogous to frequency).
- Used in image compression, coding, feature extraction, and fast computation where speed matters more than optimal compaction.
Frequently asked questions
- Where can I find the BSc CSIT (TU) Image Processing (BSc CSIT, CSC413) question paper 2075?
- The full BSc CSIT (TU) Image Processing (BSc CSIT, CSC413) 2075 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
- Does the Image Processing (BSc CSIT, CSC413) 2075 paper come with solutions?
- Yes. Every question on this Image Processing (BSc CSIT, CSC413) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
- How many marks is the BSc CSIT (TU) Image Processing (BSc CSIT, CSC413) 2075 paper?
- The BSc CSIT (TU) Image Processing (BSc CSIT, CSC413) 2075 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
- Is practising this Image Processing (BSc CSIT, CSC413) past paper free?
- Yes — reading and attempting this Image Processing (BSc CSIT, CSC413) past paper on Kekkei is completely free.