BSc CSIT (TU) Science Image Processing (BSc CSIT, CSC413) Question Paper 2081 Nepal
This is the official BSc CSIT (TU) (Science stream) Image Processing (BSc CSIT, CSC413) question paper for 2081, as set in the regular annual examination. It carries 60 full marks and a time allowance of 180 minutes, across 12 questions. On Kekkei you can attempt this Image Processing (BSc CSIT, CSC413) past paper online with a timer, get instant AI feedback and step-by-step solutions, and track the topics where you lose marks — completely free. Whether you are revising for your BSc CSIT (TU) Image Processing (BSc CSIT, CSC413) exam or solving previous years' question papers, this 2081 paper is a great way to practise under real exam conditions.
Section A: Long Answer Questions
Attempt any TWO questions.
Explain digital image processing fundamentals. Discuss image sampling, quantization, and the basic relationships between pixels.
Digital Image Processing Fundamentals
A digital image is a two-dimensional function , where are spatial coordinates and the value of at any point is the intensity (gray level). When , , and the amplitude are all finite and discrete, the image is digital. Each discrete element is a pixel (picture element).
Image Acquisition Pipeline
A continuous scene is converted to a digital image in two steps:
- Sampling – digitizing the spatial coordinates .
- Quantization – digitizing the amplitude (intensity) values.
1. Image Sampling
Sampling divides the continuous image into a grid of points, producing the pixel array:
- More samples → higher spatial resolution → finer detail.
- Insufficient sampling causes aliasing (the Nyquist criterion: sampling rate must be at least twice the highest spatial frequency).
2. Quantization
Each sampled value is mapped to one of discrete intensity levels, where is the number of bits per pixel. For an 8-bit image, levels (–). Too few levels produce false contouring. Storage of an image is bits.
3. Basic Relationships Between Pixels
For a pixel at :
- Neighbors:
- 4-neighbors : .
- Diagonal neighbors : .
- 8-neighbors .
- Adjacency: 4-, 8-, and m-adjacency (mixed adjacency, used to remove path ambiguity).
- Connectivity / Region / Boundary: pixels are connected if adjacent and their values satisfy a similarity criterion ; a maximal connected set forms a region.
- Distance measures between and :
- Euclidean:
- City-block ():
- Chessboard ():
Conclusion
Sampling and quantization determine an image's spatial resolution and intensity resolution respectively, while pixel relationships (neighborhood, adjacency, connectivity, distance) form the basis of region-level operations such as segmentation and morphology.
What is histogram processing? Explain histogram equalization and histogram specification with mathematical formulation and an example.
Histogram Processing
The histogram of a digital image with intensity levels in is the discrete function
where is the -th intensity and is the number of pixels with that intensity. The normalized histogram is , which approximates the probability of occurrence of level . Histogram processing modifies this distribution to improve contrast.
Histogram Equalization
Goal: spread intensities to produce an approximately uniform histogram, maximizing global contrast. The transformation is the cumulative distribution function (CDF):
Example. A 3-bit () image ():
| CDF | round | ||||
|---|---|---|---|---|---|
| 0 | 790 | 0.19 | 0.19 | 1.33 | 1 |
| 1 | 1023 | 0.25 | 0.44 | 3.08 | 3 |
| 2 | 850 | 0.21 | 0.65 | 4.55 | 5 |
| 3 | 656 | 0.16 | 0.81 | 5.67 | 6 |
| 4 | 329 | 0.08 | 0.89 | 6.23 | 6 |
| 5 | 245 | 0.06 | 0.95 | 6.65 | 7 |
| 6 | 122 | 0.03 | 0.98 | 6.86 | 7 |
| 7 | 81 | 0.02 | 1.00 | 7.00 | 7 |
Pixels are remapped using the round column, stretching the originally dark image across the full range.
Histogram Specification (Matching)
Instead of a uniform output, we force the histogram to match a specified target shape . Steps:
- Equalize the input: .
- Equalize the target: .
- For each , find , i.e. the level whose equalized value is closest to .
This gives the mapping .
Equalization vs Specification
| Aspect | Equalization | Specification |
|---|---|---|
| Output histogram | (Approx.) uniform | User-defined shape |
| Control | Automatic | Targeted |
| Use | General contrast enhancement | When a particular tonal distribution is desired |
Explain the Fourier transform in image processing. Discuss frequency-domain filtering using ideal and Butterworth filters with examples.
Fourier Transform in Image Processing
The Fourier transform (FT) decomposes an image into its sinusoidal frequency components, moving from the spatial domain to the frequency domain . Slowly varying regions map to low frequencies; edges and fine detail map to high frequencies.
The 2-D Discrete Fourier Transform (DFT) of an image:
and its inverse:
Frequency-Domain Filtering
Filtering is performed by point-wise multiplication with a transfer function :
General steps: (1) shift origin by multiplying by , (2) compute the DFT, (3) multiply by , (4) inverse DFT, (5) take real part and undo the shift. Let be the distance of from the centred origin and the cutoff.
1. Ideal Low-Pass Filter (ILPF)
It passes all frequencies inside a circle of radius and blocks the rest. Although it gives the sharpest possible cut, its abrupt transition causes ringing artifacts (concentric ripples) because of the sinc nature of its spatial response.
2. Butterworth Low-Pass Filter (BLPF)
where is the filter order. The transition is smooth and controllable:
- Low order → gentle roll-off, little ringing.
- High order → approaches the ideal filter (more ringing).
Examples / Effect
- Low-pass (ILPF/BLPF) → blurring and noise smoothing (removes high frequencies).
- High-pass (obtained as ) → edge sharpening (removes low frequencies). The Butterworth high-pass is .
Comparison
| Filter | Transition | Ringing | Control |
|---|---|---|---|
| Ideal | Abrupt (brick-wall) | Severe | None |
| Butterworth | Smooth | Negligible–moderate (order-dependent) | Via order |
Thus the Butterworth filter is generally preferred in practice because it trades a small amount of sharpness for greatly reduced ringing.
Section B: Short Answer Questions
Attempt any EIGHT questions.
Define resolution and aspect ratio of an image.
Resolution is the amount of detail an image holds, given by its number of pixels, written as (e.g. ). Spatial resolution refers to the smallest discernible detail (pixel density, often dpi), while intensity (gray-level) resolution refers to the number of distinct intensity levels . Higher resolution means finer detail.
Aspect ratio is the proportional relationship between an image's width and height, expressed as (e.g. , ). It must be preserved when resizing to avoid distortion (stretching/squashing).
Explain log transformation and its use.
Log Transformation
The log transformation is a point (intensity) transformation given by
where is the input intensity (), is the output, and is a scaling constant (often to fit the output to ).
Characteristics & Use
- It maps a narrow range of low (dark) input values to a wider range of output values, and compresses high (bright) values.
- Therefore it expands the dark regions and brightens the image while compressing the dynamic range.
- Main use: displaying images with a very large dynamic range, such as the Fourier spectrum, where a few very large values would otherwise dominate and hide low-magnitude detail. Applying the log lets all components become visible on a normal display.
- The inverse-log (exponential) transformation does the opposite, enhancing bright regions.
What is convolution in spatial filtering?
Convolution in Spatial Filtering
Spatial filtering processes a pixel using a small mask (kernel) of size slid over the image. Convolution is the core operation that computes each output pixel as a weighted sum of the input pixel and its neighbours.
For a kernel of size , the 2-D convolution is
Key Points
- The kernel is rotated by 180° before sliding (this distinguishes convolution from correlation, which uses the kernel as-is).
- For each position, multiply overlapping kernel and image values, sum them, and place the result at the centre pixel.
- Borders are handled by padding (zero, replicate, or mirror).
Example: convolving with averages each neighbourhood, producing a smoothing (blur) effect. Different kernels produce smoothing, sharpening, or edge detection. Convolution is linear and shift-invariant, which is why the same kernel applies uniformly across the image.
Differentiate between the DFT and the DCT.
DFT vs DCT
| Feature | DFT (Discrete Fourier Transform) | DCT (Discrete Cosine Transform) |
|---|---|---|
| Basis functions | Complex exponentials (sine + cosine) | Real cosine functions only |
| Output | Complex (magnitude + phase) | Real-valued |
| Symmetry assumed | Periodic extension of the signal | Even (mirror-symmetric) extension |
| Energy compaction | Lower | High – energy concentrated in few low-frequency coefficients |
| Boundary artifacts | Can show discontinuities/ringing at edges | Reduced, due to symmetric extension |
| Computation | More (complex arithmetic) | Less (real arithmetic) |
| Typical use | Frequency-domain filtering, spectral analysis, convolution | Compression (JPEG, MPEG) |
1-D DCT: .
Summary: The DFT is general-purpose (complex, used for filtering/analysis), whereas the DCT is real-valued with superior energy compaction, making it the transform of choice for image and video compression.
Explain watershed segmentation.
Watershed Segmentation
The watershed transform is a region-based segmentation method that treats a grayscale image as a topographic surface, where intensity is interpreted as elevation. Bright pixels are hills/ridges and dark pixels are valleys (catchment basins).
Flooding Analogy
Imagine piercing a hole at each local minimum and slowly flooding the surface from below:
- Water rises uniformly and fills each catchment basin.
- Where waters from two different basins are about to merge, a dam (watershed line) is built.
- When flooding completes, the dams form closed contours that segment the image into regions.
It is usually applied to a gradient image, so basins correspond to homogeneous regions and watershed lines lie along strong edges.
Over-segmentation and the Fix
Noise and small intensity fluctuations create many spurious minima, causing over-segmentation (too many tiny regions). This is controlled by marker-controlled watershed: internal and external markers are defined first, and flooding starts only from those markers, yielding meaningful regions.
Advantages: produces continuous, closed boundaries; intuitive. Disadvantage: sensitive to noise → over-segmentation without markers.
What is the Canny edge detector?
Canny Edge Detector
The Canny edge detector (J. Canny, 1986) is an optimal multi-stage algorithm designed to satisfy three criteria: good detection (low error rate), good localization (edges close to true edges), and single response per edge. Its steps are:
- Smoothing – convolve the image with a Gaussian filter to reduce noise.
- Gradient computation – find intensity gradient magnitude and direction (e.g. using Sobel):
- Non-maximum suppression – thin the edges by keeping only pixels that are local maxima of along the gradient direction, suppressing all others.
- Double thresholding – classify pixels using two thresholds and into strong (), weak (–), and non-edge ().
- Edge tracking by hysteresis – keep weak edges only if they are connected to a strong edge; discard isolated weak edges.
Significance
Canny produces thin, continuous, well-localized edges and is robust to noise, making it one of the most widely used edge detectors compared with simpler operators (Sobel, Prewitt) that lack thinning and hysteresis.
Explain dilation and erosion with examples.
Dilation and Erosion (Morphological Operations)
Morphological operations process a binary (or grayscale) image with a small structuring element (SE) , based on set theory.
Dilation
The SE is slid over the image; if it touches (overlaps) any foreground pixel, the centre is set to foreground. Effect: grows / thickens objects, fills small holes and gaps, and connects nearby components.
Example: with a SE, an isolated foreground pixel expands into a block; a one-pixel gap in a line is bridged.
Erosion
The centre stays foreground only if the entire SE fits inside the object. Effect: shrinks / thins objects, removes small isolated noise pixels and thin protrusions.
Example: with a SE, an object's boundary layer is stripped off by one pixel; an isolated single foreground pixel is removed entirely.
Duality and Combinations
They are duals: .
- Opening = erosion then dilation () → removes small objects/noise, smooths contours.
- Closing = dilation then erosion () → fills small holes and gaps.
What is the JPEG compression standard?
JPEG Compression Standard
JPEG (Joint Photographic Experts Group) is the most widely used lossy compression standard for continuous-tone still images. It exploits the human visual system's reduced sensitivity to high-frequency detail and to colour (chrominance).
Baseline Encoding Steps
- Colour transform & subsampling – convert RGB to ; subsample the chroma channels (e.g. 4:2:0) since the eye is less sensitive to colour detail.
- Block splitting – divide each channel into pixel blocks; level-shift by subtracting 128.
- Forward DCT – apply the 2-D Discrete Cosine Transform to each block, concentrating energy in the top-left (low-frequency) coefficients.
- Quantization – divide each DCT coefficient by a value from a quantization table and round. This is the lossy step; higher-frequency coefficients (less visible) are quantized more coarsely. A quality factor scales the table.
- Entropy coding – reorder coefficients in a zig-zag scan (groups zeros together), apply run-length encoding plus DPCM on the DC term, then Huffman (or arithmetic) coding.
Decoding
The inverse steps (entropy decode → dequantize → inverse DCT → colour reconstruct) recover the image. Because quantization discards data, the result is an approximation; lower quality → higher compression and more blocking artifacts.
Note: JPEG also defines a lossless mode, while JPEG 2000 uses the wavelet transform instead of the DCT.
Write short notes on colour image processing models.
Colour Image Processing Models
A colour model is a specification of a coordinate system in which each colour is represented as a point. The main models used in image processing are:
1. RGB (Red, Green, Blue)
An additive model where colours are formed by combining the three primaries. Represented as a unit cube; suited to hardware such as monitors, cameras, and scanners. A pixel is a triplet ; 24-bit colour gives M colours.
2. CMY / CMYK (Cyan, Magenta, Yellow, blacK)
A subtractive model used for printing. Related to RGB by . Black (K) is added for true black and ink economy.
3. HSI (Hue, Saturation, Intensity)
Separates colour information (hue, saturation) from intensity. Because it decouples intensity from chromaticity, it matches human colour perception and is ideal for image-processing algorithms (e.g. you can enhance brightness or segment by hue independently). Related variants: HSV/HSB and HSL.
4. / YUV
= luminance, = chrominance. Used in video and JPEG/MPEG compression because chroma can be subsampled with little perceived loss.
Summary
| Model | Type | Main use |
|---|---|---|
| RGB | Additive | Display hardware |
| CMYK | Subtractive | Printing |
| HSI/HSV | Perceptual | Image-processing algorithms |
| Luma/chroma | Compression, video |
Frequently asked questions
- Where can I find the BSc CSIT (TU) Image Processing (BSc CSIT, CSC413) question paper 2081?
- The full BSc CSIT (TU) Image Processing (BSc CSIT, CSC413) 2081 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
- Does the Image Processing (BSc CSIT, CSC413) 2081 paper come with solutions?
- Yes. Every question on this Image Processing (BSc CSIT, CSC413) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
- How many marks is the BSc CSIT (TU) Image Processing (BSc CSIT, CSC413) 2081 paper?
- The BSc CSIT (TU) Image Processing (BSc CSIT, CSC413) 2081 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
- Is practising this Image Processing (BSc CSIT, CSC413) past paper free?
- Yes — reading and attempting this Image Processing (BSc CSIT, CSC413) past paper on Kekkei is completely free.