BSc CSIT (TU) Science Image Processing (BSc CSIT, CSC413) Question Paper 2080 Nepal
This is the official BSc CSIT (TU) (Science stream) Image Processing (BSc CSIT, CSC413) question paper for 2080, as set in the regular annual examination. It carries 60 full marks and a time allowance of 180 minutes, across 12 questions. On Kekkei you can attempt this Image Processing (BSc CSIT, CSC413) past paper online with a timer, get instant AI feedback and step-by-step solutions, and track the topics where you lose marks — completely free. Whether you are revising for your BSc CSIT (TU) Image Processing (BSc CSIT, CSC413) exam or solving previous years' question papers, this 2080 paper is a great way to practise under real exam conditions.
Section A: Long Answer Questions
Attempt any TWO questions.
Explain image enhancement techniques in the spatial and frequency domains with suitable examples and their comparative advantages.
Image Enhancement
Image enhancement is the process of improving the visual appearance of an image (or making it more suitable for a specific analysis task) by accentuating certain features while suppressing others. The two broad approaches are spatial-domain and frequency-domain methods.
1. Spatial-Domain Techniques
These operate directly on the pixels of the image. A spatial operation is expressed as:
where is the input image, is the output and is an operator defined over a neighbourhood of .
(a) Point processing (intensity transformations) — acts on a single pixel.
- Negative: (highlights white detail in dark regions).
- Log transform: (expands dark values, compresses bright ones).
- Power-law (gamma): .
- Contrast stretching and thresholding.
- Histogram equalization: redistributes intensities to span the full range for better contrast.
(b) Spatial filtering (mask/neighbourhood processing) — a kernel is convolved with the image.
- Smoothing (low-pass): averaging / Gaussian masks remove noise and blur.
- Sharpening (high-pass): Laplacian, unsharp masking, high-boost filtering enhance edges.
- Order-statistic filters: median filter for impulse noise.
2. Frequency-Domain Techniques
The image is transformed with the 2-D DFT , multiplied by a filter transfer function , and transformed back:
- Low-pass filters (Ideal, Butterworth, Gaussian) attenuate high frequencies → smoothing / noise removal / blurring.
- High-pass filters attenuate low frequencies → sharpening / edge enhancement.
- Homomorphic filtering simultaneously compresses dynamic range and enhances contrast by separating illumination and reflectance.
- Band-pass / band-reject (notch) filters remove periodic noise.
3. Comparative Advantages
| Aspect | Spatial domain | Frequency domain |
|---|---|---|
| Operation | Direct on pixels (convolution) | Multiplication after DFT |
| Cost | Cheap for small kernels | Cheaper for large kernels (FFT ) |
| Intuition | Easy, local control | Frequency-selective, global control |
| Best for | Point ops, small smoothing/sharpening masks | Periodic-noise removal, homomorphic filtering, large-support filters |
| Design | Kernel design can be ad hoc | Filter directly specified by frequency response |
Example: Removing periodic interference (e.g. scan-line noise) is awkward in the spatial domain but trivial with a frequency-domain notch filter; conversely a quick 3×3 median to kill salt-and-pepper noise is simplest in the spatial domain.
Conclusion: Spatial methods are simple and computationally light for local operations, whereas frequency methods give precise, global, frequency-selective control and are efficient for large filters via the FFT. The two are linked by the convolution theorem, so a designer chooses the domain that is more convenient for the task.
What are image transforms? Explain the Fourier, cosine, Hadamard, and Haar transforms and their applications in image processing.
Image Transforms
An image transform maps an image from the spatial domain into another domain (usually a frequency / sequency domain) using a set of orthogonal basis functions. The forward transform expresses the image as a weighted sum of these basis images; the coefficients reveal information (frequency, energy compaction) that is hard to see in the pixel domain and is useful for compression, filtering, feature extraction and enhancement. A general 2-D separable transform is:
where is the forward transformation kernel.
1. Fourier Transform (DFT)
Uses complex sinusoidal basis functions. The 2-D DFT is:
- Produces magnitude (energy at each frequency) and phase (structure/position).
- Applications: frequency-domain filtering, fast convolution, periodic-noise removal, image analysis, computed via the FFT.
2. Discrete Cosine Transform (DCT)
Uses real cosine basis functions:
- Excellent energy compaction — most signal energy packs into a few low-frequency coefficients; real-valued (no phase).
- Applications: the heart of JPEG image compression (8×8 block DCT), MPEG video.
3. Walsh–Hadamard Transform
Basis functions are the Hadamard matrix entries, taking only values and :
- No multiplications needed (only additions/subtractions) → very fast and cheap hardware. Ordered by sequency (number of sign changes) rather than frequency.
- Applications: low-cost compression, image coding, spread-spectrum, fast computation.
4. Haar Transform
The oldest wavelet transform; basis functions are rectangular Haar wavelets that are localized in both space and frequency.
- Captures both average (approximation) and detail (edges) at multiple resolutions.
- Applications: multiresolution analysis, edge detection, the basis of wavelet-based compression (JPEG-2000-style), feature extraction.
Comparison
| Transform | Basis | Real/Complex | Strength |
|---|---|---|---|
| Fourier | Complex sinusoids | Complex | Frequency analysis, filtering |
| DCT | Cosines | Real | Best energy compaction (JPEG) |
| Hadamard | ±1 (Walsh) | Real | Fastest, cheapest |
| Haar | Wavelets | Real | Localized, multiresolution |
Conclusion: All four are orthogonal transforms used for compression and analysis. Fourier suits frequency filtering, DCT dominates practical compression, Hadamard wins on computational cost, and Haar provides spatial-frequency localization for multiresolution processing.
Explain image segmentation. Discuss thresholding, edge-based, and region-based segmentation methods in detail.
Image Segmentation
Image segmentation is the process of partitioning an image into multiple meaningful, non-overlapping regions (sets of pixels) that are homogeneous with respect to some property (intensity, colour, texture), so that objects and boundaries can be identified. Formally the image is divided into regions such that:
- (complete),
- for (disjoint),
- each is connected and satisfies a homogeneity predicate , while for adjacent regions.
Segmentation algorithms exploit two basic properties of intensity: discontinuity (edges) and similarity (regions/thresholds).
1. Thresholding
Separates objects from background using one or more intensity thresholds. For a single threshold :
- Global thresholding: a single for the whole image; chosen from the histogram. Otsu's method picks that maximises between-class variance.
- Adaptive/local thresholding: varies across the image to handle uneven illumination.
- Pros: simple, fast, works well for bimodal histograms. Cons: sensitive to noise and illumination; ignores spatial context.
2. Edge-Based Segmentation
Detects boundaries where intensity changes abruptly (discontinuity), then links edges into contours.
- First-derivative (gradient) operators: Roberts, Prewitt, Sobel detect edges as gradient maxima; magnitude .
- Second-derivative: Laplacian / Laplacian of Gaussian (LoG, Marr–Hildreth) find edges at zero-crossings.
- Canny edge detector: Gaussian smoothing → gradient → non-maximum suppression → double thresholding with hysteresis; gives thin, well-localized edges.
- Detected edge pixels are then linked (local processing / Hough transform) to form region boundaries.
- Pros: good localization of boundaries. Cons: sensitive to noise, edges may be broken/incomplete.
3. Region-Based Segmentation
Groups pixels directly into regions based on similarity.
- Region growing: starts from seed pixels and appends neighbouring pixels that satisfy a similarity criterion (e.g. intensity within a tolerance) until no more pixels qualify.
- Region splitting and merging: uses a quadtree — split a region into quadrants if it is not homogeneous (), then merge adjacent homogeneous regions.
- Pros: produces connected regions, more robust to noise, uses spatial connectivity. Cons: needs seed/criterion selection, can be computationally heavy, sensitive to seed choice.
Summary
| Method | Based on | Strength | Weakness |
|---|---|---|---|
| Thresholding | Similarity (intensity) | Simple, fast | Poor with non-uniform illumination |
| Edge-based | Discontinuity | Good boundary localization | Broken edges, noise-sensitive |
| Region-based | Similarity + connectivity | Connected, robust regions | Slow, seed-dependent |
Conclusion: Thresholding and region methods rely on pixel similarity, while edge methods rely on discontinuity. In practice they are often combined (e.g. edges + region growing) for reliable segmentation.
Section B: Short Answer Questions
Attempt any EIGHT questions.
What is the difference between spatial and intensity resolution?
Spatial resolution refers to the smallest discernible detail in an image — it is determined by the number of pixels (sampling density), usually stated as rows × columns or dots-per-inch (dpi). More pixels (finer sampling) → higher spatial resolution and more spatial detail; reducing it causes blocky/pixelated images.
Intensity (gray-level) resolution refers to the smallest discernible change in gray level — it is determined by the number of bits used to quantize the intensity. For bits there are gray levels (e.g. 8 bits → 256 levels). More levels → finer tonal/brightness detail; reducing it causes false contouring (banding).
| Spatial resolution | Intensity resolution | |
|---|---|---|
| Determined by | Sampling (number of pixels) | Quantization (bits per pixel) |
| Measures | Spatial detail | Brightness/gray-level detail |
| Degradation effect | Pixelation/blockiness | False contouring (banding) |
In short, spatial resolution = how many pixels, while intensity resolution = how many gray levels per pixel.
Explain power-law (gamma) transformation.
Power-law (gamma) transformation is a point intensity transformation of the form:
where is the input intensity (often normalized to ), is the output intensity, and and are positive constants.
- : maps a narrow range of dark input values to a wider range of output values → brightens the image and expands contrast in dark regions.
- : maps a narrow range of bright input values to a wider output range → darkens the image and expands contrast in bright regions.
- : linear (identity) mapping.
Gamma correction uses this law to compensate for the non-linear response of display devices (CRT/LCD), which inherently follow with –; an inverse power law is applied so the displayed image looks correct. It is also widely used for general contrast manipulation, MRI/X-ray enhancement and aerial-image enhancement.
Unlike the fixed log transform, gamma gives a family of curves controlled by , so the amount of brightening/darkening can be tuned.
What is the use of a median filter in noise removal?
A median filter is a non-linear, order-statistic spatial filter used mainly to remove impulse (salt-and-pepper) noise while preserving edges.
Operation: A window (e.g. 3×3) slides over the image. For each pixel, the gray values inside the window are sorted and the median value replaces the centre pixel:
Why it works for noise removal:
- Salt-and-pepper noise creates extreme (very high or very low) outlier pixels. Outliers fall at the ends of the sorted list, so the median rejects them, whereas a mean (averaging) filter would smear them across neighbours.
- Because the output is an actual pixel value from the neighbourhood (not an average), the median filter preserves sharp edges and causes far less blurring than a linear averaging filter.
Limitations: Less effective against Gaussian noise; very large windows blur fine detail; computationally costlier (requires sorting).
Summary: The median filter is the preferred choice for removing impulse noise with good edge preservation.
Explain the properties of the 2D DFT.
Properties of the 2-D DFT
For an image with transform , the key properties are:
-
Separability: the 2-D DFT can be computed as two successive 1-D DFTs — first along rows, then along columns — allowing use of the FFT for speed.
-
Linearity: .
-
Translation (shift): a spatial shift adds a phase term, ; multiplying by an exponential shifts the spectrum (used to centre the spectrum with ).
-
Periodicity: — the DFT is periodic in both directions.
-
Conjugate symmetry: for a real image, , so (magnitude spectrum is symmetric).
-
Rotation: rotating the image by an angle rotates its spectrum by the same angle (shown in polar coordinates).
-
Convolution theorem: spatial convolution corresponds to multiplication in the frequency domain, (basis of frequency-domain filtering).
-
Scaling / distributivity, and DC term: = average intensity; scaling spatial coordinates inversely scales the frequency axes.
-
Average value & Parseval (energy conservation): total energy is preserved between domains.
These properties make the 2-D DFT a practical tool for filtering, convolution and analysis.
Differentiate between point, line, and edge detection.
All three are intensity-discontinuity detectors that use spatial masks (typically based on the second derivative / Laplacian), but they detect different geometric features.
| Feature | What it detects | Idea / mask | Output |
|---|---|---|---|
| Point detection | An isolated point (single pixel) that differs sharply from its surroundings | Laplacian mask, e.g. centre , neighbours ; flag pixel if $ | R |
| Line detection | Thin lines one pixel wide in a particular orientation | Directional masks for horizontal, vertical, , ; choose the largest response | Line segments |
| Edge detection | A boundary between two regions of different intensity (a step/ramp change) | Gradient (Sobel/Prewitt – first derivative) or Laplacian zero-crossings (second derivative) | Region boundaries |
Distinguishing points:
- A point is a 0-D discontinuity (single pixel), a line is a 1-D feature (thin, oriented), and an edge separates two extended regions.
- Point and line detection typically use the Laplacian (second derivative); edge detection commonly uses the gradient (first derivative) whose magnitude peaks at edges, or the second derivative's zero-crossings.
- Edges are the most common and important for segmentation; points and lines are special, sparser features.
What is hit-or-miss transformation?
The hit-or-miss transform (HMT) is a basic morphological operation used to detect specific shapes/patterns in a binary image — it locates pixel configurations that simultaneously match the foreground and match the background around them.
It uses a pair of disjoint structuring elements : must fit inside the object (a hit) and must fit inside the background/complement (a miss). The transform is defined as:
where is erosion, is the complement of , and are disjoint ().
Interpretation: A pixel survives only if the foreground pattern fits the object and the background pattern fits the surrounding background at that location — i.e. an exact local shape match.
Applications: template/shape matching, corner and endpoint detection, isolated-pixel detection, and as a building block for thinning, thickening and skeletonization.
In short, the hit-or-miss transform is the morphological tool for exact pattern/shape detection in binary images.
Explain predictive coding in compression.
Predictive coding is a lossless or lossy compression technique that reduces redundancy by coding the difference (prediction error) between each pixel and a value predicted from previously coded neighbouring pixels, instead of coding the pixel itself. Because neighbouring pixels are highly correlated, this error is small and concentrated near zero, so it can be entropy-coded with far fewer bits.
Basic scheme (DPCM):
- A predictor estimates the current pixel from past pixels: (e.g. ).
- Compute the prediction error / residual: .
- Encode (quantize if lossy) and apply entropy coding (Huffman/arithmetic).
- The decoder uses the same predictor and adds the decoded error back to reconstruct the pixel: .
Lossless vs lossy: if the residual is coded exactly, reconstruction is perfect (lossless, e.g. used in lossless JPEG); if a quantizer is inserted (DPCM), it becomes lossy with higher compression.
Advantages: simple, removes inter-pixel (spatial/temporal) redundancy, low complexity. Used in lossless JPEG, JPEG-LS, and motion-compensated prediction in video.
What is the role of a Gaussian filter?
A Gaussian filter is a linear, low-pass smoothing filter whose kernel weights follow a 2-D Gaussian function:
The standard deviation controls the amount of smoothing (larger → wider kernel → more blurring).
Role / uses:
- Noise reduction: smooths the image by weighted averaging, suppressing Gaussian/random noise while giving the centre pixel the most weight, so it blurs less harshly than a box (mean) filter.
- Edge/feature preparation: used as the pre-smoothing stage in edge detectors (e.g. Canny, Laplacian of Gaussian / Marr–Hildreth) to reduce noise before differentiation.
- Scale-space / multiresolution: repeated Gaussian smoothing builds image pyramids (e.g. for SIFT).
Advantages:
- It is separable (a 2-D Gaussian = product of two 1-D Gaussians) → computationally efficient.
- It has no sharp cutoff, so its frequency response is smooth → no ringing artifacts (unlike the ideal low-pass filter).
- Rotationally symmetric and optimally trades off spatial and frequency localization.
Trade-off: smoothing also blurs edges; larger removes more noise but loses more detail.
Write short notes on image restoration techniques.
Image Restoration Techniques
Image restoration aims to recover an original image from a degraded version by modelling the degradation process and applying its inverse — unlike enhancement, it is objective and based on the known/estimated degradation. The degradation model is:
where is the degradation (blur) function and is additive noise. The goal is to estimate .
Main techniques:
-
Noise-only restoration (spatial filters): when only noise is present, use mean filters (arithmetic, geometric), order-statistic filters (median, max, min) and adaptive filters to remove noise based on its statistics.
-
Inverse filtering: estimate . Simple but amplifies noise badly where is small/zero.
-
Wiener (minimum mean-square-error) filtering: balances inverse filtering and noise smoothing:
It incorporates the noise-to-signal power ratio and gives the best MSE estimate.
-
Constrained least-squares filtering: restores using a smoothness (Laplacian) constraint, needing only the noise variance — more robust than Wiener when statistics are unknown.
-
Geometric / blind deconvolution: correct geometric distortions or estimate when it is unknown.
Summary: Restoration models degradation ( + noise) and inverts it; in practice Wiener and constrained least-squares filters are preferred over plain inverse filtering because they control noise amplification.
Frequently asked questions
- Where can I find the BSc CSIT (TU) Image Processing (BSc CSIT, CSC413) question paper 2080?
- The full BSc CSIT (TU) Image Processing (BSc CSIT, CSC413) 2080 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
- Does the Image Processing (BSc CSIT, CSC413) 2080 paper come with solutions?
- Yes. Every question on this Image Processing (BSc CSIT, CSC413) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
- How many marks is the BSc CSIT (TU) Image Processing (BSc CSIT, CSC413) 2080 paper?
- The BSc CSIT (TU) Image Processing (BSc CSIT, CSC413) 2080 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
- Is practising this Image Processing (BSc CSIT, CSC413) past paper free?
- Yes — reading and attempting this Image Processing (BSc CSIT, CSC413) past paper on Kekkei is completely free.