BSc CSIT (TU) Science Multimedia Computing (BSc CSIT, CSC467) Question Paper 2074 Nepal

Q: Where can I find the BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) question paper 2074?

The full BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) 2074 (Regular (annual)) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.

Q: Does the Multimedia Computing (BSc CSIT, CSC467) 2074 paper come with solutions?

Yes. Every question on this Multimedia Computing (BSc CSIT, CSC467) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.

Q: How many marks is the BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) 2074 paper?

The BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) 2074 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.

Q: Is practising this Multimedia Computing (BSc CSIT, CSC467) past paper free?

Yes — reading and attempting this Multimedia Computing (BSc CSIT, CSC467) past paper on Kekkei is completely free.

Question

1Long answer10 marks

Explain the JPEG image compression standard. Describe its main steps - DCT, quantization, zig-zag ordering and entropy (Huffman) coding - with the help of a block diagram.

compressionjpeg

Answer 1

JPEG Image Compression Standard

JPEG (Joint Photographic Experts Group) is a widely used lossy compression standard for continuous-tone still images. It exploits the limitations of the human visual system (which is less sensitive to high-frequency detail and to chrominance than luminance) to discard information that is not visually significant.

Block Diagram (pipeline)

Source Image
   |
[Color transform RGB -> YCbCr + chroma subsampling]
   |
[Divide into 8x8 blocks]
   |
[Forward DCT] --> [Quantization] --> [Zig-zag ordering] --> [Run-length + DPCM] --> [Entropy (Huffman) coding]
   |
Compressed bitstream (JFIF)

Main Steps

1. Color transform & subsampling. RGB is converted to $Y$ (luminance), $C_b, C_r$ (chrominance). Chrominance is subsampled (e.g. 4:2:0) since the eye is less sensitive to color detail.

2. Block splitting. Each component is divided into non-overlapping 8x8 pixel blocks.

3. Discrete Cosine Transform (DCT). Each 8x8 block of pixel values $f(x,y)$ is transformed into 64 frequency coefficients $F(u,v)$ :

F(u,v)=\tfrac{1}{4}C(u)C(v)\sum_{x=0}^{7}\sum_{y=0}^{7} f(x,y)\cos\!\frac{(2x+1)u\pi}{16}\cos\!\frac{(2y+1)v\pi}{16}

where $C(u),C(v)=1/\sqrt2$ for $u,v=0$ , else $1$ . The top-left coefficient $F(0,0)$ is the DC (average) term; the rest are AC terms representing increasing spatial frequencies. Energy is compacted into a few low-frequency coefficients.

4. Quantization. Each coefficient is divided by a value from an 8x8 quantization table $Q(u,v)$ and rounded:

F_q(u,v)=\operatorname{round}\!\left(\frac{F(u,v)}{Q(u,v)}\right)

Larger step sizes are used for high frequencies, so many of those coefficients become zero. This is the lossy step and controls the quality/size trade-off.

5. Zig-zag ordering. The 8x8 quantized block is read in a zig-zag order from low to high frequency. This groups the long runs of zeros at the end, which compress well.

6. Entropy (Huffman) coding. The DC coefficient is coded differentially (DPCM) against the previous block's DC. AC coefficients are coded as (run-length of zeros, value) pairs, then Huffman-coded to produce the final compressed bitstream. (Arithmetic coding is an optional alternative.)

Decoding

Decoding reverses the pipeline: entropy decode, dequantize, inverse DCT, recombine blocks, and convert YCbCr back to RGB.

Answer 2

MPEG Video Compression Standard

MPEG (Moving Picture Experts Group) standards (MPEG-1, MPEG-2, etc.) compress digital video by removing both spatial redundancy within a frame (using JPEG-like intra-frame DCT coding) and temporal redundancy between successive frames (using motion-compensated prediction).

Frame Types

Frame	Coding	Reference used	Compression
I-frame (Intra)	Coded independently like a JPEG image	None	Lowest
P-frame (Predictive)	Predicted from a previous I or P frame	Past frame	Medium
B-frame (Bidirectional)	Predicted from both previous and future I/P frames	Past + future	Highest

I-frames allow random access and act as recovery/refresh points but use the most bits.
P-frames store only the motion-compensated difference from a past reference.
B-frames interpolate between a past and a future reference, giving the best compression; they are not used as references themselves.

Motion Estimation and Compensation

Each frame is divided into macroblocks (typically 16x16 pixels).

Motion estimation: For each macroblock, the encoder searches a region of the reference frame to find the best-matching block, producing a motion vector (dx, dy). Block-matching with a cost such as SAD (Sum of Absolute Differences) is used.
Motion compensation: The predicted block is shifted by the motion vector. Only the residual (difference between actual and predicted block) plus the motion vector is encoded. The residual is DCT-transformed, quantized and entropy-coded.

This greatly reduces data because consecutive video frames are highly similar.

Group of Pictures (GOP)

A GOP is a repeating sequence of frames beginning with an I-frame, e.g.:

I B B P B B P B B P ...

GOP size (distance between I-frames) trades off compression vs random-access/error-resilience.
Because B-frames depend on a future P/I frame, the display order differs from the transmission/decoding order (the future reference is sent before the B-frames that use it).

Answer 3

Entropy and Source Coding for Multimedia

Entropy is the average information content of a source. For a source with symbols of probability $p_i$ :

H=-\sum_i p_i\log_2 p_i \quad\text{(bits/symbol)}

It is the theoretical lower bound on the average number of bits per symbol for lossless coding. Source (entropy) coding assigns shorter codes to frequent symbols to approach this bound.

Huffman Coding

A variable-length, prefix-free code built bottom-up: repeatedly merge the two least-probable symbols into a node whose probability is their sum, until one tree remains; assign 0/1 to branches. It produces the optimal integer-length prefix code.

Run-Length Encoding (RLE)

Replaces runs of identical symbols by a (value, count) pair, e.g. AAAAABBB -> 5A3B. Very effective for data with long repeats (e.g. zero runs after JPEG quantization, fax images).

Arithmetic Coding

Encodes an entire message as a single fractional number in $[0,1)$ , recursively narrowing the interval according to each symbol's probability. It can use fractional bits per symbol and so approaches entropy more closely than Huffman, especially for skewed probabilities.

Worked Huffman Example

Let symbols and frequencies be: A=5, B=2, C=1, D=1 (total 9).

Combine smallest: C(1)+D(1) = CD(2).
Combine B(2)+CD(2) = BCD(4).
Combine A(5)+BCD(4) = root(9).

Assigning 0/left, 1/right:

Symbol	Freq	Code	Length
A	5	0	1
B	2	10	2
C	1	110	3
D	1	111	3

Average length $=\frac{5\cdot1+2\cdot2+1\cdot3+1\cdot3}{9}=\frac{15}{9}\approx1.67$ bits/symbol, far better than 2 bits with fixed-length coding.

Answer 4

Huffman Coding

Huffman coding is a lossless, variable-length prefix code that assigns shorter codewords to more frequent symbols and longer codewords to rare ones, minimizing the average code length.

Algorithm

List symbols with their frequencies/probabilities.
Repeatedly take the two lowest-frequency nodes and merge them into a new node whose frequency is their sum.
Repeat until a single tree (root) remains.
Label each left/right branch 0/1; the path from root to a leaf is that symbol's code.

Example

Symbols: A=45, B=13, C=12, D=16, E=9, F=5.

Merging the two smallest at each step (F+E=14, C+B=25, ...), a valid resulting code is:

Symbol	Freq	Code
A	45	0
B	13	101
C	12	100
D	16	111
E	9	1101
F	5	1100

The codes are prefix-free (no code is a prefix of another), so the bitstream decodes unambiguously. Frequent symbol A uses 1 bit while rare F uses 4 bits, giving an average length well below the 3 bits needed for fixed-length coding of 6 symbols.

Answer 5

Run-Length Encoding (RLE)

Run-length encoding is a simple lossless compression technique that replaces consecutive repeated data values (a run) with a single value and a count, rather than storing each repetition.

Example

Input string:

WWWWWWWWWWWWBWWWWWWWWWWWWBBBWWWWWWWWWWWWWWWWWWWWWWWWBWWWWWWWWWWWWWW

RLE output:

12W1B12W3B24W1B14W

Here 67 characters are reduced to 18, encoding each run as <count><symbol>.

Characteristics

Best for data with long runs of identical values: simple graphics, icons, fax (CCITT) images, and the zero-runs produced after JPEG quantization.
Worst case: data with no repetition can become larger than the original.
It is fast and is often combined with other methods (e.g. RLE then Huffman in JPEG).

Answer 6

Role of DCT and Quantization in JPEG

Discrete Cosine Transform (DCT)

JPEG applies a forward DCT to each 8x8 block of pixels, converting spatial pixel values into 64 frequency coefficients:

F(u,v)=\tfrac14 C(u)C(v)\sum_{x=0}^{7}\sum_{y=0}^{7} f(x,y)\cos\frac{(2x+1)u\pi}{16}\cos\frac{(2y+1)v\pi}{16}

The coefficient $F(0,0)$ is the DC term (block average); the rest are AC terms for increasing frequencies.
DCT performs energy compaction: most signal energy concentrates in a few low-frequency coefficients, while high-frequency coefficients (fine detail/noise) are small. The DCT itself is lossless and reversible.

Quantization

Each DCT coefficient is divided by a value from an 8x8 quantization table and rounded:

F_q(u,v)=\operatorname{round}\!\left(\frac{F(u,v)}{Q(u,v)}\right)

Larger quantization steps are applied to high-frequency coefficients (the eye is less sensitive to them), so many become zero.
This is the lossy step of JPEG; it is where actual compression and quality loss happen.
The quality factor scales $Q$ : higher quality = finer steps = larger files.

Together: DCT reorganizes information so it can be discarded selectively, and quantization discards the least visually important information, producing many zeros that subsequent zig-zag, run-length and Huffman coding compress efficiently.

Answer 7

I-frames vs P-frames vs B-frames (MPEG)

Feature	I-frame (Intra)	P-frame (Predictive)	B-frame (Bidirectional)
Coding	Self-contained, JPEG-like intra coding	Predicted from a past I/P frame	Predicted from past and future I/P frames
Reference frames	None	One previous frame	One previous + one future frame
Compression	Lowest (most bits)	Medium	Highest (fewest bits)
Used as reference?	Yes	Yes	No
Random access	Provides entry/refresh points	No	No
Error propagation	Stops error drift	Can propagate errors	Does not propagate (not referenced)

Summary

I-frames are encoded independently using only spatial (intra-frame) redundancy, allowing random access but giving the least compression.
P-frames use motion-compensated prediction from a previous reference frame and store only the residual + motion vectors.
B-frames interpolate bidirectionally between a past and a future reference, achieving the best compression; because they need a future frame, the decoding order differs from the display order.

Answer 8

RGB vs CMYK Color Models

Aspect	RGB	CMYK
Components	Red, Green, Blue	Cyan, Magenta, Yellow, Black (Key)
Color mixing	Additive (adding light)	Subtractive (absorbing/subtracting light from white)
White / Black	All channels max = white; all zero = black	No ink = white (paper); all inks = black (K used for true black)
Primary use	Screens/emissive displays: monitors, TVs, cameras, web	Printing: inkjet/offset, magazines, packaging
Gamut	Larger; covers more bright/saturated colors	Smaller; cannot reproduce some bright RGB colors
Channels	3	4

Explanation

RGB is additive: colors are produced by emitting and combining red, green and blue light. Maximum of all three gives white; this matches how display devices generate color.
CMYK is subtractive: printed inks absorb (subtract) wavelengths from white light reflected off paper. Cyan, magenta and yellow theoretically make black, but in practice a separate black (K) ink is added for deeper blacks, sharper text and to save colored ink.
Designs created in RGB for screens must be converted to CMYK for printing, which may shift colors because CMYK has a smaller gamut.

Answer 9

Sampling and Quantization of Digital Audio

Converting a continuous (analog) sound wave into digital form requires two steps: sampling (discretizing time) and quantization (discretizing amplitude). Together they form Pulse Code Modulation (PCM).

Sampling

The amplitude of the analog signal is measured at regular time intervals at a fixed sampling rate $f_s$ (samples per second, Hz).
By the Nyquist theorem, $f_s$ must be at least twice the highest frequency in the signal to avoid aliasing:

f_s \ge 2 f_{max}

Example: human hearing reaches ~20 kHz, so audio CDs use $f_s = 44.1$ kHz.

Quantization

Each sampled amplitude is rounded to the nearest level from a finite set of $2^n$ levels, where $n$ is the bit depth (e.g. 16 bits = 65,536 levels).
The rounding error introduces quantization noise; more bits = finer levels = higher signal-to-noise ratio and better fidelity.

Data Rate

\text{bit rate} = f_s \times n \times (\text{channels})

For CD-quality stereo: $44100 \times 16 \times 2 = 1.41$ Mbps. Higher rate/depth means better quality but larger files, motivating audio compression (e.g. MP3).

Answer 10

Lossy vs Lossless Compression

Aspect	Lossless	Lossy
Data recovery	Original reconstructed exactly	Approximate; some data permanently discarded
Compression ratio	Lower (typically 2:1 to 3:1)	Much higher (10:1 to 50:1 or more)
Quality	No quality loss	Quality degrades with higher compression
Basis	Removes statistical/redundant data	Removes perceptually unimportant data
Reversible?	Yes	No
Examples	ZIP, PNG, GIF, FLAC, Huffman, RLE, LZW	JPEG, MPEG, MP3, AAC, H.264

Explanation

Lossless compression encodes data so it can be perfectly restored. It exploits statistical redundancy (e.g. Huffman, RLE, LZW). Used where exact data matters: text, executables, archives, medical/legal images.
Lossy compression achieves much smaller sizes by discarding information the human eye/ear is unlikely to notice (high-frequency detail, inaudible sounds). The loss is irreversible but acceptable for photos, audio and video, so it is used in JPEG, MP3 and MPEG.

Trade-off: lossy gives far smaller files at the cost of fidelity; lossless preserves data exactly but compresses less.

Answer 11

Characteristics and Storage Requirements of Multimedia Data

Characteristics

Voluminous / large data size – images, audio and especially video produce huge amounts of data.
Heterogeneous (multiple media types) – text, graphics, images, audio, video and animation combined, each with different formats.
Time-dependent (continuous) media – audio and video must be presented at a fixed rate; they have temporal/real-time constraints.
High bandwidth and processing demands – capture, transmission and playback need high data rates.
Need for synchronization – different streams (e.g. audio with video) must stay aligned.
Highly compressible – contains much redundancy, so compression is essential.

Storage Requirements

Uncompressed multimedia is extremely large:

Image: $\text{width}\times\text{height}\times\text{bits per pixel}$ . A 1024x768 24-bit image $\approx 2.36$ MB.
Audio: $f_s \times \text{bit depth} \times \text{channels}$ . CD stereo = 1.41 Mbps $\approx$ 10 MB/minute.
Video: $\text{frame size} \times \text{frames per second}$ . Raw 720x480, 24-bit, 30 fps $\approx$ 248 Mbps, i.e. several GB per minute.

Because of these enormous sizes, multimedia systems rely on compression (JPEG, MP3, MPEG) and on high-capacity, high-throughput storage and networks.

Answer 12

Multimedia Synchronization

Multimedia synchronization is the coordination of the temporal and spatial relationships between different media objects so they are presented to the user in the correct, intended order and timing (e.g. keeping audio aligned with video, or showing a subtitle at the right moment).

Intra-media vs Inter-media Synchronization

Aspect	Intra-media synchronization	Inter-media synchronization
Scope	Within a single continuous medium	Between two or more media streams
Goal	Maintain the correct internal timing/playback rate of one stream	Maintain temporal alignment across different streams
Example	Playing video frames at a steady 30 fps; smooth audio playback without gaps	Lip-sync: aligning the audio track with the video track; showing a slide with its narration
Concern	Jitter, frame dropping, constant data rate	Skew/drift between streams

Summary

Intra-media synchronization preserves the timing within one medium (e.g. constant frame/sample rate).
Inter-media synchronization preserves timing between media (the classic example being audio-video lip-sync). Proper synchronization is essential for a coherent multimedia presentation.

Level	BSc CSIT (TU)
Stream	Science
Subject	Multimedia Computing (BSc CSIT, CSC467)
Year	2074 BS
Exam session	Regular (annual)
Full marks	60
Time allowed	180 minutes
Questions	12, all with step-by-step solutions

Section A: Long Answer Questions

JPEG Image Compression Standard

Block Diagram (pipeline)

Main Steps

Decoding

MPEG Video Compression Standard

Frame Types

Motion Estimation and Compensation

Group of Pictures (GOP)

Entropy and Source Coding for Multimedia

Huffman Coding

Run-Length Encoding (RLE)

Arithmetic Coding

Worked Huffman Example

Section B: Short Answer Questions

Huffman Coding

Algorithm

Example

Run-Length Encoding (RLE)

Example

Characteristics

Role of DCT and Quantization in JPEG

Discrete Cosine Transform (DCT)

Quantization

I-frames vs P-frames vs B-frames (MPEG)

Summary

RGB vs CMYK Color Models

Explanation

Sampling and Quantization of Digital Audio

Sampling

Quantization

Data Rate

Lossy vs Lossless Compression

Explanation

Characteristics and Storage Requirements of Multimedia Data

Characteristics

Storage Requirements

Multimedia Synchronization

Intra-media vs Inter-media Synchronization

Summary

Frequently asked questions