Browse papers
A

Section A: Long Answer Questions

Attempt any TWO questions.

3 questions·10 marks each
1long10 marks

What is a multimedia system? Explain the characteristics of multimedia data, the storage and coding requirements, and the components of a multimedia computing system.

Multimedia System

A multimedia system is a computer-controlled, integrated system that can capture, store, process, transmit and present information in more than one medium — combining text, graphics, images, audio, video and animation — in a synchronized and interactive manner. At least one continuous (time-dependent) medium such as audio or video must be involved.

Characteristics of Multimedia Data

  • Voluminous (large data size): Uncompressed audio, image and especially video produce huge data volumes (e.g. raw video ≈ tens of MB per second).
  • Time dependence (continuous media): Audio and video are isochronous — samples must be delivered at fixed rates; late data is useless.
  • Need for synchronization: Different media streams (e.g. lip-sync of audio and video) must be temporally coordinated.
  • High bandwidth and real-time requirements: Streaming demands guaranteed throughput and bounded delay/jitter.
  • Compressibility: Multimedia data contains spatial/temporal redundancy and so is amenable to compression.
  • Interactivity: Users can navigate, seek, pause and control the presentation.

Storage and Coding Requirements

Because raw data is so large, multimedia systems must compress data using coding standards:

MediumTypical coding standard
ImageJPEG, PNG, GIF
AudioMP3, AAC, PCM/WAV
VideoMPEG-1/2/4, H.264/H.265

Example of raw size: a 640×480640\times480 true-color (24-bit) image needs 640×480×3=921,600640\times480\times3 = 921{,}600 bytes (0.9\approx 0.9 MB). One second of such video at 25 fps needs 22\approx 22 MB, so compression and high-capacity storage (CD/DVD/SSD) are essential.

Components of a Multimedia Computing System

  1. Capture devices – cameras, microphones, scanners, video/audio digitizers.
  2. Storage devices – hard disks/SSD, optical media (CD/DVD/Blu-ray), large-capacity servers.
  3. Processing hardware – CPU/GPU, sound and graphics cards, codecs (compression/decompression).
  4. Communication/network – high-bandwidth networks for streaming and conferencing.
  5. Presentation/output devices – display screens, speakers, projectors.
  6. Software – authoring tools, media players, operating-system multimedia support (file formats, drivers, real-time scheduling).

Together these support the acquisition → compression/storage → processing → transmission → synchronized presentation pipeline that defines a multimedia computing system.

multimedia-systems
2long10 marks

What is multimedia synchronization? Explain intra-media and inter-media synchronization, and discuss the reference model for multimedia synchronization.

Multimedia Synchronization

Multimedia synchronization is the task of maintaining the correct temporal and spatial relationships among the media objects that make up a multimedia presentation, so that they are rendered in the intended order and at the intended times (e.g. audio matching the moving lips in a video).

Intra-media (Intra-stream) Synchronization

This maintains the internal timing within a single continuous medium. Each media unit (e.g. a video frame or audio sample) must be presented at the correct instant relative to the previous units, preserving the playback rate.

  • Example: displaying a 25 fps video so that one frame appears every 40 ms; or playing audio samples at exactly 44.1 kHz.
  • Goal: avoid jitter and rate variation inside one stream.

Inter-media (Inter-stream) Synchronization

This maintains the temporal relationship between two or more different media streams.

  • Classic example: lip-sync between the audio stream and the video stream (skew must stay within about ±80 ms to be unnoticed).
  • Other examples: a slide changing exactly when the narration reaches a point; subtitles appearing with the spoken line.
  • Goal: keep the relative skew between streams within acceptable limits.

Reference Model for Multimedia Synchronization (Four-Layer Model)

A widely used model structures synchronization into four layers, each offering a service to the layer above:

  1. Media Layer – deals with a single continuous media stream as a sequence of Logical Data Units (LDUs); provides device-level read/write of samples/frames (intra-stream timing).
  2. Stream Layer – operates on continuous streams and groups of streams; provides guarantees on throughput, delay and jitter, and handles intra-stream synchronization for grouped streams.
  3. Object Layer – integrates all media streams and time-independent objects (text, images) into a complete presentation; hides the difference between continuous and discrete media and handles inter-stream coordination.
  4. Specification Layer – an open layer where authors specify the synchronization requirements (using interval-based, axes-based, hierarchical or event-based specification methods); it maps the author's intent down to the object layer.

This layered reference model separates what must be synchronized (specification) from how it is enforced at runtime (object/stream/media layers).

synchronization
3long10 marks

Explain digital image representation and color models (RGB, CMYK, YUV/YCbCr). Discuss how color is sampled and the concept of chroma subsampling in multimedia.

Digital Image Representation

A digital image is a 2-D array of pixels (picture elements). Each pixel stores an intensity/color value. For a color image with bit depth bb bits per channel and 3 channels, each pixel needs 3b3b bits; a W×HW\times H true-color (24-bit) image needs W×H×3W\times H\times 3 bytes. Spatial resolution = pixel count; color depth = bits per pixel.

Color Models

RGB (Red, Green, Blue) — additive

Colors are produced by adding light in the three primaries. Used by emissive devices (monitors, cameras, scanners). (0,0,0)(0,0,0) = black, (255,255,255)(255,255,255) = white. Suited to display, not to printing.

CMYK (Cyan, Magenta, Yellow, Black) — subtractive

Colors are produced by subtracting (absorbing) light using inks on paper. Used in printing. Black (K) is added because mixing C, M, Y inks gives a muddy dark, not true black, and saves ink. Approximate conversion: C=1R,  M=1G,  Y=1BC=1-R',\; M=1-G',\; Y=1-B' (on normalized values), then K is extracted.

YUV / YCbCr — luminance + chrominance

Separates brightness from color:

  • Y = luma (brightness),
  • U/Cb = blue-difference chroma, V/Cr = red-difference chroma.

Conversion (BT.601):

Y=0.299R+0.587G+0.114BY = 0.299R + 0.587G + 0.114B Cb=0.564(BY),Cr=0.713(RY)C_b = 0.564(B-Y),\qquad C_r = 0.713(R-Y)

This model is used in TV, JPEG and MPEG because it decorrelates color and lets the chroma be compressed more heavily than luma.

Color Sampling and Chroma Subsampling

The human eye is more sensitive to brightness (luma) than to color (chroma). Therefore the chroma channels can be sampled at a lower spatial resolution than luma without visible quality loss — this is chroma subsampling.

Notation J:a:bJ{:}a{:}b over a 4×24\times2 block:

  • 4:4:4 – no subsampling; full chroma for every pixel.
  • 4:2:2 – chroma sampled at half horizontal resolution (shared by 2 pixels).
  • 4:2:0 – chroma sampled at half horizontal and half vertical resolution (one chroma sample per 2×22\times2 block). Used in JPEG, MPEG, H.264.

Benefit: 4:2:0 stores chroma at one quarter the samples, cutting data size by about 50% versus 4:4:4 with little perceptual loss — a key reason YCbCr is preferred over RGB for compression.

colorimage
B

Section B: Short Answer Questions

Attempt any EIGHT questions.

9 questions·5 marks each
4short5 marks

What is Huffman coding? Construct a Huffman code for a given set of symbols and explain its working.

Huffman Coding

Huffman coding is a lossless, variable-length, entropy-coding technique that assigns shorter codewords to more frequent symbols and longer codewords to rarer ones, producing an optimal prefix-free code (no codeword is a prefix of another).

Construction (example)

Symbols and frequencies: A=45, B=13, C=12, D=16, E=9, F=5 (total 100).

  1. Treat each symbol as a leaf node with its frequency.
  2. Repeatedly take the two lowest-frequency nodes and merge them into a parent whose frequency is their sum.
  3. Repeat until one tree (root) remains.
  4. Label left edges 0 and right edges 1; the path from root to a leaf is that symbol's code.

Merging order: (F5+E9)=14, (C12+B13)=25, (14+D16)=30, (25+30)=55, (45+55)=100.

Resulting codes:

SymbolFreqCodeBits
A4501
B131013
C121003
D161113
E911014
F511004

Average length =(451+133+123+163+94+54)/100=224/100=2.24= (45\cdot1+13\cdot3+12\cdot3+16\cdot3+9\cdot4+5\cdot4)/100 = 224/100 = 2.24 bits/symbol, versus 3 bits for fixed-length coding — a clear saving.

Working

Encoding replaces each symbol with its codeword. Decoding walks the tree bit-by-bit from the root until a leaf is reached; the prefix property guarantees unambiguous decoding. Huffman coding is used inside JPEG, MP3 and ZIP.

huffman
5short5 marks

Explain run-length encoding (RLE) with an example.

Run-Length Encoding (RLE)

Run-length encoding is a simple lossless compression technique that replaces consecutive repetitions (runs) of the same symbol with a single value and a count, instead of storing each repeated symbol separately.

Principle

A run of nn identical symbols s is stored as the pair (n, s) (or (s, n)). It is most effective when data contains long runs of identical values, e.g. simple images, fax (bi-level) images, and icons.

Example

Input string:

AAAAABBBCCDAA

Encoded as count–symbol pairs:

5A 3B 2C 1D 2A

The 13-character input is represented by 5 pairs.

For a binary/fax line such as 0000000111110000 it becomes 7(0) 5(1) 4(0).

Characteristics

  • Lossless – original data is fully recoverable.
  • Very fast and simple to implement.
  • Best for low-complexity / repetitive data; for data with few runs (e.g. natural photos), it can even expand the data, so it is often combined with other schemes (e.g. RLE of zero runs in JPEG after DCT/quantization).
run-length
6short5 marks

Explain the role of DCT and quantization in JPEG compression.

Role of DCT and Quantization in JPEG Compression

JPEG compresses an image in 8×88\times8 pixel blocks (after RGB→YCbCr conversion and optional chroma subsampling). Two key stages are the DCT and quantization.

Discrete Cosine Transform (DCT)

The 2-D DCT transforms each 8×88\times8 block of pixel values from the spatial domain to the frequency domain:

F(u,v)=14C(u)C(v)x=07y=07f(x,y)cos ⁣(2x+1)uπ16cos ⁣(2y+1)vπ16F(u,v)=\tfrac14 C(u)C(v)\sum_{x=0}^{7}\sum_{y=0}^{7} f(x,y)\cos\!\frac{(2x+1)u\pi}{16}\cos\!\frac{(2y+1)v\pi}{16}
  • It produces 64 DCT coefficients: one DC coefficient (average/low frequency, top-left) and 63 AC coefficients (higher frequencies).
  • Image energy is compacted into a few low-frequency coefficients; most high-frequency coefficients are near zero.
  • The DCT itself is lossless and reversible; it only reorganizes the information so redundancy can be removed.

Quantization

Each DCT coefficient is divided by a value from an 8×88\times8 quantization table and rounded:

FQ(u,v)=round ⁣(F(u,v)Q(u,v))F_Q(u,v)=\text{round}\!\left(\frac{F(u,v)}{Q(u,v)}\right)
  • This is the only lossy step in JPEG.
  • High-frequency coefficients (to which the eye is less sensitive) get larger quantization step sizes, so many become zero and are discarded.
  • The quality/compression trade-off is controlled here: larger QQ values → more zeros → smaller file but lower quality.

Together

DCT concentrates the visually important information into a few coefficients; quantization then throws away the perceptually unimportant (mostly high-frequency) detail. The resulting block — full of zeros — is then zig-zag scanned and run-length + Huffman coded to achieve the final compression.

jpeg
7short5 marks

Differentiate between I-frames, P-frames and B-frames in MPEG.

I-, P- and B-frames in MPEG

MPEG video exploits temporal redundancy between successive frames using three frame (picture) types organized in a Group of Pictures (GOP).

FeatureI-frame (Intra)P-frame (Predicted)B-frame (Bidirectional)
CodingCoded independently (like a JPEG image)Predicted from a previous I/P framePredicted from previous and following I/P frames
CompressionLowest (largest size)Higher than IHighest (smallest size)
Motion compensationNoneForward predictionForward + backward prediction
Random access / seekingYes (entry point)NoNo
Error propagationStops herePropagates to dependent framesUsed as reference: no (cannot be referenced by others)

Details

  • I-frame (Intra-coded): self-contained; uses only spatial (intra-frame) compression. Acts as a reference and a random-access/refresh point, and starts each GOP. Largest in size.
  • P-frame (Predictive-coded): stores only the difference from the most recent preceding I- or P-frame, using forward motion compensation. Smaller than I but depends on the earlier frame.
  • B-frame (Bidirectionally predicted): predicted from both a past and a future reference frame, giving the best compression. B-frames are never used as references, so they cause no error propagation. (Because they need a future frame, encoding/decoding order differs from display order.)

A typical GOP looks like I B B P B B P B B P ….

mpeg
8short5 marks

Differentiate between the RGB and CMYK color models.

RGB vs CMYK Color Models

AspectRGBCMYK
PrimariesRed, Green, BlueCyan, Magenta, Yellow, Black (Key)
Color mixingAdditive (adds light)Subtractive (absorbs/subtracts light)
Base / 'no color'Black is absence of light (0,0,0)(0,0,0)White is the (blank) paper
'Full' resultWhite = all colors added (255,255,255)Black ≈ all inks combined
Used byEmissive devices – monitors, TVs, cameras, scanners, webPrinting – inkjet/laser/offset presses
Channels3 channels4 channels
Color gamutWider; can show vivid, bright colorsNarrower; cannot reproduce all RGB colors

Explanation

  • RGB is additive: colored light is emitted and combined; adding all three at full intensity yields white. Ideal for screens that produce their own light.
  • CMYK is subtractive: colored inks on white paper absorb part of the incident white light and reflect the rest; combining inks removes more light, tending toward black. The extra K (black) is used because mixed CMY inks give a muddy brown rather than pure black, and using K saves ink and gives sharper text.
  • Conversion (approx., normalized 0–1): C=1R,  M=1G,  Y=1BC=1-R,\;M=1-G,\;Y=1-B, then K=min(C,M,Y)K=\min(C,M,Y) is factored out. Because gamuts differ, some bright RGB screen colors cannot be printed exactly in CMYK.
color-model
9short5 marks

Explain sampling and quantization of digital audio.

Sampling and Quantization of Digital Audio

Converting a continuous (analog) sound wave into a digital signal involves two steps: sampling (discretizing time) and quantization (discretizing amplitude) — together forming PCM (Pulse Code Modulation).

Sampling

The continuous waveform is measured at regular time intervals; each measurement is a sample. The sampling rate (frequency) fsf_s is the number of samples per second (Hz).

  • Nyquist theorem: to capture a signal whose highest frequency is fmaxf_{max} without aliasing, fs2fmaxf_s \ge 2 f_{max}.
  • Example: human hearing reaches ~20 kHz, so CD audio uses fs=44.1f_s = 44.1 kHz (2×20 kHz\ge 2\times20\text{ kHz}).
  • Too low a rate causes aliasing (high frequencies fold into wrong, lower frequencies).

Quantization

Each sampled amplitude (a real value) is rounded to the nearest level out of a finite set determined by the bit depth nn. With nn bits there are 2n2^n levels.

  • Example: 16-bit audio → 216=65,5362^{16}=65{,}536 levels.
  • The rounding error is quantization noise; more bits → finer steps → lower noise and higher dynamic range (≈ 6.02n6.02n dB).

Resulting Data Size

Bitrate=fs×bit depth×channels\text{Bitrate} = f_s \times \text{bit depth} \times \text{channels}

For CD-quality stereo: 44100×16×2=1,411,20044100 \times 16 \times 2 = 1{,}411{,}200 bits/s ≈ 1.41 Mbit/s (≈ 10 MB/minute uncompressed), which is why audio compression (MP3/AAC) is used.

audio
10short5 marks

Differentiate between lossy and lossless compression with examples.

Lossy vs Lossless Compression

AspectLosslessLossy
Data recoveryOriginal is exactly reconstructedReconstruction is approximate; some data permanently lost
Compression ratioLower (typically 2:1–4:1)Much higher (10:1 to 100:1+)
QualityNo quality lossSome quality degradation (often imperceptible)
How it worksRemoves statistical redundancy onlyDiscards perceptually unimportant information
ExamplesPNG, GIF, ZIP, FLAC, Huffman, RLE, LZWJPEG, MP3, AAC, MPEG/H.264 video
Use casesText, source code, medical/technical images, archivesPhotos, music, streaming video where small size matters

Explanation

  • Lossless compression encodes data more efficiently (e.g. Huffman or RLE) so the exact original bits are recoverable. It is mandatory where every bit matters (programs, text, legal/medical images).
  • Lossy compression exploits limits of human perception (e.g. the eye's insensitivity to high spatial frequencies, the ear's masking effects) to throw away data that the user is unlikely to notice. This yields far smaller files, which is why it dominates images, audio and video.

Examples: A .png logo (lossless) can be restored bit-for-bit; a .jpg photo (lossy) at high compression shows blocking artifacts and cannot be restored to the original.

compression
11short5 marks

What are the characteristics of multimedia data? Explain the storage requirements.

Characteristics of Multimedia Data and Storage Requirements

Characteristics of Multimedia Data

  • Voluminous (large size): Uncompressed image, audio and especially video generate very large data volumes.
  • Continuous / time-dependent media: Audio and video are isochronous — units must be delivered and played at a fixed rate; late data is worthless.
  • Real-time and high-bandwidth needs: Capture and playback require guaranteed throughput and bounded delay and jitter.
  • Need for synchronization: Multiple streams (e.g. audio + video lip-sync) must keep their temporal relationship.
  • Redundancy / compressibility: Contains spatial and temporal redundancy, so it compresses well (JPEG, MPEG, MP3).
  • Interactivity: Supports user control such as seek, pause and navigation.

Storage Requirements

Because raw data is so large, multimedia needs high-capacity, high-throughput storage and compression.

Example (raw sizes):

  • 640×480640\times480 true-color (24-bit) image: 640×480×3=921,6000.9640\times480\times3 = 921{,}600 \approx 0.9 MB.
  • CD-quality stereo audio: 44100×16×21.4144100\times16\times2 \approx 1.41 Mbit/s ≈ 10 MB/minute.
  • Raw video at 25 fps of the above frame ≈ 0.9 MB×25220.9\text{ MB}\times25 \approx 22 MB/s — over a GB per minute.

Consequences for storage:

  1. Large capacity media (HDD/SSD, optical CD/DVD/Blu-ray, servers).
  2. High transfer rate / low access time to sustain real-time playback.
  3. Compression (lossy and lossless) to make storage and transmission feasible.
  4. Specialized file systems/buffering to deliver continuous media with bounded jitter.
multimedia-data
12short5 marks

What is multimedia synchronization? Differentiate intra-media and inter-media synchronization.

Multimedia Synchronization

Multimedia synchronization is the process of maintaining the correct temporal (and spatial) relationships between the media objects in a presentation, so they are played in the intended order and timing (e.g. audio matching video, subtitles matching speech).

Intra-media vs Inter-media Synchronization

AspectIntra-media (intra-stream)Inter-media (inter-stream)
ScopeWithin a single continuous mediumBetween two or more different media streams
GoalMaintain correct timing/rate of units inside one stream (avoid jitter)Maintain correct relative timing (skew) across streams
ExamplePlaying a 25 fps video so a frame appears exactly every 40 ms; audio at 44.1 kHzLip-sync of audio with video; subtitle appearing with the spoken line; slide changing with narration
ConcernPlayback rate, jitterRelative skew between streams

Explanation

  • Intra-media synchronization ensures the internal continuity of one stream: each logical data unit (frame/sample) is presented at the right instant relative to the previous one, so playback is smooth and at the correct rate.
  • Inter-media synchronization coordinates several streams together. The classic case is lip-sync, where audio–video skew should stay within roughly ±80 ms to remain unnoticed. It also covers discrete media, e.g. an image or text appearing at a defined moment in the audio.
synchronization

Frequently asked questions

Where can I find the BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) question paper 2081?
The full BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) 2081 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
Does the Multimedia Computing (BSc CSIT, CSC467) 2081 paper come with solutions?
Yes. Every question on this Multimedia Computing (BSc CSIT, CSC467) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
How many marks is the BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) 2081 paper?
The BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) 2081 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
Is practising this Multimedia Computing (BSc CSIT, CSC467) past paper free?
Yes — reading and attempting this Multimedia Computing (BSc CSIT, CSC467) past paper on Kekkei is completely free.