BSc CSIT (TU) Science Multimedia Computing (BSc CSIT, CSC467) Question Paper 2081 Nepal
This is the official BSc CSIT (TU) (Science stream) Multimedia Computing (BSc CSIT, CSC467) question paper for 2081, as set in the regular annual examination. It carries 60 full marks and a time allowance of 180 minutes, across 12 questions. On Kekkei you can attempt this Multimedia Computing (BSc CSIT, CSC467) past paper online with a timer, get instant AI feedback and step-by-step solutions, and track the topics where you lose marks — completely free. Whether you are revising for your BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) exam or solving previous years' question papers, this 2081 paper is a great way to practise under real exam conditions.
Section A: Long Answer Questions
Attempt any TWO questions.
What is a multimedia system? Explain the characteristics of multimedia data, the storage and coding requirements, and the components of a multimedia computing system.
Multimedia System
A multimedia system is a computer-controlled, integrated system that can capture, store, process, transmit and present information in more than one medium — combining text, graphics, images, audio, video and animation — in a synchronized and interactive manner. At least one continuous (time-dependent) medium such as audio or video must be involved.
Characteristics of Multimedia Data
- Voluminous (large data size): Uncompressed audio, image and especially video produce huge data volumes (e.g. raw video ≈ tens of MB per second).
- Time dependence (continuous media): Audio and video are isochronous — samples must be delivered at fixed rates; late data is useless.
- Need for synchronization: Different media streams (e.g. lip-sync of audio and video) must be temporally coordinated.
- High bandwidth and real-time requirements: Streaming demands guaranteed throughput and bounded delay/jitter.
- Compressibility: Multimedia data contains spatial/temporal redundancy and so is amenable to compression.
- Interactivity: Users can navigate, seek, pause and control the presentation.
Storage and Coding Requirements
Because raw data is so large, multimedia systems must compress data using coding standards:
| Medium | Typical coding standard |
|---|---|
| Image | JPEG, PNG, GIF |
| Audio | MP3, AAC, PCM/WAV |
| Video | MPEG-1/2/4, H.264/H.265 |
Example of raw size: a true-color (24-bit) image needs bytes ( MB). One second of such video at 25 fps needs MB, so compression and high-capacity storage (CD/DVD/SSD) are essential.
Components of a Multimedia Computing System
- Capture devices – cameras, microphones, scanners, video/audio digitizers.
- Storage devices – hard disks/SSD, optical media (CD/DVD/Blu-ray), large-capacity servers.
- Processing hardware – CPU/GPU, sound and graphics cards, codecs (compression/decompression).
- Communication/network – high-bandwidth networks for streaming and conferencing.
- Presentation/output devices – display screens, speakers, projectors.
- Software – authoring tools, media players, operating-system multimedia support (file formats, drivers, real-time scheduling).
Together these support the acquisition → compression/storage → processing → transmission → synchronized presentation pipeline that defines a multimedia computing system.
What is multimedia synchronization? Explain intra-media and inter-media synchronization, and discuss the reference model for multimedia synchronization.
Multimedia Synchronization
Multimedia synchronization is the task of maintaining the correct temporal and spatial relationships among the media objects that make up a multimedia presentation, so that they are rendered in the intended order and at the intended times (e.g. audio matching the moving lips in a video).
Intra-media (Intra-stream) Synchronization
This maintains the internal timing within a single continuous medium. Each media unit (e.g. a video frame or audio sample) must be presented at the correct instant relative to the previous units, preserving the playback rate.
- Example: displaying a 25 fps video so that one frame appears every 40 ms; or playing audio samples at exactly 44.1 kHz.
- Goal: avoid jitter and rate variation inside one stream.
Inter-media (Inter-stream) Synchronization
This maintains the temporal relationship between two or more different media streams.
- Classic example: lip-sync between the audio stream and the video stream (skew must stay within about ±80 ms to be unnoticed).
- Other examples: a slide changing exactly when the narration reaches a point; subtitles appearing with the spoken line.
- Goal: keep the relative skew between streams within acceptable limits.
Reference Model for Multimedia Synchronization (Four-Layer Model)
A widely used model structures synchronization into four layers, each offering a service to the layer above:
- Media Layer – deals with a single continuous media stream as a sequence of Logical Data Units (LDUs); provides device-level read/write of samples/frames (intra-stream timing).
- Stream Layer – operates on continuous streams and groups of streams; provides guarantees on throughput, delay and jitter, and handles intra-stream synchronization for grouped streams.
- Object Layer – integrates all media streams and time-independent objects (text, images) into a complete presentation; hides the difference between continuous and discrete media and handles inter-stream coordination.
- Specification Layer – an open layer where authors specify the synchronization requirements (using interval-based, axes-based, hierarchical or event-based specification methods); it maps the author's intent down to the object layer.
This layered reference model separates what must be synchronized (specification) from how it is enforced at runtime (object/stream/media layers).
Explain digital image representation and color models (RGB, CMYK, YUV/YCbCr). Discuss how color is sampled and the concept of chroma subsampling in multimedia.
Digital Image Representation
A digital image is a 2-D array of pixels (picture elements). Each pixel stores an intensity/color value. For a color image with bit depth bits per channel and 3 channels, each pixel needs bits; a true-color (24-bit) image needs bytes. Spatial resolution = pixel count; color depth = bits per pixel.
Color Models
RGB (Red, Green, Blue) — additive
Colors are produced by adding light in the three primaries. Used by emissive devices (monitors, cameras, scanners). = black, = white. Suited to display, not to printing.
CMYK (Cyan, Magenta, Yellow, Black) — subtractive
Colors are produced by subtracting (absorbing) light using inks on paper. Used in printing. Black (K) is added because mixing C, M, Y inks gives a muddy dark, not true black, and saves ink. Approximate conversion: (on normalized values), then K is extracted.
YUV / YCbCr — luminance + chrominance
Separates brightness from color:
- Y = luma (brightness),
- U/Cb = blue-difference chroma, V/Cr = red-difference chroma.
Conversion (BT.601):
This model is used in TV, JPEG and MPEG because it decorrelates color and lets the chroma be compressed more heavily than luma.
Color Sampling and Chroma Subsampling
The human eye is more sensitive to brightness (luma) than to color (chroma). Therefore the chroma channels can be sampled at a lower spatial resolution than luma without visible quality loss — this is chroma subsampling.
Notation over a block:
- 4:4:4 – no subsampling; full chroma for every pixel.
- 4:2:2 – chroma sampled at half horizontal resolution (shared by 2 pixels).
- 4:2:0 – chroma sampled at half horizontal and half vertical resolution (one chroma sample per block). Used in JPEG, MPEG, H.264.
Benefit: 4:2:0 stores chroma at one quarter the samples, cutting data size by about 50% versus 4:4:4 with little perceptual loss — a key reason YCbCr is preferred over RGB for compression.
Section B: Short Answer Questions
Attempt any EIGHT questions.
What is Huffman coding? Construct a Huffman code for a given set of symbols and explain its working.
Huffman Coding
Huffman coding is a lossless, variable-length, entropy-coding technique that assigns shorter codewords to more frequent symbols and longer codewords to rarer ones, producing an optimal prefix-free code (no codeword is a prefix of another).
Construction (example)
Symbols and frequencies: A=45, B=13, C=12, D=16, E=9, F=5 (total 100).
- Treat each symbol as a leaf node with its frequency.
- Repeatedly take the two lowest-frequency nodes and merge them into a parent whose frequency is their sum.
- Repeat until one tree (root) remains.
- Label left edges
0and right edges1; the path from root to a leaf is that symbol's code.
Merging order: (F5+E9)=14, (C12+B13)=25, (14+D16)=30, (25+30)=55, (45+55)=100.
Resulting codes:
| Symbol | Freq | Code | Bits |
|---|---|---|---|
| A | 45 | 0 | 1 |
| B | 13 | 101 | 3 |
| C | 12 | 100 | 3 |
| D | 16 | 111 | 3 |
| E | 9 | 1101 | 4 |
| F | 5 | 1100 | 4 |
Average length bits/symbol, versus 3 bits for fixed-length coding — a clear saving.
Working
Encoding replaces each symbol with its codeword. Decoding walks the tree bit-by-bit from the root until a leaf is reached; the prefix property guarantees unambiguous decoding. Huffman coding is used inside JPEG, MP3 and ZIP.
Explain run-length encoding (RLE) with an example.
Run-Length Encoding (RLE)
Run-length encoding is a simple lossless compression technique that replaces consecutive repetitions (runs) of the same symbol with a single value and a count, instead of storing each repeated symbol separately.
Principle
A run of identical symbols s is stored as the pair (n, s) (or (s, n)). It is most effective when data contains long runs of identical values, e.g. simple images, fax (bi-level) images, and icons.
Example
Input string:
AAAAABBBCCDAA
Encoded as count–symbol pairs:
5A 3B 2C 1D 2A
The 13-character input is represented by 5 pairs.
For a binary/fax line such as 0000000111110000 it becomes 7(0) 5(1) 4(0).
Characteristics
- Lossless – original data is fully recoverable.
- Very fast and simple to implement.
- Best for low-complexity / repetitive data; for data with few runs (e.g. natural photos), it can even expand the data, so it is often combined with other schemes (e.g. RLE of zero runs in JPEG after DCT/quantization).
Explain the role of DCT and quantization in JPEG compression.
Role of DCT and Quantization in JPEG Compression
JPEG compresses an image in pixel blocks (after RGB→YCbCr conversion and optional chroma subsampling). Two key stages are the DCT and quantization.
Discrete Cosine Transform (DCT)
The 2-D DCT transforms each block of pixel values from the spatial domain to the frequency domain:
- It produces 64 DCT coefficients: one DC coefficient (average/low frequency, top-left) and 63 AC coefficients (higher frequencies).
- Image energy is compacted into a few low-frequency coefficients; most high-frequency coefficients are near zero.
- The DCT itself is lossless and reversible; it only reorganizes the information so redundancy can be removed.
Quantization
Each DCT coefficient is divided by a value from an quantization table and rounded:
- This is the only lossy step in JPEG.
- High-frequency coefficients (to which the eye is less sensitive) get larger quantization step sizes, so many become zero and are discarded.
- The quality/compression trade-off is controlled here: larger values → more zeros → smaller file but lower quality.
Together
DCT concentrates the visually important information into a few coefficients; quantization then throws away the perceptually unimportant (mostly high-frequency) detail. The resulting block — full of zeros — is then zig-zag scanned and run-length + Huffman coded to achieve the final compression.
Differentiate between I-frames, P-frames and B-frames in MPEG.
I-, P- and B-frames in MPEG
MPEG video exploits temporal redundancy between successive frames using three frame (picture) types organized in a Group of Pictures (GOP).
| Feature | I-frame (Intra) | P-frame (Predicted) | B-frame (Bidirectional) |
|---|---|---|---|
| Coding | Coded independently (like a JPEG image) | Predicted from a previous I/P frame | Predicted from previous and following I/P frames |
| Compression | Lowest (largest size) | Higher than I | Highest (smallest size) |
| Motion compensation | None | Forward prediction | Forward + backward prediction |
| Random access / seeking | Yes (entry point) | No | No |
| Error propagation | Stops here | Propagates to dependent frames | Used as reference: no (cannot be referenced by others) |
Details
- I-frame (Intra-coded): self-contained; uses only spatial (intra-frame) compression. Acts as a reference and a random-access/refresh point, and starts each GOP. Largest in size.
- P-frame (Predictive-coded): stores only the difference from the most recent preceding I- or P-frame, using forward motion compensation. Smaller than I but depends on the earlier frame.
- B-frame (Bidirectionally predicted): predicted from both a past and a future reference frame, giving the best compression. B-frames are never used as references, so they cause no error propagation. (Because they need a future frame, encoding/decoding order differs from display order.)
A typical GOP looks like I B B P B B P B B P ….
Differentiate between the RGB and CMYK color models.
RGB vs CMYK Color Models
| Aspect | RGB | CMYK |
|---|---|---|
| Primaries | Red, Green, Blue | Cyan, Magenta, Yellow, Black (Key) |
| Color mixing | Additive (adds light) | Subtractive (absorbs/subtracts light) |
| Base / 'no color' | Black is absence of light | White is the (blank) paper |
| 'Full' result | White = all colors added (255,255,255) | Black ≈ all inks combined |
| Used by | Emissive devices – monitors, TVs, cameras, scanners, web | Printing – inkjet/laser/offset presses |
| Channels | 3 channels | 4 channels |
| Color gamut | Wider; can show vivid, bright colors | Narrower; cannot reproduce all RGB colors |
Explanation
- RGB is additive: colored light is emitted and combined; adding all three at full intensity yields white. Ideal for screens that produce their own light.
- CMYK is subtractive: colored inks on white paper absorb part of the incident white light and reflect the rest; combining inks removes more light, tending toward black. The extra K (black) is used because mixed CMY inks give a muddy brown rather than pure black, and using K saves ink and gives sharper text.
- Conversion (approx., normalized 0–1): , then is factored out. Because gamuts differ, some bright RGB screen colors cannot be printed exactly in CMYK.
Explain sampling and quantization of digital audio.
Sampling and Quantization of Digital Audio
Converting a continuous (analog) sound wave into a digital signal involves two steps: sampling (discretizing time) and quantization (discretizing amplitude) — together forming PCM (Pulse Code Modulation).
Sampling
The continuous waveform is measured at regular time intervals; each measurement is a sample. The sampling rate (frequency) is the number of samples per second (Hz).
- Nyquist theorem: to capture a signal whose highest frequency is without aliasing, .
- Example: human hearing reaches ~20 kHz, so CD audio uses kHz ().
- Too low a rate causes aliasing (high frequencies fold into wrong, lower frequencies).
Quantization
Each sampled amplitude (a real value) is rounded to the nearest level out of a finite set determined by the bit depth . With bits there are levels.
- Example: 16-bit audio → levels.
- The rounding error is quantization noise; more bits → finer steps → lower noise and higher dynamic range (≈ dB).
Resulting Data Size
For CD-quality stereo: bits/s ≈ 1.41 Mbit/s (≈ 10 MB/minute uncompressed), which is why audio compression (MP3/AAC) is used.
Differentiate between lossy and lossless compression with examples.
Lossy vs Lossless Compression
| Aspect | Lossless | Lossy |
|---|---|---|
| Data recovery | Original is exactly reconstructed | Reconstruction is approximate; some data permanently lost |
| Compression ratio | Lower (typically 2:1–4:1) | Much higher (10:1 to 100:1+) |
| Quality | No quality loss | Some quality degradation (often imperceptible) |
| How it works | Removes statistical redundancy only | Discards perceptually unimportant information |
| Examples | PNG, GIF, ZIP, FLAC, Huffman, RLE, LZW | JPEG, MP3, AAC, MPEG/H.264 video |
| Use cases | Text, source code, medical/technical images, archives | Photos, music, streaming video where small size matters |
Explanation
- Lossless compression encodes data more efficiently (e.g. Huffman or RLE) so the exact original bits are recoverable. It is mandatory where every bit matters (programs, text, legal/medical images).
- Lossy compression exploits limits of human perception (e.g. the eye's insensitivity to high spatial frequencies, the ear's masking effects) to throw away data that the user is unlikely to notice. This yields far smaller files, which is why it dominates images, audio and video.
Examples: A .png logo (lossless) can be restored bit-for-bit; a .jpg photo (lossy) at high compression shows blocking artifacts and cannot be restored to the original.
What are the characteristics of multimedia data? Explain the storage requirements.
Characteristics of Multimedia Data and Storage Requirements
Characteristics of Multimedia Data
- Voluminous (large size): Uncompressed image, audio and especially video generate very large data volumes.
- Continuous / time-dependent media: Audio and video are isochronous — units must be delivered and played at a fixed rate; late data is worthless.
- Real-time and high-bandwidth needs: Capture and playback require guaranteed throughput and bounded delay and jitter.
- Need for synchronization: Multiple streams (e.g. audio + video lip-sync) must keep their temporal relationship.
- Redundancy / compressibility: Contains spatial and temporal redundancy, so it compresses well (JPEG, MPEG, MP3).
- Interactivity: Supports user control such as seek, pause and navigation.
Storage Requirements
Because raw data is so large, multimedia needs high-capacity, high-throughput storage and compression.
Example (raw sizes):
- true-color (24-bit) image: MB.
- CD-quality stereo audio: Mbit/s ≈ 10 MB/minute.
- Raw video at 25 fps of the above frame ≈ MB/s — over a GB per minute.
Consequences for storage:
- Large capacity media (HDD/SSD, optical CD/DVD/Blu-ray, servers).
- High transfer rate / low access time to sustain real-time playback.
- Compression (lossy and lossless) to make storage and transmission feasible.
- Specialized file systems/buffering to deliver continuous media with bounded jitter.
What is multimedia synchronization? Differentiate intra-media and inter-media synchronization.
Multimedia Synchronization
Multimedia synchronization is the process of maintaining the correct temporal (and spatial) relationships between the media objects in a presentation, so they are played in the intended order and timing (e.g. audio matching video, subtitles matching speech).
Intra-media vs Inter-media Synchronization
| Aspect | Intra-media (intra-stream) | Inter-media (inter-stream) |
|---|---|---|
| Scope | Within a single continuous medium | Between two or more different media streams |
| Goal | Maintain correct timing/rate of units inside one stream (avoid jitter) | Maintain correct relative timing (skew) across streams |
| Example | Playing a 25 fps video so a frame appears exactly every 40 ms; audio at 44.1 kHz | Lip-sync of audio with video; subtitle appearing with the spoken line; slide changing with narration |
| Concern | Playback rate, jitter | Relative skew between streams |
Explanation
- Intra-media synchronization ensures the internal continuity of one stream: each logical data unit (frame/sample) is presented at the right instant relative to the previous one, so playback is smooth and at the correct rate.
- Inter-media synchronization coordinates several streams together. The classic case is lip-sync, where audio–video skew should stay within roughly ±80 ms to remain unnoticed. It also covers discrete media, e.g. an image or text appearing at a defined moment in the audio.
Frequently asked questions
- Where can I find the BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) question paper 2081?
- The full BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) 2081 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
- Does the Multimedia Computing (BSc CSIT, CSC467) 2081 paper come with solutions?
- Yes. Every question on this Multimedia Computing (BSc CSIT, CSC467) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
- How many marks is the BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) 2081 paper?
- The BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) 2081 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
- Is practising this Multimedia Computing (BSc CSIT, CSC467) past paper free?
- Yes — reading and attempting this Multimedia Computing (BSc CSIT, CSC467) past paper on Kekkei is completely free.