BSc CSIT (TU) Science Multimedia Computing (BSc CSIT, CSC467) Question Paper 2079 Nepal
This is the official BSc CSIT (TU) (Science stream) Multimedia Computing (BSc CSIT, CSC467) question paper for 2079, as set in the regular annual examination. It carries 60 full marks and a time allowance of 180 minutes, across 12 questions. On Kekkei you can attempt this Multimedia Computing (BSc CSIT, CSC467) past paper online with a timer, get instant AI feedback and step-by-step solutions, and track the topics where you lose marks — completely free. Whether you are revising for your BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) exam or solving previous years' question papers, this 2079 paper is a great way to practise under real exam conditions.
Section A: Long Answer Questions
Attempt any TWO questions.
What is multimedia synchronization? Explain intra-media and inter-media synchronization, and discuss the reference model for multimedia synchronization.
Multimedia Synchronization
Multimedia synchronization is the process of maintaining the correct temporal (time), spatial (position), and content relationships between different media objects (text, audio, video, images, animation) during their capture, storage, transmission, and presentation. Its purpose is to ensure that media are played back at the right time and in the right order, so the presentation is meaningful to the user (e.g., audio matching the lip movements in a video).
Intra-media (Intra-stream) Synchronization
This maintains the temporal relationship within a single continuous media stream.
- It ensures a fixed, constant playout rate for the units (frames/samples) of one stream.
- Example: a video must display its frames at a constant rate (e.g., 25 frames/second); audio samples must be played at a constant sampling rate (e.g., 44,100 samples/second).
- Failure causes jitter, frames appearing too fast/slow, or jerky playback.
Inter-media (Inter-stream) Synchronization
This maintains the temporal relationship between two or more different media streams.
- The most common case is lip synchronization (audio aligned with video). The tolerable skew is roughly ms before users notice the mismatch.
- It also covers relationships such as a slide appearing together with its narration, or subtitles matching speech.
- Failure causes audio leading/lagging the video, or annotations appearing at the wrong moment.
Reference Model for Multimedia Synchronization
A layered four-layer reference model is used to describe synchronization at different levels of abstraction:
| Layer | Function | Synchronization handled |
|---|---|---|
| Media Layer | Operates on a single continuous media stream as a sequence of Logical Data Units (LDUs). | Intra-media (device-level playout rate). |
| Stream Layer | Operates on continuous streams as a whole; provides Quality of Service (QoS) and ensures intra-stream timing and basic inter-stream timing. | Intra- and simple inter-stream sync. |
| Object Layer | Operates on all media types together; hides the difference between continuous and discrete (time-independent) media. | Inter-media (e.g., text + audio + video). |
| Specification Layer | The top, application/authoring layer; lets authors specify synchronization relationships using models such as timeline, hierarchical (Petri-net / OCPN), reference-point, or scripting. | High-level specification of sync. |
Each layer offers an interface (service) to the layer above and uses the services of the layer below. The Specification Layer is open (no fixed implementation) and includes models like the Object Composition Petri Net (OCPN), timeline-based, and reference-point models for expressing how media should be coordinated.
Explain digital image representation and color models (RGB, CMYK, YUV/YCbCr). Discuss how color is sampled and the concept of chroma subsampling in multimedia.
Digital Image Representation and Color Models
Digital Image Representation
A digital image is a 2-D array (matrix) of pixels (picture elements). Each pixel stores a numeric value representing brightness/color.
- Spatial resolution: number of pixels (width height), e.g., .
- Color/bit depth: number of bits per pixel. A grayscale image uses 8 bits (0–255); a true-color image uses 24 bits (8 bits each for R, G, B).
- Storage (uncompressed) .
Color Models
RGB (Red, Green, Blue) — an additive model used by displays, cameras and scanners. Colors are produced by adding red, green and blue light; = black, = white. Device-dependent.
CMYK (Cyan, Magenta, Yellow, Black) — a subtractive model used in color printing. Inks absorb (subtract) light; combining C, M, Y gives a muddy dark, so a separate K (black) channel is added for true black and detail. Conversion (simplified): , , on normalized values, then .
YUV / YCbCr — separates luminance (Y = brightness) from chrominance (color). carries the gray-scale information; and carry color difference. Conversion from RGB (BT.601):
This model exploits the fact that the human eye is more sensitive to brightness than to color, which is the basis for compression.
Color Sampling and Chroma Subsampling
Because the eye perceives less detail in color than in luminance, the chrominance channels can be sampled at a lower resolution than luminance — this is chroma subsampling. It reduces data with little visible quality loss. It is denoted as (luma:chroma in a region):
| Scheme | Meaning | Cb/Cr horizontal resolution |
|---|---|---|
| 4:4:4 | No subsampling | Full (each pixel has its own color) |
| 4:2:2 | Color sampled at half horizontal rate | 1/2 |
| 4:2:0 | Color sampled at half horizontal and half vertical | 1/4 (used in JPEG, MPEG, H.264) |
Example: 4:2:0 stores one Cb and one Cr value for every block of luma pixels, roughly halving the total data versus 4:4:4.
Explain the JPEG image compression standard. Describe its main steps - DCT, quantization, zig-zag ordering and entropy (Huffman) coding - with the help of a block diagram.
JPEG Image Compression Standard
JPEG (Joint Photographic Experts Group) is a widely used lossy compression standard for continuous-tone still images. It achieves high compression by discarding high-frequency detail the human eye cannot easily perceive.
Block Diagram (encoder)
Input Image -> Color Transform (RGB->YCbCr) -> 8x8 Block split
-> [Forward DCT] -> [Quantization] -> [Zig-zag scan]
-> [DC: DPCM] [AC: Run-Length] -> [Entropy (Huffman) coding] -> Compressed bitstream
Main Steps
1. Color transform and downsampling. RGB is converted to YCbCr, and the chroma channels are subsampled (commonly 4:2:0).
2. Block splitting. The image is divided into pixel blocks, processed independently. Pixel values are level-shifted (subtract 128).
3. Forward DCT (Discrete Cosine Transform). Each block is transformed from the spatial domain to the frequency domain:
The result is one DC coefficient (top-left, average value) and 63 AC coefficients. Energy is concentrated in the low-frequency (top-left) coefficients.
4. Quantization. Each DCT coefficient is divided by a value from a quantization table and rounded:
High-frequency coefficients are divided by larger values, so many become zero. This is the lossy step and controls the quality/compression trade-off.
5. Zig-zag ordering. The quantized block is read in a zig-zag pattern from low to high frequency. This groups the many trailing zeros together into a long run, making them easy to compress.
6. Entropy (Huffman) coding.
- The DC coefficient is coded differentially (DPCM) relative to the previous block's DC.
- The AC coefficients are run-length encoded as (run-of-zeros, value) pairs and then Huffman coded (variable-length codes giving short codes to frequent symbols). This is lossless.
At the decoder, the steps are reversed: entropy decoding -> de-zig-zag -> dequantization -> inverse DCT -> color transform back to RGB, reconstructing an approximation of the original image.
Section B: Short Answer Questions
Attempt any EIGHT questions.
What is Huffman coding? Construct a Huffman code for a given set of symbols and explain its working.
Huffman Coding
Huffman coding is a lossless, variable-length, prefix-free entropy coding technique that assigns shorter codes to more frequent symbols and longer codes to rarer symbols, minimizing the average code length. No code is a prefix of another, so the stream is uniquely decodable.
Algorithm
- List all symbols with their frequencies (probabilities).
- Repeatedly take the two lowest-frequency nodes and merge them into a new node whose frequency is their sum.
- Continue until one tree remains.
- Label left edges
0and right edges1; the path from root to a leaf is that symbol's code.
Example
Symbols and frequencies: A=45, B=13, C=12, D=16, E=9, F=5 (total 100).
Building the tree (merge two smallest each step): F(5)+E(9)=14; C(12)+B(13)=25; 14+D(16)=30; 25+30=55; 55+A(45)=100.
Resulting codes:
| Symbol | Freq | Code | Length |
|---|---|---|---|
| A | 45 | 0 | 1 |
| C | 12 | 100 | 3 |
| B | 13 | 101 | 3 |
| F | 5 | 1100 | 4 |
| E | 9 | 1101 | 4 |
| D | 16 | 111 | 3 |
Average length bits/symbol, versus 3 bits with fixed-length coding — a clear saving. Decoding walks the tree bit-by-bit from the root until a leaf is reached.
Explain run-length encoding (RLE) with an example.
Run-Length Encoding (RLE)
Run-length encoding is a simple lossless compression technique that replaces sequences of consecutive identical symbols (a run) with a single (symbol, count) pair. It is most effective on data with many long runs, such as simple graphics, fax images, and the long zero-runs in JPEG.
Working
Scan the data; whenever a value repeats consecutively, store it once together with the number of repetitions.
Example
Input string:
AAAAABBBCCDAA
RLE output (value, count):
5A 3B 2C 1D 2A -> encoded as: A5 B3 C2 D1 A2
The 13-character input is represented by 10 characters. For an image scan line like WWWWWWWWWWBBWWW (12 W, 2 B, ...), it becomes 12W2B3W, a large saving.
Note: RLE can expand data with few runs (e.g., ABCD becomes A1B1C1D1), so it works best on highly repetitive data.
Explain the role of DCT and quantization in JPEG compression.
Role of DCT and Quantization in JPEG
Discrete Cosine Transform (DCT)
The DCT converts each block of pixels from the spatial domain to the frequency domain, producing one DC coefficient (the block's average) and 63 AC coefficients.
- It concentrates the image energy into a few low-frequency coefficients (top-left), while high-frequency coefficients are usually small.
- DCT itself is reversible/lossless (apart from rounding); it does not compress, it prepares the data so that the redundant high-frequency information can be removed efficiently.
Quantization
Quantization divides each DCT coefficient by a value from a quantization table and rounds the result:
- Larger divisors are used for high frequencies (which the eye is less sensitive to), so many small high-frequency coefficients become zero.
- This is the only lossy step in JPEG and the main source of compression; the quantization table (scaled by a quality factor) controls the quality vs. compression trade-off.
- The resulting blocks of zeros are then efficiently compressed by zig-zag scanning, run-length and Huffman coding.
Summary: DCT re-organizes the data by frequency so that quantization can discard perceptually unimportant high-frequency detail, enabling high compression with little visible loss.
Differentiate between I-frames, P-frames and B-frames in MPEG.
I-frames, P-frames and B-frames in MPEG
MPEG video compresses a sequence using three frame (picture) types arranged in a Group of Pictures (GOP), exploiting both spatial and temporal redundancy.
| Feature | I-frame (Intra) | P-frame (Predictive) | B-frame (Bi-directional) |
|---|---|---|---|
| Coding | Coded independently, like a JPEG image (intra-coded only) | Predicted from a previous I or P frame using motion compensation | Predicted from both previous and following I/P frames |
| References | None | One (past) | Two (past and future) |
| Compression | Lowest (largest size) | Higher than I | Highest (smallest size) |
| Error resilience / random access | Best — acts as an access point; errors do not propagate | Errors propagate to later P/B frames | Not used as a reference (in classic MPEG), so errors don't propagate |
I-frame: a self-contained reference frame; needed for random access (seeking) and to start/refresh a GOP.
P-frame: stores only the motion vectors and differences relative to a preceding reference frame, giving good compression.
B-frame: interpolated from frames before and after it, giving the highest compression; this requires the encoder/decoder to reorder frames (decode order differs from display order).
A typical GOP display order: I B B P B B P B B P ...
Differentiate between the RGB and CMYK color models.
RGB vs CMYK Color Models
| Feature | RGB | CMYK |
|---|---|---|
| Full form | Red, Green, Blue | Cyan, Magenta, Yellow, Black (Key) |
| Type | Additive (adds light) | Subtractive (subtracts/absorbs light) |
| Primary action | Mixing colored light | Mixing colored inks/pigments |
| Black & white | All channels 0 = black; all max = white | No ink = white (paper); C+M+Y+K = black |
| Number of components | 3 | 4 |
| Used by | Monitors, TVs, cameras, scanners, web | Color printers, printing presses |
| Color gamut | Wider (more vivid, bright colors) | Narrower (cannot reproduce some bright RGB colors) |
Key idea: RGB is device-of-light based — increasing all values moves toward white. CMYK is ink/pigment based — increasing values absorbs more light and moves toward black. A separate K (black) channel is added in CMYK because mixing C, M, Y inks yields a muddy brown rather than true black, and using black ink also saves the costlier colored inks. Images created in RGB on screen must be converted to CMYK for accurate printing, which may shift some out-of-gamut colors.
Explain sampling and quantization of digital audio.
Sampling and Quantization of Digital Audio
Converting an analog (continuous) audio signal into digital form requires two steps performed by an Analog-to-Digital Converter (ADC): sampling (discretizing time) and quantization (discretizing amplitude).
Sampling
Sampling measures the amplitude of the continuous signal at regular time intervals. The number of samples per second is the sampling rate / frequency (), measured in Hz.
- By the Nyquist–Shannon theorem, must be at least twice the highest frequency in the signal to avoid aliasing: .
- Human hearing extends to ~20 kHz, so CD-quality audio uses Hz (44.1 kHz).
Quantization
Each sampled amplitude is rounded to the nearest of a finite set of levels, represented by a fixed number of bits — the bit depth . The number of levels .
- 8-bit gives 256 levels; 16-bit (CD quality) gives levels.
- The rounding error is called quantization error/noise; more bits = smaller error and higher signal-to-noise ratio.
Storage / Bit Rate
Example (CD stereo): bits/s Mbps.
Together, sampling rate and bit depth determine the quality and size of the digital audio.
Differentiate between lossy and lossless compression with examples.
Lossy vs Lossless Compression
| Feature | Lossless Compression | Lossy Compression |
|---|---|---|
| Data recovery | Original is exactly reconstructed (no data lost) | Some data permanently discarded; only an approximation is recovered |
| Compression ratio | Lower (typically 2:1 to 3:1) | Much higher (10:1 to 100:1) |
| Quality | No quality loss | Quality reduced (often imperceptibly) |
| Reversibility | Fully reversible | Irreversible |
| Basis | Removes statistical / coding redundancy | Removes perceptually irrelevant information |
| Used for | Text, program files, medical/legal images, archives | Photos, audio, video streaming |
| Examples | Huffman, RLE, LZW, ZIP, PNG, GIF, FLAC | JPEG, MPEG, MP3, AAC, H.264 |
Lossless is essential where every bit matters (e.g., executables, text, medical imaging) — the decompressed file is bit-for-bit identical to the original.
Lossy sacrifices unimportant detail (exploiting limits of human vision/hearing) to achieve far higher compression, which is acceptable for everyday photos, music and video where small quality loss is unnoticeable.
What are the characteristics of multimedia data? Explain the storage requirements.
Characteristics of Multimedia Data and Storage Requirements
Characteristics of Multimedia Data
- Large volume / high data size — images, audio and especially video produce huge amounts of data, demanding large storage and high bandwidth.
- Real-time / continuous nature — audio and video are continuous (time-dependent) media that must be captured and played back at a fixed rate; late data is useless.
- Temporal requirements / QoS — needs guaranteed throughput, low delay and low jitter; requires intra- and inter-media synchronization.
- Heterogeneity — combines different media types (text, graphics, image, audio, video, animation) with different formats and needs.
- Voluminous and compressible — contains much redundancy, so it is usually stored/transmitted in compressed form (JPEG, MPEG, MP3).
- Interactivity — often requires random access, fast-forward, rewind and user interaction.
Storage Requirements (with examples)
Uncompressed multimedia is very large. Storage size formulas:
- Image: . Example: a 24-bit image MB.
- Audio: . CD stereo Mbps, i.e. ~10 MB per minute.
- Video: frame size frames/sec. Raw SD video can exceed 20 MB/second (over 1 GB/minute).
Because of these sizes, multimedia systems rely heavily on compression and on storage/transmission media with large capacity and high transfer rates (e.g., DVD, Blu-ray, SSDs, streaming servers with QoS).
What is multimedia synchronization? Differentiate intra-media and inter-media synchronization.
Multimedia Synchronization: Intra-media vs Inter-media
Multimedia synchronization is the maintenance of the correct temporal, spatial and content relationships between media objects during capture, storage, transmission and, most importantly, playback, so the presentation is coherent (e.g., sound matching the picture).
Difference between Intra-media and Inter-media Synchronization
| Aspect | Intra-media (intra-stream) | Inter-media (inter-stream) |
|---|---|---|
| Scope | Timing within a single media stream | Timing between two or more media streams |
| Goal | Maintain a constant, correct playout rate of units in one stream | Keep different streams temporally aligned with each other |
| Example | Displaying video frames at exactly 25 fps; playing audio samples at 44.1 kHz | Lip-sync: audio matched to video; subtitles matched to speech |
| Problem if it fails | Jitter, jerky or too-fast/too-slow playback | Audio leads/lags video; annotations appear at the wrong time |
| Tolerance | Depends on the stream's frame/sample rate | Lip-sync skew tolerable up to about ms |
In short: intra-media synchronization ensures each individual stream plays smoothly at its own correct rate, whereas inter-media synchronization ensures separate streams stay coordinated with one another.
Frequently asked questions
- Where can I find the BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) question paper 2079?
- The full BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) 2079 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
- Does the Multimedia Computing (BSc CSIT, CSC467) 2079 paper come with solutions?
- Yes. Every question on this Multimedia Computing (BSc CSIT, CSC467) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
- How many marks is the BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) 2079 paper?
- The BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) 2079 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
- Is practising this Multimedia Computing (BSc CSIT, CSC467) past paper free?
- Yes — reading and attempting this Multimedia Computing (BSc CSIT, CSC467) past paper on Kekkei is completely free.