BSc CSIT (TU) Science Multimedia Computing (BSc CSIT, CSC467) Question Paper 2078 Nepal
This is the official BSc CSIT (TU) (Science stream) Multimedia Computing (BSc CSIT, CSC467) question paper for 2078, as set in the regular annual examination. It carries 60 full marks and a time allowance of 180 minutes, across 12 questions. On Kekkei you can attempt this Multimedia Computing (BSc CSIT, CSC467) past paper online with a timer, get instant AI feedback and step-by-step solutions, and track the topics where you lose marks — completely free. Whether you are revising for your BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) exam or solving previous years' question papers, this 2078 paper is a great way to practise under real exam conditions.
Section A: Long Answer Questions
Attempt any TWO questions.
Explain entropy and source coding for multimedia. Discuss Huffman coding, run-length encoding and arithmetic coding, and construct a Huffman code for a given set of symbol frequencies.
Entropy and Source Coding for Multimedia
Entropy measures the average information content (uncertainty) of a source. For a source emitting symbols with probability , Shannon's entropy is:
Entropy is the theoretical lower bound on the average number of bits per symbol for lossless coding. Source coding removes statistical redundancy so the average code length approaches .
Huffman Coding
A prefix, variable-length code built by repeatedly merging the two least-probable symbols into a binary tree; shorter codes are assigned to frequent symbols. It is optimal among integer-length prefix codes, with average length satisfying .
Run-Length Encoding (RLE)
Replaces runs of identical symbols by a (value, count) pair, e.g. AAAAB -> (A,4)(B,1). Very effective for data with long uniform runs (fax, simple graphics) but poor for noisy data.
Arithmetic Coding
Encodes an entire message as a single fractional number in by successively narrowing an interval according to symbol probabilities. It can use fractional bits per symbol and so approaches the entropy bound more closely than Huffman, especially for skewed or small alphabets.
Worked Huffman Example
Given symbols with frequencies: A=45, B=13, C=12, D=16, E=9, F=5 (total 100).
- Merge F(5)+E(9)=14
- Merge C(12)+B(13)=25
- Merge 14+D(16)=30
- Merge 25+30=55
- Merge A(45)+55=100
Resulting codes (one valid assignment):
| Symbol | Freq | Code | Bits |
|---|---|---|---|
| A | 45 | 0 | 1 |
| C | 12 | 100 | 3 |
| B | 13 | 101 | 3 |
| F | 5 | 1100 | 4 |
| E | 9 | 1101 | 4 |
| D | 16 | 111 | 3 |
Average length bits/symbol, compared with 3 bits for fixed-length coding.
Explain digital audio representation. Discuss sampling, quantization, PCM, and audio compression techniques including MPEG audio (MP3) with the role of psychoacoustic models.
Digital Audio Representation
Sound is a continuous (analog) pressure wave. To process it digitally it must be sampled (discretized in time) and quantized (discretized in amplitude).
Sampling
The analog signal is measured at regular intervals at the sampling rate . By the Nyquist theorem, to avoid aliasing (e.g. CD audio uses 44.1 kHz to cover the 20 kHz audible range).
Quantization
Each sample's amplitude is rounded to one of levels for -bit resolution (e.g. 16-bit = 65,536 levels). Quantization introduces quantization noise; signal-to-noise ratio improves about 6 dB per bit.
PCM (Pulse Code Modulation)
The standard uncompressed representation: a stream of quantized sample values. Bit rate . CD stereo Mbps.
Audio Compression and MPEG Audio (MP3)
- Lossless (FLAC, ALS) removes redundancy only.
- Lossy (MP3 = MPEG-1/2 Layer III) removes perceptually irrelevant data.
MP3 pipeline: the signal is split into frequency sub-bands using a filter bank and MDCT; a psychoacoustic model analyses each frame to decide what can be discarded; bits are allocated and quantized accordingly; finally Huffman coding packs the result. MP3 achieves roughly 10:1–12:1 compression with near-transparent quality at 128 kbps.
Role of Psychoacoustic Models
They exploit limits of human hearing:
- Absolute threshold of hearing — very quiet tones are inaudible.
- Frequency (simultaneous) masking — a loud tone hides nearby weaker tones.
- Temporal masking — sounds before/after a loud sound are masked.
Bits are allocated so that the quantization noise stays below the computed masking threshold, making the loss inaudible.
What is a multimedia system? Explain the characteristics of multimedia data, the storage and coding requirements, and the components of a multimedia computing system.
Multimedia System
A multimedia system is a computer-based system that can capture, store, process, transmit and present information in more than one medium — combining text, graphics, images, audio, video and animation in an integrated, often interactive, way.
Characteristics of Multimedia Data
- Large volume / high storage demand (especially audio and video).
- High bandwidth for transmission.
- Real-time and continuous nature — audio/video must be played within timing (latency, jitter) constraints.
- Time dependency / synchronization — media streams (e.g. lip-sync) must stay aligned.
- Heterogeneity — different media have different data and processing requirements.
Storage and Coding Requirements
Raw multimedia is huge: uncompressed CD audio is ~1.4 Mbps and SD video can exceed 100 Mbps. Therefore systems rely on compression/coding (JPEG, MPEG, MP3, H.264) to reduce storage and bandwidth, plus fast storage with high sustained throughput (SSD/RAID) and indexing for random access.
Components of a Multimedia Computing System
- Hardware: capture devices (camera, microphone, scanner), high-speed CPU/GPU, large RAM, fast storage, sound/graphics cards, display and audio output.
- Software: operating system with multimedia/real-time support, codecs, authoring tools (e.g. Adobe), media players and editors.
- Storage media: hard disks, SSDs, optical media, cloud storage.
- Networking/communication: high-bandwidth networks and streaming protocols (RTP/RTSP) for delivery.
- Compression standards that tie the components together.
Section B: Short Answer Questions
Attempt any EIGHT questions.
Define entropy in the context of coding. How is it related to compression?
Entropy in coding is the average information content of a source, bits per symbol, representing the minimum average number of bits needed to losslessly encode each symbol.
Relation to compression: Entropy is the theoretical lower bound for lossless compression. A good entropy coder (Huffman, arithmetic) produces an average code length approaching . Low-entropy (highly predictable/redundant) data compresses well; high-entropy (random) data cannot be compressed below .
Explain arithmetic coding and how it differs from Huffman coding.
Arithmetic coding represents an entire message as a single number (interval) in . Starting from the full interval, it is recursively subdivided in proportion to each symbol's probability; the final interval (or any number inside it) encodes the whole message. It can assign fractional bits per symbol.
Differences from Huffman coding:
| Aspect | Huffman | Arithmetic |
|---|---|---|
| Unit | One code per symbol | One number for whole message |
| Bits/symbol | Integer | Fractional |
| Optimality | Optimal only for integer lengths | Approaches entropy more closely |
| Skewed/small alphabet | Inefficient (min 1 bit/symbol) | Efficient |
| Adaptivity | Needs new tree | Easily adaptive |
Arithmetic coding generally achieves better compression but is computationally heavier and historically encumbered by patents.
Explain motion estimation and motion compensation in video compression.
Motion estimation and motion compensation exploit temporal redundancy between successive video frames, since consecutive frames are usually very similar.
Motion estimation: Each frame is divided into blocks (e.g. 16x16 macroblocks). For each block in the current frame, the encoder searches a region of a reference frame for the best-matching block, producing a motion vector (the displacement) — typically using block-matching with a criterion such as minimum SAD/MSE.
Motion compensation: Using the motion vectors, the encoder predicts the current frame from the reference frame. Only the motion vectors and the small residual (prediction error) are encoded (the residual is then DCT-transformed and quantized).
This greatly reduces bit rate and is fundamental to P-frames and B-frames in MPEG/H.26x codecs.
Write short notes on MP3 audio compression and psychoacoustic models.
MP3 Audio Compression
MP3 (MPEG-1/2 Audio Layer III) is a lossy audio compression format. The audio is divided into short frames, transformed into the frequency domain using a polyphase filter bank and MDCT, then quantized and Huffman coded. It typically gives 10:1–12:1 compression, with near-CD quality at about 128 kbps, by discarding perceptually irrelevant data rather than statistically redundant data.
Psychoacoustic Models
MP3's quality relies on a psychoacoustic model of human hearing:
- Absolute threshold of hearing: sounds quieter than the threshold are removed.
- Frequency masking: a strong tone makes nearby weaker tones inaudible.
- Temporal masking: sounds just before/after a loud sound are masked.
The model computes a masking threshold for each frame; the encoder allocates bits and sets quantization so that the resulting noise stays below this threshold, keeping the loss inaudible while saving bits.
What is computer animation? Differentiate between frame-based and key-frame animation.
Computer animation is the technique of creating the illusion of movement by rapidly displaying a sequence of images (frames) generated or manipulated by a computer, exploiting persistence of vision.
Frame-based vs Key-frame animation:
| Frame-based animation | Key-frame animation |
|---|---|
| Every individual frame is created/drawn explicitly | Only important (key) frames are defined |
| In-between frames are also drawn manually | In-between frames are generated automatically by tweening/interpolation |
| Labour-intensive, large storage | Less effort, smaller storage |
| Full control over each frame | Smooth motion with less manual work |
| Traditional cel/flip-book style | Used in modern tools (Flash, After Effects, 3D software) |
In key-frame animation the animator sets the start and end states (position, scale, colour) at key frames and the software computes the intermediate frames.
What is hypermedia? Differentiate between hypertext and hypermedia.
Hypermedia is an extension of hypertext in which the nodes and links connect not only text but also other media — images, audio, video, graphics and animation — allowing non-linear navigation through multimedia content (e.g. the modern World Wide Web).
Hypertext vs Hypermedia:
| Hypertext | Hypermedia |
|---|---|
| Links text-only documents | Links any media (text, image, audio, video) |
| Navigation through text nodes via hyperlinks | Navigation through multimedia nodes |
| Subset/special case | Superset that includes hypertext |
| Example: early text-based help systems | Example: modern web pages, interactive CD-ROMs |
In short, hypermedia = hypertext + multimedia: every hypermedia system is hypertext-based, but hypertext deals only with linked text.
Write short notes on multimedia streaming and the issues involved.
Multimedia Streaming
Streaming delivers audio/video so that playback begins while the rest of the content is still being downloaded, instead of waiting for a full download. A small buffer absorbs network variation; protocols such as RTP/RTSP or adaptive HTTP streaming (HLS, MPEG-DASH) are used. It may be on-demand (stored media) or live.
Issues Involved
- Bandwidth limitation: the network may not sustain the required data rate, causing rebuffering.
- Latency and delay: especially critical for live/interactive streams.
- Jitter: variable packet delay disrupts smooth playback; handled by buffering.
- Packet loss: UDP-based streaming may drop packets, degrading quality; needs error concealment/FEC.
- Synchronization: keeping audio and video (lip-sync) aligned.
- Heterogeneous clients: varying device/network capabilities require adaptive bit-rate streaming.
- QoS / continuous real-time delivery must be maintained.
Explain the Discrete Cosine Transform (DCT) and its importance in image compression.
Discrete Cosine Transform (DCT)
The DCT transforms a block of spatial-domain pixel values into the frequency domain, expressing the block as a sum of cosine basis functions. For an block (JPEG uses ), the 2-D forward DCT is:
where for and otherwise. is the DC coefficient (average); the rest are AC coefficients of increasing frequency.
Importance in Image Compression
- It concentrates energy into a few low-frequency coefficients (energy compaction).
- Human eyes are less sensitive to high frequencies, so after the DCT those coefficients can be heavily quantized (set to zero), giving large compression.
- The DCT itself is lossless and reversible; loss happens only in the quantization step.
- It is the core transform of JPEG and MPEG image/video coding.
Write short notes on common multimedia file formats (JPEG, GIF, PNG, MPEG).
Common Multimedia File Formats
JPEG (.jpg): A lossy image format using DCT-based compression with adjustable quality. Excellent for photographs and natural images (24-bit colour) but introduces artifacts and is unsuitable for sharp-edged graphics/text. Does not support transparency.
GIF (.gif): A lossless image format limited to an 8-bit (256-colour) palette using LZW compression. Supports transparency and simple animation; best for logos, icons and small animations, poor for photographs.
PNG (.png): A lossless format supporting up to 24/32-bit colour with alpha transparency and better compression than GIF (DEFLATE). Ideal for web graphics and images needing transparency, but no native animation and larger than JPEG for photos.
MPEG (.mpg/.mp4): A family of lossy video/audio compression standards (MPEG-1/2/4, H.264) using DCT plus motion estimation/compensation to exploit temporal redundancy. Used for DVD, digital TV and streaming video.
| Format | Type | Use |
|---|---|---|
| JPEG | Lossy image | Photographs |
| GIF | Lossless, 256 colours | Logos, simple animation |
| PNG | Lossless, alpha | Web graphics, transparency |
| MPEG | Lossy video/audio | Movies, streaming |
Frequently asked questions
- Where can I find the BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) question paper 2078?
- The full BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) 2078 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
- Does the Multimedia Computing (BSc CSIT, CSC467) 2078 paper come with solutions?
- Yes. Every question on this Multimedia Computing (BSc CSIT, CSC467) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
- How many marks is the BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) 2078 paper?
- The BSc CSIT (TU) Multimedia Computing (BSc CSIT, CSC467) 2078 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
- Is practising this Multimedia Computing (BSc CSIT, CSC467) past paper free?
- Yes — reading and attempting this Multimedia Computing (BSc CSIT, CSC467) past paper on Kekkei is completely free.