Browse papers
A

Section A: Long Answer Questions

Attempt any TWO questions.

3 questions·10 marks each
1long10 marks

Explain the theory of estimation. Differentiate between point estimation and interval estimation and explain the properties of a good estimator.

Theory of Estimation

Estimation is the branch of statistical inference concerned with inferring the value of an unknown population parameter (e.g. mean μ\mu, variance σ2\sigma^2, proportion PP) from a sample statistic computed from sample data. The sample statistic used for this purpose is called an estimator, and a particular numerical value it takes is an estimate.

There are two broad approaches:

Point Estimation vs Interval Estimation

BasisPoint EstimationInterval Estimation
MeaningGives a single value as the estimate of the parameterGives a range (interval) within which the parameter is expected to lie
Formθ^\hat{\theta} (e.g. xˉ\bar{x} for μ\mu)(θ^L, θ^U)(\hat{\theta}_L,\ \hat{\theta}_U), e.g. xˉ±Zα/2σn\bar{x} \pm Z_{\alpha/2}\dfrac{\sigma}{\sqrt{n}}
Probability statementNo probability/confidence attachedAssociated with a confidence level (1α)(1-\alpha), e.g. 95%
ReliabilityDoes not indicate precision or errorIndicates precision via interval width and margin of error
Examplexˉ=50\bar{x}=50 estimates μ\muμ\mu lies in (47, 53)(47,\ 53) with 95% confidence

Properties of a Good Estimator

An estimator θ^\hat{\theta} of a parameter θ\theta is considered good if it possesses the following properties:

  1. Unbiasedness: θ^\hat{\theta} is unbiased if its expected value equals the parameter, i.e. E(θ^)=θE(\hat{\theta})=\theta. For example, the sample mean xˉ\bar{x} is an unbiased estimator of μ\mu.

  2. Consistency: θ^\hat{\theta} is consistent if it converges in probability to θ\theta as the sample size increases, i.e. θ^θ\hat{\theta}\to\theta as nn\to\infty. Larger samples give estimates closer to the true value.

  3. Efficiency: Among all unbiased estimators, the one with the minimum variance is the most efficient. If θ^1\hat{\theta}_1 and θ^2\hat{\theta}_2 are both unbiased and Var(θ^1)<Var(θ^2)Var(\hat{\theta}_1)<Var(\hat{\theta}_2), then θ^1\hat{\theta}_1 is more efficient.

  4. Sufficiency: An estimator is sufficient if it utilizes all the information in the sample relevant to the parameter, so that no other statistic can add further information about θ\theta.

A good estimator should ideally be unbiased, consistent, efficient and sufficient.

estimation
2long10 marks

Explain the chi-square test. Describe its applications in testing goodness of fit and independence of attributes.

The Chi-Square (χ2\chi^2) Test

The chi-square test is a non-parametric test based on the chi-square distribution, used to test hypotheses about categorical (attribute) data by comparing observed frequencies (O)(O) with expected frequencies (E)(E).

The test statistic is:

χ2=(OiEi)2Ei\chi^2 = \sum \frac{(O_i - E_i)^2}{E_i}

which follows a χ2\chi^2 distribution with appropriate degrees of freedom. A large value of χ2\chi^2 indicates a large discrepancy between observed and expected frequencies, leading to rejection of H0H_0.

Conditions: observations independent, sample reasonably large, total frequency N>50N>50, and each expected frequency 5\geq 5 (otherwise pool cells).

Application 1: Test of Goodness of Fit

Used to test whether an observed frequency distribution fits a theoretical/expected distribution (e.g. uniform, Binomial, Poisson, Normal).

  • H0H_0: The observed data agree with the assumed theoretical distribution.
  • H1H_1: The observed data do not fit the assumed distribution.
  • Compute expected frequencies from the theoretical distribution.
  • Statistic: χ2=(OE)2E\chi^2 = \sum \dfrac{(O-E)^2}{E}.
  • Degrees of freedom =n1k= n - 1 - k, where nn = number of classes and kk = number of parameters estimated from the data.
  • If χcal2>χtab2\chi^2_{cal} > \chi^2_{tab} at level α\alpha, reject H0H_0 (poor fit).

Application 2: Test of Independence of Attributes

Used with a contingency table to test whether two attributes (e.g. gender and preference) are independent or associated.

  • H0H_0: The two attributes are independent.
  • H1H_1: The two attributes are associated (dependent).
  • For an r×cr \times c table, expected frequency of a cell:
Eij=(Row total)×(Column total)Grand totalE_{ij} = \frac{(\text{Row total}) \times (\text{Column total})}{\text{Grand total}}
  • Statistic: χ2=(OE)2E\chi^2 = \sum \dfrac{(O-E)^2}{E}.
  • Degrees of freedom =(r1)(c1)= (r-1)(c-1).
  • If χcal2>χtab2\chi^2_{cal} > \chi^2_{tab}, reject H0H_0 and conclude the attributes are associated.

Other Uses

  • Test for a specified population variance σ2\sigma^2.
  • Test of homogeneity (whether several populations have the same distribution).
chi-square
3long10 marks

Define a probability distribution. Explain the binomial distribution with its mean and variance, and state the conditions under which it is applied.

Probability Distribution

A probability distribution is a description that assigns a probability to each possible value of a random variable. For a discrete random variable XX taking values x1,x2,x_1, x_2, \dots, the function P(X=xi)=piP(X=x_i)=p_i is the probability mass function, satisfying:

pi0andipi=1p_i \geq 0 \quad \text{and} \quad \sum_i p_i = 1

For a continuous random variable, it is described by a probability density function f(x)f(x) with f(x)0f(x)\geq 0 and f(x)dx=1\int_{-\infty}^{\infty} f(x)\,dx = 1.

Binomial Distribution

A discrete random variable XX follows a binomial distribution if it represents the number of successes in nn independent trials, each with probability of success pp (and failure q=1pq = 1-p). Its probability mass function is:

P(X=x)=(nx)pxqnx,x=0,1,2,,nP(X = x) = \binom{n}{x} p^x q^{\,n-x}, \quad x = 0, 1, 2, \dots, n

where (nx)=n!x!(nx)!\binom{n}{x} = \dfrac{n!}{x!(n-x)!}.

Mean and Variance

  • Mean: E(X)=npE(X) = np
  • Variance: Var(X)=npqVar(X) = npq
  • Standard deviation: npq\sqrt{npq}

Since q<1q<1, the variance npqnpq is always less than the mean npnp.

Conditions (Assumptions) for Application

The binomial distribution applies when:

  1. The experiment consists of a fixed number nn of trials.
  2. Each trial has only two mutually exclusive outcomes — success and failure.
  3. The trials are independent of one another.
  4. The probability of success pp remains constant from trial to trial.

Example

Tossing a fair coin 5 times and counting the number of heads, where n=5n=5, p=12p=\tfrac12.

probabilitydistribution
B

Section B: Short Answer Questions

Attempt any EIGHT questions.

9 questions·5 marks each
4short5 marks

Explain how to construct a confidence interval for a population mean.

Confidence Interval for a Population Mean

A confidence interval (CI) for the population mean μ\mu is a range of values, computed from sample data, that is expected to contain μ\mu with a stated confidence level (1α)(1-\alpha), e.g. 95%.

Case 1: σ\sigma known (or large sample, n>30n>30) — use Z

xˉ±Zα/2σn\bar{x} \pm Z_{\alpha/2}\,\frac{\sigma}{\sqrt{n}}

Case 2: σ\sigma unknown and small sample (n30n \leq 30) — use t

xˉ±tα/2,n1sn\bar{x} \pm t_{\alpha/2,\,n-1}\,\frac{s}{\sqrt{n}}

where ss is the sample standard deviation and degrees of freedom =n1= n-1.

Steps to Construct

  1. Compute the sample mean xˉ\bar{x} (and ss if σ\sigma unknown).
  2. Choose the confidence level (1α)(1-\alpha) and find the critical value Zα/2Z_{\alpha/2} (e.g. 1.961.96 for 95%) or tα/2,n1t_{\alpha/2,n-1}.
  3. Compute the standard error σn\dfrac{\sigma}{\sqrt n} (or sn\dfrac{s}{\sqrt n}).
  4. Compute the margin of error E=Zα/2σnE = Z_{\alpha/2}\dfrac{\sigma}{\sqrt n}.
  5. The interval is (xˉE, xˉ+E)(\bar{x}-E,\ \bar{x}+E).

Interpretation

For a 95% CI, if many samples were drawn and an interval computed for each, about 95% of those intervals would contain the true mean μ\mu.

confidence-interval
5short5 marks

Explain the F-test for the equality of two population variances.

F-Test for Equality of Two Population Variances

The F-test is used to test whether two independent normal populations have equal variances. It is based on the ratio of two independent sample variances, which follows the F-distribution.

Hypotheses

  • H0:σ12=σ22H_0: \sigma_1^2 = \sigma_2^2 (the two population variances are equal)
  • H1:σ12σ22H_1: \sigma_1^2 \neq \sigma_2^2 (they are unequal)

Test Statistic

From two independent random samples of sizes n1n_1 and n2n_2 with sample variances s12s_1^2 and s22s_2^2:

F=s12s22,where s12>s22F = \frac{s_1^2}{s_2^2}, \quad \text{where } s_1^2 > s_2^2

The larger variance is placed in the numerator so that F1F \geq 1. The unbiased sample variance is

s2=1n1(xixˉ)2.s^2 = \frac{1}{n-1}\sum (x_i - \bar{x})^2.

Degrees of Freedom

ν1=n11\nu_1 = n_1 - 1 (numerator), ν2=n21\nu_2 = n_2 - 1 (denominator).

Decision Rule

Compare FcalF_{cal} with the tabulated Fα,(ν1,ν2)F_{\alpha,(\nu_1,\nu_2)}:

  • If FcalFtabF_{cal} \leq F_{tab}, accept H0H_0 — variances are equal.
  • If Fcal>FtabF_{cal} > F_{tab}, reject H0H_0 — variances differ significantly.

Assumptions

Both populations are normal, and the two samples are independent and drawn randomly.

f-test
6short5 marks

Define index numbers and explain Laspeyres' and Paasche's price index methods.

Index Numbers

An index number is a statistical measure that expresses the relative change in a variable or a group of related variables (such as prices, quantities, or value) over time, place, or other characteristic, with respect to a chosen base period (taken as 100). It is often called an economic barometer.

A price index measures the average change in the prices of a basket of commodities between the base period (0) and the current period (1).

Laspeyres' Price Index

Uses base-year quantities (q0)(q_0) as weights:

P01L=p1q0p0q0×100P_{01}^{L} = \frac{\sum p_1 q_0}{\sum p_0 q_0} \times 100

It answers: what is the cost now of the base-year basket compared with its base-year cost? It tends to overstate price rises because it ignores substitution away from goods that became dearer.

Paasche's Price Index

Uses current-year quantities (q1)(q_1) as weights:

P01P=p1q1p0q1×100P_{01}^{P} = \frac{\sum p_1 q_1}{\sum p_0 q_1} \times 100

It answers: what would the current basket have cost in the base year versus now? It tends to understate price rises.

Comparison

BasisLaspeyresPaasche
WeightsBase-year quantities q0q_0Current-year quantities q1q_1
BiasUpward (overestimates)Downward (underestimates)
Data neededOnly base-year quantitiesCurrent-year quantities each period

Fisher's ideal index is the geometric mean of the two: P01F=PL×PPP_{01}^{F} = \sqrt{P^{L} \times P^{P}}.

index-numbers
7short5 marks

State and explain the addition and multiplication theorems of probability with examples.

Addition Theorem of Probability

Gives the probability of the union of events (occurrence of at least one event).

For any two events AA and BB:

P(AB)=P(A)+P(B)P(AB)P(A \cup B) = P(A) + P(B) - P(A \cap B)

If AA and BB are mutually exclusive (cannot occur together, AB=A\cap B=\varnothing):

P(AB)=P(A)+P(B)P(A \cup B) = P(A) + P(B)

Example: A card is drawn from 52 cards. P(King)=452P(\text{King}) = \tfrac{4}{52}, P(Heart)=1352P(\text{Heart}) = \tfrac{13}{52}, P(King of Hearts)=152P(\text{King of Hearts}) = \tfrac{1}{52}.

P(King or Heart)=452+1352152=1652=413P(\text{King or Heart}) = \frac{4}{52} + \frac{13}{52} - \frac{1}{52} = \frac{16}{52} = \frac{4}{13}

Multiplication Theorem of Probability

Gives the probability of the joint occurrence (intersection) of events.

For any two events AA and BB:

P(AB)=P(A)P(BA)=P(B)P(AB)P(A \cap B) = P(A)\cdot P(B \mid A) = P(B)\cdot P(A \mid B)

where P(BA)P(B\mid A) is the conditional probability of BB given AA.

If AA and BB are independent (occurrence of one does not affect the other):

P(AB)=P(A)P(B)P(A \cap B) = P(A)\cdot P(B)

Example: Two cards are drawn one after another without replacement.

P(both Kings)=452×351=122652=1221P(\text{both Kings}) = \frac{4}{52} \times \frac{3}{51} = \frac{12}{2652} = \frac{1}{221}

If drawn with replacement (independent): 452×452=1169\tfrac{4}{52}\times\tfrac{4}{52}=\tfrac{1}{169}.

probability
8short5 marks

Explain the Poisson distribution with its mean and variance and state its applications.

Poisson Distribution

The Poisson distribution is a discrete probability distribution that gives the probability of a given number of independent events occurring in a fixed interval of time, space, area, or volume, when these events occur at a constant average rate λ\lambda. It is the limiting case of the binomial distribution when nn \to \infty, p0p \to 0, with np=λnp = \lambda finite.

The probability mass function is:

P(X=x)=eλλxx!,x=0,1,2,P(X = x) = \frac{e^{-\lambda}\,\lambda^{x}}{x!}, \quad x = 0, 1, 2, \dots

where λ>0\lambda > 0 is the average number of occurrences and e2.718e \approx 2.718.

Mean and Variance

A characteristic property is that the mean and variance are equal:

Mean=E(X)=λ,Variance=Var(X)=λ\text{Mean} = E(X) = \lambda, \qquad \text{Variance} = Var(X) = \lambda

Conditions

  • Events occur independently.
  • The average rate λ\lambda is constant.
  • nn is large, pp is small (rare events).

Applications

Used to model the number of rare events, such as:

  1. Number of telephone calls received at an exchange per minute.
  2. Number of printing/typing errors per page of a book.
  3. Number of accidents on a highway per day.
  4. Number of defective items in a large batch.
  5. Number of customers/packets arriving at a server per unit time (queueing/network traffic).
poisson
9short5 marks

Define a random variable. Differentiate between discrete and continuous random variables with examples.

Random Variable

A random variable is a real-valued function that assigns a numerical value to each outcome (sample point) of a random experiment. It maps the sample space SS to the set of real numbers, i.e. X:SRX: S \to \mathbb{R}.

Example: In tossing two coins, if XX = number of heads, then XX takes values 0,1,20, 1, 2.

Discrete vs Continuous Random Variables

BasisDiscrete Random VariableContinuous Random Variable
ValuesTakes countable (finite or countably infinite) isolated valuesTakes any value within an interval (uncountable)
ProbabilityDescribed by a probability mass function P(X=x)P(X=x)Described by a probability density function f(x)f(x); P(X=x)=0P(X=x)=0
Total probabilityxP(X=x)=1\sum_x P(X=x) = 1f(x)dx=1\int_{-\infty}^{\infty} f(x)\,dx = 1
Probability of a rangeSum over valuesP(aXb)=abf(x)dxP(a\le X\le b)=\int_a^b f(x)\,dx
ExamplesNumber of heads in tosses, number of defective items, number of accidentsHeight, weight, temperature, time, length

Examples

  • Discrete: Number of students present in a class; number of calls per hour.
  • Continuous: The height of a person (e.g. 165.3 cm); the time taken to run a race.
random-variable
10short5 marks

Define mathematical expectation. State and prove its properties.

Mathematical Expectation

The mathematical expectation (or expected value) of a random variable is the long-run average value it takes, weighted by probabilities. It is a measure of the central tendency of a probability distribution.

For a discrete random variable XX with p.m.f. P(X=xi)=piP(X=x_i)=p_i:

E(X)=ixipiE(X) = \sum_i x_i\,p_i

For a continuous random variable with density f(x)f(x):

E(X)=xf(x)dxE(X) = \int_{-\infty}^{\infty} x\,f(x)\,dx

Properties (with Proof)

1. Expectation of a constant: E(c)=cE(c) = c. Proof: E(c)=cpi=cpi=c1=c.E(c)=\sum c\,p_i = c\sum p_i = c\cdot 1 = c.

2. Constant multiplier: E(cX)=cE(X)E(cX) = c\,E(X). Proof: E(cX)=cxipi=cxipi=cE(X).E(cX)=\sum c x_i p_i = c\sum x_i p_i = c\,E(X).

3. Addition (linearity): E(X+Y)=E(X)+E(Y)E(X + Y) = E(X) + E(Y), for any random variables X,YX,Y. Proof (discrete): E(X+Y)=xy(x+y)P(x,y)=xyxP(x,y)+xyyP(x,y)=E(X)+E(Y).E(X+Y)=\sum_x\sum_y (x+y)P(x,y) = \sum_x\sum_y xP(x,y) + \sum_x\sum_y yP(x,y) = E(X)+E(Y).

4. Linear combination: E(aX+b)=aE(X)+bE(aX + b) = a\,E(X) + b (from properties 1–3).

5. Multiplication for independent variables: If XX and YY are independent, then E(XY)=E(X)E(Y)E(XY) = E(X)\,E(Y). Proof: For independence P(x,y)=P(x)P(y)P(x,y)=P(x)P(y), so E(XY)=xyxyP(x)P(y)=(xxP(x))(yyP(y))=E(X)E(Y).E(XY)=\sum_x\sum_y xy\,P(x)P(y) = \big(\sum_x xP(x)\big)\big(\sum_y yP(y)\big) = E(X)E(Y).

expectation
11short5 marks

Explain the t-test for testing the significance of the difference between two sample means.

t-Test for Difference Between Two Sample Means

This test checks whether the means of two independent small samples (n1,n230n_1, n_2 \le 30) drawn from two normal populations differ significantly, when the population variances are unknown but assumed equal.

Hypotheses

  • H0:μ1=μ2H_0: \mu_1 = \mu_2 (no significant difference in population means)
  • H1:μ1μ2H_1: \mu_1 \neq \mu_2 (two-tailed)

Test Statistic

t=xˉ1xˉ2S1n1+1n2t = \frac{\bar{x}_1 - \bar{x}_2}{S\sqrt{\dfrac{1}{n_1} + \dfrac{1}{n_2}}}

where the pooled standard deviation SS is

S2=(x1xˉ1)2+(x2xˉ2)2n1+n22=(n11)s12+(n21)s22n1+n22S^2 = \frac{\sum (x_1 - \bar{x}_1)^2 + \sum (x_2 - \bar{x}_2)^2}{n_1 + n_2 - 2} = \frac{(n_1-1)s_1^2 + (n_2-1)s_2^2}{n_1 + n_2 - 2}

Degrees of Freedom

ν=n1+n22\nu = n_1 + n_2 - 2

Decision Rule

Compare tcal|t_{cal}| with the tabulated value tα/2,νt_{\alpha/2,\,\nu}:

  • If tcalttab|t_{cal}| \leq t_{tab}, accept H0H_0 — means do not differ significantly.
  • If tcal>ttab|t_{cal}| > t_{tab}, reject H0H_0 — the difference is significant.

Assumptions

The two samples are independent and drawn from normal populations with equal (but unknown) variances.

t-test
12short5 marks

Explain the z-test for a large sample test of a single mean with an example.

Z-Test for a Single Mean (Large Sample)

For a large sample (n>30n > 30), the Z-test is used to test whether the sample mean xˉ\bar{x} differs significantly from a specified population mean μ0\mu_0. By the Central Limit Theorem, xˉ\bar{x} is approximately normally distributed.

Hypotheses

  • H0:μ=μ0H_0: \mu = \mu_0
  • H1:μμ0H_1: \mu \neq \mu_0 (two-tailed)

Test Statistic

Z=xˉμ0σ/nZ = \frac{\bar{x} - \mu_0}{\sigma / \sqrt{n}}

If the population standard deviation σ\sigma is unknown, the sample standard deviation ss is used (valid for large nn).

Decision Rule (5% level)

Compare Zcal|Z_{cal}| with the critical value Zα/2=1.96Z_{\alpha/2} = 1.96:

  • If Zcal1.96|Z_{cal}| \leq 1.96, accept H0H_0.
  • If Zcal>1.96|Z_{cal}| > 1.96, reject H0H_0.

Example

A sample of n=100n = 100 items has mean xˉ=52\bar{x} = 52 with σ=10\sigma = 10. Test whether the population mean is μ0=50\mu_0 = 50 at the 5% level.

Z=525010/100=21=2.0Z = \frac{52 - 50}{10/\sqrt{100}} = \frac{2}{1} = 2.0

Since Z=2.0>1.96|Z| = 2.0 > 1.96, we reject H0H_0 and conclude that the population mean differs significantly from 50.

z-test

Frequently asked questions

Where can I find the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) question paper 2078?
The full BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2078 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
Does the Statistics II (BSc CSIT, STA210) 2078 paper come with solutions?
Yes. Every question on this Statistics II (BSc CSIT, STA210) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
How many marks is the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2078 paper?
The BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2078 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
Is practising this Statistics II (BSc CSIT, STA210) past paper free?
Yes — reading and attempting this Statistics II (BSc CSIT, STA210) past paper on Kekkei is completely free.