Browse papers
A

Section A: Long Answer Questions

Attempt any TWO questions.

3 questions·10 marks each
1long10 marks

What is sampling? Explain different methods of probability and non-probability sampling with their merits and demerits.

Sampling

Sampling is the process of selecting a representative subset (the sample) from a larger group (the population) so that conclusions about the whole population can be drawn by studying only the sample. It saves cost, time and effort and is often the only feasible approach when the population is large or testing is destructive.

Sampling methods are broadly classified into probability and non-probability sampling.

A. Probability Sampling

Every unit of the population has a known, non-zero chance of selection. Results can be generalised and sampling error can be estimated.

1. Simple Random Sampling (SRS) Each unit has an equal chance of selection (lottery method or random numbers).

  • Merits: unbiased, simple, error measurable.
  • Demerits: needs a complete sampling frame; units may be geographically scattered.

2. Stratified Sampling Population is divided into homogeneous strata and a random sample is drawn from each.

  • Merits: high precision, ensures representation of every subgroup.
  • Demerits: requires prior knowledge of strata; complex to organise.

3. Systematic Sampling Every kthk^{th} unit is selected after a random start, where k=N/nk = N/n.

  • Merits: simple, evenly spread over the frame.
  • Demerits: biased if the list has a periodic/hidden pattern.

4. Cluster / Multistage Sampling Population is divided into clusters; some clusters are selected at random and studied (fully or in further stages).

  • Merits: economical for wide geographical areas, no full frame needed.
  • Demerits: lower precision (high intra-cluster correlation).

B. Non-Probability Sampling

Selection is based on judgement/convenience; probability of selection is unknown and sampling error cannot be measured.

1. Convenience Sampling — units that are easiest to reach are chosen. Merit: fast, cheap. Demerit: highly biased, not representative.

2. Judgement (Purposive) Sampling — expert selects typical units. Merit: useful for small/specialised studies. Demerit: depends on investigator's bias.

3. Quota Sampling — units chosen until fixed quotas per category are met. Merit: quick, ensures group coverage. Demerit: selection within quota is biased.

4. Snowball Sampling — existing subjects refer further subjects. Merit: good for rare/hidden populations. Demerit: sample not representative.

Conclusion

Probability sampling allows valid statistical inference and is preferred for scientific surveys, whereas non-probability sampling is cheaper and quicker but cannot guarantee representativeness.

sampling
2long10 marks

What is analysis of variance (ANOVA)? Explain the procedure of one-way ANOVA with the construction of the ANOVA table.

Analysis of Variance (ANOVA)

ANOVA is a statistical technique (developed by R. A. Fisher) used to test whether the means of three or more populations are equal by partitioning the total variation in the data into components attributable to different sources. It compares the variance between groups with the variance within groups using the F-test.

Assumptions: observations are independent, drawn from normal populations, and the populations have equal variances (homoscedasticity).

One-Way ANOVA Procedure

Used when one factor (with kk levels/treatments) classifies the data.

Step 1 — Hypotheses

H0:μ1=μ2==μk(all means equal)H_0: \mu_1 = \mu_2 = \dots = \mu_k \quad (\text{all means equal}) H1:at least one mean differsH_1: \text{at least one mean differs}

Step 2 — Compute totals. Let TT = grand total of all NN observations and C=T2NC = \dfrac{T^2}{N} (correction factor).

Step 3 — Sum of squares

SST=xij2C(Total)SST = \sum x_{ij}^2 - C \quad(\text{Total}) SSB=i=1kTi2niC(Between treatments)SSB = \sum_{i=1}^{k} \frac{T_i^2}{n_i} - C \quad(\text{Between treatments}) SSE=SSTSSB(Within / Error)SSE = SST - SSB \quad(\text{Within / Error})

where TiT_i and nin_i are the total and size of the ithi^{th} group.

Step 4 — Degrees of freedom: Between = k1k-1, Within = NkN-k, Total = N1N-1.

Step 5 — Mean squares and F-ratio

MSB=SSBk1,MSE=SSENk,F=MSBMSEMSB = \frac{SSB}{k-1}, \qquad MSE = \frac{SSE}{N-k}, \qquad F = \frac{MSB}{MSE}

ANOVA Table

Source of VariationSum of Squaresd.f.Mean SquareF-ratio
Between treatmentsSSBSSBk1k-1MSB=SSB/(k1)MSB = SSB/(k-1)MSB/MSEMSB/MSE
Within (Error)SSESSENkN-kMSE=SSE/(Nk)MSE = SSE/(N-k)
TotalSSTSSTN1N-1

Step 6 — Decision. Compare calculated FF with the table value Fα,(k1,Nk)F_{\alpha,(k-1,N-k)}. If Fcal>FtabF_{cal} > F_{tab}, reject H0H_0 and conclude the treatment means differ significantly; otherwise accept H0H_0.

anova
3long10 marks

Explain the theory of estimation. Differentiate between point estimation and interval estimation and explain the properties of a good estimator.

Theory of Estimation

Estimation is the branch of statistical inference concerned with using sample data to estimate the unknown value of a population parameter (such as μ\mu, σ2\sigma^2 or PP). The sample statistic used for this purpose is called an estimator and a particular numerical value of it is an estimate. Estimation is of two types: point estimation and interval estimation.

Point Estimation vs Interval Estimation

BasisPoint EstimationInterval Estimation
MeaningGives a single value as the estimate of the parameterGives a range (interval) within which the parameter is expected to lie
Examplexˉ\bar{x} estimates μ\mu; s2s^2 estimates σ2\sigma^2xˉ±Zα/2σn\bar{x} \pm Z_{\alpha/2}\,\dfrac{\sigma}{\sqrt{n}} for μ\mu
ReliabilityDoes not indicate the error/precisionCarries a confidence level (e.g. 95%) and shows precision
FormNumberInterval (L,U)(L, U)

A point estimate by itself tells nothing about how close it is to the true value, whereas an interval estimate quantifies the uncertainty.

Properties of a Good Estimator

1. Unbiasedness — The expected value of the estimator equals the parameter: E(θ^)=θE(\hat{\theta}) = \theta. (e.g. E(xˉ)=μE(\bar{x}) = \mu.)

2. Consistency — As the sample size nn \to \infty, the estimator converges to the true parameter: θ^Pθ\hat{\theta} \xrightarrow{P} \theta.

3. Efficiency — Among all unbiased estimators, the efficient one has the smallest variance, giving the most precise estimate.

4. Sufficiency — The estimator uses all the information in the sample about the parameter, so no other statistic can add more information.

An estimator that is unbiased, consistent, efficient and sufficient is regarded as the best (ideal) estimator.

estimation
B

Section B: Short Answer Questions

Attempt any EIGHT questions.

9 questions·5 marks each
4short5 marks

Define Karl Pearson's coefficient of correlation and state its properties.

Karl Pearson's Coefficient of Correlation

Karl Pearson's coefficient of correlation, denoted rr, is a numerical measure of the degree and direction of linear relationship between two quantitative variables XX and YY. It is defined as the ratio of the covariance of the variables to the product of their standard deviations:

r=Cov(X,Y)σXσY=(XXˉ)(YYˉ)(XXˉ)2(YYˉ)2r = \frac{\text{Cov}(X,Y)}{\sigma_X\,\sigma_Y} = \frac{\sum (X-\bar{X})(Y-\bar{Y})}{\sqrt{\sum (X-\bar{X})^2}\,\sqrt{\sum (Y-\bar{Y})^2}}

Properties

  1. Range: 1r+1-1 \le r \le +1. r=+1r=+1 means perfect positive, r=1r=-1 perfect negative, and r=0r=0 no linear correlation.
  2. Sign: indicates the direction — positive (X,YX,Y move together), negative (move oppositely).
  3. Independent of origin and scale (unit-free): rr is unchanged if a constant is added to or each value is multiplied by a constant.
  4. Symmetric: rXY=rYXr_{XY} = r_{YX}.
  5. Geometric mean of regression coefficients: r=±bxybyxr = \pm\sqrt{b_{xy}\cdot b_{yx}}, with the sign of the regression coefficients.
  6. It measures only linear association; r=0r=0 does not imply the variables are unrelated (a non-linear relation may exist).
correlation
5short5 marks

What are regression coefficients? State their properties.

Regression Coefficients

In the two linear regression equations of YY on XX and XX on YY, the slopes are called the regression coefficients. They measure the average change in one variable for a unit change in the other.

  • Regression coefficient of YY on XX:   byx=rσyσx=Cov(X,Y)σx2\;b_{yx} = r\dfrac{\sigma_y}{\sigma_x} = \dfrac{\text{Cov}(X,Y)}{\sigma_x^2}
  • Regression coefficient of XX on YY:   bxy=rσxσy=Cov(X,Y)σy2\;b_{xy} = r\dfrac{\sigma_x}{\sigma_y} = \dfrac{\text{Cov}(X,Y)}{\sigma_y^2}

Properties

  1. Geometric mean gives correlation: r=±byxbxyr = \pm\sqrt{b_{yx}\cdot b_{xy}}, the sign being that of the regression coefficients.
  2. Same sign: both byxb_{yx} and bxyb_{xy} have the same sign, which is also the sign of rr.
  3. Product 1\le 1: since r2=byxbxy1r^2 = b_{yx}\,b_{xy} \le 1, the product of the two regression coefficients cannot exceed unity; hence if one is greater than 1 the other must be less than 1.
  4. Independent of origin but not of scale: adding a constant does not change them, but changing the scale does.
  5. The arithmetic mean of the two regression coefficients is greater than or equal to the correlation coefficient rr (when r>0r>0).
regression
6short5 marks

Explain the concept of sampling distribution and standard error.

Sampling Distribution

If all possible samples of a fixed size nn are drawn from a population and a statistic (e.g. the sample mean xˉ\bar{x}) is computed for each, the probability distribution of that statistic over all such samples is called its sampling distribution. For example, the sampling distribution of the mean has its own mean μxˉ=μ\mu_{\bar{x}} = \mu and describes how sample means vary from sample to sample. It forms the theoretical basis of statistical inference (estimation and testing of hypotheses).

Standard Error

The standard error (S.E.) is the standard deviation of the sampling distribution of a statistic. It measures the magnitude of the sampling fluctuation, i.e. how much the statistic is expected to vary from the true parameter due to chance.

For the sample mean (population s.d. σ\sigma, sample size nn):

S.E.(xˉ)=σn\text{S.E.}(\bar{x}) = \frac{\sigma}{\sqrt{n}}

For a sample proportion pp:   S.E.(p)=PQn\;\text{S.E.}(p) = \sqrt{\dfrac{PQ}{n}}.

Uses of standard error:

  • Smaller S.E. means a more precise/reliable estimate; it decreases as nn increases.
  • It is used to set up confidence intervals and to compute test statistics in hypothesis testing.
samplingdistribution
7short5 marks

Explain how to construct a confidence interval for a population mean.

Confidence Interval for a Population Mean

A confidence interval (CI) is a range of values, computed from a sample, that is expected to contain the true population mean μ\mu with a stated probability called the confidence level (1α)(1-\alpha) (e.g. 95%).

Case 1 — Population variance σ2\sigma^2 known (or large sample, n30n \ge 30): The statistic xˉμσ/n\dfrac{\bar{x}-\mu}{\sigma/\sqrt{n}} follows the standard normal distribution, so the 100(1α)%100(1-\alpha)\% CI is

xˉ±Zα/2σn\bar{x} \pm Z_{\alpha/2}\,\frac{\sigma}{\sqrt{n}}

where Zα/2=1.96Z_{\alpha/2}=1.96 for 95% and 2.582.58 for 99%. (If σ\sigma is unknown in a large sample, replace it by the sample s.d. ss.)

Case 2 — σ\sigma unknown and small sample (n<30n<30): Use the tt-distribution with n1n-1 degrees of freedom:

xˉ±tα/2,n1sn\bar{x} \pm t_{\alpha/2,\,n-1}\,\frac{s}{\sqrt{n}}

Steps: (1) compute xˉ\bar{x} and the standard error; (2) choose the confidence level and find Zα/2Z_{\alpha/2} or tα/2t_{\alpha/2}; (3) form the limits xˉ±(critical value)×S.E.\bar{x} \pm (\text{critical value})\times \text{S.E.}

Interpretation: we are 95% confident that the interval contains μ\mu; over repeated sampling, 95% of such intervals would capture the true mean.

confidence-interval
8short5 marks

Explain the F-test for the equality of two population variances.

F-Test for Equality of Two Population Variances

The F-test is used to test whether two independent normal populations have equal variances. It compares the ratio of two sample variances.

Step 1 — Hypotheses

H0:σ12=σ22H1:σ12σ22H_0: \sigma_1^2 = \sigma_2^2 \qquad H_1: \sigma_1^2 \neq \sigma_2^2

Step 2 — Test statistic. From samples of sizes n1,n2n_1, n_2 with unbiased sample variances s12s_1^2 and s22s_2^2, where s2=(xxˉ)2n1s^2 = \dfrac{\sum(x-\bar{x})^2}{n-1}, the statistic is

F=s12s22,s12>s22F = \frac{s_1^2}{s_2^2}, \qquad s_1^2 > s_2^2

The larger variance is always placed in the numerator so that F1F \ge 1.

Step 3 — Degrees of freedom: numerator =n11= n_1 - 1, denominator =n21= n_2 - 1.

Step 4 — Decision. Compare FcalF_{cal} with the table value Fα,(n11,n21)F_{\alpha,(n_1-1,\,n_2-1)}.

  • If Fcal>FtabF_{cal} > F_{tab}: reject H0H_0 — the population variances differ significantly.
  • If FcalFtabF_{cal} \le F_{tab}: accept H0H_0 — variances may be regarded as equal.

Assumptions: both samples are random and independent and drawn from normal populations. (This equality-of-variance test is also a prerequisite for the t-test of two means.)

f-test
9short5 marks

Define index numbers and explain Laspeyres' and Paasche's price index methods.

Index Numbers

An index number is a statistical measure that expresses the relative change in the level of a variable (price, quantity, value, production, etc.) over time or place, compared with a chosen base period taken as 100. Price index numbers are widely used to measure inflation and changes in the cost of living.

Notation: p0,q0p_0, q_0 = price and quantity in the base year; p1,q1p_1, q_1 = price and quantity in the current year.

Laspeyres' Price Index

Uses base-year quantities (q0)(q_0) as weights:

P01L=p1q0p0q0×100P_{01}^{L} = \frac{\sum p_1 q_0}{\sum p_0 q_0} \times 100

Merit: requires only one set of weights (base-year), so easy to compute over time. Demerit: tends to overstate the price rise (consumers shift away from costlier goods).

Paasche's Price Index

Uses current-year quantities (q1)(q_1) as weights:

P01P=p1q1p0q1×100P_{01}^{P} = \frac{\sum p_1 q_1}{\sum p_0 q_1} \times 100

Merit: reflects the current consumption pattern. Demerit: current-year weights must be collected every period (costly); tends to understate the price rise.

Note: Fisher's ideal index is the geometric mean of the two: PF=PL×PPP^{F} = \sqrt{P^{L}\times P^{P}}.

index-numbers
10short5 marks

State and explain the addition and multiplication theorems of probability with examples.

Addition and Multiplication Theorems of Probability

1. Addition Theorem (probability of AA OR BB)

For any two events AA and BB:

P(AB)=P(A)+P(B)P(AB)P(A \cup B) = P(A) + P(B) - P(A \cap B)

If AA and BB are mutually exclusive (cannot occur together, AB=A \cap B = \varnothing):

P(AB)=P(A)+P(B)P(A \cup B) = P(A) + P(B)

Example: Drawing one card from a pack, P(King or Queen)=452+452=852=213P(\text{King or Queen}) = \dfrac{4}{52} + \dfrac{4}{52} = \dfrac{8}{52} = \dfrac{2}{13} (mutually exclusive). For a King or a Heart (not exclusive): 452+1352152=1652=413\dfrac{4}{52}+\dfrac{13}{52}-\dfrac{1}{52} = \dfrac{16}{52} = \dfrac{4}{13}.

2. Multiplication Theorem (probability of AA AND BB)

For any two events:

P(AB)=P(A)P(BA)P(A \cap B) = P(A)\cdot P(B\mid A)

where P(BA)P(B\mid A) is the conditional probability of BB given AA. If AA and BB are independent, P(BA)=P(B)P(B\mid A)=P(B), so:

P(AB)=P(A)P(B)P(A \cap B) = P(A)\cdot P(B)

Example: Two cards drawn one after another without replacement; probability both are aces:

P=452×351=1221P = \frac{4}{52}\times\frac{3}{51} = \frac{1}{221}

If tossing two fair coins (independent), P(both heads)=12×12=14P(\text{both heads}) = \dfrac12\times\dfrac12 = \dfrac14.

probability
11short5 marks

Explain the Poisson distribution with its mean and variance and state its applications.

Poisson Distribution

The Poisson distribution is a discrete probability distribution that gives the probability of a given number of independent events occurring in a fixed interval of time, space or volume, when these events occur with a known constant average rate λ\lambda and rarely (large nn, small pp). The probability mass function is

P(X=x)=eλλxx!,x=0,1,2,,    λ>0P(X = x) = \frac{e^{-\lambda}\,\lambda^{x}}{x!}, \qquad x = 0,1,2,\dots,\;\; \lambda > 0

where λ=np\lambda = np is the average number of occurrences. It is the limiting form of the binomial distribution as nn \to \infty, p0p \to 0 with np=λnp = \lambda finite.

Mean and Variance

A distinctive feature is that the mean and variance are equal:

Mean=λ,Variance=λ\text{Mean} = \lambda, \qquad \text{Variance} = \lambda

Applications

  • Number of telephone calls arriving at an exchange per minute.
  • Number of printing/typing errors per page of a book.
  • Number of accidents on a highway or defects per unit length of cloth/wire.
  • Number of radioactive decays per second; arrivals in queueing (servers, network packets).
  • Number of customers arriving at a counter per hour.
poisson
12short5 marks

Define a random variable. Differentiate between discrete and continuous random variables with examples.

Random Variable

A random variable is a real-valued function that assigns a numerical value to each outcome of a random experiment (sample space). It is usually denoted by capital letters X,Y,ZX, Y, Z. For example, in tossing two coins, the number of heads XX can take the values 0,1,20, 1, 2.

Random variables are of two types: discrete and continuous.

Discrete vs Continuous Random Variables

BasisDiscrete Random VariableContinuous Random Variable
Values takenCountable / isolated values (finite or countably infinite)Any value within an interval (uncountable)
Probability described byProbability mass function P(X=x)P(X=x)Probability density function f(x)f(x)
Probability of a single pointCan be non-zeroAlways zero; only P(aXb)P(a\le X\le b) is meaningful
Summation/IntegrationP(x)=1\sum P(x) = 1f(x)dx=1\int_{-\infty}^{\infty} f(x)\,dx = 1
ExamplesNumber of heads in coin tosses, number of defective items, number of callsHeight, weight, temperature, time taken

Examples:

  • Discrete: the number of students present in a class (0, 1, 2, …).
  • Continuous: the height of a student (e.g. any value between 150 cm and 180 cm).
random-variable

Frequently asked questions

Where can I find the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) question paper 2081?
The full BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2081 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
Does the Statistics II (BSc CSIT, STA210) 2081 paper come with solutions?
Yes. Every question on this Statistics II (BSc CSIT, STA210) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
How many marks is the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2081 paper?
The BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2081 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
Is practising this Statistics II (BSc CSIT, STA210) past paper free?
Yes — reading and attempting this Statistics II (BSc CSIT, STA210) past paper on Kekkei is completely free.