Browse papers
A

Section A: Long Answer Questions

Attempt any TWO questions.

3 questions·10 marks each
1long10 marks

What is hypothesis testing? Explain the procedure of testing of hypothesis including null and alternative hypotheses, level of significance, types of errors and the critical region.

Hypothesis Testing

Hypothesis testing is a statistical procedure that uses sample data to decide whether a claim (assumption) about a population parameter is supported by the evidence. It provides a rule for accepting or rejecting the claim with a controlled probability of error.

Procedure of Testing a Hypothesis

Step 1: Set up the Null and Alternative Hypotheses

  • Null hypothesis (H0H_0): A statement of no difference / no effect that is assumed true until evidence contradicts it, e.g. H0:μ=μ0H_0:\mu = \mu_0.
  • Alternative hypothesis (H1H_1): The claim accepted if H0H_0 is rejected. It may be:
    • Two-tailed: H1:μμ0H_1:\mu \neq \mu_0
    • One-tailed: H1:μ>μ0H_1:\mu > \mu_0 or H1:μ<μ0H_1:\mu < \mu_0

Step 2: Choose the Level of Significance (α\alpha)

The level of significance is the maximum probability of rejecting H0H_0 when it is actually true. Common values are α=0.05\alpha = 0.05 (5%) or 0.010.01 (1%). It is fixed before collecting data.

Step 3: Identify the Test Statistic

Select an appropriate statistic (e.g. zz, tt, χ2\chi^2, FF) whose sampling distribution under H0H_0 is known, e.g.

Z=xˉμ0σ/n.Z = \frac{\bar{x}-\mu_0}{\sigma/\sqrt{n}}.

Step 4: Determine the Critical Region (Rejection Region)

The critical region is the set of values of the test statistic for which H0H_0 is rejected. Its boundary is the critical value, fixed so that P(test statisticcritical regionH0)=αP(\text{test statistic} \in \text{critical region} \mid H_0) = \alpha.

Step 5: Compute the test statistic from the sample and compare with the critical value.

Step 6: Decision — Reject H0H_0 if the computed value falls in the critical region; otherwise do not reject H0H_0. State the conclusion in the context of the problem.

Types of Errors

H0H_0 TrueH0H_0 False
Reject H0H_0Type I error (α\alpha)Correct (power =1β=1-\beta)
Accept H0H_0Correct (1α)(1-\alpha)Type II error (β\beta)
  • Type I error: Rejecting a true H0H_0; probability =α= \alpha.
  • Type II error: Accepting a false H0H_0; probability =β= \beta.

Reducing one error (for fixed nn) tends to increase the other; increasing sample size reduces both.

hypothesis-testing
2long10 marks

Define correlation and regression. Explain the method of fitting two regression lines and the relationship between correlation coefficient and regression coefficients.

Correlation and Regression

Correlation measures the degree and direction of the linear association between two variables XX and YY. It tells whether the variables move together (positive), oppositely (negative), or are unrelated. Karl Pearson's coefficient is

r=Cov(X,Y)σxσy=(xxˉ)(yyˉ)(xxˉ)2(yyˉ)2,1r1.r = \frac{\operatorname{Cov}(X,Y)}{\sigma_x \sigma_y} = \frac{\sum (x-\bar{x})(y-\bar{y})}{\sqrt{\sum (x-\bar{x})^2 \sum (y-\bar{y})^2}}, \quad -1 \le r \le 1.

Regression is the statistical method of estimating (predicting) the value of one (dependent) variable from the known value of another (independent) variable by fitting an average mathematical relationship between them.

Fitting the Two Regression Lines

There are two regression lines because either variable may be treated as dependent.

(a) Regression line of YY on XX (used to estimate YY from XX):

YYˉ=byx(XXˉ),byx=Cov(X,Y)σx2=rσyσx.Y - \bar{Y} = b_{yx}(X - \bar{X}), \qquad b_{yx} = \frac{\operatorname{Cov}(X,Y)}{\sigma_x^2} = r\frac{\sigma_y}{\sigma_x}.

The coefficient byxb_{yx} is obtained by minimising (YY^)2\sum (Y - \hat{Y})^2 (least squares).

(b) Regression line of XX on YY (used to estimate XX from YY):

XXˉ=bxy(YYˉ),bxy=Cov(X,Y)σy2=rσxσy.X - \bar{X} = b_{xy}(Y - \bar{Y}), \qquad b_{xy} = \frac{\operatorname{Cov}(X,Y)}{\sigma_y^2} = r\frac{\sigma_x}{\sigma_y}.

Both lines pass through the mean point (Xˉ,Yˉ)(\bar{X},\bar{Y}).

Relationship between rr and the Regression Coefficients

Multiplying the two coefficients:

byxbxy=(rσyσx)(rσxσy)=r2.b_{yx}\cdot b_{xy} = \left(r\frac{\sigma_y}{\sigma_x}\right)\left(r\frac{\sigma_x}{\sigma_y}\right) = r^2.

Hence the correlation coefficient is the geometric mean of the two regression coefficients:

r=±byxbxy.r = \pm\sqrt{b_{yx}\,b_{xy}}.

Key points: rr takes the same sign as the regression coefficients; since r21r^2 \le 1, both coefficients cannot exceed 1 simultaneously; if one coefficient >1>1 the other must be <1<1.

correlationregression
3long10 marks

What is sampling? Explain different methods of probability and non-probability sampling with their merits and demerits.

Sampling

Sampling is the process of selecting a representative subset (sample) from a population in order to draw inferences about the whole population, saving time, cost and effort compared with a complete census.

Sampling methods are broadly classified as probability and non-probability sampling.

A. Probability Sampling

Every unit has a known, non-zero chance of selection; results can be generalized with measurable error.

  1. Simple Random Sampling — every unit has an equal chance of selection (lottery / random numbers).
    • Merits: unbiased, simple, sampling error measurable.
    • Demerits: needs complete frame; units may be geographically scattered.
  2. Systematic Sampling — select every kthk^{th} unit after a random start (k=N/nk = N/n).
    • Merits: simple, fast, evenly spread.
    • Demerits: biased if there is a hidden periodicity in the list.
  3. Stratified Sampling — divide the population into homogeneous strata and sample from each.
    • Merits: high precision, ensures representation of subgroups.
    • Demerits: requires prior knowledge of strata; complex.
  4. Cluster / Multi-stage Sampling — divide into clusters, randomly select whole clusters (or sample within them in stages).
    • Merits: cheap, no full frame needed, good for wide areas.
    • Demerits: larger sampling error than other probability methods.

B. Non-Probability Sampling

Selection probability is unknown; based on judgement or convenience; generalization is limited.

  1. Convenience Sampling — units easiest to reach are selected. Merit: quick, cheap. Demerit: highly biased, not representative.
  2. Judgement (Purposive) Sampling — expert selects typical units. Merit: useful for small specialised studies. Demerit: subjective, bias-prone.
  3. Quota Sampling — fixed quotas filled for each category. Merit: convenient, ensures group coverage. Demerit: selection within quota is biased.
  4. Snowball Sampling — existing subjects refer further subjects. Merit: reaches hidden/rare populations. Demerit: strong selection bias.

Summary: Probability sampling gives unbiased, measurable results but is costlier; non-probability sampling is cheaper and faster but prone to bias.

sampling
B

Section B: Short Answer Questions

Attempt any EIGHT questions.

9 questions·5 marks each
4short5 marks

Define mathematical expectation. State and prove its properties.

Mathematical Expectation

For a random variable XX, the mathematical expectation (expected value / mean) is

E(X)=xxp(x)(discrete),E(X)=xf(x)dx(continuous),E(X) = \sum_x x\,p(x) \quad (\text{discrete}), \qquad E(X)=\int_{-\infty}^{\infty} x f(x)\,dx \quad (\text{continuous}),

provided the sum/integral converges absolutely. It is the long-run average value of XX.

Properties (with proofs)

1. Expectation of a constant: E(c)=cE(c) = c. Proof: E(c)=cp(x)=cp(x)=c1=c.E(c)=\sum c\,p(x)=c\sum p(x)=c\cdot 1 = c.

2. Linearity (constant multiple): E(aX)=aE(X)E(aX)=aE(X). Proof: E(aX)=axp(x)=axp(x)=aE(X).E(aX)=\sum ax\,p(x)=a\sum x\,p(x)=aE(X).

3. Addition of a constant: E(aX+b)=aE(X)+bE(aX+b)=aE(X)+b. Proof: E(aX+b)=(ax+b)p(x)=axp(x)+bp(x)=aE(X)+b.E(aX+b)=\sum (ax+b)p(x)=a\sum x p(x)+b\sum p(x)=aE(X)+b.

4. Addition theorem: E(X+Y)=E(X)+E(Y)E(X+Y)=E(X)+E(Y) (always true). Proof: E(X+Y)=xy(x+y)p(x,y)=xyxp(x,y)+xyyp(x,y)=E(X)+E(Y).E(X+Y)=\sum_x\sum_y (x+y)p(x,y)=\sum_x\sum_y x\,p(x,y)+\sum_x\sum_y y\,p(x,y)=E(X)+E(Y).

5. Multiplication theorem (independence): If XX and YY are independent, E(XY)=E(X)E(Y)E(XY)=E(X)E(Y). Proof: For independence p(x,y)=p(x)p(y)p(x,y)=p(x)p(y), so E(XY)=xyxyp(x)p(y)=(xxp(x))(yyp(y))=E(X)E(Y).E(XY)=\sum_x\sum_y xy\,p(x)p(y)=\big(\sum_x x p(x)\big)\big(\sum_y y p(y)\big)=E(X)E(Y).

expectation
5short5 marks

Explain the t-test for testing the significance of the difference between two sample means.

t-test for the Difference between Two Sample Means

Used for small independent samples (n1,n2<30n_1, n_2 < 30) drawn from normal populations having a common but unknown variance, to test whether their means differ.

Hypotheses: H0:μ1=μ2H_0:\mu_1=\mu_2 vs H1:μ1μ2H_1:\mu_1\neq\mu_2 (or one-tailed).

Test statistic:

t=xˉ1xˉ2S1n1+1n2,t = \frac{\bar{x}_1 - \bar{x}_2}{S\sqrt{\dfrac{1}{n_1}+\dfrac{1}{n_2}}},

where the pooled estimate of the common variance is

S2=(x1xˉ1)2+(x2xˉ2)2n1+n22=n1s12+n2s22n1+n22.S^2 = \frac{\sum (x_{1}-\bar{x}_1)^2 + \sum (x_{2}-\bar{x}_2)^2}{n_1+n_2-2} = \frac{n_1 s_1^2 + n_2 s_2^2}{n_1+n_2-2}.

Degrees of freedom: df=n1+n22df = n_1 + n_2 - 2.

Decision rule: Compare tcal|t|_{cal} with the table value tα,dft_{\alpha, df}.

  • If tcalttab|t|_{cal} \le t_{tab}: do not reject H0H_0 (means not significantly different).
  • If tcal>ttab|t|_{cal} > t_{tab}: reject H0H_0 (means significantly different).

Assumptions: populations normal, samples independent and random, equal (homogeneous) population variances.

t-test
6short5 marks

Explain the z-test for a large sample test of a single mean with an example.

z-test for a Single Mean (Large Sample)

When the sample is large (n30n \ge 30), the sampling distribution of the mean is approximately normal, so a zz-test tests whether the sample mean differs from a hypothesised population mean μ0\mu_0.

Hypotheses: H0:μ=μ0H_0:\mu=\mu_0 vs H1:μμ0H_1:\mu\neq\mu_0.

Test statistic:

Z=xˉμ0σ/n,Z = \frac{\bar{x}-\mu_0}{\sigma/\sqrt{n}},

where σ\sigma is the population standard deviation (the sample s.d. ss is used if σ\sigma is unknown, valid for large nn).

Decision rule (5% level, two-tailed): Reject H0H_0 if Z>1.96|Z| > 1.96 (use 2.582.58 at 1%). For one-tailed tests use 1.6451.645.

Example

A machine is set to fill packets of mean weight μ0=500\mu_0 = 500 g. A sample of n=64n = 64 packets gives xˉ=496\bar{x}=496 g with σ=16\sigma = 16 g. Test at 5%.

Z=49650016/64=416/8=42=2.0.Z = \frac{496-500}{16/\sqrt{64}} = \frac{-4}{16/8} = \frac{-4}{2} = -2.0.

Since Z=2.0>1.96|Z| = 2.0 > 1.96, we reject H0H_0: the mean filling weight differs significantly from 500 g.

z-test
7short5 marks

Define Karl Pearson's coefficient of correlation and state its properties.

Karl Pearson's Coefficient of Correlation

It is a numerical measure of the degree and direction of the linear relationship between two variables XX and YY, defined as the ratio of their covariance to the product of their standard deviations:

r=Cov(X,Y)σxσy=(xxˉ)(yyˉ)(xxˉ)2(yyˉ)2.r = \frac{\operatorname{Cov}(X,Y)}{\sigma_x \sigma_y} = \frac{\sum (x-\bar{x})(y-\bar{y})}{\sqrt{\sum (x-\bar{x})^2}\,\sqrt{\sum (y-\bar{y})^2}}.

Properties

  1. Range: 1r+1-1 \le r \le +1. r=+1r=+1 perfect positive, r=1r=-1 perfect negative, r=0r=0 no linear correlation.
  2. Independent of origin and scale (units): rr is unchanged when each variable is transformed by u=(xa)/hu=(x-a)/h, v=(yb)/kv=(y-b)/k.
  3. Symmetric: rxy=ryxr_{xy} = r_{yx}.
  4. It is the geometric mean of the two regression coefficients: r=±byxbxyr = \pm\sqrt{b_{yx}\,b_{xy}}.
  5. It is a pure (dimensionless) number, independent of the units of measurement.
  6. Measures only linear association; r=0r=0 does not imply the variables are unrelated (could be non-linear).
correlation
8short5 marks

What are regression coefficients? State their properties.

Regression Coefficients

The regression coefficient is the slope of a regression line; it measures the average change in the dependent variable for a unit change in the independent variable.

  • Regression coefficient of YY on XX: byx=Cov(X,Y)σx2=rσyσx.\displaystyle b_{yx} = \frac{\operatorname{Cov}(X,Y)}{\sigma_x^2} = r\frac{\sigma_y}{\sigma_x}.
  • Regression coefficient of XX on YY: bxy=Cov(X,Y)σy2=rσxσy.\displaystyle b_{xy} = \frac{\operatorname{Cov}(X,Y)}{\sigma_y^2} = r\frac{\sigma_x}{\sigma_y}.

Properties

  1. Geometric mean relation with rr: r=±byxbxyr = \pm\sqrt{b_{yx}\,b_{xy}}, i.e. byxbxy=r2b_{yx}\,b_{xy}=r^2.
  2. Same sign: both regression coefficients (and rr) have the same sign; if both are positive rr is positive, if both negative rr is negative.
  3. Both cannot exceed unity: since r21r^2 \le 1, the product byxbxy1b_{yx}b_{xy}\le 1, so if one coefficient is greater than 1 the other must be less than 1.
  4. Independent of change of origin but not of scale.
  5. Arithmetic mean of the two coefficients is greater than or equal to rr: 12(byx+bxy)r\tfrac{1}{2}(b_{yx}+b_{xy}) \ge r.
regression
9short5 marks

Explain the concept of sampling distribution and standard error.

Sampling Distribution and Standard Error

Sampling distribution: If all possible samples of a fixed size nn are drawn from a population and a statistic (e.g. the mean xˉ\bar{x}, proportion pp) is computed for each sample, the probability distribution of that statistic over all such samples is called its sampling distribution. For example, the sampling distribution of the mean has its own mean E(xˉ)=μE(\bar{x})=\mu and variance σ2/n\sigma^2/n, and by the Central Limit Theorem it is approximately normal for large nn.

Standard error (S.E.): The standard deviation of the sampling distribution of a statistic is called its standard error. It measures the variability of the statistic due to sampling and is used to set confidence limits and construct test statistics. For the sample mean:

S.E.(xˉ)=σn.\text{S.E.}(\bar{x}) = \frac{\sigma}{\sqrt{n}}.

For a sample proportion: S.E.(p)=PQn\text{S.E.}(p) = \sqrt{\dfrac{PQ}{n}}.

Importance / uses:

  • A smaller S.E. means the estimate is more reliable; S.E. decreases as nn increases.
  • It provides the denominator of test statistics, e.g. Z=(xˉμ)/S.E.(xˉ)Z = (\bar{x}-\mu)/\text{S.E.}(\bar{x}).
  • It is used to construct confidence intervals and to judge the precision of an estimate.
samplingdistribution
10short5 marks

Explain how to construct a confidence interval for a population mean.

Confidence Interval for a Population Mean

A confidence interval gives a range of values, computed from sample data, within which the unknown population mean μ\mu is expected to lie with a stated probability (confidence level 1α1-\alpha, e.g. 95%).

Construction

Point estimate: the sample mean xˉ\bar{x}.

General form: xˉ±(critical value)×S.E.(xˉ)\bar{x} \pm (\text{critical value}) \times \text{S.E.}(\bar{x}).

Case 1 — Large sample (n30n\ge 30) or known σ\sigma (use ZZ):

xˉ±zα/2σn.\bar{x} \pm z_{\alpha/2}\,\frac{\sigma}{\sqrt{n}}.

For 95% confidence, zα/2=1.96z_{\alpha/2}=1.96; for 99%, zα/2=2.58z_{\alpha/2}=2.58.

Case 2 — Small sample (n<30n<30), σ\sigma unknown (use tt):

xˉ±tα/2,n1sn,df=n1.\bar{x} \pm t_{\alpha/2,\,n-1}\,\frac{s}{\sqrt{n}}, \qquad df = n-1.

Example: n=100, xˉ=50, σ=10n=100,\ \bar{x}=50,\ \sigma=10. 95% interval =50±1.96×10100=50±1.96=(48.04, 51.96)= 50 \pm 1.96\times\frac{10}{\sqrt{100}} = 50 \pm 1.96 = (48.04,\ 51.96).

Interpretation: We are 95% confident that the true population mean lies between the lower and upper limits; i.e. in repeated sampling, 95% of such intervals would contain μ\mu.

confidence-interval
11short5 marks

Explain the F-test for the equality of two population variances.

F-test for Equality of Two Population Variances

Used to test whether two independent normal populations have equal variances (also the basis of ANOVA and testing the validity of pooling variances in the t-test).

Hypotheses: H0:σ12=σ22H_0:\sigma_1^2=\sigma_2^2 vs H1:σ12σ22H_1:\sigma_1^2\neq\sigma_2^2.

Test statistic: the ratio of the two sample variances, with the larger variance in the numerator:

F=s12s22(s12>s22),F = \frac{s_1^2}{s_2^2}\quad (s_1^2 > s_2^2),

where the unbiased sample variances are

s12=(x1xˉ1)2n11,s22=(x2xˉ2)2n21.s_1^2 = \frac{\sum (x_1-\bar{x}_1)^2}{n_1-1}, \qquad s_2^2 = \frac{\sum (x_2-\bar{x}_2)^2}{n_2-1}.

Degrees of freedom: (ν1,ν2)=(n11, n21)(\nu_1, \nu_2) = (n_1-1,\ n_2-1), corresponding to numerator and denominator.

Decision rule: Compare FcalF_{cal} with the table value Fα(ν1,ν2)F_{\alpha}(\nu_1,\nu_2).

  • If FcalFtabF_{cal} \le F_{tab}: do not reject H0H_0 — variances are homogeneous.
  • If Fcal>FtabF_{cal} > F_{tab}: reject H0H_0 — variances differ significantly.

Assumptions: populations normally distributed; samples independent and random. Since the larger variance is on top, F1F \ge 1 and only the upper tail is used.

f-test
12short5 marks

Define index numbers and explain Laspeyres' and Paasche's price index methods.

Index Numbers

An index number is a statistical measure that shows the relative change in the value of a variable (or a group of variables such as prices, quantities or values) over time or place, with respect to a chosen base period taken as 100. It is a specialised average used to study, for example, changes in the cost of living or price level.

Let p0,q0p_0, q_0 be base-year price and quantity and p1,q1p_1, q_1 the current-year price and quantity.

Laspeyres' Price Index (base-year weights)

Uses base-year quantities (q0q_0) as weights:

P01L=p1q0p0q0×100.P_{01}^{L} = \frac{\sum p_1 q_0}{\sum p_0 q_0} \times 100.

It measures the cost of buying base-year quantities at current prices versus base prices. It tends to overstate the price rise (ignores substitution away from costlier goods).

Paasche's Price Index (current-year weights)

Uses current-year quantities (q1q_1) as weights:

P01P=p1q1p0q1×100.P_{01}^{P} = \frac{\sum p_1 q_1}{\sum p_0 q_1} \times 100.

It tends to understate the price rise. (Fisher's ideal index is the geometric mean of the two: PF=PL×PPP^F=\sqrt{P^L \times P^P}.)

Key difference: Laspeyres uses base-period weights (data needed only once), whereas Paasche uses current-period weights (must be recollected each period).

index-numbers

Frequently asked questions

Where can I find the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) question paper 2075?
The full BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2075 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
Does the Statistics II (BSc CSIT, STA210) 2075 paper come with solutions?
Yes. Every question on this Statistics II (BSc CSIT, STA210) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
How many marks is the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2075 paper?
The BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2075 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
Is practising this Statistics II (BSc CSIT, STA210) past paper free?
Yes — reading and attempting this Statistics II (BSc CSIT, STA210) past paper on Kekkei is completely free.