Browse papers
A

Section A: Long Answer Questions

Attempt any TWO questions.

3 questions·10 marks each
1long10 marks

Define a probability distribution. Explain the binomial distribution with its mean and variance, and state the conditions under which it is applied.

Probability Distribution

A probability distribution of a random variable XX is a description that assigns to every possible value (or interval of values) of XX a probability of occurrence, such that the total probability is 11.

  • Discrete case: a probability mass function P(X=xi)=piP(X=x_i)=p_i with pi0p_i\ge 0 and ipi=1\sum_i p_i = 1.
  • Continuous case: a probability density function f(x)f(x) with f(x)0f(x)\ge 0 and f(x)dx=1\int_{-\infty}^{\infty} f(x)\,dx = 1.

Binomial Distribution

The binomial distribution describes the number of successes XX in a fixed number nn of independent trials, where each trial has only two outcomes (success/failure) with constant success probability pp (and q=1pq = 1-p).

Its probability mass function is:

P(X=x)=(nx)pxqnx,x=0,1,2,,nP(X=x) = \binom{n}{x} p^{x} q^{\,n-x}, \qquad x = 0,1,2,\dots,n

Mean and Variance

Mean=E(X)=np\text{Mean} = E(X) = np Variance=Var(X)=npq,S.D.=npq\text{Variance} = \operatorname{Var}(X) = npq, \qquad \text{S.D.} = \sqrt{npq}

Since q<1q<1, the variance is always less than the mean for a binomial distribution.

Conditions for Application

  1. The number of trials nn is fixed and finite.
  2. Each trial has only two mutually exclusive outcomes (success or failure).
  3. The trials are independent of one another.
  4. The probability of success pp remains constant from trial to trial.

Example: Tossing a fair coin 1010 times and counting heads follows B(n=10,p=0.5)B(n=10,\,p=0.5).

probabilitydistribution
2long10 marks

Explain the normal distribution and its properties. The mean of a normal distribution is 50 and standard deviation is 10; find the probability that a value lies between 45 and 62.

Normal Distribution

A continuous random variable XX follows a normal distribution with mean μ\mu and standard deviation σ\sigma if its probability density function is:

f(x)=1σ2πe12(xμσ)2,<x<f(x) = \frac{1}{\sigma\sqrt{2\pi}}\, e^{-\frac{1}{2}\left(\frac{x-\mu}{\sigma}\right)^{2}}, \qquad -\infty < x < \infty

Properties

  1. The curve is bell-shaped and symmetric about the mean μ\mu.
  2. Mean = Median = Mode =μ= \mu.
  3. Total area under the curve equals 11; the curve is asymptotic to the xx-axis.
  4. It is defined by two parameters: μ\mu (location) and σ\sigma (spread).
  5. Empirical rule: about 68%68\% of values lie within μ±σ\mu\pm\sigma, 95%95\% within μ±2σ\mu\pm 2\sigma, and 99.7%99.7\% within μ±3σ\mu\pm 3\sigma.
  6. Quartiles: Q1=μ0.6745σQ_1=\mu-0.6745\sigma, Q3=μ+0.6745σQ_3=\mu+0.6745\sigma; it has zero skewness and kurtosis 33.

Numerical: P(45<X<62)P(45 < X < 62)

Given μ=50\mu = 50, σ=10\sigma = 10. Standardize using Z=XμσZ = \dfrac{X-\mu}{\sigma}.

For X=45X = 45:

Z1=455010=0.5Z_1 = \frac{45-50}{10} = -0.5

For X=62X = 62:

Z2=625010=1.2Z_2 = \frac{62-50}{10} = 1.2

So we need P(0.5<Z<1.2)P(-0.5 < Z < 1.2). From the standard normal table:

  • P(0<Z<0.5)=0.1915P(0 < Z < 0.5) = 0.1915
  • P(0<Z<1.2)=0.3849P(0 < Z < 1.2) = 0.3849
P(0.5<Z<1.2)=P(0<Z<0.5)+P(0<Z<1.2)=0.1915+0.3849=0.5764P(-0.5 < Z < 1.2) = P(0<Z<0.5) + P(0<Z<1.2) = 0.1915 + 0.3849 = 0.5764

The probability that a value lies between 45 and 62 is 0.57640.5764 (about 57.64%57.64\%).

normal-distribution
3long10 marks

What is hypothesis testing? Explain the procedure of testing of hypothesis including null and alternative hypotheses, level of significance, types of errors and the critical region.

Hypothesis Testing

Hypothesis testing is a statistical procedure used to decide, on the basis of sample evidence, whether a stated assumption (hypothesis) about a population parameter should be accepted or rejected at a chosen level of significance.

Procedure

1. Set up the hypotheses

  • Null hypothesis (H0H_0): a statement of no difference / no effect, assumed true unless evidence contradicts it, e.g. H0:μ=μ0H_0:\mu = \mu_0.
  • Alternative hypothesis (H1H_1): the claim accepted if H0H_0 is rejected. It may be two-tailed (μμ0\mu \ne \mu_0) or one-tailed (μ>μ0\mu > \mu_0 or μ<μ0\mu < \mu_0).

2. Choose the level of significance (α\alpha) The probability of rejecting H0H_0 when it is actually true. Common values are α=0.05\alpha = 0.05 (5%5\%) or α=0.01\alpha = 0.01 (1%1\%).

3. Identify the types of error

DecisionH0H_0 TrueH0H_0 False
Reject H0H_0Type I error (α\alpha)Correct decision
Accept H0H_0Correct decisionType II error (β\beta)
  • Type I error: rejecting a true H0H_0 (probability α\alpha).
  • Type II error: accepting a false H0H_0 (probability β\beta). The quantity 1β1-\beta is the power of the test.

4. Select the test statistic appropriate to the problem (e.g. ZZ, tt, χ2\chi^2, FF) and compute its value from the sample.

5. Determine the critical region The critical (rejection) region is the set of values of the test statistic for which H0H_0 is rejected; its size equals α\alpha. The boundary value is the critical value. For a two-tailed test the region lies in both tails; for a one-tailed test in a single tail.

6. Make the decision If the computed test statistic falls in the critical region (i.e. calculated>tabulated value|\text{calculated}| > \text{tabulated value}), reject H0H_0; otherwise do not reject H0H_0. State the conclusion in terms of the original problem.

hypothesis-testing
B

Section B: Short Answer Questions

Attempt any EIGHT questions.

9 questions·5 marks each
4short5 marks

State and explain the addition and multiplication theorems of probability with examples.

Addition Theorem of Probability

For any two events AA and BB:

P(AB)=P(A)+P(B)P(AB)P(A \cup B) = P(A) + P(B) - P(A \cap B)

If AA and BB are mutually exclusive (AB=A\cap B = \varnothing), this reduces to:

P(AB)=P(A)+P(B)P(A \cup B) = P(A) + P(B)

Example: Drawing one card from a pack, P(King or Queen)=452+452=852=213P(\text{King or Queen}) = \frac{4}{52} + \frac{4}{52} = \frac{8}{52} = \frac{2}{13} (the events are mutually exclusive).

Multiplication Theorem of Probability

For any two events AA and BB:

P(AB)=P(A)P(BA)=P(B)P(AB)P(A \cap B) = P(A)\,P(B \mid A) = P(B)\,P(A \mid B)

If AA and BB are independent, then P(BA)=P(B)P(B\mid A)=P(B) and:

P(AB)=P(A)P(B)P(A \cap B) = P(A)\,P(B)

Example: Tossing a fair coin and rolling a fair die, P(Head and 6)=12×16=112P(\text{Head and } 6) = \frac{1}{2}\times\frac{1}{6} = \frac{1}{12} (independent events).

probability
5short5 marks

Explain the Poisson distribution with its mean and variance and state its applications.

Poisson Distribution

The Poisson distribution gives the probability of a given number of independent events occurring in a fixed interval of time or space, when these events occur with a known constant average rate λ\lambda and independently of the time since the last event.

Its probability mass function is:

P(X=x)=eλλxx!,x=0,1,2,P(X=x) = \frac{e^{-\lambda}\,\lambda^{x}}{x!}, \qquad x = 0,1,2,\dots

where λ>0\lambda > 0 is the average number of occurrences. It is the limiting case of the binomial distribution when nn \to \infty, p0p \to 0 with np=λnp = \lambda finite.

Mean and Variance

Mean=E(X)=λ,Var(X)=λ\text{Mean} = E(X) = \lambda, \qquad \operatorname{Var}(X) = \lambda

A characteristic feature is that the mean equals the variance (=λ)(=\lambda).

Applications

  1. Number of telephone calls arriving at an exchange per minute.
  2. Number of printing/typing errors per page of a book.
  3. Number of defective items in a large batch (rare-event quality control).
  4. Number of customers arriving at a service counter per unit time.
  5. Number of radioactive particles emitted per second; modelling arrivals in queueing/network traffic.
poisson
6short5 marks

Define a random variable. Differentiate between discrete and continuous random variables with examples.

Random Variable

A random variable is a real-valued function XX that assigns a numerical value to each outcome (sample point) of a random experiment. It maps the sample space SS to the set of real numbers R\mathbb{R}.

Example: In tossing two coins, the number of heads X{0,1,2}X \in \{0,1,2\} is a random variable.

Discrete vs Continuous Random Variables

BasisDiscrete Random VariableContinuous Random Variable
ValuesTakes countable, isolated valuesTakes any value in an interval (uncountable)
ProbabilityDescribed by a probability mass function P(X=x)P(X=x)Described by a probability density function f(x)f(x); P(X=x)=0P(X=x)=0
Summation/IntegrationxP(X=x)=1\sum_x P(X=x)=1f(x)dx=1\int_{-\infty}^{\infty} f(x)\,dx = 1
ExampleNumber of defective bulbs, number of heads in tossesHeight, weight, temperature, time

In short: a discrete random variable arises from counting, whereas a continuous random variable arises from measurement.

random-variable
7short5 marks

Define mathematical expectation. State and prove its properties.

Mathematical Expectation

The mathematical expectation (or mean) of a random variable XX is the long-run average value, weighted by probabilities.

  • Discrete: E(X)=xxP(X=x)E(X) = \sum_x x\,P(X=x)
  • Continuous: E(X)=xf(x)dxE(X) = \int_{-\infty}^{\infty} x\,f(x)\,dx

Properties (with proofs)

1. Expectation of a constant. E(c)=cE(c) = c.

Proof: E(c)=xcP(X=x)=cxP(X=x)=c1=c.E(c) = \sum_x c\,P(X=x) = c\sum_x P(X=x) = c\cdot 1 = c.

2. Linearity (constant multiplier). E(aX)=aE(X)E(aX) = a\,E(X).

Proof: E(aX)=xaxP(X=x)=axxP(X=x)=aE(X).E(aX) = \sum_x a x\,P(X=x) = a\sum_x x\,P(X=x) = a\,E(X).

3. Addition of a constant. E(aX+b)=aE(X)+bE(aX + b) = a\,E(X) + b.

Proof: E(aX+b)=x(ax+b)P(x)=axxP(x)+bxP(x)=aE(X)+b.E(aX+b)=\sum_x (ax+b)P(x)=a\sum_x xP(x)+b\sum_x P(x)=aE(X)+b.

4. Addition theorem. E(X+Y)=E(X)+E(Y)E(X + Y) = E(X) + E(Y) for any two random variables.

Proof (discrete): E(X+Y)=xy(x+y)P(x,y)=xxyP(x,y)+yyxP(x,y)=E(X)+E(Y).E(X+Y)=\sum_x\sum_y (x+y)P(x,y)=\sum_x x\sum_y P(x,y)+\sum_y y\sum_x P(x,y)=E(X)+E(Y).

5. Multiplication theorem (independence). If XX and YY are independent, E(XY)=E(X)E(Y)E(XY)=E(X)\,E(Y).

Proof: E(XY)=xyxyP(x)P(y)=(xxP(x))(yyP(y))=E(X)E(Y).E(XY)=\sum_x\sum_y xy\,P(x)P(y)=\Big(\sum_x xP(x)\Big)\Big(\sum_y yP(y)\Big)=E(X)E(Y).

expectation
8short5 marks

Explain the t-test for testing the significance of the difference between two sample means.

t-test for Difference of Two Sample Means

This test is used to determine whether the difference between the means of two independent small samples (n1,n2<30n_1, n_2 < 30) drawn from normal populations with equal (unknown) variances is statistically significant.

Hypotheses

  • H0:μ1=μ2H_0: \mu_1 = \mu_2 (the two population means are equal)
  • H1:μ1μ2H_1: \mu_1 \ne \mu_2 (two-tailed) or one-sided as required

Test Statistic

t=xˉ1xˉ2S1n1+1n2t = \frac{\bar{x}_1 - \bar{x}_2}{S\sqrt{\dfrac{1}{n_1} + \dfrac{1}{n_2}}}

where the pooled standard deviation is:

S2=(x1xˉ1)2+(x2xˉ2)2n1+n22=n1s12+n2s22n1+n22S^2 = \frac{\sum (x_1-\bar{x}_1)^2 + \sum (x_2-\bar{x}_2)^2}{n_1 + n_2 - 2} = \frac{n_1 s_1^2 + n_2 s_2^2}{n_1+n_2-2}

Degrees of Freedom

d.f.=n1+n22\text{d.f.} = n_1 + n_2 - 2

Decision Rule

Compare tcal|t_{\text{cal}}| with the tabulated value tαt_{\alpha} at the given degrees of freedom and level of significance α\alpha:

  • If tcal>ttab|t_{\text{cal}}| > t_{\text{tab}}, reject H0H_0 — the difference is significant.
  • If tcalttab|t_{\text{cal}}| \le t_{\text{tab}}, accept H0H_0 — the difference is not significant.

Assumptions: samples are random and independent, parent populations are normal, and the two population variances are equal.

t-test
9short5 marks

Explain the z-test for a large sample test of a single mean with an example.

Z-test for a Single Mean (Large Sample)

When the sample size is large (n30n \ge 30), the sampling distribution of the mean is approximately normal, and the z-test is used to test whether the sample mean xˉ\bar{x} differs significantly from a hypothesized population mean μ\mu.

Hypotheses

  • H0:μ=μ0H_0: \mu = \mu_0
  • H1:μμ0H_1: \mu \ne \mu_0 (two-tailed), at level of significance α\alpha.

Test Statistic

Z=xˉμσ/nZ = \frac{\bar{x} - \mu}{\sigma/\sqrt{n}}

If the population standard deviation σ\sigma is unknown, the sample standard deviation ss is used in its place (valid for large nn).

Decision Rule

At α=0.05\alpha = 0.05 (two-tailed) the critical value is 1.961.96. If Zcal>1.96|Z_{\text{cal}}| > 1.96, reject H0H_0; otherwise accept H0H_0.

Example

A sample of n=64n = 64 items has mean xˉ=52\bar{x} = 52 from a population with μ=50\mu = 50 and σ=8\sigma = 8. Test at 5%5\% level.

Z=52508/64=28/8=21=2.0Z = \frac{52 - 50}{8/\sqrt{64}} = \frac{2}{8/8} = \frac{2}{1} = 2.0

Since Z=2.0>1.96|Z| = 2.0 > 1.96, we reject H0H_0: the sample mean differs significantly from the population mean at the 5%5\% level of significance.

z-test
10short5 marks

Define Karl Pearson's coefficient of correlation and state its properties.

Karl Pearson's Coefficient of Correlation

Karl Pearson's coefficient of correlation rr is a numerical measure of the degree and direction of linear relationship between two variables XX and YY. It is defined as the ratio of the covariance of XX and YY to the product of their standard deviations:

r=Cov(X,Y)σXσY=(xxˉ)(yyˉ)(xxˉ)2(yyˉ)2r = \frac{\operatorname{Cov}(X,Y)}{\sigma_X\,\sigma_Y} = \frac{\sum (x-\bar{x})(y-\bar{y})}{\sqrt{\sum (x-\bar{x})^2}\,\sqrt{\sum (y-\bar{y})^2}}

Properties

  1. Range: 1r+1-1 \le r \le +1.
    • r=+1r = +1: perfect positive correlation; r=1r = -1: perfect negative correlation; r=0r = 0: no linear correlation.
  2. Independent of origin and scale: rr is unchanged if a constant is added to or each value is multiplied by a constant.
  3. Symmetric: rxy=ryxr_{xy} = r_{yx}.
  4. It is the geometric mean of the two regression coefficients: r=±bxybyxr = \pm\sqrt{b_{xy}\cdot b_{yx}}, taking the sign of the regression coefficients.
  5. It measures only the linear relationship; r=0r=0 does not imply the variables are independent (a nonlinear relation may exist).
correlation
11short5 marks

What are regression coefficients? State their properties.

Regression Coefficients

In linear regression between two variables XX and YY, the regression coefficient is the slope of a regression line — it measures the average change in the dependent variable for a unit change in the independent variable.

  • Regression coefficient of YY on XX:
byx=Cov(X,Y)σX2=rσYσXb_{yx} = \frac{\operatorname{Cov}(X,Y)}{\sigma_X^{2}} = r\frac{\sigma_Y}{\sigma_X}
  • Regression coefficient of XX on YY:
bxy=Cov(X,Y)σY2=rσXσYb_{xy} = \frac{\operatorname{Cov}(X,Y)}{\sigma_Y^{2}} = r\frac{\sigma_X}{\sigma_Y}

Properties

  1. Correlation as geometric mean: r=±byxbxyr = \pm\sqrt{b_{yx}\cdot b_{xy}}, so r2=byxbxyr^2 = b_{yx}\,b_{xy}.
  2. Same sign: both regression coefficients have the same sign as rr; if one is positive both are positive, and vice versa.
  3. Since r1|r|\le 1, the product byxbxy1b_{yx}\,b_{xy} \le 1 (both cannot exceed 11 simultaneously).
  4. They are independent of change of origin but not of scale.
  5. The arithmetic mean of the two regression coefficients is greater than or equal to rr, i.e. byx+bxy2r\dfrac{b_{yx}+b_{xy}}{2} \ge r.
regression
12short5 marks

Explain the concept of sampling distribution and standard error.

Sampling Distribution

A sampling distribution is the probability distribution of a sample statistic (such as the sample mean xˉ\bar{x} or proportion pp) obtained from all possible samples of the same size nn drawn from a given population. For example, if repeated random samples of size nn are taken and the mean of each is computed, the distribution of those means is the sampling distribution of the mean.

By the Central Limit Theorem, for large nn the sampling distribution of the mean is approximately normal with mean μ\mu (the population mean) and standard deviation σ/n\sigma/\sqrt{n}, regardless of the shape of the parent population.

Standard Error

The standard error (S.E.) is the standard deviation of the sampling distribution of a statistic. It measures the variability of the statistic from sample to sample.

  • S.E. of the mean: S.E.(xˉ)=σn\quad \text{S.E.}(\bar{x}) = \dfrac{\sigma}{\sqrt{n}}
  • S.E. of a proportion: S.E.(p)=PQn\quad \text{S.E.}(p) = \sqrt{\dfrac{PQ}{n}}

Significance: a smaller standard error indicates greater precision (estimates cluster tightly around the parameter). The S.E. decreases as the sample size nn increases, and it is used to construct confidence intervals and test statistics in tests of significance.

samplingdistribution

Frequently asked questions

Where can I find the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) question paper 2074?
The full BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2074 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
Does the Statistics II (BSc CSIT, STA210) 2074 paper come with solutions?
Yes. Every question on this Statistics II (BSc CSIT, STA210) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
How many marks is the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2074 paper?
The BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2074 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
Is practising this Statistics II (BSc CSIT, STA210) past paper free?
Yes — reading and attempting this Statistics II (BSc CSIT, STA210) past paper on Kekkei is completely free.