Browse papers
A

Section A: Long Answer Questions

Attempt all / any as specified.

4 questions
1long12 marks

The following data represent the breaking strength (in kg) of 60 samples of a cable produced in a factory:

Breaking strength (kg)50-6060-7070-8080-9090-100100-110
No. of samples610181484

(a) Compute the arithmetic mean, median and mode of the distribution. (b) Calculate the standard deviation and the coefficient of variation, and comment on the consistency of the production process. (c) Find the Karl Pearson's coefficient of skewness and interpret the shape of the distribution.

Working table

Using class mid-points mm and N=f=60N=\sum f = 60:

Classmmfffmfmf(mxˉ)2f(m-\bar{x})^2
50-605563303408.0
60-7065106501780.6
70-8075181350200.0
80-9085141190614.7
90-1009587602244.4
100-11010544202685.2
Total60470010933.3

(a) Mean, Median, Mode

Mean: xˉ=fmN=470060=78.33\bar{x} = \dfrac{\sum fm}{N} = \dfrac{4700}{60} = 78.33 kg.

Median: N/2=30N/2 = 30 falls in the class 70-80 (cumulative frequency reaches 34). With L=70, cf=16, f=18, h=10L=70,\ cf=16,\ f=18,\ h=10:

M=L+N/2cffh=70+301618×10=77.78 kg.M = L + \frac{N/2 - cf}{f}\,h = 70 + \frac{30-16}{18}\times 10 = 77.78 \text{ kg}.

Mode: Modal class is 70-80 (highest frequency 18). With f1=18, f0=10, f2=14f_1=18,\ f_0=10,\ f_2=14:

Z=L+f1f02f1f0f2h=70+1810361014×10=76.67 kg.Z = L + \frac{f_1-f_0}{2f_1-f_0-f_2}\,h = 70 + \frac{18-10}{36-10-14}\times 10 = 76.67 \text{ kg}.

(b) Standard deviation and CV

σ=f(mxˉ)2N=10933.360=182.22=13.50 kg.\sigma = \sqrt{\frac{\sum f(m-\bar{x})^2}{N}} = \sqrt{\frac{10933.3}{60}} = \sqrt{182.22} = 13.50 \text{ kg}. CV=σxˉ×100=13.5078.33×100=17.23%.\text{CV} = \frac{\sigma}{\bar{x}}\times 100 = \frac{13.50}{78.33}\times 100 = 17.23\%.

Comment: A CV of about 17% indicates moderate relative variability. The breaking strengths are reasonably consistent, so the production process is fairly stable, though not highly uniform.

(c) Karl Pearson's coefficient of skewness

Sk=xˉZσ=78.3376.6713.50=+0.12.S_k = \frac{\bar{x}-Z}{\sigma} = \frac{78.33-76.67}{13.50} = +0.12.

(Equivalently, Sk=3(xˉM)σ=3(78.3377.78)13.50+0.12S_k = \dfrac{3(\bar{x}-M)}{\sigma} = \dfrac{3(78.33-77.78)}{13.50} \approx +0.12.)

Since SkS_k is small and positive, the distribution is slightly positively (right) skewed — nearly symmetrical with a mild tail towards higher breaking strengths.

descriptive-statisticsmeasures-of-dispersionskewness
2long12 marks

(a) State the axioms of probability. For any two events AA and BB, prove the addition theorem P(AB)=P(A)+P(B)P(AB)P(A \cup B) = P(A) + P(B) - P(A \cap B). (b) Three machines M1M_1, M2M_2 and M3M_3 produce respectively 25%, 35% and 40% of the total output of a factory. The proportions of defective items produced by them are 5%, 4% and 2% respectively. An item is drawn at random from the total output and found to be defective. Using Bayes' theorem, find the probability that it was produced by machine M3M_3.

(a) Axioms of probability and the addition theorem

Axioms (Kolmogorov). For a sample space SS and any event AA:

  1. Non-negativity: P(A)0P(A) \geq 0.
  2. Normalization: P(S)=1P(S) = 1.
  3. Additivity: if A1,A2,A_1, A_2, \dots are mutually exclusive events, then P ⁣(iAi)=iP(Ai)P\!\left(\bigcup_i A_i\right) = \sum_i P(A_i).

Addition theorem. We can write the union of AA and BB as a union of disjoint pieces:

AB=A(BAc).A \cup B = A \cup (B \cap A^c).

Since AA and BAcB\cap A^c are mutually exclusive, by Axiom 3:

P(AB)=P(A)+P(BAc).(1)P(A\cup B) = P(A) + P(B\cap A^c). \quad (1)

Also, B=(BA)(BAc)B = (B\cap A) \cup (B\cap A^c) is a disjoint split of BB, so:

P(B)=P(BA)+P(BAc)    P(BAc)=P(B)P(AB).(2)P(B) = P(B\cap A) + P(B\cap A^c) \;\Rightarrow\; P(B\cap A^c) = P(B) - P(A\cap B). \quad (2)

Substituting (2) into (1):

P(AB)=P(A)+P(B)P(AB).\boxed{P(A\cup B) = P(A) + P(B) - P(A\cap B).}

(b) Bayes' theorem application

Let DD be the event that a drawn item is defective. Given:

P(M1)=0.25, P(M2)=0.35, P(M3)=0.40,P(M_1)=0.25,\ P(M_2)=0.35,\ P(M_3)=0.40, P(DM1)=0.05, P(DM2)=0.04, P(DM3)=0.02.P(D|M_1)=0.05,\ P(D|M_2)=0.04,\ P(D|M_3)=0.02.

Total probability of a defective:

P(D)=(0.25)(0.05)+(0.35)(0.04)+(0.40)(0.02)P(D) = (0.25)(0.05) + (0.35)(0.04) + (0.40)(0.02) =0.0125+0.0140+0.0080=0.0345.= 0.0125 + 0.0140 + 0.0080 = 0.0345.

Bayes' theorem:

P(M3D)=P(M3)P(DM3)P(D)=0.00800.0345=0.2319.P(M_3|D) = \frac{P(M_3)P(D|M_3)}{P(D)} = \frac{0.0080}{0.0345} = 0.2319.

Thus the probability the defective item was produced by machine M3M_3 is 0.232\approx 0.232 (23.2%).

probability-axiomsbayes-theoremconditional-probability
3long12 marks

(a) Define the normal distribution and state its important properties. Show that for a normal distribution the mean, median and mode coincide. (b) The lifetime of a certain type of electronic component is normally distributed with mean 1200 hours and standard deviation 150 hours. (i) What proportion of components last more than 1450 hours? (ii) What proportion of components have a lifetime between 1000 and 1300 hours? (iii) If the manufacturer wishes to guarantee a minimum lifetime such that only 5% of components fail before the guarantee period, what should the guarantee period be? (Standard normal tables are attached.)

(a) Normal distribution: definition, properties, coincidence of averages

A continuous random variable XX follows a normal distribution with mean μ\mu and variance σ2\sigma^2 if its probability density function is

f(x)=1σ2πe(xμ)22σ2,<x<.f(x) = \frac{1}{\sigma\sqrt{2\pi}}\,e^{-\frac{(x-\mu)^2}{2\sigma^2}}, \quad -\infty < x < \infty.

Important properties:

  • Bell-shaped, symmetric about x=μx=\mu.
  • Mean = Median = Mode = μ\mu.
  • Total area under the curve is 1; the curve is asymptotic to the xx-axis.
  • The xx-axis is approached but never touched; μ±σ\mu \pm \sigma are points of inflexion.
  • Empirical rule: about 68%, 95% and 99.7% of values lie within μ±σ\mu\pm\sigma, μ±2σ\mu\pm2\sigma, μ±3σ\mu\pm3\sigma.
  • Quartiles: Q1=μ0.6745σQ_1=\mu-0.6745\sigma, Q3=μ+0.6745σQ_3=\mu+0.6745\sigma; mean deviation =0.7979σ=0.7979\sigma.

Mean = Median = Mode. The curve is symmetric about x=μx=\mu, so μ\mu divides the area into two equal halves \Rightarrow Median =μ=\mu. Differentiating f(x)f(x) and setting f(x)=0f'(x)=0 gives (xμ)σ2f(x)=0x=μ-\frac{(x-\mu)}{\sigma^2}f(x)=0 \Rightarrow x=\mu, and f(μ)<0f''(\mu)<0, so f(x)f(x) is maximum at x=μx=\mu \Rightarrow Mode =μ=\mu. The mean of a symmetric distribution equals its centre of symmetry μ\mu. Hence mean = median = mode = μ\mu.

(b) Numerical part (μ=1200, σ=150\mu=1200,\ \sigma=150)

Standardize with z=xμσz = \dfrac{x-\mu}{\sigma}.

(i) P(X>1450)P(X>1450): z=14501200150=1.67z = \dfrac{1450-1200}{150} = 1.67.

P(X>1450)=P(Z>1.67)=0.50.4525=0.04754.75%.P(X>1450) = P(Z>1.67) = 0.5 - 0.4525 = 0.0475 \approx 4.75\%.

(ii) P(1000<X<1300)P(1000<X<1300): z1=10001200150=1.33z_1 = \dfrac{1000-1200}{150} = -1.33, z2=13001200150=0.67z_2 = \dfrac{1300-1200}{150} = 0.67.

P=P(1.33<Z<0.67)=0.4082+0.2486=0.656865.7%.P = P(-1.33<Z<0.67) = 0.4082 + 0.2486 = 0.6568 \approx 65.7\%.

(iii) Guarantee period gg such that P(X<g)=0.05P(X<g)=0.05. The lower 5% point is z=1.645z=-1.645:

g=μ+zσ=1200+(1.645)(150)=1200246.75=953.25 hours.g = \mu + z\sigma = 1200 + (-1.645)(150) = 1200 - 246.75 = 953.25 \text{ hours}.

The manufacturer should set the guarantee period at about 953 hours.

normal-distributioncontinuous-distributions
4long12 marks

The following data show the number of hours studied (XX) and the marks obtained (YY) by 8 students in an examination:

X (hours)2456891112
Y (marks)2130354050567078

(a) Compute the Karl Pearson's coefficient of correlation between XX and YY and interpret the result. (b) Obtain the two regression equations (the regression of YY on XX and of XX on YY). (c) Estimate the marks obtained by a student who studies for 10 hours, and show that the geometric mean of the two regression coefficients equals the correlation coefficient.

Sums (n = 8)

X=57, Y=380, X2=491, Y2=20786, XY=3187,\sum X = 57,\ \sum Y = 380,\ \sum X^2 = 491,\ \sum Y^2 = 20786,\ \sum XY = 3187, Xˉ=7.125,Yˉ=47.5.\bar{X} = 7.125,\quad \bar{Y} = 47.5.

(a) Karl Pearson's correlation coefficient

r=nXYXY[nX2(X)2][nY2(Y)2]r = \frac{n\sum XY - \sum X\sum Y}{\sqrt{[n\sum X^2 - (\sum X)^2][n\sum Y^2 - (\sum Y)^2]}} =8(3187)(57)(380)[8(491)572][8(20786)3802]=2549621660(39283249)(166288144400)= \frac{8(3187) - (57)(380)}{\sqrt{[8(491)-57^2][8(20786)-380^2]}} = \frac{25496 - 21660}{\sqrt{(3928-3249)(166288-144400)}} =3836679×21888=38363855.1=0.995.= \frac{3836}{\sqrt{679 \times 21888}} = \frac{3836}{3855.1} = 0.995.

There is a very strong positive linear correlation between hours studied and marks obtained.

(b) Regression equations

Regression coefficients:

bYX=nXYXYnX2(X)2=3836679=5.649,b_{YX} = \frac{n\sum XY - \sum X\sum Y}{n\sum X^2 - (\sum X)^2} = \frac{3836}{679} = 5.649, bXY=nXYXYnY2(Y)2=383621888=0.1753.b_{XY} = \frac{n\sum XY - \sum X\sum Y}{n\sum Y^2 - (\sum Y)^2} = \frac{3836}{21888} = 0.1753.

Regression of YY on XX: YYˉ=bYX(XXˉ)Y - \bar{Y} = b_{YX}(X-\bar{X})

Y=47.5+5.649(X7.125)Y=7.25+5.649X.Y = 47.5 + 5.649(X - 7.125) \Rightarrow \boxed{Y = 7.25 + 5.649X.}

Regression of XX on YY: XXˉ=bXY(YYˉ)X - \bar{X} = b_{XY}(Y-\bar{Y})

X=7.125+0.1753(Y47.5)X=1.20+0.1753Y.X = 7.125 + 0.1753(Y - 47.5) \Rightarrow \boxed{X = -1.20 + 0.1753Y.}

(c) Estimate at X = 10 and verification

Y^X=10=7.25+5.649(10)=63.7464 marks.\hat{Y}|_{X=10} = 7.25 + 5.649(10) = 63.74 \approx 64 \text{ marks}.

Verification: bYXbXY=5.649×0.1753=0.9903=0.995=r.\sqrt{b_{YX}\cdot b_{XY}} = \sqrt{5.649 \times 0.1753} = \sqrt{0.9903} = 0.995 = r.

Hence the geometric mean of the two regression coefficients equals the correlation coefficient (sign of rr = common sign of the coefficients, here positive).

correlationregressionleast-squares
B

Section B: Short Answer Questions

Attempt all / any as specified.

7 questions
5short6 marks

A discrete random variable XX has the following probability distribution:

X01234
P(X)k2k3k2kk

(a) Find the value of the constant kk. (b) Compute the expectation E(X)E(X) and the variance Var(X)\text{Var}(X). (c) Find P(X2)P(X \geq 2).

(a) Value of kk

Since P(X)=1\sum P(X) = 1:

k+2k+3k+2k+k=9k=1k=19.k + 2k + 3k + 2k + k = 9k = 1 \Rightarrow \boxed{k = \tfrac{1}{9}}.

So the distribution is P(0)=19, P(1)=29, P(2)=39, P(3)=29, P(4)=19.P(0)=\tfrac19,\ P(1)=\tfrac29,\ P(2)=\tfrac39,\ P(3)=\tfrac29,\ P(4)=\tfrac19.

(b) Expectation and variance

E(X)=xP(x)=0+2+6+6+49=189=2.E(X) = \sum xP(x) = \frac{0+2+6+6+4}{9} = \frac{18}{9} = 2. E(X2)=x2P(x)=0+2+12+18+169=489=5.333.E(X^2) = \sum x^2 P(x) = \frac{0+2+12+18+16}{9} = \frac{48}{9} = 5.333. Var(X)=E(X2)[E(X)]2=5.3334=1.333=43.\text{Var}(X) = E(X^2) - [E(X)]^2 = 5.333 - 4 = 1.333 = \tfrac{4}{3}.

(The distribution is symmetric about x=2x=2, consistent with E(X)=2E(X)=2.)

(c) P(X2)P(X \geq 2)

P(X2)=P(2)+P(3)+P(4)=3+2+19=69=230.667.P(X\geq 2) = P(2)+P(3)+P(4) = \frac{3+2+1}{9} = \frac{6}{9} = \frac{2}{3} \approx 0.667.
random-variablesexpectationvariance
6short6 marks

In a certain manufacturing process it is known that on average 4% of the items produced are defective. A random sample of 10 items is selected. Using the binomial distribution, find the probability that the sample contains (a) exactly 2 defective items, (b) at most 1 defective item, and (c) at least 1 defective item. Also state the mean and variance of the number of defectives in such a sample.

Let XX = number of defective items in a sample of n=10n=10, with p=0.04p=0.04 (so q=0.96q=0.96). Then XB(10,0.04)X \sim B(10, 0.04) and

P(X=x)=(10x)(0.04)x(0.96)10x.P(X=x) = \binom{10}{x}(0.04)^x(0.96)^{10-x}.

(a) Exactly 2 defective:

P(X=2)=(102)(0.04)2(0.96)8=45(0.0016)(0.7214)=0.0519.P(X=2) = \binom{10}{2}(0.04)^2(0.96)^8 = 45(0.0016)(0.7214) = 0.0519.

(b) At most 1 defective:

P(X1)=P(0)+P(1)=(0.96)10+10(0.04)(0.96)9P(X\leq 1) = P(0)+P(1) = (0.96)^{10} + 10(0.04)(0.96)^9 =0.6648+0.2770=0.9418.= 0.6648 + 0.2770 = 0.9418.

(c) At least 1 defective:

P(X1)=1P(X=0)=1(0.96)10=10.6648=0.3352.P(X\geq 1) = 1 - P(X=0) = 1 - (0.96)^{10} = 1 - 0.6648 = 0.3352.

Mean and variance:

Mean=np=10×0.04=0.4,Variance=npq=10×0.04×0.96=0.384.\text{Mean} = np = 10\times 0.04 = 0.4,\qquad \text{Variance} = npq = 10\times 0.04\times 0.96 = 0.384.
binomial-distributiondiscrete-distributions
7short6 marks

The number of telephone calls received at an exchange follows a Poisson distribution with an average of 3 calls per minute. (a) Find the probability that no call is received in a given minute. (b) Find the probability that more than 2 calls are received in a given minute. (c) Justify the conditions under which the Poisson distribution may be used as an approximation to the binomial distribution.

Let XX = number of calls per minute, XPoisson(λ=3)X \sim \text{Poisson}(\lambda = 3), with

P(X=x)=eλλxx!=e33xx!.P(X=x) = \frac{e^{-\lambda}\lambda^x}{x!} = \frac{e^{-3}3^x}{x!}.

(a) No call received:

P(X=0)=e3=0.0498.P(X=0) = e^{-3} = 0.0498.

(b) More than 2 calls:

P(X>2)=1[P(0)+P(1)+P(2)]=1e3[1+3+92]P(X>2) = 1 - [P(0)+P(1)+P(2)] = 1 - e^{-3}\left[1 + 3 + \tfrac{9}{2}\right] =1e3(8.5)=10.0498×8.5=10.4232=0.5768.= 1 - e^{-3}(8.5) = 1 - 0.0498\times 8.5 = 1 - 0.4232 = 0.5768.

(c) Poisson as a limit of the binomial. The binomial B(n,p)B(n,p) tends to a Poisson distribution with λ=np\lambda = np when:

  • the number of trials nn is large (nn \to \infty);
  • the probability of success pp is very small (p0p \to 0, i.e. the event is rare);
  • the product np=λnp = \lambda remains finite and constant (moderate).

Under these conditions (nx)pxqnxeλλxx!\binom{n}{x}p^x q^{n-x} \to \dfrac{e^{-\lambda}\lambda^x}{x!}. In practice the approximation is good when n20n \geq 20 and p0.05p \leq 0.05 (often stated as n50, np<5n\ge 50,\ np<5). A key feature is that mean = variance = λ\lambda for the Poisson.

poisson-distributiondiscrete-distributions
8short6 marks

A random sample of 100 light bulbs taken from a large consignment gave a mean life of 1570 hours with a standard deviation of 120 hours. (a) Construct a 95% confidence interval for the true mean life of the bulbs in the consignment. (b) Explain the difference between a point estimate and an interval estimate, and state the properties of a good estimator.

(a) 95% confidence interval for the mean

Given n=100, xˉ=1570, s=120n=100,\ \bar{x}=1570,\ s=120. Since nn is large, use the normal (z) interval with z0.025=1.96z_{0.025}=1.96. Standard error:

SE=sn=120100=12.SE = \frac{s}{\sqrt{n}} = \frac{120}{\sqrt{100}} = 12.

Confidence interval:

xˉ±zSE=1570±1.96(12)=1570±23.52.\bar{x} \pm z\cdot SE = 1570 \pm 1.96(12) = 1570 \pm 23.52. 1546.48 hours<μ<1593.52 hours.\boxed{1546.48 \text{ hours} < \mu < 1593.52 \text{ hours}.}

We are 95% confident that the true mean life of the bulbs lies between about 1546.5 and 1593.5 hours.

(b) Point vs interval estimate; properties of a good estimator

  • Point estimate: a single value of a sample statistic used to estimate the unknown population parameter (e.g. xˉ=1570\bar{x}=1570 for μ\mu). It gives no measure of uncertainty.
  • Interval estimate: a range of values, together with a confidence level, within which the parameter is expected to lie (e.g. 1546.51546.5 to 1593.51593.5 hours at 95%). It conveys the precision/reliability of the estimate.

Properties of a good estimator:

  1. UnbiasednessE(θ^)=θE(\hat{\theta}) = \theta.
  2. Consistencyθ^θ\hat{\theta} \to \theta as nn \to \infty.
  3. Efficiency — smallest variance among unbiased estimators.
  4. Sufficiency — uses all the information about θ\theta contained in the sample.
estimationconfidence-intervalsampling
9short6 marks

A company claims that the mean tensile strength of its steel wire is 580 N/mm². A random sample of 64 wires gave a mean strength of 568 N/mm² with a standard deviation of 40 N/mm². (a) At the 5% level of significance, test whether the data provide sufficient evidence against the company's claim. (b) Clearly state the null and alternative hypotheses, and define Type I and Type II errors in this context.

(a) Z-test for the mean

Given: claimed mean μ0=580\mu_0 = 580, n=64n=64, xˉ=568\bar{x}=568, s=40s=40.

Hypotheses:

H0:μ=580 N/mm2vsH1:μ580 N/mm2 (two-tailed).H_0: \mu = 580 \text{ N/mm}^2 \quad\text{vs}\quad H_1: \mu \neq 580 \text{ N/mm}^2 \ (\text{two-tailed}).

Test statistic (large sample zz-test):

z=xˉμ0s/n=56858040/64=125=2.40.z = \frac{\bar{x}-\mu_0}{s/\sqrt{n}} = \frac{568-580}{40/\sqrt{64}} = \frac{-12}{5} = -2.40.

Critical value: at 5% (two-tailed), z0.025=±1.96z_{0.025} = \pm 1.96.

Decision: z=2.40>1.96|z| = 2.40 > 1.96, so we reject H0H_0.

Conclusion: At the 5% level there is sufficient evidence against the company's claim; the true mean tensile strength differs significantly from (and appears lower than) 580 N/mm².

(b) Hypotheses and error types

  • H0H_0: μ=580\mu = 580 (the wire meets the claimed mean strength).
  • H1H_1: μ580\mu \neq 580 (the mean strength is not 580).
  • Type I error (α\alpha): rejecting H0H_0 when it is actually true — concluding the strength differs from 580 when in fact it is 580 (wrongly rejecting a good claim). Here α=0.05\alpha = 0.05.
  • Type II error (β\beta): failing to reject H0H_0 when it is false — concluding the wire meets 580 N/mm² when in reality it does not.
hypothesis-testingz-test
10short6 marks

A die is thrown 120 times and the following frequencies of the faces are observed:

Face123456
Frequency152218251624

Using the chi-square test of goodness of fit at the 5% level of significance, test whether the die is unbiased. State the degrees of freedom and your conclusion.

Chi-square goodness-of-fit test

Hypotheses: H0H_0: the die is unbiased (each face equally likely) vs H1H_1: the die is biased.

Under H0H_0 each face has probability 16\tfrac16, so the expected frequency for each is

E=120×16=20.E = 120 \times \tfrac16 = 20.
FaceOOEE(OE)2/E(O-E)^2/E
115201.25
222200.20
318200.20
425201.25
516200.80
624200.80
Total1201204.50

Test statistic:

χ2=(OE)2E=4.50.\chi^2 = \sum \frac{(O-E)^2}{E} = 4.50.

Degrees of freedom: df=k1=61=5.df = k - 1 = 6 - 1 = 5.

Critical value: χ0.05,52=11.07.\chi^2_{0.05,\,5} = 11.07.

Decision: Since calculated χ2=4.50<11.07\chi^2 = 4.50 < 11.07, we do not reject H0H_0.

Conclusion: At the 5% level of significance, the observed frequencies are consistent with a fair die — there is no significant evidence that the die is biased.

hypothesis-testingchi-square-testgoodness-of-fit
11short6 marks

Write short notes on any THREE of the following: (a) Moments, skewness and kurtosis. (b) Mutually exclusive and independent events with examples. (c) Probability mass function versus probability density function. (d) Central Limit Theorem and its significance in sampling.

(Any three of the following.)

(a) Moments, skewness and kurtosis

The rr-th central moment is μr=E[(XXˉ)r]\mu_r = E[(X-\bar{X})^r]. The first four describe the distribution: μ1=0\mu_1=0, μ2=σ2\mu_2=\sigma^2 (variance), μ3\mu_3 measures asymmetry, μ4\mu_4 measures peakedness. Skewness measures the lack of symmetry: β1=μ32/μ23\beta_1 = \mu_3^2/\mu_2^3 (or γ1=μ3/σ3\gamma_1=\mu_3/\sigma^3); positive \Rightarrow right tail, negative \Rightarrow left tail, zero \Rightarrow symmetric. Kurtosis measures peakedness relative to the normal: β2=μ4/μ22\beta_2 = \mu_4/\mu_2^2; β2=3\beta_2=3 (mesokurtic/normal), >3>3 leptokurtic (sharp peak), <3<3 platykurtic (flat).

(b) Mutually exclusive vs independent events

  • Mutually exclusive: events that cannot occur together, AB=A\cap B=\varnothing, so P(AB)=0P(A\cap B)=0 and P(AB)=P(A)+P(B)P(A\cup B)=P(A)+P(B). Example: getting a head and a tail on a single coin toss.
  • Independent: the occurrence of one does not affect the probability of the other, P(AB)=P(A)P(B)P(A\cap B)=P(A)P(B). Example: results of two separate coin tosses.
  • Note: two events with non-zero probabilities cannot be both mutually exclusive and independent (if mutually exclusive, knowing AA occurred forces P(BA)=0P(B)P(B|A)=0\ne P(B)).

(c) PMF vs PDF

  • Probability mass function (pmf): for a discrete random variable, p(x)=P(X=x)p(x)=P(X=x), with p(x)0p(x)\ge 0 and xp(x)=1\sum_x p(x)=1. Probabilities attach to individual points.
  • Probability density function (pdf): for a continuous random variable, f(x)0f(x)\ge 0 with f(x)dx=1\int_{-\infty}^{\infty} f(x)\,dx = 1. Here P(X=x)=0P(X=x)=0; probability is an area: P(a<X<b)=abf(x)dxP(a<X<b)=\int_a^b f(x)\,dx.

(d) Central Limit Theorem (CLT)

For a population with mean μ\mu and finite variance σ2\sigma^2, the sampling distribution of the sample mean Xˉ\bar{X} of a random sample of size nn tends to a normal distribution N ⁣(μ,σ2/n)N\!\left(\mu, \sigma^2/n\right) as nn becomes large (typically n30n\ge 30), irrespective of the shape of the parent population. Significance: it justifies using normal-based (zz) methods for confidence intervals and hypothesis tests about means even for non-normal populations, and explains why many statistics are approximately normally distributed.

descriptive-statisticsprobability-axiomsrandom-variables

Frequently asked questions

Where can I find the BE Computer Engineering (IOE, TU) Probability and Statistics (IOE, SH 602 / ENSH 301) question paper 2079?
The full BE Computer Engineering (IOE, TU) Probability and Statistics (IOE, SH 602 / ENSH 301) 2079 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
Does the Probability and Statistics (IOE, SH 602 / ENSH 301) 2079 paper come with solutions?
Yes. Every question on this Probability and Statistics (IOE, SH 602 / ENSH 301) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
How many marks is the BE Computer Engineering (IOE, TU) Probability and Statistics (IOE, SH 602 / ENSH 301) 2079 paper?
The BE Computer Engineering (IOE, TU) Probability and Statistics (IOE, SH 602 / ENSH 301) 2079 paper carries 80 full marks and is meant to be completed in 180 minutes, across 11 questions.
Is practising this Probability and Statistics (IOE, SH 602 / ENSH 301) past paper free?
Yes — reading and attempting this Probability and Statistics (IOE, SH 602 / ENSH 301) past paper on Kekkei is completely free.