Browse papers
A

Section A: Long Answer Questions

Attempt all / any as specified.

4 questions
1long14 marks

The following data represent the time (in milliseconds) taken by a web server to respond to 40 requests during a load test:

Response time (ms)10–2020–3030–4040–5050–6060–70
Number of requests4712953

(a) Compute the arithmetic mean, median and mode of the response times. (6)

(b) Compute the standard deviation and the coefficient of variation, and comment on the consistency of the server's response. (5)

(c) Calculate the coefficient of skewness (Karl Pearson's) and interpret the shape of the distribution. (3)

We use mid-points xx of each class; N=f=40N=\sum f = 40.

Classxxfffxfxc.f.f(xxˉ)2f(x-\bar x)^2
10–201546042162.25
20–30257175111225.00
30–40351242023126.75
40–5045940532414.56
50–60555275371404.69
60–70653195402144.19
Total4015307477.50

(a) Mean, Median, Mode

Mean: xˉ=fxN=153040=38.25\bar x = \dfrac{\sum fx}{N} = \dfrac{1530}{40} = 38.25 ms.

Median: N/2=20N/2 = 20 lies in class 30–40 (c.f. just exceeds 20). With L=30, F=11, fm=12, h=10L=30,\ F=11,\ f_m=12,\ h=10:

Median=L+N/2Ffmh=30+201112×10=37.5 ms.\text{Median} = L + \frac{N/2 - F}{f_m}\,h = 30 + \frac{20-11}{12}\times 10 = 37.5 \text{ ms}.

Mode: modal class is 30–40 (highest f=12f=12). With f1=12, f0=7, f2=9f_1=12,\ f_0=7,\ f_2=9:

Mode=L+f1f02f1f0f2h=30+1272479×10=30+6.25=36.25 ms.\text{Mode} = L + \frac{f_1-f_0}{2f_1-f_0-f_2}\,h = 30 + \frac{12-7}{24-7-9}\times 10 = 30 + 6.25 = 36.25 \text{ ms}.

(b) Standard deviation and coefficient of variation

σ=f(xxˉ)2N=7477.540=186.94=13.67 ms.\sigma = \sqrt{\frac{\sum f(x-\bar x)^2}{N}} = \sqrt{\frac{7477.5}{40}} = \sqrt{186.94} = 13.67 \text{ ms}. C.V.=σxˉ×100=13.6738.25×100=35.7%.\text{C.V.} = \frac{\sigma}{\bar x}\times 100 = \frac{13.67}{38.25}\times 100 = 35.7\%.

Comment: A C.V. of about 35.7%35.7\% is fairly high, so the server's response times show considerable relative variability — the response is not very consistent; latency fluctuates substantially across requests.

(c) Karl Pearson's coefficient of skewness

Sk=xˉModeσ=38.2536.2513.67=+0.146.S_k = \frac{\bar x - \text{Mode}}{\sigma} = \frac{38.25 - 36.25}{13.67} = +0.146.

Since Sk>0S_k > 0 (and Mean>Median>Mode\text{Mean} > \text{Median} > \text{Mode}), the distribution is slightly positively (right) skewed: most response times are moderate, with a thin tail of longer response times pulling the mean above the mode.

descriptive-statistics
2long14 marks

A software company procures identical microcontroller chips from three suppliers A, B and C, which supply 50%, 30% and 20% of the total stock respectively. From past experience it is known that 2%, 3% and 4% of the chips from suppliers A, B and C respectively are defective.

(a) State Bayes' theorem and the theorem of total probability, clearly explaining the meaning of prior and posterior probability. (4)

(b) A chip is selected at random from the combined stock. Find the probability that it is defective. (4)

(c) Given that a randomly chosen chip is found to be defective, find the probability that it was supplied by supplier C. (4)

(d) Which supplier is most likely to be the source of a defective chip? Justify your answer. (2)

Let A,B,CA,B,C be the events that a chip comes from suppliers A, B, C, and DD the event that the chip is defective.

P(A)=0.50, P(B)=0.30, P(C)=0.20,P(DA)=0.02, P(DB)=0.03, P(DC)=0.04.P(A)=0.50,\ P(B)=0.30,\ P(C)=0.20,\quad P(D|A)=0.02,\ P(D|B)=0.03,\ P(D|C)=0.04.

(a) Theorems and prior/posterior probability

Theorem of total probability: If A1,,AnA_1,\dots,A_n are mutually exclusive and exhaustive events with P(Ai)>0P(A_i)>0, then for any event DD:

P(D)=i=1nP(Ai)P(DAi).P(D)=\sum_{i=1}^{n} P(A_i)\,P(D|A_i).

Bayes' theorem: For any AkA_k,

P(AkD)=P(Ak)P(DAk)iP(Ai)P(DAi).P(A_k|D)=\frac{P(A_k)\,P(D|A_k)}{\sum_{i} P(A_i)\,P(D|A_i)}.
  • The prior probability P(Ak)P(A_k) is the probability assigned to a hypothesis before observing the evidence (here, the proportion of stock from each supplier).
  • The posterior probability P(AkD)P(A_k|D) is the revised probability of the hypothesis after the evidence DD (a defective chip) has been observed.

(b) Probability the chip is defective

P(D)=P(A)P(DA)+P(B)P(DB)+P(C)P(DC)P(D)=P(A)P(D|A)+P(B)P(D|B)+P(C)P(D|C) =0.50(0.02)+0.30(0.03)+0.20(0.04)=0.010+0.009+0.008=0.027.=0.50(0.02)+0.30(0.03)+0.20(0.04)=0.010+0.009+0.008=0.027.

So P(D)=0.027P(D)=0.027 (i.e. 2.7%2.7\%).

(c) Probability it came from supplier C, given defective

P(CD)=P(C)P(DC)P(D)=0.0080.027=0.296.P(C|D)=\frac{P(C)P(D|C)}{P(D)}=\frac{0.008}{0.027}=0.296.

(d) Most likely source of a defective chip

P(AD)=0.0100.027=0.370,P(BD)=0.0090.027=0.333,P(CD)=0.296.P(A|D)=\frac{0.010}{0.027}=0.370,\quad P(B|D)=\frac{0.009}{0.027}=0.333,\quad P(C|D)=0.296.

The largest posterior is P(AD)=0.370P(A|D)=0.370, so a defective chip is most likely to have come from supplier A. Although A has the lowest defect rate, it supplies half of all chips, so it contributes the most defective units in absolute terms.

bayes-theoremprobability-theory
3long14 marks

The following table shows the number of hours (X) eight students spent practising programming per week and their scores (Y) out of 100 in a coding assessment:

X (hours)2456891112
Y (score)4048556070728590

(a) Compute the Karl Pearson coefficient of correlation between X and Y and interpret it. (5)

(b) Fit the least-squares regression line of Y on X. (5)

(c) Estimate the expected score of a student who practises 10 hours per week, and explain the meaning of the coefficient of determination for this fit. (4)

n=8n=8. Computing sums:

X=57, Y=520, XY=4130, X2=491, Y2=35938.\sum X=57,\ \sum Y=520,\ \sum XY=4130,\ \sum X^2=491,\ \sum Y^2=35938. Xˉ=57/8=7.125,Yˉ=520/8=65.\bar X=57/8=7.125,\quad \bar Y=520/8=65.

(a) Karl Pearson coefficient of correlation

r=nXYXY[nX2(X)2][nY2(Y)2]r=\frac{n\sum XY-\sum X\sum Y}{\sqrt{[n\sum X^2-(\sum X)^2][n\sum Y^2-(\sum Y)^2]}} =8(4130)57(520)[8(491)572][8(35938)5202]=3304029640(39283249)(287504270400)=\frac{8(4130)-57(520)}{\sqrt{[8(491)-57^2][8(35938)-520^2]}}=\frac{33040-29640}{\sqrt{(3928-3249)(287504-270400)}} =3400679×17104=34003407.9=0.998.=\frac{3400}{\sqrt{679\times 17104}}=\frac{3400}{3407.9}=0.998.

Interpretation: r+0.998r\approx +0.998 indicates a very strong positive linear correlation — students who practise more hours score higher, almost perfectly linearly.

(b) Least-squares regression line of Y on X

bYX=nXYXYnX2(X)2=3400679=5.007.b_{YX}=\frac{n\sum XY-\sum X\sum Y}{n\sum X^2-(\sum X)^2}=\frac{3400}{679}=5.007. a=YˉbYXXˉ=655.007(7.125)=6535.68=29.32.a=\bar Y-b_{YX}\bar X=65-5.007(7.125)=65-35.68=29.32.

Regression line:

Y^=29.32+5.007X.\boxed{\hat Y = 29.32 + 5.007\,X.}

(c) Prediction and coefficient of determination

For X=10X=10 hours:

Y^=29.32+5.007(10)=29.32+50.07=79.4.\hat Y = 29.32 + 5.007(10) = 29.32 + 50.07 = 79.4.

The expected score is about 79 out of 100.

Coefficient of determination: r2=(0.998)2=0.995r^2 = (0.998)^2 = 0.995. This means about 99.5% of the variation in assessment scores is explained by the variation in practice hours through the fitted line; only 0.5%0.5\% is due to other factors. The very high r2r^2 confirms the line fits the data extremely well.

regression-correlation
4long14 marks

A manufacturer claims that the mean lifetime of its LED bulbs is at least 8000 hours. A quality-control engineer tests a random sample of 36 bulbs and finds a sample mean lifetime of 7820 hours with a sample standard deviation of 480 hours.

(a) Explain the difference between a Type I and a Type II error, and define the level of significance and the p-value of a test. (4)

(b) Formulate the null and alternative hypotheses and, at the 5% level of significance, test whether the manufacturer's claim is justified. (6)

(c) Construct a 95% confidence interval for the true mean lifetime of the bulbs and state whether it is consistent with your conclusion in part (b). (4)

Given: μ0=8000\mu_0=8000 h (claim: at least 8000), xˉ=7820\bar x=7820 h, s=480s=480 h, n=36n=36.

(a) Errors, level of significance, p-value

  • Type I error (α\alpha): rejecting the null hypothesis H0H_0 when it is actually true (a 'false alarm').
  • Type II error (β\beta): failing to reject H0H_0 when it is actually false (a 'missed detection').
  • Level of significance α\alpha: the maximum probability of committing a Type I error that we are willing to tolerate (here 0.050.05); it fixes the rejection region.
  • p-value: the probability, assuming H0H_0 is true, of obtaining a test statistic at least as extreme as the one observed. If p-value <α< \alpha, we reject H0H_0.

(b) Hypothesis test (5% level)

H0:μ8000(claim true)H1:μ<8000(left-tailed).H_0:\mu \ge 8000 \quad\text{(claim true)}\qquad H_1:\mu < 8000 \quad\text{(left-tailed)}.

Since n=36n=36 is large, use the zz-test (using ss for σ\sigma):

z=xˉμ0s/n=78208000480/36=18080=2.25.z=\frac{\bar x-\mu_0}{s/\sqrt n}=\frac{7820-8000}{480/\sqrt{36}}=\frac{-180}{80}=-2.25.

Critical value for a left-tailed test at α=0.05\alpha=0.05 is z0.05=1.645-z_{0.05}=-1.645.

Since z=2.25<1.645z=-2.25 < -1.645, the test statistic falls in the rejection region (equivalently p-value =P(Z<2.25)0.0122<0.05=P(Z<-2.25)\approx 0.0122 < 0.05).

Conclusion: Reject H0H_0. There is sufficient evidence at the 5% level that the true mean lifetime is less than 8000 hours, so the manufacturer's claim is not justified.

(c) 95% confidence interval

xˉ±z0.025sn=7820±1.96×80=7820±156.8.\bar x \pm z_{0.025}\,\frac{s}{\sqrt n}=7820 \pm 1.96\times 80 = 7820 \pm 156.8. CI=(7663.2, 7976.8) hours.\text{CI} = (7663.2,\ 7976.8) \text{ hours}.

The entire interval lies below 8000 hours; 8000 is not contained in it. This is consistent with part (b) — at the 5% level the data are incompatible with a mean of 8000 h or more, confirming the claim is not supported.

hypothesis-testingsampling-estimation
B

Section B: Short Answer Questions

Attempt all / any as specified.

8 questions
5short6 marks

Two cards are drawn one after another without replacement from a well-shuffled standard deck of 52 playing cards. Find the probability that (a) both cards are aces, (b) the first is a king and the second is a queen, and (c) at least one of the two cards is a spade. State clearly any addition or multiplication rule of probability you use.

Drawing without replacement, so the second draw's probability is conditional on the first; we use the multiplication rule P(E1E2)=P(E1)P(E2E1)P(E_1\cap E_2)=P(E_1)P(E_2|E_1), and for (c) the complement/addition idea.

(a) Both aces. There are 4 aces.

P(both aces)=452×351=122652=12210.00452.P(\text{both aces})=\frac{4}{52}\times\frac{3}{51}=\frac{12}{2652}=\frac{1}{221}\approx 0.00452.

(b) First king, then queen.

P=452×451=162652=46630.00603.P=\frac{4}{52}\times\frac{4}{51}=\frac{16}{2652}=\frac{4}{663}\approx 0.00603.

(c) At least one spade. Easiest via the complement (no spade in either draw). There are 39 non-spades.

P(no spade)=3952×3851=14822652=19340.559.P(\text{no spade})=\frac{39}{52}\times\frac{38}{51}=\frac{1482}{2652}=\frac{19}{34}\approx 0.559. P(at least one spade)=11934=15340.441.P(\text{at least one spade})=1-\frac{19}{34}=\frac{15}{34}\approx 0.441.
probability-theory
6short6 marks

In a digital communication channel, bits are transmitted independently and each bit is received in error with probability 0.01. A packet consists of 100 bits. Using a suitable approximation, find the probability that a packet contains (a) no error and (b) at most two errors. State the distribution you used and justify why the approximation is appropriate.

Each bit errs independently with p=0.01p=0.01, n=100n=100. The number of errors XX is Binomial(100,0.01)\text{Binomial}(100,0.01). Since nn is large and pp is small with np=1np=1 moderate, we approximate by the Poisson distribution with mean

λ=np=100×0.01=1.\lambda = np = 100\times 0.01 = 1.

The approximation is appropriate because n20n\ge 20 (large), p0.05p\le 0.05 (small/rare event), and np<10np<10, the standard conditions for the Poisson approximation to the binomial.

Poisson pmf: P(X=k)=eλλkk!P(X=k)=\dfrac{e^{-\lambda}\lambda^{k}}{k!} with λ=1\lambda=1.

(a) No error (k=0k=0):

P(X=0)=e1=0.3679.P(X=0)=e^{-1}=0.3679.

(b) At most two errors (k=0,1,2k=0,1,2):

P(X2)=e1 ⁣(1+1+12)=e1(2.5)=2.5×0.3679=0.9197.P(X\le 2)=e^{-1}\!\left(1+1+\tfrac{1}{2}\right)=e^{-1}(2.5)=2.5\times 0.3679=0.9197.

So about 36.8%36.8\% of packets are error-free and about 92.0%92.0\% contain at most two errors.

probability-distributions
7short6 marks

The lengths of bolts produced by a machine are normally distributed with mean 50 mm and standard deviation 1.5 mm. A bolt is acceptable if its length lies between 47.5 mm and 52.5 mm. (a) What percentage of bolts produced are acceptable? (b) If 2000 bolts are produced in a shift, how many are expected to be rejected? (Use standard normal tables.)

Let XX = bolt length, XN(μ=50, σ=1.5)X\sim N(\mu=50,\ \sigma=1.5). Standardise with Z=XμσZ=\dfrac{X-\mu}{\sigma}.

(a) Percentage acceptable (between 47.5 and 52.5 mm)

Z1=47.5501.5=1.67,Z2=52.5501.5=+1.67.Z_1=\frac{47.5-50}{1.5}=-1.67,\qquad Z_2=\frac{52.5-50}{1.5}=+1.67. P(47.5<X<52.5)=P(1.67<Z<1.67)=2Φ(1.67)1.P(47.5<X<52.5)=P(-1.67<Z<1.67)=2\,\Phi(1.67)-1.

From the standard normal table Φ(1.67)=0.9525\Phi(1.67)=0.9525, so

P=2(0.9525)1=0.9050.P=2(0.9525)-1=0.9050.

About 90.5%90.5\% of bolts are acceptable.

(b) Expected number rejected out of 2000

Fraction rejected =10.9050=0.0950=1-0.9050=0.0950.

Rejected=2000×0.0950190 bolts.\text{Rejected}=2000\times 0.0950 \approx 190 \text{ bolts.}

So roughly 190 bolts per shift are expected to be rejected.

probability-distributions
8short6 marks

A continuous random variable X has the probability density function

f(x)={kx(2x),0x20,otherwisef(x) = \begin{cases} kx(2-x), & 0 \le x \le 2 \\ 0, & \text{otherwise} \end{cases}

(a) Determine the value of the constant k. (b) Find the mean E(X) and the variance Var(X) of X. (c) Compute P(X > 1).

f(x)=kx(2x)f(x)=kx(2-x) for 0x20\le x\le 2, else 0.

(a) Value of k

A pdf integrates to 1:

02kx(2x)dx=k02(2xx2)dx=k[x2x33]02=k(483)=k43=1.\int_0^2 kx(2-x)\,dx = k\int_0^2 (2x-x^2)\,dx = k\Big[x^2-\tfrac{x^3}{3}\Big]_0^2 = k\Big(4-\tfrac{8}{3}\Big)=k\cdot\tfrac{4}{3}=1. k=34.\Rightarrow k=\frac{3}{4}.

(b) Mean and variance

Mean:

E(X)=02x34x(2x)dx=3402(2x2x3)dx=34[2x33x44]02=34(1634)=3443=1.E(X)=\int_0^2 x\cdot\tfrac34 x(2-x)\,dx=\tfrac34\int_0^2(2x^2-x^3)\,dx=\tfrac34\Big[\tfrac{2x^3}{3}-\tfrac{x^4}{4}\Big]_0^2=\tfrac34\Big(\tfrac{16}{3}-4\Big)=\tfrac34\cdot\tfrac43=1.

(By symmetry of x(2x)x(2-x) about x=1x=1, E(X)=1E(X)=1 as expected.)

E(X2)E(X^2):

E(X2)=3402(2x3x4)dx=34[x42x55]02=34(8325)=3485=65=1.2.E(X^2)=\tfrac34\int_0^2(2x^3-x^4)\,dx=\tfrac34\Big[\tfrac{x^4}{2}-\tfrac{x^5}{5}\Big]_0^2=\tfrac34\Big(8-\tfrac{32}{5}\Big)=\tfrac34\cdot\tfrac{8}{5}=\tfrac{6}{5}=1.2.

Variance:

Var(X)=E(X2)[E(X)]2=1.21=0.2=15.\text{Var}(X)=E(X^2)-[E(X)]^2=1.2-1=0.2=\tfrac15.

(c) P(X > 1)

By symmetry about x=1x=1, P(X>1)=12P(X>1)=\tfrac12. Verifying:

P(X>1)=3412(2xx2)dx=34[x2x33]12=34[(483)(113)]=3423=0.5.P(X>1)=\tfrac34\int_1^2(2x-x^2)\,dx=\tfrac34\Big[x^2-\tfrac{x^3}{3}\Big]_1^2=\tfrac34\Big[(4-\tfrac83)-(1-\tfrac13)\Big]=\tfrac34\cdot\tfrac23=0.5.
random-variables
9short6 marks

(a) Distinguish between a parameter and a statistic, and explain the terms 'sampling distribution' and 'standard error' with an example. (3)

(b) State the Central Limit Theorem and explain its significance in statistical inference for large samples. (3)

(a) Parameter vs statistic; sampling distribution; standard error

  • A parameter is a numerical characteristic of the population (e.g. population mean μ\mu, proportion pp); it is usually fixed and unknown.
  • A statistic is a numerical characteristic computed from a sample (e.g. sample mean xˉ\bar x, sample proportion p^\hat p); it varies from sample to sample and is used to estimate the parameter.
  • The sampling distribution is the probability distribution of a statistic over all possible samples of a given size nn from the population. Example: the distribution of xˉ\bar x from repeated samples of size 36.
  • The standard error is the standard deviation of that sampling distribution. For the mean, SE(xˉ)=σ/n\text{SE}(\bar x)=\sigma/\sqrt n; it measures the typical sampling fluctuation of the estimate and decreases as nn increases.

(b) Central Limit Theorem

Statement: If X1,X2,,XnX_1,X_2,\dots,X_n are independent, identically distributed random variables with mean μ\mu and finite variance σ2\sigma^2, then for large nn the sampling distribution of the sample mean Xˉ\bar X is approximately normal:

Xˉ    N ⁣(μ, σ2n),equivalentlyZ=Xˉμσ/nN(0,1).\bar X \;\approx\; N\!\left(\mu,\ \frac{\sigma^2}{n}\right),\quad\text{equivalently}\quad Z=\frac{\bar X-\mu}{\sigma/\sqrt n}\to N(0,1).

Significance: It holds regardless of the shape of the population distribution (as long as variance is finite). This justifies using normal-based zz-tests and confidence intervals for means and proportions in large samples, even when the underlying data are non-normal — making it the foundation of large-sample statistical inference.

sampling-estimation
10short6 marks

In a survey, 360 out of 600 randomly selected smartphone users said they prefer Android over iOS. Test, at the 1% level of significance, the hypothesis that 50% of all users prefer Android. State your hypotheses, the test statistic used, and your conclusion.

Large-sample test of a single proportion. n=600n=600, observed p^=360/600=0.60\hat p = 360/600 = 0.60, claimed p0=0.50p_0=0.50.

Hypotheses (two-tailed):

H0:p=0.50H1:p0.50.H_0: p = 0.50 \qquad H_1: p \ne 0.50.

Test statistic (standard normal zz):

z=p^p0p0(1p0)/n=0.600.500.5×0.5/600=0.100.0004167=0.100.02041=4.90.z=\frac{\hat p - p_0}{\sqrt{p_0(1-p_0)/n}}=\frac{0.60-0.50}{\sqrt{0.5\times 0.5/600}}=\frac{0.10}{\sqrt{0.0004167}}=\frac{0.10}{0.02041}=4.90.

Critical value: at α=0.01\alpha=0.01 two-tailed, z0.005=±2.576z_{0.005}=\pm 2.576.

Since z=4.90>2.576|z|=4.90 > 2.576, the statistic falls in the rejection region (p-value 106\approx 10^{-6}, far below 0.01).

Conclusion: Reject H0H_0. There is highly significant evidence at the 1% level that the proportion of users preferring Android is not 50% — in fact it is significantly greater than 50% (about 60%).

hypothesis-testing
11short6 marks

A die is rolled 120 times and the following frequencies of the faces are observed:

Face123456
Frequency152218242120

Using the chi-square goodness-of-fit test at the 5% level of significance, test whether the die is fair.

Chi-square goodness-of-fit test for a fair die.

Hypotheses: H0H_0: the die is fair (each face equally likely, pi=1/6p_i=1/6); H1H_1: the die is not fair.

Total rolls =120=120, so expected frequency Ei=120/6=20E_i = 120/6 = 20 for every face.

FaceOiO_iEiE_i(OiEi)2/Ei(O_i-E_i)^2/E_i
115201.25
222200.20
318200.20
424200.80
521200.05
620200.00
Total1201202.50

Test statistic:

χ2=(OiEi)2Ei=2.50.\chi^2=\sum \frac{(O_i-E_i)^2}{E_i}=2.50.

Degrees of freedom: df=k1=61=5df = k-1 = 6-1 = 5. Critical value χ0.05,52=11.07\chi^2_{0.05,5}=11.07.

Since χ2=2.50<11.07\chi^2 = 2.50 < 11.07, we do not reject H0H_0.

Conclusion: The observed frequencies are consistent with a uniform distribution; there is no significant evidence at the 5% level that the die is biased — the die may be regarded as fair.

hypothesis-testingprobability-distributions
12short6 marks

Write short notes on any TWO of the following: (a) Skewness and kurtosis as measures of the shape of a distribution; (b) Mathematical expectation and its properties for a discrete random variable; (c) Mutually exclusive events versus independent events, with one example of each.

(Answer any two.)

(a) Skewness and kurtosis

Skewness measures the asymmetry of a distribution about its mean.

  • Symmetric: skewness =0=0 (mean = median = mode).
  • Positive (right) skew: long right tail, mean > median > mode.
  • Negative (left) skew: long left tail, mean < median < mode. A common measure is Karl Pearson's Sk=(xˉmode)/σS_k=(\bar x-\text{mode})/\sigma.

Kurtosis measures the peakedness and tail-heaviness relative to the normal curve, via β2=μ4/μ22\beta_2=\mu_4/\mu_2^2.

  • β2=3\beta_2=3: mesokurtic (normal).
  • β2>3\beta_2>3: leptokurtic (sharp peak, heavy tails).
  • β2<3\beta_2<3: platykurtic (flat peak, light tails).

(b) Mathematical expectation (discrete RV)

For a discrete random variable XX with pmf P(X=xi)=piP(X=x_i)=p_i, the expectation (mean) is

E(X)=ixipi.E(X)=\sum_i x_i\,p_i.

It is the long-run average value of XX. Properties:

  1. E(c)=cE(c)=c for a constant cc.
  2. E(aX+b)=aE(X)+bE(aX+b)=a\,E(X)+b (linearity).
  3. E(X+Y)=E(X)+E(Y)E(X+Y)=E(X)+E(Y) (additivity, always).
  4. If X,YX,Y are independent, E(XY)=E(X)E(Y)E(XY)=E(X)E(Y).
  5. Var(X)=E(X2)[E(X)]2\text{Var}(X)=E(X^2)-[E(X)]^2.

(c) Mutually exclusive vs independent events

  • Mutually exclusive events cannot occur together: AB=A\cap B=\varnothing, so P(AB)=0P(A\cap B)=0 and P(AB)=P(A)+P(B)P(A\cup B)=P(A)+P(B). Example: rolling a 2 and rolling a 5 on a single die throw.
  • Independent events: the occurrence of one does not affect the other, so P(AB)=P(A)P(B)P(A\cap B)=P(A)P(B). Example: getting a head on a coin toss and a 6 on a die roll.

Note: two events with non-zero probabilities cannot be both mutually exclusive and independent — if mutually exclusive, knowing one occurred tells you the other did not.

descriptive-statisticsrandom-variables

Frequently asked questions

Where can I find the BE Computer Engineering (Pokhara University) Probability and Statistics (PU, MTH 216) question paper 2078?
The full BE Computer Engineering (Pokhara University) Probability and Statistics (PU, MTH 216) 2078 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
Does the Probability and Statistics (PU, MTH 216) 2078 paper come with solutions?
Yes. Every question on this Probability and Statistics (PU, MTH 216) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
How many marks is the BE Computer Engineering (Pokhara University) Probability and Statistics (PU, MTH 216) 2078 paper?
The BE Computer Engineering (Pokhara University) Probability and Statistics (PU, MTH 216) 2078 paper carries 100 full marks and is meant to be completed in 180 minutes, across 12 questions.
Is practising this Probability and Statistics (PU, MTH 216) past paper free?
Yes — reading and attempting this Probability and Statistics (PU, MTH 216) past paper on Kekkei is completely free.