Browse papers
A

Section A: Long Answer Questions

Attempt any TWO questions.

3 questions·10 marks each
1long10 marks

Define statistics. Explain the importance and limitations of statistics. Describe the various measures of central tendency (mean, median, mode) with their merits and demerits.

Statistics

Definition. Statistics is the branch of science that deals with the collection, organization, presentation, analysis and interpretation of numerical data to aid rational decision-making under uncertainty. In the plural sense it means the data themselves; in the singular sense it means the scientific methods used to handle such data.

Importance of Statistics

  • Planning and policy: Governments and businesses use statistical data for planning, budgeting and forecasting.
  • Decision-making under uncertainty: Provides tools (estimation, testing) to draw conclusions from samples.
  • Comparison: Averages and measures of dispersion allow comparison of different groups.
  • Relationship study: Correlation and regression reveal relationships between variables.
  • Forecasting: Time-series and trend analysis help predict future values.
  • Applications: Indispensable in economics, business, biology, computer science (data mining, ML), and research.

Limitations of Statistics

  • Studies only aggregates, not individuals.
  • Deals only with quantitative (or quantifiable) data; qualitative facts must be coded.
  • Results are true on average, not in every individual case.
  • Liable to misuse by unscrupulous persons; can be misleading if methods are wrong.
  • Requires expertise; conclusions are probabilistic, not certain.

Measures of Central Tendency

A central value that represents the whole data set.

1. Arithmetic Mean

xˉ=xin(ungrouped),xˉ=fixifi(grouped)\bar{x}=\frac{\sum x_i}{n}\quad\text{(ungrouped)},\qquad \bar{x}=\frac{\sum f_i x_i}{\sum f_i}\quad\text{(grouped)}

Merits: rigidly defined, based on all observations, suitable for algebraic treatment, least affected by sampling fluctuation. Demerits: highly affected by extreme values (outliers); cannot be found for open-end classes; may give an impossible value (e.g. 2.5 children).

2. Median

The middle value when data are arranged in order; positional average.

Median=L+N2cff×h(grouped)\text{Median}=L+\frac{\tfrac{N}{2}-cf}{f}\times h\quad\text{(grouped)}

Merits: not affected by extreme values; can be found for open-end classes; can be located graphically (ogive). Demerits: not based on all observations; needs arranging data; less suitable for further algebraic treatment.

3. Mode

The value that occurs most frequently.

Mode=L+f1f02f1f0f2×h(grouped)\text{Mode}=L+\frac{f_1-f_0}{2f_1-f_0-f_2}\times h\quad\text{(grouped)}

Merits: easy to understand; not affected by extreme values; the most typical value; useful for qualitative data. Demerits: ill-defined when data are multimodal or have no repetition; not based on all observations; not suitable for algebraic treatment.

Empirical relation: Mode=3Median2Mean\text{Mode}=3\,\text{Median}-2\,\text{Mean} for a moderately skewed distribution.

central-tendencybasics
2long10 marks

Define correlation. Calculate the Karl Pearson coefficient of correlation for a given bivariate data set and interpret the result. Distinguish between correlation and regression.

Correlation

Definition. Correlation is the statistical technique that measures the degree and direction of linear relationship between two quantitative variables XX and YY. If both increase together it is positive; if one increases while the other decreases it is negative; the value lies in [1,+1][-1,+1].

Karl Pearson's Coefficient of Correlation

r=nxyxynx2(x)2  ny2(y)2r=\frac{n\sum xy-\sum x\sum y}{\sqrt{n\sum x^2-(\sum x)^2}\;\sqrt{n\sum y^2-(\sum y)^2}}

Worked example

Let the bivariate data be:

XX12345
YY24545
xxyyx2x^2y2y^2xyxy
12142
244168
3592515
44161616
55252525
1520558666

Here n=5n=5, x=15, y=20, x2=55, y2=86, xy=66\sum x=15,\ \sum y=20,\ \sum x^2=55,\ \sum y^2=86,\ \sum xy=66.

r=5(66)15(20)5(55)152  5(86)202=330300275225  430400=305030r=\frac{5(66)-15(20)}{\sqrt{5(55)-15^2}\;\sqrt{5(86)-20^2}}=\frac{330-300}{\sqrt{275-225}\;\sqrt{430-400}}=\frac{30}{\sqrt{50}\sqrt{30}} r=301500=3038.730.775r=\frac{30}{\sqrt{1500}}=\frac{30}{38.73}\approx 0.775

Interpretation. r+0.78r\approx+0.78 indicates a fairly strong positive linear correlation: as XX increases, YY tends to increase. (With actual exam data, substitute the given values into the same formula.)

Correlation vs Regression

CorrelationRegression
Measures degree/strength of relationshipMeasures the nature/form of dependence (predicts one from another)
Symmetric: rxy=ryxr_{xy}=r_{yx}Asymmetric: byxbxyb_{yx}\ne b_{xy} in general
Value lies in [1,+1][-1,+1], a pure numberCoefficient has units; line y=a+bxy=a+bx
Does not imply cause and effectUsed for estimation/prediction of dependent variable
No distinction between dependent/independent variableClear dependent and independent variables

Relation: r=±byxbxyr=\pm\sqrt{b_{yx}\cdot b_{xy}}.

correlationregression
3long10 marks

Define probability. State and explain the addition and multiplication theorems of probability. State Bayes' theorem and solve a related problem.

Probability

Definition (classical). If a random experiment has nn equally likely, mutually exclusive and exhaustive outcomes, of which mm are favourable to event AA, then

P(A)=mn=favourable casestotal cases,0P(A)1.P(A)=\frac{m}{n}=\frac{\text{favourable cases}}{\text{total cases}},\qquad 0\le P(A)\le 1.

Addition Theorem

For any two events AA and BB:

P(AB)=P(A)+P(B)P(AB).P(A\cup B)=P(A)+P(B)-P(A\cap B).

If AA and BB are mutually exclusive (AB=A\cap B=\varnothing), then

P(AB)=P(A)+P(B).P(A\cup B)=P(A)+P(B).

It gives the probability that at least one of the events occurs.

Multiplication Theorem

For any two events:

P(AB)=P(A)P(BA)=P(B)P(AB).P(A\cap B)=P(A)\,P(B\mid A)=P(B)\,P(A\mid B).

If AA and BB are independent, P(BA)=P(B)P(B\mid A)=P(B), so

P(AB)=P(A)P(B).P(A\cap B)=P(A)\,P(B).

It gives the probability that both events occur simultaneously.

Bayes' Theorem

If E1,E2,,EkE_1,E_2,\dots,E_k are mutually exclusive and exhaustive events with P(Ei)>0P(E_i)>0, and AA is any event, then for each ii:

P(EiA)=P(Ei)P(AEi)j=1kP(Ej)P(AEj).P(E_i\mid A)=\frac{P(E_i)\,P(A\mid E_i)}{\sum_{j=1}^{k}P(E_j)\,P(A\mid E_j)}.

The P(Ei)P(E_i) are prior probabilities and P(EiA)P(E_i\mid A) are posterior probabilities.

Worked problem

Three machines E1,E2,E3E_1,E_2,E_3 produce 50%, 30% and 20% of the items, with defective rates 3%, 4% and 5% respectively. An item drawn at random is defective. Find the probability it came from machine E3E_3.

Given: P(E1)=0.5, P(E2)=0.3, P(E3)=0.2P(E_1)=0.5,\ P(E_2)=0.3,\ P(E_3)=0.2; P(AE1)=0.03, P(AE2)=0.04, P(AE3)=0.05P(A\mid E_1)=0.03,\ P(A\mid E_2)=0.04,\ P(A\mid E_3)=0.05.

Total probability of a defective item:

P(A)=0.5(0.03)+0.3(0.04)+0.2(0.05)=0.015+0.012+0.010=0.037.P(A)=0.5(0.03)+0.3(0.04)+0.2(0.05)=0.015+0.012+0.010=0.037.

By Bayes' theorem:

P(E3A)=0.2(0.05)0.037=0.0100.0370.270.P(E_3\mid A)=\frac{0.2(0.05)}{0.037}=\frac{0.010}{0.037}\approx 0.270.

Result: there is about a 27% chance that the defective item was produced by machine E3E_3.

probabilitybayes
B

Section B: Short Answer Questions

Attempt any EIGHT questions.

9 questions·5 marks each
4short5 marks

Define mean, median and mode.

  • Mean (arithmetic mean): the sum of all observations divided by their number, xˉ=xin\bar{x}=\dfrac{\sum x_i}{n}. It is the most common average and uses every value.
  • Median: the middle value of the data arranged in ascending (or descending) order; it divides the data into two equal halves and is unaffected by extreme values.
  • Mode: the value that occurs most frequently in the data set; a distribution may be unimodal, bimodal or multimodal.
central-tendency
5short5 marks

What is the difference between primary and secondary data?

Primary data are data collected originally by the investigator for the first time for a specific purpose (e.g. through surveys, interviews, questionnaires, direct observation or experiments). They are original, more reliable and accurate but costly and time-consuming.

Secondary data are data that have already been collected by someone else and are used by the investigator second-hand (e.g. from published reports, government records, journals, websites). They are cheaper and quicker to obtain but may not exactly fit the purpose and need careful checking for accuracy.

BasisPrimary dataSecondary data
OriginalityOriginal, first-handSecond-hand
Collected byThe investigatorSomeone else
Cost/timeHighLow
ReliabilityHigher (if collected well)Depends on source
data
6short5 marks

Define standard deviation and variance.

Variance is the mean of the squared deviations of observations from their arithmetic mean. It measures how spread out the data are.

σ2=(xixˉ)2n(population);s2=(xixˉ)2n1 (sample).\sigma^2=\frac{\sum (x_i-\bar{x})^2}{n}\qquad(\text{population});\qquad s^2=\frac{\sum (x_i-\bar{x})^2}{n-1}\ (\text{sample}).

Standard deviation is the positive square root of the variance; it is the most reliable measure of dispersion and is expressed in the same units as the data.

σ=(xixˉ)2n=xi2nxˉ2.\sigma=\sqrt{\frac{\sum (x_i-\bar{x})^2}{n}}=\sqrt{\frac{\sum x_i^2}{n}-\bar{x}^2}.

A larger standard deviation/variance means greater scatter of the values about the mean.

dispersion
7short5 marks

State the classical definition of probability.

Classical (a priori) definition of probability. If a random experiment results in nn exhaustive, mutually exclusive and equally likely outcomes, of which mm are favourable to the occurrence of an event AA, then the probability of AA is

P(A)=mn=number of favourable outcomestotal number of outcomes.P(A)=\frac{m}{n}=\frac{\text{number of favourable outcomes}}{\text{total number of outcomes}}.

Here 0P(A)10\le P(A)\le 1; P(A)=0P(A)=0 for an impossible event and P(A)=1P(A)=1 for a certain event. The probability of non-occurrence is P(Aˉ)=1P(A)P(\bar{A})=1-P(A).

Limitation: it fails when outcomes are not equally likely or when nn is infinite.

probability
8short5 marks

What is a frequency distribution?

A frequency distribution is a tabular arrangement of data that shows how the observations are distributed among different values or class intervals, together with the number of times each value or class occurs (its frequency).

  • A discrete (ungrouped) frequency distribution lists individual values with their frequencies.
  • A continuous (grouped) frequency distribution groups data into class intervals (e.g. 0–10, 10–20) with corresponding frequencies.

It condenses raw data into a compact form, making patterns, the most common values and the overall shape of the data easy to study, and forms the basis for graphs such as histograms and for computing averages and dispersion.

Example:

Marks0–1010–2020–3030–40
No. of students (f)(f)51283
frequency-distribution
9short5 marks

Define correlation coefficient.

The correlation coefficient is a numerical measure of the degree and direction of the linear relationship between two variables XX and YY. Karl Pearson's coefficient is defined as

r=Cov(x,y)σxσy=(xxˉ)(yyˉ)(xxˉ)2  (yyˉ)2.r=\frac{\text{Cov}(x,y)}{\sigma_x\,\sigma_y}=\frac{\sum (x-\bar{x})(y-\bar{y})}{\sqrt{\sum (x-\bar{x})^2}\;\sqrt{\sum (y-\bar{y})^2}}.

Properties:

  • 1r+1-1\le r\le +1.
  • r=+1r=+1: perfect positive correlation; r=1r=-1: perfect negative correlation; r=0r=0: no linear correlation.
  • It is a pure number, independent of units and of change of origin and scale.
correlation
10short5 marks

What is a histogram?

A histogram is a graphical representation of a continuous (grouped) frequency distribution using a set of adjacent rectangles. Each rectangle is drawn over a class interval on the XX-axis, and its area is proportional to the frequency of that class.

  • For equal class widths, the height of each bar equals the class frequency.
  • For unequal widths, the height is taken as the frequency density =frequencyclass width=\dfrac{\text{frequency}}{\text{class width}} so that area stays proportional to frequency.
  • The bars touch one another (no gaps), reflecting the continuity of data, which distinguishes a histogram from a bar diagram.

It is used to study the shape, central tendency and spread of a distribution, and the mode can be located graphically from it.

graphs
11short5 marks

Define mutually exclusive events.

Two (or more) events are said to be mutually exclusive (or disjoint) if the occurrence of one prevents the occurrence of the other in the same trial, i.e. they cannot happen simultaneously. In set terms their intersection is empty:

AB=P(AB)=0.A\cap B=\varnothing\quad\Rightarrow\quad P(A\cap B)=0.

For such events the addition rule simplifies to

P(AB)=P(A)+P(B).P(A\cup B)=P(A)+P(B).

Example: in a single toss of a coin, getting a head and getting a tail are mutually exclusive; in rolling a die, the events {even number} and {odd number} are mutually exclusive.

probability
12short5 marks

What is the coefficient of variation?

The coefficient of variation (CV) is a relative measure of dispersion that expresses the standard deviation as a percentage of the mean:

CV=σxˉ×100%.\text{CV}=\frac{\sigma}{\bar{x}}\times 100\%.

Because it is a unitless quantity, it is used to compare the variability (consistency) of two or more series that have different units or very different means.

  • A higher CV means greater variability and less consistency/uniformity.
  • A lower CV means less variability and more consistency/stability.

Example: comparing two batsmen, the one with the smaller CV of runs is the more consistent player.

dispersion

Frequently asked questions

Where can I find the BSc CSIT (TU) Statistics I (BSc CSIT, STA164) question paper 2074?
The full BSc CSIT (TU) Statistics I (BSc CSIT, STA164) 2074 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
Does the Statistics I (BSc CSIT, STA164) 2074 paper come with solutions?
Yes. Every question on this Statistics I (BSc CSIT, STA164) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
How many marks is the BSc CSIT (TU) Statistics I (BSc CSIT, STA164) 2074 paper?
The BSc CSIT (TU) Statistics I (BSc CSIT, STA164) 2074 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
Is practising this Statistics I (BSc CSIT, STA164) past paper free?
Yes — reading and attempting this Statistics I (BSc CSIT, STA164) past paper on Kekkei is completely free.