BSc CSIT (TU) Science Statistics I (BSc CSIT, STA164) Question Paper 2074 Nepal

Q: Where can I find the BSc CSIT (TU) Statistics I (BSc CSIT, STA164) question paper 2074?

The full BSc CSIT (TU) Statistics I (BSc CSIT, STA164) 2074 (Regular (annual)) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.

Q: Does the Statistics I (BSc CSIT, STA164) 2074 paper come with solutions?

Yes. Every question on this Statistics I (BSc CSIT, STA164) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.

Q: How many marks is the BSc CSIT (TU) Statistics I (BSc CSIT, STA164) 2074 paper?

The BSc CSIT (TU) Statistics I (BSc CSIT, STA164) 2074 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.

Q: Is practising this Statistics I (BSc CSIT, STA164) past paper free?

Yes — reading and attempting this Statistics I (BSc CSIT, STA164) past paper on Kekkei is completely free.

Question

1Long answer10 marks

Define statistics. Explain the importance and limitations of statistics. Describe the various measures of central tendency (mean, median, mode) with their merits and demerits.

central-tendencybasics

Answer 1

Statistics

Definition. Statistics is the branch of science that deals with the collection, organization, presentation, analysis and interpretation of numerical data to aid rational decision-making under uncertainty. In the plural sense it means the data themselves; in the singular sense it means the scientific methods used to handle such data.

Importance of Statistics

Planning and policy: Governments and businesses use statistical data for planning, budgeting and forecasting.
Decision-making under uncertainty: Provides tools (estimation, testing) to draw conclusions from samples.
Comparison: Averages and measures of dispersion allow comparison of different groups.
Relationship study: Correlation and regression reveal relationships between variables.
Forecasting: Time-series and trend analysis help predict future values.
Applications: Indispensable in economics, business, biology, computer science (data mining, ML), and research.

Limitations of Statistics

Studies only aggregates, not individuals.
Deals only with quantitative (or quantifiable) data; qualitative facts must be coded.
Results are true on average, not in every individual case.
Liable to misuse by unscrupulous persons; can be misleading if methods are wrong.
Requires expertise; conclusions are probabilistic, not certain.

Measures of Central Tendency

A central value that represents the whole data set.

1. Arithmetic Mean

\bar{x}=\frac{\sum x_i}{n}\quad\text{(ungrouped)},\qquad \bar{x}=\frac{\sum f_i x_i}{\sum f_i}\quad\text{(grouped)}

Merits: rigidly defined, based on all observations, suitable for algebraic treatment, least affected by sampling fluctuation. Demerits: highly affected by extreme values (outliers); cannot be found for open-end classes; may give an impossible value (e.g. 2.5 children).

2. Median

The middle value when data are arranged in order; positional average.

\text{Median}=L+\frac{\tfrac{N}{2}-cf}{f}\times h\quad\text{(grouped)}

Merits: not affected by extreme values; can be found for open-end classes; can be located graphically (ogive). Demerits: not based on all observations; needs arranging data; less suitable for further algebraic treatment.

3. Mode

The value that occurs most frequently.

\text{Mode}=L+\frac{f_1-f_0}{2f_1-f_0-f_2}\times h\quad\text{(grouped)}

Merits: easy to understand; not affected by extreme values; the most typical value; useful for qualitative data. Demerits: ill-defined when data are multimodal or have no repetition; not based on all observations; not suitable for algebraic treatment.

Empirical relation: $\text{Mode}=3\,\text{Median}-2\,\text{Mean}$ for a moderately skewed distribution.

Answer 2

Correlation

Definition. Correlation is the statistical technique that measures the degree and direction of linear relationship between two quantitative variables $X$ and $Y$ . If both increase together it is positive; if one increases while the other decreases it is negative; the value lies in $[-1,+1]$ .

Karl Pearson's Coefficient of Correlation

r=\frac{n\sum xy-\sum x\sum y}{\sqrt{n\sum x^2-(\sum x)^2}\;\sqrt{n\sum y^2-(\sum y)^2}}

Worked example

Let the bivariate data be:

$X$	1	2	3	4	5
$Y$	2	4	5	4	5

$x$	$y$	$x^2$	$y^2$	$xy$
1	2	1	4	2
2	4	4	16	8
3	5	9	25	15
4	4	16	16	16
5	5	25	25	25
15	20	55	86	66

Here $n=5$ , $\sum x=15,\ \sum y=20,\ \sum x^2=55,\ \sum y^2=86,\ \sum xy=66$ .

r=\frac{5(66)-15(20)}{\sqrt{5(55)-15^2}\;\sqrt{5(86)-20^2}}=\frac{330-300}{\sqrt{275-225}\;\sqrt{430-400}}=\frac{30}{\sqrt{50}\sqrt{30}}

r=\frac{30}{\sqrt{1500}}=\frac{30}{38.73}\approx 0.775

Interpretation. $r\approx+0.78$ indicates a fairly strong positive linear correlation: as $X$ increases, $Y$ tends to increase. (With actual exam data, substitute the given values into the same formula.)

Correlation vs Regression

Correlation	Regression
Measures degree/strength of relationship	Measures the nature/form of dependence (predicts one from another)
Symmetric: $r_{xy}=r_{yx}$	Asymmetric: $b_{yx}\ne b_{xy}$ in general
Value lies in $[-1,+1]$ , a pure number	Coefficient has units; line $y=a+bx$
Does not imply cause and effect	Used for estimation/prediction of dependent variable
No distinction between dependent/independent variable	Clear dependent and independent variables

Relation: $r=\pm\sqrt{b_{yx}\cdot b_{xy}}$ .

Answer 3

Probability

Definition (classical). If a random experiment has $n$ equally likely, mutually exclusive and exhaustive outcomes, of which $m$ are favourable to event $A$ , then

P(A)=\frac{m}{n}=\frac{\text{favourable cases}}{\text{total cases}},\qquad 0\le P(A)\le 1.

Addition Theorem

For any two events $A$ and $B$ :

P(A\cup B)=P(A)+P(B)-P(A\cap B).

If $A$ and $B$ are mutually exclusive ( $A\cap B=\varnothing$ ), then

P(A\cup B)=P(A)+P(B).

It gives the probability that at least one of the events occurs.

Multiplication Theorem

For any two events:

P(A\cap B)=P(A)\,P(B\mid A)=P(B)\,P(A\mid B).

If $A$ and $B$ are independent, $P(B\mid A)=P(B)$ , so

P(A\cap B)=P(A)\,P(B).

It gives the probability that both events occur simultaneously.

Bayes' Theorem

If $E_1,E_2,\dots,E_k$ are mutually exclusive and exhaustive events with $P(E_i)>0$ , and $A$ is any event, then for each $i$ :

P(E_i\mid A)=\frac{P(E_i)\,P(A\mid E_i)}{\sum_{j=1}^{k}P(E_j)\,P(A\mid E_j)}.

The $P(E_i)$ are prior probabilities and $P(E_i\mid A)$ are posterior probabilities.

Worked problem

Three machines $E_1,E_2,E_3$ produce 50%, 30% and 20% of the items, with defective rates 3%, 4% and 5% respectively. An item drawn at random is defective. Find the probability it came from machine $E_3$ .

Given: $P(E_1)=0.5,\ P(E_2)=0.3,\ P(E_3)=0.2$ ; $P(A\mid E_1)=0.03,\ P(A\mid E_2)=0.04,\ P(A\mid E_3)=0.05$ .

Total probability of a defective item:

P(A)=0.5(0.03)+0.3(0.04)+0.2(0.05)=0.015+0.012+0.010=0.037.

By Bayes' theorem:

P(E_3\mid A)=\frac{0.2(0.05)}{0.037}=\frac{0.010}{0.037}\approx 0.270.

Result: there is about a 27% chance that the defective item was produced by machine $E_3$ .

Answer 4

Mean (arithmetic mean): the sum of all observations divided by their number, $\bar{x}=\dfrac{\sum x_i}{n}$ . It is the most common average and uses every value.
Median: the middle value of the data arranged in ascending (or descending) order; it divides the data into two equal halves and is unaffected by extreme values.
Mode: the value that occurs most frequently in the data set; a distribution may be unimodal, bimodal or multimodal.

Answer 5

Primary data are data collected originally by the investigator for the first time for a specific purpose (e.g. through surveys, interviews, questionnaires, direct observation or experiments). They are original, more reliable and accurate but costly and time-consuming.

Secondary data are data that have already been collected by someone else and are used by the investigator second-hand (e.g. from published reports, government records, journals, websites). They are cheaper and quicker to obtain but may not exactly fit the purpose and need careful checking for accuracy.

Basis	Primary data	Secondary data
Originality	Original, first-hand	Second-hand
Collected by	The investigator	Someone else
Cost/time	High	Low
Reliability	Higher (if collected well)	Depends on source

Answer 6

Variance is the mean of the squared deviations of observations from their arithmetic mean. It measures how spread out the data are.

\sigma^2=\frac{\sum (x_i-\bar{x})^2}{n}\qquad(\text{population});\qquad s^2=\frac{\sum (x_i-\bar{x})^2}{n-1}\ (\text{sample}).

Standard deviation is the positive square root of the variance; it is the most reliable measure of dispersion and is expressed in the same units as the data.

\sigma=\sqrt{\frac{\sum (x_i-\bar{x})^2}{n}}=\sqrt{\frac{\sum x_i^2}{n}-\bar{x}^2}.

A larger standard deviation/variance means greater scatter of the values about the mean.

Answer 7

Classical (a priori) definition of probability. If a random experiment results in $n$ exhaustive, mutually exclusive and equally likely outcomes, of which $m$ are favourable to the occurrence of an event $A$ , then the probability of $A$ is

P(A)=\frac{m}{n}=\frac{\text{number of favourable outcomes}}{\text{total number of outcomes}}.

Here $0\le P(A)\le 1$ ; $P(A)=0$ for an impossible event and $P(A)=1$ for a certain event. The probability of non-occurrence is $P(\bar{A})=1-P(A)$ .

Limitation: it fails when outcomes are not equally likely or when $n$ is infinite.

Answer 8

A frequency distribution is a tabular arrangement of data that shows how the observations are distributed among different values or class intervals, together with the number of times each value or class occurs (its frequency).

A discrete (ungrouped) frequency distribution lists individual values with their frequencies.
A continuous (grouped) frequency distribution groups data into class intervals (e.g. 0–10, 10–20) with corresponding frequencies.

It condenses raw data into a compact form, making patterns, the most common values and the overall shape of the data easy to study, and forms the basis for graphs such as histograms and for computing averages and dispersion.

Example:

Marks	0–10	10–20	20–30	30–40
No. of students $(f)$	5	12	8	3

Answer 9

The correlation coefficient is a numerical measure of the degree and direction of the linear relationship between two variables $X$ and $Y$ . Karl Pearson's coefficient is defined as

r=\frac{\text{Cov}(x,y)}{\sigma_x\,\sigma_y}=\frac{\sum (x-\bar{x})(y-\bar{y})}{\sqrt{\sum (x-\bar{x})^2}\;\sqrt{\sum (y-\bar{y})^2}}.

Properties:

$-1\le r\le +1$ .
$r=+1$ : perfect positive correlation; $r=-1$ : perfect negative correlation; $r=0$ : no linear correlation.
It is a pure number, independent of units and of change of origin and scale.

Answer 10

A histogram is a graphical representation of a continuous (grouped) frequency distribution using a set of adjacent rectangles. Each rectangle is drawn over a class interval on the $X$ -axis, and its area is proportional to the frequency of that class.

For equal class widths, the height of each bar equals the class frequency.
For unequal widths, the height is taken as the frequency density $=\dfrac{\text{frequency}}{\text{class width}}$ so that area stays proportional to frequency.
The bars touch one another (no gaps), reflecting the continuity of data, which distinguishes a histogram from a bar diagram.

It is used to study the shape, central tendency and spread of a distribution, and the mode can be located graphically from it.

Answer 11

Two (or more) events are said to be mutually exclusive (or disjoint) if the occurrence of one prevents the occurrence of the other in the same trial, i.e. they cannot happen simultaneously. In set terms their intersection is empty:

A\cap B=\varnothing\quad\Rightarrow\quad P(A\cap B)=0.

For such events the addition rule simplifies to

P(A\cup B)=P(A)+P(B).

Example: in a single toss of a coin, getting a head and getting a tail are mutually exclusive; in rolling a die, the events {even number} and {odd number} are mutually exclusive.

Answer 12

The coefficient of variation (CV) is a relative measure of dispersion that expresses the standard deviation as a percentage of the mean:

\text{CV}=\frac{\sigma}{\bar{x}}\times 100\%.

Because it is a unitless quantity, it is used to compare the variability (consistency) of two or more series that have different units or very different means.

A higher CV means greater variability and less consistency/uniformity.
A lower CV means less variability and more consistency/stability.

Example: comparing two batsmen, the one with the smaller CV of runs is the more consistent player.

Level	BSc CSIT (TU)
Stream	Science
Subject	Statistics I (BSc CSIT, STA164)
Year	2074 BS
Exam session	Regular (annual)
Full marks	60
Time allowed	180 minutes
Questions	12, all with step-by-step solutions