BSc CSIT (TU) Science Statistics I (BSc CSIT, STA164) Question Paper 2075 Nepal

Q: Where can I find the BSc CSIT (TU) Statistics I (BSc CSIT, STA164) question paper 2075?

The full BSc CSIT (TU) Statistics I (BSc CSIT, STA164) 2075 (Regular (annual)) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.

Q: Does the Statistics I (BSc CSIT, STA164) 2075 paper come with solutions?

Yes. Every question on this Statistics I (BSc CSIT, STA164) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.

Q: How many marks is the BSc CSIT (TU) Statistics I (BSc CSIT, STA164) 2075 paper?

The BSc CSIT (TU) Statistics I (BSc CSIT, STA164) 2075 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.

Q: Is practising this Statistics I (BSc CSIT, STA164) past paper free?

Yes — reading and attempting this Statistics I (BSc CSIT, STA164) past paper on Kekkei is completely free.

Question

1Long answer10 marks

Define dispersion. Explain different measures of dispersion. Compute the standard deviation and coefficient of variation for a given frequency distribution and comment on consistency.

dispersionstandard-deviation

Answer 1

Dispersion

Dispersion (or variation) measures the extent to which individual observations in a data set are scattered about a central value (mean/median). A small dispersion means the data are clustered closely around the average; a large dispersion means they are spread out. Dispersion describes the consistency, reliability, and homogeneity of a series.

Measures of Dispersion

1. Absolute measures (expressed in the same units as the data):

Range $= L - S$ (largest value minus smallest value). Simplest but unstable.
Quartile Deviation (Semi-IQR) $= \dfrac{Q_3 - Q_1}{2}$ . Based on the middle 50% of data.
Mean Deviation $= \dfrac{\sum f|x-\bar{x}|}{N}$ . Average of absolute deviations from the mean (or median).
Standard Deviation $\sigma = \sqrt{\dfrac{\sum f(x-\bar{x})^2}{N}}$ . The most important and widely used measure.

2. Relative measures (unit-free, used to compare two series):

Coefficient of Range $= \dfrac{L-S}{L+S}$
Coefficient of Quartile Deviation $= \dfrac{Q_3-Q_1}{Q_3+Q_1}$
Coefficient of Variation $C.V. = \dfrac{\sigma}{\bar{x}}\times 100\%$

Worked Computation (illustrative frequency distribution)

Class	$f$	Mid $x$	$fx$	$fx^2$
0–10	5	5	25	125
10–20	8	15	120	1800
20–30	15	25	375	9375
30–40	7	35	245	8575
40–50	5	45	225	10125
Total	40		990	30000

Mean: $\bar{x} = \dfrac{\sum fx}{N} = \dfrac{990}{40} = 24.75$

Standard deviation:

\sigma = \sqrt{\frac{\sum fx^2}{N} - \left(\frac{\sum fx}{N}\right)^2} = \sqrt{\frac{30000}{40} - (24.75)^2}

= \sqrt{750 - 612.56} = \sqrt{137.44} = 11.72

Coefficient of variation:

C.V. = \frac{\sigma}{\bar{x}}\times 100 = \frac{11.72}{24.75}\times 100 = 47.36\%

Comment on Consistency

The series whose C.V. is smaller is more consistent / uniform / stable, while a higher C.V. indicates greater variability. (When comparing two distributions, compute C.V. for each and conclude that the one with the lower C.V. is more consistent.) Here, a C.V. of about $47\%$ indicates a fairly high degree of variability in the data.

Answer 2

Method of Least Squares

The method of least squares fits a line (or curve) to data so that the sum of squares of the vertical deviations of observed points from the fitted line is a minimum. For the line of regression of $Y$ on $X$ , written as

Y = a + bX,

we minimise $S = \sum (Y_i - a - bX_i)^2$ . Setting $\partial S/\partial a = 0$ and $\partial S/\partial b = 0$ gives the two normal equations:

\sum Y = na + b\sum X

\sum XY = a\sum X + b\sum X^2

Solving these gives:

b = \frac{n\sum XY - \sum X \sum Y}{n\sum X^2 - (\sum X)^2}, \qquad a = \bar{Y} - b\bar{X}

Here $b = b_{YX}$ is the regression coefficient of $Y$ on $X$ , the average change in $Y$ per unit change in $X$ .

Worked Example (fitting $Y$ on $X$ )

For data $X: 1,2,3,4,5$ and $Y: 2,4,5,4,6$ :

$X$	$Y$	$XY$	$X^2$
1	2	2	1
2	4	8	4
3	5	15	9
4	4	16	16
5	6	30	25
15	21	71	55

$n=5,\ \sum X=15,\ \sum Y=21,\ \sum XY=71,\ \sum X^2=55$

b = \frac{5(71) - (15)(21)}{5(55) - (15)^2} = \frac{355 - 315}{275 - 225} = \frac{40}{50} = 0.8

\bar{X} = 3,\quad \bar{Y} = 4.2,\qquad a = 4.2 - 0.8(3) = 1.8

Regression line: $\;Y = 1.8 + 0.8X$

Estimating Y for a given X

For example, at $X = 6$ :

\hat{Y} = 1.8 + 0.8(6) = 1.8 + 4.8 = 6.6

So the estimated value of $Y$ when $X=6$ is 6.6. (Substitute the actual $X$ asked in the question into the fitted line to obtain the estimate.)

Answer 3

Random Variable

A random variable is a real-valued function $X$ that assigns a numerical value to each outcome (sample point) of a random experiment, i.e. $X: S \to \mathbb{R}$ . It is discrete if it takes a finite or countably infinite set of values (e.g. number of heads in tosses) and continuous if it can take any value in an interval (e.g. height, time).

Probability Mass Function (PMF)

For a discrete random variable $X$ , the PMF is

p(x) = P(X = x)

satisfying (i) $p(x) \ge 0$ for all $x$ , and (ii) $\sum_x p(x) = 1$ . It gives the probability that $X$ equals each specific value.

Probability Density Function (PDF)

For a continuous random variable $X$ , the PDF $f(x)$ satisfies (i) $f(x) \ge 0$ , (ii) $\int_{-\infty}^{\infty} f(x)\,dx = 1$ , and the probability over an interval is

P(a \le X \le b) = \int_a^b f(x)\,dx.

For a continuous variable $P(X=x)=0$ ; only interval probabilities are meaningful.

Mean and Variance (worked example)

Let $X$ have the distribution:

$x$	0	1	2	3
$p(x)$	0.1	0.3	0.4	0.2

Mean (Expectation):

E(X) = \sum x\,p(x) = 0(0.1) + 1(0.3) + 2(0.4) + 3(0.2) = 0.3 + 0.8 + 0.6 = 1.7

$E(X^2)$ :

E(X^2) = \sum x^2 p(x) = 0 + 1(0.3) + 4(0.4) + 9(0.2) = 0.3 + 1.6 + 1.8 = 3.7

Variance:

Var(X) = E(X^2) - [E(X)]^2 = 3.7 - (1.7)^2 = 3.7 - 2.89 = 0.81

Standard deviation $= \sqrt{0.81} = 0.9$ .

For a continuous distribution the same idea is used with integrals: $E(X)=\int x f(x)\,dx$ and $Var(X)=\int x^2 f(x)\,dx - [E(X)]^2$ .

Answer 4

Dispersion is the degree to which the values of a data set are scattered or spread about a central (average) value. It indicates the consistency and homogeneity of the data.

Its measures are of two kinds:

Absolute measures (same units as data): Range, Quartile Deviation, Mean Deviation, Standard Deviation.
Relative measures (unit-free, for comparison): Coefficient of Range, Coefficient of Quartile Deviation, Coefficient of Mean Deviation, and Coefficient of Variation $\left(\dfrac{\sigma}{\bar{x}}\times 100\%\right)$ .

A smaller dispersion means more consistent/uniform data; a larger dispersion means more variability.

Answer 5

The range is the simplest absolute measure of dispersion, defined as the difference between the largest ( $L$ ) and smallest ( $S$ ) values in a data set:

\text{Range} = L - S.

For the data $12, 15, 20, 8, 25$ : largest $L = 25$ , smallest $S = 8$ .

\text{Range} = 25 - 8 = \mathbf{17}.

Answer 6

Regression coefficients are the slopes of the two lines of regression; each measures the average rate of change of one variable per unit change in the other.

Regression coefficient of $Y$ on $X$ : $\;b_{YX} = r\dfrac{\sigma_y}{\sigma_x} = \dfrac{\text{Cov}(x,y)}{\sigma_x^2}$ — change in $Y$ per unit change in $X$ .
Regression coefficient of $X$ on $Y$ : $\;b_{XY} = r\dfrac{\sigma_x}{\sigma_y} = \dfrac{\text{Cov}(x,y)}{\sigma_y^2}$ — change in $X$ per unit change in $Y$ .

Important properties:

$r = \pm\sqrt{b_{YX}\cdot b_{XY}}$ , so the correlation coefficient is the geometric mean of the two regression coefficients.
Both have the same sign as $r$ .
Their product cannot exceed 1: $b_{YX}\cdot b_{XY} \le 1$ .

Answer 7

The multiplication theorem of probability gives the probability of the joint occurrence of two events.

For dependent events $A$ and $B$ (general form):

P(A \cap B) = P(A)\cdot P(B\mid A) = P(B)\cdot P(A\mid B),

where $P(B\mid A)$ is the conditional probability of $B$ given that $A$ has occurred.

For independent events, $P(B\mid A) = P(B)$ , so the theorem reduces to:

P(A \cap B) = P(A)\cdot P(B).

This extends to $n$ events: $P(A_1\cap A_2\cap\dots\cap A_n) = P(A_1)P(A_2\mid A_1)\cdots P(A_n\mid A_1\cap\dots\cap A_{n-1})$ .

Answer 8

A random variable is a real-valued function $X$ that assigns a numerical value to every outcome of a random experiment, i.e. $X: S \to \mathbb{R}$ where $S$ is the sample space.

It is classified as:

Discrete random variable — takes a finite or countably infinite number of distinct values (e.g. number of heads when a coin is tossed three times: $0,1,2,3$ ).
Continuous random variable — can assume any value within an interval (e.g. height, weight, time).

Example: Tossing two coins, let $X$ = number of heads. Then $X$ takes values $0, 1, 2$ with probabilities $\tfrac14, \tfrac12, \tfrac14$ .

Answer 9

Correlation describes the direction and strength of the linear relationship between two variables.

Positive correlation: both variables move in the same direction — as one increases, the other also increases (and as one decreases, the other decreases). The correlation coefficient $r$ lies in $(0, +1]$ . Example: height and weight; income and expenditure.
Negative correlation: the two variables move in opposite directions — as one increases, the other decreases. Here $r$ lies in $[-1, 0)$ . Example: price and quantity demanded; speed and time taken for a fixed distance.

$r = +1$ is perfect positive and $r = -1$ is perfect negative correlation.

Answer 10

The median is the middle value of a data set when the observations are arranged in ascending (or descending) order.

Arrange the data $7, 3, 9, 5, 11$ in ascending order:

3,\ 5,\ 7,\ 9,\ 11.

There are $n = 5$ (odd) observations, so the median is the $\left(\dfrac{n+1}{2}\right)^{\text{th}} = 3^{\text{rd}}$ value.

\text{Median} = \mathbf{7}.

Answer 11

The probability density function (PDF) is the function $f(x)$ that describes the distribution of a continuous random variable $X$ . It satisfies:

$f(x) \ge 0$ for all $x$ (non-negative),
$\displaystyle\int_{-\infty}^{\infty} f(x)\,dx = 1$ (total area under the curve equals 1),
The probability that $X$ lies in an interval is the area under the curve:

P(a \le X \le b) = \int_a^b f(x)\,dx.

Note that for a continuous variable $P(X = x) = 0$ ; the PDF gives probabilities only over intervals, not at single points. The PDF is the derivative of the cumulative distribution function: $f(x) = \dfrac{d}{dx}F(x)$ .

Answer 12

A scatter diagram (scatter plot) is a graphical method of representing bivariate data, in which each pair of observations $(x_i, y_i)$ is plotted as a point on the $XY$ -plane (with $X$ on the horizontal axis and $Y$ on the vertical axis).

It gives a quick visual idea of the nature, direction, and degree of correlation between the two variables:

If points cluster around an upward-sloping line → positive correlation.
If points cluster around a downward-sloping line → negative correlation.
If points lie exactly on a line → perfect correlation ( $r = \pm 1$ ).
If points are scattered randomly with no pattern → little or no correlation.

It is simple to draw and is usually the first step before computing a numerical correlation coefficient.

Level	BSc CSIT (TU)
Stream	Science
Subject	Statistics I (BSc CSIT, STA164)
Year	2075 BS
Exam session	Regular (annual)
Full marks	60
Time allowed	180 minutes
Questions	12, all with step-by-step solutions

BSc CSIT (TU) Science Statistics I (BSc CSIT, STA164) Question Paper 2075 Nepal

Section A: Long Answer Questions

Dispersion

Measures of Dispersion

Worked Computation (illustrative frequency distribution)

Comment on Consistency

Method of Least Squares

Worked Example (fitting $Y$ on $X$ )

Estimating Y for a given X

Random Variable

Probability Mass Function (PMF)

Probability Density Function (PDF)

Mean and Variance (worked example)

Section B: Short Answer Questions

Frequently asked questions

Section A: Long Answer Questions

Dispersion

Measures of Dispersion

Worked Computation (illustrative frequency distribution)

Comment on Consistency

Method of Least Squares

Worked Example (fitting YYY on XXX)

Estimating Y for a given X

Random Variable

Probability Mass Function (PMF)

Probability Density Function (PDF)

Mean and Variance (worked example)

Section B: Short Answer Questions

Frequently asked questions

Worked Example (fitting $Y$ on $X$ )