BSc CSIT (TU) Science Statistics II (BSc CSIT, STA210) Question Paper 2079 Nepal

Q: Where can I find the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) question paper 2079?

The full BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2079 (Regular (annual)) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.

Q: Does the Statistics II (BSc CSIT, STA210) 2079 paper come with solutions?

Yes. Every question on this Statistics II (BSc CSIT, STA210) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.

Q: How many marks is the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2079 paper?

The BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2079 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.

Q: Is practising this Statistics II (BSc CSIT, STA210) past paper free?

Yes — reading and attempting this Statistics II (BSc CSIT, STA210) past paper on Kekkei is completely free.

Question

1Long answer10 marks

Define a probability distribution. Explain the binomial distribution with its mean and variance, and state the conditions under which it is applied.

probabilitydistribution

Answer 1

Probability Distribution

A probability distribution is a mathematical function (or table) that assigns to each possible value of a random variable the probability of its occurrence. For a discrete random variable $X$ taking values $x_1, x_2, \dots$ it is described by the probability mass function $P(X = x_i) = p_i$ with $p_i \ge 0$ and $\sum_i p_i = 1$ ; for a continuous variable it is described by a probability density function $f(x)$ with $f(x) \ge 0$ and $\int_{-\infty}^{\infty} f(x)\,dx = 1$ .

Binomial Distribution

A discrete random variable $X$ follows a binomial distribution if it counts the number of successes in $n$ independent Bernoulli trials, each having success probability $p$ (and failure $q = 1 - p$ ). Its probability mass function is

P(X = x) = \binom{n}{x} p^{x} q^{\,n-x}, \qquad x = 0, 1, 2, \dots, n.

Mean and Variance

\text{Mean } \mu = E(X) = np, \qquad \text{Variance } \sigma^{2} = npq.

Since $q < 1$ , the variance $npq$ is always less than the mean $np$ , i.e. for the binomial distribution mean $>$ variance.

Outline of the mean: writing $X = \sum_{i=1}^{n} X_i$ where each $X_i$ is Bernoulli with $E(X_i)=p$ , by linearity $E(X)=\sum E(X_i)=np$ ; similarly $\operatorname{Var}(X_i)=pq$ and by independence $\operatorname{Var}(X)=npq$ .

Conditions for Application

The binomial distribution applies when:

The number of trials $n$ is fixed and finite.
Each trial has only two outcomes — success or failure (dichotomous).
The trials are independent of one another.
The probability of success $p$ remains constant from trial to trial.

Examples: number of heads in 10 tosses of a coin, number of defective items in a sample of fixed size, number of correct answers in a multiple-choice test.

Answer 2

Normal Distribution

A continuous random variable $X$ follows a normal distribution with mean $\mu$ and standard deviation $\sigma$ if its probability density function is

f(x) = \frac{1}{\sigma\sqrt{2\pi}}\, e^{-\frac{(x-\mu)^2}{2\sigma^2}}, \qquad -\infty < x < \infty.

We write $X \sim N(\mu, \sigma^2)$ .

Properties

The curve is bell-shaped and symmetric about the mean $\mu$ .
Mean = Median = Mode = $\mu$ .
It is unimodal; the maximum of $f(x)$ occurs at $x = \mu$ .
The curve is asymptotic to the $x$ -axis on both sides.
Total area under the curve is 1, with half on each side of $\mu$ .
The points of inflection occur at $x = \mu \pm \sigma$ .
Empirical rule: about 68.27%, 95.45% and 99.73% of values lie within $\mu\pm\sigma$ , $\mu\pm 2\sigma$ and $\mu\pm 3\sigma$ respectively.
Quartile deviation $= 0.6745\sigma$ and mean deviation $= 0.7979\sigma$ .
The standard normal variate is $Z = \dfrac{X - \mu}{\sigma} \sim N(0,1)$ .

Numerical: $P(45 < X < 62)$

Given $\mu = 50$ , $\sigma = 10$ . Convert to $Z$ -scores:

Z_1 = \frac{45 - 50}{10} = -0.5, \qquad Z_2 = \frac{62 - 50}{10} = 1.2.

Using the standard normal table ( $\Phi$ = area from $0$ to $z$ ):

P(0 < Z < 0.5) = 0.1915, \qquad P(0 < Z < 1.2) = 0.3849.

Since the two points lie on opposite sides of the mean,

P(45 < X < 62) = P(-0.5 < Z < 1.2) = 0.1915 + 0.3849 = \boxed{0.5764}.

Hence the required probability is about 0.5764 (57.64%).

Answer 3

Hypothesis Testing

Hypothesis testing is a statistical procedure used to decide, on the basis of sample data, whether to accept or reject an assumption (hypothesis) made about a population parameter. It quantifies how strongly the sample evidence contradicts the assumption.

Key Concepts

1. Null Hypothesis ( $H_0$ ): A statement of no effect or no difference that is assumed true until evidence suggests otherwise, e.g. $H_0: \mu = \mu_0$ .

2. Alternative Hypothesis ( $H_1$ ): The statement accepted if $H_0$ is rejected. It may be:

Two-tailed: $H_1: \mu \ne \mu_0$
Right-tailed: $H_1: \mu > \mu_0$
Left-tailed: $H_1: \mu < \mu_0$

3. Level of Significance ( $\alpha$ ): The maximum probability of rejecting $H_0$ when it is actually true (probability of a Type I error). Common values are $0.05$ (5%) and $0.01$ (1%).

4. Types of Errors:

	$H_0$ True	$H_0$ False
Reject $H_0$	Type I error ( $\alpha$ )	Correct decision
Accept $H_0$	Correct decision	Type II error ( $\beta$ )

Type I error ( $\alpha$ ): rejecting a true $H_0$ .
Type II error ( $\beta$ ): accepting a false $H_0$ . The quantity $1 - \beta$ is the power of the test.

5. Critical Region (Rejection Region): The set of values of the test statistic for which $H_0$ is rejected. Its area equals $\alpha$ . The boundary value(s) separating it from the acceptance region are the critical value(s).

Procedure (Steps)

Set up the null hypothesis $H_0$ and alternative hypothesis $H_1$ .
Choose the level of significance $\alpha$ .
Select an appropriate test statistic (Z, t, $\chi^2$ , F) based on sample size and what is known.
Determine the critical value and the critical (rejection) region from statistical tables.
Compute the value of the test statistic from the sample data.
Decision: If the computed value falls in the critical region, reject $H_0$ ; otherwise accept (fail to reject) $H_0$ .
State the conclusion in the context of the problem.

Answer 4

Addition Theorem of Probability

For any two events $A$ and $B$ ,

P(A \cup B) = P(A) + P(B) - P(A \cap B).

If $A$ and $B$ are mutually exclusive ( $A \cap B = \varnothing$ ), this reduces to

P(A \cup B) = P(A) + P(B).

Example: Drawing one card from a pack, $P(\text{King or Queen}) = \frac{4}{52} + \frac{4}{52} = \frac{8}{52} = \frac{2}{13}$ (mutually exclusive). For $P(\text{King or Heart}) = \frac{4}{52} + \frac{13}{52} - \frac{1}{52} = \frac{16}{52} = \frac{4}{13}$ (not mutually exclusive).

Multiplication Theorem of Probability

For any two events $A$ and $B$ ,

P(A \cap B) = P(A)\,P(B \mid A) = P(B)\,P(A \mid B),

where $P(B \mid A)$ is the conditional probability of $B$ given $A$ . If $A$ and $B$ are independent, then $P(B\mid A) = P(B)$ and

P(A \cap B) = P(A)\,P(B).

Example: Two cards drawn one after another without replacement, $P(\text{both Kings}) = \frac{4}{52} \times \frac{3}{51} = \frac{1}{221}$ . Tossing two coins (independent), $P(\text{both heads}) = \frac{1}{2}\times\frac{1}{2} = \frac{1}{4}$ .

Answer 5

Poisson Distribution

The Poisson distribution is a discrete distribution that models the number of times a rare event occurs in a fixed interval of time, space or volume when occurrences are independent and at a constant average rate. A random variable $X$ follows a Poisson distribution with parameter $\lambda$ (the mean number of occurrences) if

P(X = x) = \frac{e^{-\lambda}\,\lambda^{x}}{x!}, \qquad x = 0, 1, 2, \dots; \; \lambda > 0.

It is the limiting form of the binomial distribution when $n \to \infty$ , $p \to 0$ , with $np = \lambda$ finite.

Mean and Variance

\text{Mean} = E(X) = \lambda, \qquad \text{Variance} = \operatorname{Var}(X) = \lambda.

A characteristic feature is that the mean equals the variance $(\mu = \sigma^2 = \lambda)$ .

Applications

Used for counting rare, independent events such as:

Number of telephone calls arriving at an exchange per minute.
Number of printing/typing errors per page of a book.
Number of accidents on a highway per day.
Number of defective items in a large batch.
Number of customers (or network packets/requests) arriving at a server per unit time.
Number of radioactive particles emitted per second.

Answer 6

Random Variable

A random variable is a real-valued function that assigns a numerical value to each outcome (sample point) of a random experiment. It is usually denoted by a capital letter $X$ , and a particular value by a small letter $x$ . For example, if a coin is tossed twice and $X$ denotes the number of heads, then $X$ takes the values $0, 1, 2$ .

Discrete vs Continuous Random Variables

Basis	Discrete Random Variable	Continuous Random Variable
Values	Takes a finite or countably infinite set of isolated values	Takes any value within an interval (uncountable)
Described by	Probability mass function $P(X=x)=p(x)$	Probability density function $f(x)$
Probability of a point	$P(X=x)$ can be non-zero	$P(X=x)=0$ ; only $P(a<X<b)$ is meaningful
Total probability	$\sum_x p(x) = 1$	$\int_{-\infty}^{\infty} f(x)\,dx = 1$
Example	Number of heads in 3 tosses; number of defective bulbs	Height, weight, temperature, time of a person/object

Examples

Discrete: number of students present in a class, number of cars passing a point in an hour.
Continuous: the exact weight of a person, the lifetime of an electric bulb, daily rainfall.

Answer 7

Mathematical Expectation

The mathematical expectation (or expected value) of a random variable is the long-run average value it takes, weighted by probabilities. For a discrete random variable $X$ with pmf $p(x)$ ,

E(X) = \sum_x x\,p(x),

and for a continuous random variable with pdf $f(x)$ ,

E(X) = \int_{-\infty}^{\infty} x\,f(x)\,dx.

Properties (with proofs)

1. Expectation of a constant. If $k$ is a constant, $E(k) = k$ .

Proof: $E(k) = \sum_x k\,p(x) = k\sum_x p(x) = k\cdot 1 = k.$

2. Multiplication by a constant. $E(kX) = k\,E(X)$ .

Proof: $E(kX) = \sum_x kx\,p(x) = k\sum_x x\,p(x) = k\,E(X).$

3. Addition (linearity). $E(X + Y) = E(X) + E(Y)$ for any random variables.

Proof (discrete): $E(X+Y) = \sum_x\sum_y (x+y)p(x,y) = \sum_x\sum_y x\,p(x,y) + \sum_x\sum_y y\,p(x,y) = E(X) + E(Y).$

Combining 2 and 3: $E(aX + b) = aE(X) + b$ .

4. Product for independent variables. If $X$ and $Y$ are independent, $E(XY) = E(X)\,E(Y)$ .

Proof: For independent variables $p(x,y) = p(x)p(y)$ , so $E(XY) = \sum_x\sum_y xy\,p(x)p(y) = \left(\sum_x x\,p(x)\right)\left(\sum_y y\,p(y)\right) = E(X)E(Y).$

Answer 8

t-test for Difference of Two Sample Means

This test checks whether two independent small samples (sizes $n_1, n_2 < 30$ ) drawn from normal populations with equal (unknown) variances have significantly different means.

Hypotheses: $H_0: \mu_1 = \mu_2$ (no difference) against $H_1: \mu_1 \ne \mu_2$ (two-tailed) or one-sided alternative.

Test statistic:

t = \frac{\bar{x}_1 - \bar{x}_2}{S\sqrt{\dfrac{1}{n_1} + \dfrac{1}{n_2}}},

where the pooled standard deviation is

S^2 = \frac{\sum (x_1 - \bar{x}_1)^2 + \sum (x_2 - \bar{x}_2)^2}{n_1 + n_2 - 2} = \frac{n_1 s_1^2 + n_2 s_2^2}{n_1 + n_2 - 2}.

The statistic follows the t-distribution with $\nu = n_1 + n_2 - 2$ degrees of freedom.

Decision rule: Compute $|t|$ and compare with the table value $t_{\alpha,\nu}$ at significance level $\alpha$ .

If $|t| \le t_{\alpha,\nu}$ : accept $H_0$ — the difference is not significant.
If $|t| > t_{\alpha,\nu}$ : reject $H_0$ — the difference is significant.

Assumptions: the parent populations are normal, the two samples are independent, and the population variances are equal (homogeneous).

Answer 9

z-test for a Single Mean (Large Sample)

When the sample size is large ( $n \ge 30$ ), the sampling distribution of the mean is approximately normal, so a z-test is used to test whether a sample mean $\bar{x}$ differs significantly from a hypothesised population mean $\mu$ .

Hypotheses: $H_0: \mu = \mu_0$ versus $H_1: \mu \ne \mu_0$ (two-tailed).

Test statistic:

Z = \frac{\bar{x} - \mu_0}{\sigma / \sqrt{n}},

where $\sigma$ is the population standard deviation (if unknown, the sample s.d. $s$ is used since $n$ is large). $Z$ follows the standard normal distribution.

Decision rule (5% level): reject $H_0$ if $|Z| > 1.96$ (for 1%, if $|Z| > 2.58$ ); otherwise accept $H_0$ .

Example

A sample of $n = 64$ items has mean $\bar{x} = 52$ , drawn from a population with mean $\mu_0 = 50$ and $\sigma = 8$ . Test at 5% whether the sample mean differs from 50.

Z = \frac{52 - 50}{8/\sqrt{64}} = \frac{2}{8/8} = \frac{2}{1} = 2.0.

Since $|Z| = 2.0 > 1.96$ , we reject $H_0$ : the sample mean differs significantly from 50 at the 5% level of significance.

Answer 10

Karl Pearson's Coefficient of Correlation

Correlation measures the degree and direction of the linear relationship between two variables $X$ and $Y$ . Karl Pearson's coefficient of correlation (product-moment correlation), denoted $r$ , is defined as

r = \frac{\operatorname{Cov}(X,Y)}{\sigma_X\,\sigma_Y} = \frac{\sum (x-\bar{x})(y-\bar{y})}{\sqrt{\sum (x-\bar{x})^2}\,\sqrt{\sum (y-\bar{y})^2}}.

An equivalent computational form is

r = \frac{n\sum xy - \sum x \sum y}{\sqrt{n\sum x^2 - (\sum x)^2}\,\sqrt{n\sum y^2 - (\sum y)^2}}.

Properties

Range: $r$ always lies between $-1$ and $+1$ , i.e. $-1 \le r \le +1$ .
Interpretation: $r = +1$ perfect positive, $r = -1$ perfect negative, $r = 0$ no linear correlation.
Independent of origin and scale (units): $r$ is unchanged if each value is shifted or multiplied by a constant; it is a pure number.
Symmetric: $r_{XY} = r_{YX}$ .
It is the geometric mean of the two regression coefficients: $r = \pm\sqrt{b_{xy}\,b_{yx}}$ , taking the sign of the regression coefficients.
If $X$ and $Y$ are independent then $r = 0$ (the converse is not necessarily true).

Answer 11

Regression Coefficients

In linear regression between two variables $X$ and $Y$ , the regression coefficient is the slope of the line of regression — it measures the average change in the dependent variable per unit change in the independent variable.

Regression coefficient of $Y$ on $X$ :

b_{yx} = r\,\frac{\sigma_y}{\sigma_x} = \frac{\operatorname{Cov}(X,Y)}{\sigma_x^2}.

Regression coefficient of $X$ on $Y$ :

b_{xy} = r\,\frac{\sigma_x}{\sigma_y} = \frac{\operatorname{Cov}(X,Y)}{\sigma_y^2}.

Properties

Correlation as geometric mean: the correlation coefficient is the geometric mean of the two regression coefficients,

r = \pm\sqrt{b_{xy}\cdot b_{yx}}.

Same sign: both regression coefficients have the same sign, which is also the sign of $r$ . If one is positive both are positive, and so is $r$ .
Product $\le 1$ : since $-1 \le r \le 1$ , $\;b_{xy}\cdot b_{yx} = r^2 \le 1$ . Hence both coefficients cannot exceed unity simultaneously; if one is greater than 1, the other must be less than 1.
Independent of change of origin but not of scale.
The arithmetic mean of the two regression coefficients is greater than or equal to $r$ , i.e. $\dfrac{b_{xy}+b_{yx}}{2} \ge r$ .
The two regression lines intersect at the point $(\bar{x}, \bar{y})$ .

Answer 12

Sampling Distribution

When all possible samples of a fixed size $n$ are drawn from a population, a statistic (such as the sample mean $\bar{x}$ , proportion $p$ , or variance $s^2$ ) varies from sample to sample. The probability distribution of all possible values of that statistic is called its sampling distribution.

For example, the sampling distribution of the mean lists every possible $\bar{x}$ with its probability. By the Central Limit Theorem, for large $n$ the sampling distribution of $\bar{x}$ is approximately normal with mean $\mu$ and standard deviation $\sigma/\sqrt{n}$ , regardless of the shape of the parent population.

Standard Error (S.E.)

The standard error of a statistic is the standard deviation of its sampling distribution. It measures the variability of the statistic due to sampling and indicates how precisely the sample statistic estimates the population parameter.

Common standard errors:

Mean: $\;\text{S.E.}(\bar{x}) = \dfrac{\sigma}{\sqrt{n}}$
Proportion: $\;\text{S.E.}(p) = \sqrt{\dfrac{PQ}{n}}$ , where $Q = 1-P$ .

Uses: the standard error is used (i) to test the significance of a statistic (test statistic = (estimate − parameter)/S.E.), and (ii) to construct confidence intervals. A smaller standard error (larger $n$ ) means a more reliable estimate.

Level	BSc CSIT (TU)
Stream	Science
Subject	Statistics II (BSc CSIT, STA210)
Year	2079 BS
Exam session	Regular (annual)
Full marks	60
Time allowed	180 minutes
Questions	12, all with step-by-step solutions

BSc CSIT (TU) Science Statistics II (BSc CSIT, STA210) Question Paper 2079 Nepal

Section A: Long Answer Questions

Probability Distribution

Binomial Distribution

Conditions for Application

Normal Distribution

Properties

Numerical: $P(45 < X < 62)$

Hypothesis Testing

Key Concepts

Procedure (Steps)

Section B: Short Answer Questions

Addition Theorem of Probability

Multiplication Theorem of Probability

Poisson Distribution

Mean and Variance

Applications

Random Variable

Discrete vs Continuous Random Variables

Mathematical Expectation

Properties (with proofs)

t-test for Difference of Two Sample Means

z-test for a Single Mean (Large Sample)

Example

Karl Pearson's Coefficient of Correlation

Properties

Regression Coefficients

Properties

Sampling Distribution

Standard Error (S.E.)

Frequently asked questions

Section A: Long Answer Questions

Probability Distribution

Binomial Distribution

Conditions for Application

Normal Distribution

Properties

Numerical: P(45<X<62)P(45 < X < 62)P(45<X<62)

Hypothesis Testing

Key Concepts

Procedure (Steps)

Section B: Short Answer Questions

Addition Theorem of Probability

Multiplication Theorem of Probability

Poisson Distribution

Mean and Variance

Applications

Random Variable

Discrete vs Continuous Random Variables

Mathematical Expectation

Properties (with proofs)

t-test for Difference of Two Sample Means

z-test for a Single Mean (Large Sample)

Example

Karl Pearson's Coefficient of Correlation

Properties

Regression Coefficients

Properties

Sampling Distribution

Standard Error (S.E.)

Frequently asked questions

Numerical: $P(45 < X < 62)$