BSc CSIT (TU) Science Statistics II (BSc CSIT, STA210) Question Paper 2079 Nepal
This is the official BSc CSIT (TU) (Science stream) Statistics II (BSc CSIT, STA210) question paper for 2079, as set in the regular annual examination. It carries 60 full marks and a time allowance of 180 minutes, across 12 questions. On Kekkei you can attempt this Statistics II (BSc CSIT, STA210) past paper online with a timer, get instant AI feedback and step-by-step solutions, and track the topics where you lose marks — completely free. Whether you are revising for your BSc CSIT (TU) Statistics II (BSc CSIT, STA210) exam or solving previous years' question papers, this 2079 paper is a great way to practise under real exam conditions.
Section A: Long Answer Questions
Attempt any TWO questions.
Define a probability distribution. Explain the binomial distribution with its mean and variance, and state the conditions under which it is applied.
Probability Distribution
A probability distribution is a mathematical function (or table) that assigns to each possible value of a random variable the probability of its occurrence. For a discrete random variable taking values it is described by the probability mass function with and ; for a continuous variable it is described by a probability density function with and .
Binomial Distribution
A discrete random variable follows a binomial distribution if it counts the number of successes in independent Bernoulli trials, each having success probability (and failure ). Its probability mass function is
Mean and Variance
Since , the variance is always less than the mean , i.e. for the binomial distribution mean variance.
Outline of the mean: writing where each is Bernoulli with , by linearity ; similarly and by independence .
Conditions for Application
The binomial distribution applies when:
- The number of trials is fixed and finite.
- Each trial has only two outcomes — success or failure (dichotomous).
- The trials are independent of one another.
- The probability of success remains constant from trial to trial.
Examples: number of heads in 10 tosses of a coin, number of defective items in a sample of fixed size, number of correct answers in a multiple-choice test.
Explain the normal distribution and its properties. The mean of a normal distribution is 50 and standard deviation is 10; find the probability that a value lies between 45 and 62.
Normal Distribution
A continuous random variable follows a normal distribution with mean and standard deviation if its probability density function is
We write .
Properties
- The curve is bell-shaped and symmetric about the mean .
- Mean = Median = Mode = .
- It is unimodal; the maximum of occurs at .
- The curve is asymptotic to the -axis on both sides.
- Total area under the curve is 1, with half on each side of .
- The points of inflection occur at .
- Empirical rule: about 68.27%, 95.45% and 99.73% of values lie within , and respectively.
- Quartile deviation and mean deviation .
- The standard normal variate is .
Numerical:
Given , . Convert to -scores:
Using the standard normal table ( = area from to ):
Since the two points lie on opposite sides of the mean,
Hence the required probability is about 0.5764 (57.64%).
What is hypothesis testing? Explain the procedure of testing of hypothesis including null and alternative hypotheses, level of significance, types of errors and the critical region.
Hypothesis Testing
Hypothesis testing is a statistical procedure used to decide, on the basis of sample data, whether to accept or reject an assumption (hypothesis) made about a population parameter. It quantifies how strongly the sample evidence contradicts the assumption.
Key Concepts
1. Null Hypothesis (): A statement of no effect or no difference that is assumed true until evidence suggests otherwise, e.g. .
2. Alternative Hypothesis (): The statement accepted if is rejected. It may be:
- Two-tailed:
- Right-tailed:
- Left-tailed:
3. Level of Significance (): The maximum probability of rejecting when it is actually true (probability of a Type I error). Common values are (5%) and (1%).
4. Types of Errors:
| True | False | |
|---|---|---|
| Reject | Type I error () | Correct decision |
| Accept | Correct decision | Type II error () |
- Type I error (): rejecting a true .
- Type II error (): accepting a false . The quantity is the power of the test.
5. Critical Region (Rejection Region): The set of values of the test statistic for which is rejected. Its area equals . The boundary value(s) separating it from the acceptance region are the critical value(s).
Procedure (Steps)
- Set up the null hypothesis and alternative hypothesis .
- Choose the level of significance .
- Select an appropriate test statistic (Z, t, , F) based on sample size and what is known.
- Determine the critical value and the critical (rejection) region from statistical tables.
- Compute the value of the test statistic from the sample data.
- Decision: If the computed value falls in the critical region, reject ; otherwise accept (fail to reject) .
- State the conclusion in the context of the problem.
Section B: Short Answer Questions
Attempt any EIGHT questions.
State and explain the addition and multiplication theorems of probability with examples.
Addition Theorem of Probability
For any two events and ,
If and are mutually exclusive (), this reduces to
Example: Drawing one card from a pack, (mutually exclusive). For (not mutually exclusive).
Multiplication Theorem of Probability
For any two events and ,
where is the conditional probability of given . If and are independent, then and
Example: Two cards drawn one after another without replacement, . Tossing two coins (independent), .
Explain the Poisson distribution with its mean and variance and state its applications.
Poisson Distribution
The Poisson distribution is a discrete distribution that models the number of times a rare event occurs in a fixed interval of time, space or volume when occurrences are independent and at a constant average rate. A random variable follows a Poisson distribution with parameter (the mean number of occurrences) if
It is the limiting form of the binomial distribution when , , with finite.
Mean and Variance
A characteristic feature is that the mean equals the variance .
Applications
Used for counting rare, independent events such as:
- Number of telephone calls arriving at an exchange per minute.
- Number of printing/typing errors per page of a book.
- Number of accidents on a highway per day.
- Number of defective items in a large batch.
- Number of customers (or network packets/requests) arriving at a server per unit time.
- Number of radioactive particles emitted per second.
Define a random variable. Differentiate between discrete and continuous random variables with examples.
Random Variable
A random variable is a real-valued function that assigns a numerical value to each outcome (sample point) of a random experiment. It is usually denoted by a capital letter , and a particular value by a small letter . For example, if a coin is tossed twice and denotes the number of heads, then takes the values .
Discrete vs Continuous Random Variables
| Basis | Discrete Random Variable | Continuous Random Variable |
|---|---|---|
| Values | Takes a finite or countably infinite set of isolated values | Takes any value within an interval (uncountable) |
| Described by | Probability mass function | Probability density function |
| Probability of a point | can be non-zero | ; only is meaningful |
| Total probability | ||
| Example | Number of heads in 3 tosses; number of defective bulbs | Height, weight, temperature, time of a person/object |
Examples
- Discrete: number of students present in a class, number of cars passing a point in an hour.
- Continuous: the exact weight of a person, the lifetime of an electric bulb, daily rainfall.
Define mathematical expectation. State and prove its properties.
Mathematical Expectation
The mathematical expectation (or expected value) of a random variable is the long-run average value it takes, weighted by probabilities. For a discrete random variable with pmf ,
and for a continuous random variable with pdf ,
Properties (with proofs)
1. Expectation of a constant. If is a constant, .
Proof:
2. Multiplication by a constant. .
Proof:
3. Addition (linearity). for any random variables.
Proof (discrete):
Combining 2 and 3: .
4. Product for independent variables. If and are independent, .
Proof: For independent variables , so
Explain the t-test for testing the significance of the difference between two sample means.
t-test for Difference of Two Sample Means
This test checks whether two independent small samples (sizes ) drawn from normal populations with equal (unknown) variances have significantly different means.
Hypotheses: (no difference) against (two-tailed) or one-sided alternative.
Test statistic:
where the pooled standard deviation is
The statistic follows the t-distribution with degrees of freedom.
Decision rule: Compute and compare with the table value at significance level .
- If : accept — the difference is not significant.
- If : reject — the difference is significant.
Assumptions: the parent populations are normal, the two samples are independent, and the population variances are equal (homogeneous).
Explain the z-test for a large sample test of a single mean with an example.
z-test for a Single Mean (Large Sample)
When the sample size is large (), the sampling distribution of the mean is approximately normal, so a z-test is used to test whether a sample mean differs significantly from a hypothesised population mean .
Hypotheses: versus (two-tailed).
Test statistic:
where is the population standard deviation (if unknown, the sample s.d. is used since is large). follows the standard normal distribution.
Decision rule (5% level): reject if (for 1%, if ); otherwise accept .
Example
A sample of items has mean , drawn from a population with mean and . Test at 5% whether the sample mean differs from 50.
Since , we reject : the sample mean differs significantly from 50 at the 5% level of significance.
Define Karl Pearson's coefficient of correlation and state its properties.
Karl Pearson's Coefficient of Correlation
Correlation measures the degree and direction of the linear relationship between two variables and . Karl Pearson's coefficient of correlation (product-moment correlation), denoted , is defined as
An equivalent computational form is
Properties
- Range: always lies between and , i.e. .
- Interpretation: perfect positive, perfect negative, no linear correlation.
- Independent of origin and scale (units): is unchanged if each value is shifted or multiplied by a constant; it is a pure number.
- Symmetric: .
- It is the geometric mean of the two regression coefficients: , taking the sign of the regression coefficients.
- If and are independent then (the converse is not necessarily true).
What are regression coefficients? State their properties.
Regression Coefficients
In linear regression between two variables and , the regression coefficient is the slope of the line of regression — it measures the average change in the dependent variable per unit change in the independent variable.
- Regression coefficient of on :
- Regression coefficient of on :
Properties
- Correlation as geometric mean: the correlation coefficient is the geometric mean of the two regression coefficients,
- Same sign: both regression coefficients have the same sign, which is also the sign of . If one is positive both are positive, and so is .
- Product : since , . Hence both coefficients cannot exceed unity simultaneously; if one is greater than 1, the other must be less than 1.
- Independent of change of origin but not of scale.
- The arithmetic mean of the two regression coefficients is greater than or equal to , i.e. .
- The two regression lines intersect at the point .
Explain the concept of sampling distribution and standard error.
Sampling Distribution
When all possible samples of a fixed size are drawn from a population, a statistic (such as the sample mean , proportion , or variance ) varies from sample to sample. The probability distribution of all possible values of that statistic is called its sampling distribution.
For example, the sampling distribution of the mean lists every possible with its probability. By the Central Limit Theorem, for large the sampling distribution of is approximately normal with mean and standard deviation , regardless of the shape of the parent population.
Standard Error (S.E.)
The standard error of a statistic is the standard deviation of its sampling distribution. It measures the variability of the statistic due to sampling and indicates how precisely the sample statistic estimates the population parameter.
Common standard errors:
- Mean:
- Proportion: , where .
Uses: the standard error is used (i) to test the significance of a statistic (test statistic = (estimate − parameter)/S.E.), and (ii) to construct confidence intervals. A smaller standard error (larger ) means a more reliable estimate.
Frequently asked questions
- Where can I find the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) question paper 2079?
- The full BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2079 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
- Does the Statistics II (BSc CSIT, STA210) 2079 paper come with solutions?
- Yes. Every question on this Statistics II (BSc CSIT, STA210) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
- How many marks is the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2079 paper?
- The BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2079 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
- Is practising this Statistics II (BSc CSIT, STA210) past paper free?
- Yes — reading and attempting this Statistics II (BSc CSIT, STA210) past paper on Kekkei is completely free.