BSc CSIT (TU) Science Statistics II (BSc CSIT, STA210) Question Paper 2080 Nepal
This is the official BSc CSIT (TU) (Science stream) Statistics II (BSc CSIT, STA210) question paper for 2080, as set in the regular annual examination. It carries 60 full marks and a time allowance of 180 minutes, across 12 questions. On Kekkei you can attempt this Statistics II (BSc CSIT, STA210) past paper online with a timer, get instant AI feedback and step-by-step solutions, and track the topics where you lose marks — completely free. Whether you are revising for your BSc CSIT (TU) Statistics II (BSc CSIT, STA210) exam or solving previous years' question papers, this 2080 paper is a great way to practise under real exam conditions.
Section A: Long Answer Questions
Attempt any TWO questions.
What is hypothesis testing? Explain the procedure of testing of hypothesis including null and alternative hypotheses, level of significance, types of errors and the critical region.
Hypothesis Testing
Hypothesis testing is a statistical procedure used to decide, on the basis of sample data, whether to accept or reject a claim (hypothesis) made about a population parameter. It provides an objective rule for choosing between two competing statements about the population.
1. Null and Alternative Hypotheses
- Null hypothesis (): A statement of no difference or no effect, assumed true until evidence contradicts it. e.g. .
- Alternative hypothesis (): The statement accepted if is rejected. It may be:
- Two-tailed:
- One-tailed: or
2. Level of Significance ()
The maximum probability of rejecting when it is actually true. Common values are (5%) and (1%). It fixes the size of the critical region.
3. Types of Errors
| True | False | |
|---|---|---|
| Reject | Type I error () | Correct decision (power ) |
| Accept | Correct decision | Type II error () |
- Type I error: Rejecting a true ; probability .
- Type II error: Accepting a false ; probability .
4. Critical Region
The critical (rejection) region is the set of values of the test statistic for which is rejected. Its area equals . The complementary region is the acceptance region. The boundary value is the critical value (e.g. for a two-tailed -test at 5%).
5. General Procedure
- Set up and .
- Choose the level of significance .
- Select an appropriate test statistic (z, t, , F) and find its sampling distribution under .
- Determine the critical region / critical value.
- Compute the test statistic from sample data.
- Decision: If the computed value falls in the critical region, reject ; otherwise do not reject . State the conclusion in context.
Define correlation and regression. Explain the method of fitting two regression lines and the relationship between correlation coefficient and regression coefficients.
Correlation and Regression
Correlation measures the degree and direction of the linear relationship between two variables and . It is dimensionless and lies between and .
Regression measures the average functional dependence of one variable on another and is used for prediction. It gives the equation of the line that best estimates one variable from the other.
Fitting the Two Regression Lines (Least Squares)
The two lines are obtained by minimising the sum of squared deviations.
(a) Regression of on (predict from ):
(b) Regression of on (predict from ):
Both lines pass through the mean point , where they intersect.
Relationship between and the Regression Coefficients
The correlation coefficient is the geometric mean of the two regression coefficients:
Key points:
- The sign of equals the (common) sign of and .
- Since , both coefficients cannot exceed 1 in magnitude simultaneously.
- If , both regression coefficients are 0 and the lines are perpendicular; if the two lines coincide.
What is sampling? Explain different methods of probability and non-probability sampling with their merits and demerits.
Sampling
Sampling is the process of selecting a representative subset (sample) from a population so that conclusions about the whole population can be drawn from the sample, saving time, cost and effort.
A. Probability (Random) Sampling
Every unit has a known, non-zero chance of selection.
- Simple Random Sampling — every unit has an equal chance (lottery / random numbers).
- Merits: unbiased, simple, easy to analyse.
- Demerits: needs complete sampling frame; may miss small subgroups.
- Stratified Random Sampling — population divided into homogeneous strata, samples drawn from each.
- Merits: high precision, ensures representation of all groups.
- Demerits: requires prior knowledge of strata; complex.
- Systematic Sampling — select every unit after a random start ().
- Merits: simple, quick, spreads sample evenly.
- Demerits: biased if there is periodicity in the list.
- Cluster / Multistage Sampling — population divided into clusters; whole clusters selected.
- Merits: economical for wide geographic areas, no full frame needed.
- Demerits: higher sampling error if clusters are heterogeneous within.
B. Non-Probability Sampling
Selection probability is unknown; relies on judgement/convenience.
- Convenience Sampling — units that are easiest to reach.
- Merits: fast, cheap. Demerits: highly biased, not generalisable.
- Judgement (Purposive) Sampling — expert chooses typical units.
- Merits: useful for small specialised studies. Demerits: subjective, biased.
- Quota Sampling — fixed quotas filled by interviewer choice.
- Merits: quick, ensures group representation. Demerits: selection bias.
- Snowball Sampling — existing subjects recruit others.
- Merits: good for hidden/rare populations. Demerits: strong bias, no frame.
Conclusion: Probability methods allow estimation of sampling error and valid inference; non-probability methods are cheaper and faster but cannot reliably generalise to the population.
Section B: Short Answer Questions
Attempt any EIGHT questions.
Define mathematical expectation. State and prove its properties.
Mathematical Expectation
The mathematical expectation (mean) of a random variable is the long-run average value, weighted by probabilities.
Properties (with proof)
1. Expectation of a constant. . Proof:
2. Constant multiplier. . Proof:
3. Linearity (addition). . Proof:
4. Addition theorem. (always). Proof:
5. Multiplication theorem. If and are independent, . Proof: For independence , so
Explain the t-test for testing the significance of the difference between two sample means.
t-Test for Difference of Two Sample Means
Used for small independent samples () from two normal populations with unknown but equal variances.
Hypotheses
Test Statistic
where the pooled standard deviation is
Degrees of Freedom
Decision Rule
Compare with the table value :
- If → do not reject (means not significantly different).
- If → reject (significant difference).
Assumptions: samples independent, drawn from normal populations, with equal population variances.
Explain the z-test for a large sample test of a single mean with an example.
z-Test for a Single Mean (Large Sample)
Used when (or population known) to test whether the sample mean differs from a stated population mean .
Hypotheses
Test Statistic
If is unknown, the sample s.d. is used. Critical value at 5% (two-tailed) is .
Decision Rule
Reject if (5%); otherwise do not reject.
Example
A sample of bulbs has mean life hr with hr. Test whether the mean life is hr.
- , .
- → reject .
Conclusion: The mean life differs significantly from 1600 hours at the 5% level.
Define Karl Pearson's coefficient of correlation and state its properties.
Karl Pearson's Coefficient of Correlation
It is a numerical measure of the degree and direction of the linear relationship between two variables and , defined as the ratio of their covariance to the product of their standard deviations:
or equivalently
Properties
- Range: .
- Sign: → positive (direct) correlation; → negative (inverse); → no linear correlation.
- Perfect correlation: means all points lie exactly on a straight line.
- Independent of origin and scale: is unchanged by linear transformations , .
- Symmetry: .
- Geometric mean of regression coefficients: .
- It is a pure number (unit-free).
What are regression coefficients? State their properties.
Regression Coefficients
The regression coefficient is the slope of a regression line — it gives the average change in the dependent variable per unit change in the independent variable.
- Regression of on :
- Regression of on :
Properties
- Correlation as geometric mean: .
- Same sign: both and have the same sign, which is also the sign of .
- Product condition: , so both coefficients cannot exceed unity at the same time (if one , the other ).
- Independent of origin but not of scale: changing scale changes the coefficients.
- Arithmetic mean property: the arithmetic mean of the two regression coefficients is , i.e. .
Explain the concept of sampling distribution and standard error.
Sampling Distribution and Standard Error
Sampling Distribution
If all possible samples of a fixed size are drawn from a population and a statistic (e.g. the sample mean , proportion , or variance ) is computed for each, the probability distribution of that statistic over all such samples is called its sampling distribution.
Example: the sampling distribution of has mean and, by the Central Limit Theorem, is approximately normal for large .
Standard Error (S.E.)
The standard error is the standard deviation of the sampling distribution of a statistic. It measures the precision / variability of the estimate:
Uses of Standard Error
- It measures the reliability of a sample estimate (smaller S.E. = more precise).
- It forms the denominator of test statistics ().
- It is used to construct confidence intervals.
- S.E. decreases as the sample size increases (proportional to ).
Explain how to construct a confidence interval for a population mean.
Confidence Interval for a Population Mean
A confidence interval (CI) is a range of values, computed from a sample, that is likely to contain the unknown population mean with a stated probability called the confidence level , e.g. 95%.
Case 1: Large Sample () or known
Using the normal distribution:
For 95% confidence, ; for 99%, . If is unknown, replace it with the sample s.d. .
Case 2: Small Sample (), unknown
Using the -distribution with degrees of freedom:
Steps
- Compute sample mean and standard error (or ).
- Choose confidence level and find or .
- Compute the margin of error
- The interval is .
Interpretation: We are 95% confident that the population mean lies within this interval; a wider level or smaller gives a wider interval.
Explain the F-test for the equality of two population variances.
F-Test for Equality of Two Population Variances
Used to test whether two independent normal populations have equal variances, based on their sample variances.
Hypotheses
Test Statistic
where the unbiased sample variances are
Degrees of Freedom
(numerator), (denominator).
Decision Rule
Compare with the table value :
- If → do not reject (variances equal).
- If → reject (variances differ significantly).
Assumptions: both samples are independent and drawn from normal populations. (The F-test is also the basis of ANOVA.)
Define index numbers and explain Laspeyres' and Paasche's price index methods.
Index Numbers
An index number is a statistical measure that expresses the relative change in a variable (price, quantity, value) of a group of related items between a current period and a fixed base period, usually expressed as a percentage. Base period value .
Notation: = base-period price and quantity; = current-period price and quantity.
Laspeyres' Price Index (base-year weights)
Uses base-year quantities as weights:
Advantage: requires only base-year quantities, easy to update. Limitation: tends to over-estimate the rise in prices (ignores consumers switching away from costlier goods).
Paasche's Price Index (current-year weights)
Uses current-year quantities as weights:
Advantage: reflects current consumption pattern. Limitation: needs fresh current-year quantities each period; tends to under-estimate the price rise.
Note: Fisher's Ideal Index is the geometric mean of the two: .
Frequently asked questions
- Where can I find the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) question paper 2080?
- The full BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2080 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
- Does the Statistics II (BSc CSIT, STA210) 2080 paper come with solutions?
- Yes. Every question on this Statistics II (BSc CSIT, STA210) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
- How many marks is the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2080 paper?
- The BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2080 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
- Is practising this Statistics II (BSc CSIT, STA210) past paper free?
- Yes — reading and attempting this Statistics II (BSc CSIT, STA210) past paper on Kekkei is completely free.