BSc CSIT (TU) Science Statistics II (BSc CSIT, STA210) Question Paper 2078 Nepal
This is the official BSc CSIT (TU) (Science stream) Statistics II (BSc CSIT, STA210) question paper for 2078, as set in the regular annual examination. It carries 60 full marks and a time allowance of 180 minutes, across 12 questions. On Kekkei you can attempt this Statistics II (BSc CSIT, STA210) past paper online with a timer, get instant AI feedback and step-by-step solutions, and track the topics where you lose marks — completely free. Whether you are revising for your BSc CSIT (TU) Statistics II (BSc CSIT, STA210) exam or solving previous years' question papers, this 2078 paper is a great way to practise under real exam conditions.
Section A: Long Answer Questions
Attempt any TWO questions.
Explain the theory of estimation. Differentiate between point estimation and interval estimation and explain the properties of a good estimator.
Theory of Estimation
Estimation is the branch of statistical inference concerned with inferring the value of an unknown population parameter (e.g. mean , variance , proportion ) from a sample statistic computed from sample data. The sample statistic used for this purpose is called an estimator, and a particular numerical value it takes is an estimate.
There are two broad approaches:
Point Estimation vs Interval Estimation
| Basis | Point Estimation | Interval Estimation |
|---|---|---|
| Meaning | Gives a single value as the estimate of the parameter | Gives a range (interval) within which the parameter is expected to lie |
| Form | (e.g. for ) | , e.g. |
| Probability statement | No probability/confidence attached | Associated with a confidence level , e.g. 95% |
| Reliability | Does not indicate precision or error | Indicates precision via interval width and margin of error |
| Example | estimates | lies in with 95% confidence |
Properties of a Good Estimator
An estimator of a parameter is considered good if it possesses the following properties:
-
Unbiasedness: is unbiased if its expected value equals the parameter, i.e. . For example, the sample mean is an unbiased estimator of .
-
Consistency: is consistent if it converges in probability to as the sample size increases, i.e. as . Larger samples give estimates closer to the true value.
-
Efficiency: Among all unbiased estimators, the one with the minimum variance is the most efficient. If and are both unbiased and , then is more efficient.
-
Sufficiency: An estimator is sufficient if it utilizes all the information in the sample relevant to the parameter, so that no other statistic can add further information about .
A good estimator should ideally be unbiased, consistent, efficient and sufficient.
Explain the chi-square test. Describe its applications in testing goodness of fit and independence of attributes.
The Chi-Square () Test
The chi-square test is a non-parametric test based on the chi-square distribution, used to test hypotheses about categorical (attribute) data by comparing observed frequencies with expected frequencies .
The test statistic is:
which follows a distribution with appropriate degrees of freedom. A large value of indicates a large discrepancy between observed and expected frequencies, leading to rejection of .
Conditions: observations independent, sample reasonably large, total frequency , and each expected frequency (otherwise pool cells).
Application 1: Test of Goodness of Fit
Used to test whether an observed frequency distribution fits a theoretical/expected distribution (e.g. uniform, Binomial, Poisson, Normal).
- : The observed data agree with the assumed theoretical distribution.
- : The observed data do not fit the assumed distribution.
- Compute expected frequencies from the theoretical distribution.
- Statistic: .
- Degrees of freedom , where = number of classes and = number of parameters estimated from the data.
- If at level , reject (poor fit).
Application 2: Test of Independence of Attributes
Used with a contingency table to test whether two attributes (e.g. gender and preference) are independent or associated.
- : The two attributes are independent.
- : The two attributes are associated (dependent).
- For an table, expected frequency of a cell:
- Statistic: .
- Degrees of freedom .
- If , reject and conclude the attributes are associated.
Other Uses
- Test for a specified population variance .
- Test of homogeneity (whether several populations have the same distribution).
Define a probability distribution. Explain the binomial distribution with its mean and variance, and state the conditions under which it is applied.
Probability Distribution
A probability distribution is a description that assigns a probability to each possible value of a random variable. For a discrete random variable taking values , the function is the probability mass function, satisfying:
For a continuous random variable, it is described by a probability density function with and .
Binomial Distribution
A discrete random variable follows a binomial distribution if it represents the number of successes in independent trials, each with probability of success (and failure ). Its probability mass function is:
where .
Mean and Variance
- Mean:
- Variance:
- Standard deviation:
Since , the variance is always less than the mean .
Conditions (Assumptions) for Application
The binomial distribution applies when:
- The experiment consists of a fixed number of trials.
- Each trial has only two mutually exclusive outcomes — success and failure.
- The trials are independent of one another.
- The probability of success remains constant from trial to trial.
Example
Tossing a fair coin 5 times and counting the number of heads, where , .
Section B: Short Answer Questions
Attempt any EIGHT questions.
Explain how to construct a confidence interval for a population mean.
Confidence Interval for a Population Mean
A confidence interval (CI) for the population mean is a range of values, computed from sample data, that is expected to contain with a stated confidence level , e.g. 95%.
Case 1: known (or large sample, ) — use Z
Case 2: unknown and small sample () — use t
where is the sample standard deviation and degrees of freedom .
Steps to Construct
- Compute the sample mean (and if unknown).
- Choose the confidence level and find the critical value (e.g. for 95%) or .
- Compute the standard error (or ).
- Compute the margin of error .
- The interval is .
Interpretation
For a 95% CI, if many samples were drawn and an interval computed for each, about 95% of those intervals would contain the true mean .
Explain the F-test for the equality of two population variances.
F-Test for Equality of Two Population Variances
The F-test is used to test whether two independent normal populations have equal variances. It is based on the ratio of two independent sample variances, which follows the F-distribution.
Hypotheses
- (the two population variances are equal)
- (they are unequal)
Test Statistic
From two independent random samples of sizes and with sample variances and :
The larger variance is placed in the numerator so that . The unbiased sample variance is
Degrees of Freedom
(numerator), (denominator).
Decision Rule
Compare with the tabulated :
- If , accept — variances are equal.
- If , reject — variances differ significantly.
Assumptions
Both populations are normal, and the two samples are independent and drawn randomly.
Define index numbers and explain Laspeyres' and Paasche's price index methods.
Index Numbers
An index number is a statistical measure that expresses the relative change in a variable or a group of related variables (such as prices, quantities, or value) over time, place, or other characteristic, with respect to a chosen base period (taken as 100). It is often called an economic barometer.
A price index measures the average change in the prices of a basket of commodities between the base period (0) and the current period (1).
Laspeyres' Price Index
Uses base-year quantities as weights:
It answers: what is the cost now of the base-year basket compared with its base-year cost? It tends to overstate price rises because it ignores substitution away from goods that became dearer.
Paasche's Price Index
Uses current-year quantities as weights:
It answers: what would the current basket have cost in the base year versus now? It tends to understate price rises.
Comparison
| Basis | Laspeyres | Paasche |
|---|---|---|
| Weights | Base-year quantities | Current-year quantities |
| Bias | Upward (overestimates) | Downward (underestimates) |
| Data needed | Only base-year quantities | Current-year quantities each period |
Fisher's ideal index is the geometric mean of the two: .
State and explain the addition and multiplication theorems of probability with examples.
Addition Theorem of Probability
Gives the probability of the union of events (occurrence of at least one event).
For any two events and :
If and are mutually exclusive (cannot occur together, ):
Example: A card is drawn from 52 cards. , , .
Multiplication Theorem of Probability
Gives the probability of the joint occurrence (intersection) of events.
For any two events and :
where is the conditional probability of given .
If and are independent (occurrence of one does not affect the other):
Example: Two cards are drawn one after another without replacement.
If drawn with replacement (independent): .
Explain the Poisson distribution with its mean and variance and state its applications.
Poisson Distribution
The Poisson distribution is a discrete probability distribution that gives the probability of a given number of independent events occurring in a fixed interval of time, space, area, or volume, when these events occur at a constant average rate . It is the limiting case of the binomial distribution when , , with finite.
The probability mass function is:
where is the average number of occurrences and .
Mean and Variance
A characteristic property is that the mean and variance are equal:
Conditions
- Events occur independently.
- The average rate is constant.
- is large, is small (rare events).
Applications
Used to model the number of rare events, such as:
- Number of telephone calls received at an exchange per minute.
- Number of printing/typing errors per page of a book.
- Number of accidents on a highway per day.
- Number of defective items in a large batch.
- Number of customers/packets arriving at a server per unit time (queueing/network traffic).
Define a random variable. Differentiate between discrete and continuous random variables with examples.
Random Variable
A random variable is a real-valued function that assigns a numerical value to each outcome (sample point) of a random experiment. It maps the sample space to the set of real numbers, i.e. .
Example: In tossing two coins, if = number of heads, then takes values .
Discrete vs Continuous Random Variables
| Basis | Discrete Random Variable | Continuous Random Variable |
|---|---|---|
| Values | Takes countable (finite or countably infinite) isolated values | Takes any value within an interval (uncountable) |
| Probability | Described by a probability mass function | Described by a probability density function ; |
| Total probability | ||
| Probability of a range | Sum over values | |
| Examples | Number of heads in tosses, number of defective items, number of accidents | Height, weight, temperature, time, length |
Examples
- Discrete: Number of students present in a class; number of calls per hour.
- Continuous: The height of a person (e.g. 165.3 cm); the time taken to run a race.
Define mathematical expectation. State and prove its properties.
Mathematical Expectation
The mathematical expectation (or expected value) of a random variable is the long-run average value it takes, weighted by probabilities. It is a measure of the central tendency of a probability distribution.
For a discrete random variable with p.m.f. :
For a continuous random variable with density :
Properties (with Proof)
1. Expectation of a constant: . Proof:
2. Constant multiplier: . Proof:
3. Addition (linearity): , for any random variables . Proof (discrete):
4. Linear combination: (from properties 1–3).
5. Multiplication for independent variables: If and are independent, then . Proof: For independence , so
Explain the t-test for testing the significance of the difference between two sample means.
t-Test for Difference Between Two Sample Means
This test checks whether the means of two independent small samples () drawn from two normal populations differ significantly, when the population variances are unknown but assumed equal.
Hypotheses
- (no significant difference in population means)
- (two-tailed)
Test Statistic
where the pooled standard deviation is
Degrees of Freedom
Decision Rule
Compare with the tabulated value :
- If , accept — means do not differ significantly.
- If , reject — the difference is significant.
Assumptions
The two samples are independent and drawn from normal populations with equal (but unknown) variances.
Explain the z-test for a large sample test of a single mean with an example.
Z-Test for a Single Mean (Large Sample)
For a large sample (), the Z-test is used to test whether the sample mean differs significantly from a specified population mean . By the Central Limit Theorem, is approximately normally distributed.
Hypotheses
- (two-tailed)
Test Statistic
If the population standard deviation is unknown, the sample standard deviation is used (valid for large ).
Decision Rule (5% level)
Compare with the critical value :
- If , accept .
- If , reject .
Example
A sample of items has mean with . Test whether the population mean is at the 5% level.
Since , we reject and conclude that the population mean differs significantly from 50.
Frequently asked questions
- Where can I find the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) question paper 2078?
- The full BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2078 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
- Does the Statistics II (BSc CSIT, STA210) 2078 paper come with solutions?
- Yes. Every question on this Statistics II (BSc CSIT, STA210) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
- How many marks is the BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2078 paper?
- The BSc CSIT (TU) Statistics II (BSc CSIT, STA210) 2078 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
- Is practising this Statistics II (BSc CSIT, STA210) past paper free?
- Yes — reading and attempting this Statistics II (BSc CSIT, STA210) past paper on Kekkei is completely free.