BSc CSIT (TU) Science Statistics I (BSc CSIT, STA164) Question Paper 2075 Nepal
This is the official BSc CSIT (TU) (Science stream) Statistics I (BSc CSIT, STA164) question paper for 2075, as set in the regular annual examination. It carries 60 full marks and a time allowance of 180 minutes, across 12 questions. On Kekkei you can attempt this Statistics I (BSc CSIT, STA164) past paper online with a timer, get instant AI feedback and step-by-step solutions, and track the topics where you lose marks — completely free. Whether you are revising for your BSc CSIT (TU) Statistics I (BSc CSIT, STA164) exam or solving previous years' question papers, this 2075 paper is a great way to practise under real exam conditions.
Section A: Long Answer Questions
Attempt any TWO questions.
Define dispersion. Explain different measures of dispersion. Compute the standard deviation and coefficient of variation for a given frequency distribution and comment on consistency.
Dispersion
Dispersion (or variation) measures the extent to which individual observations in a data set are scattered about a central value (mean/median). A small dispersion means the data are clustered closely around the average; a large dispersion means they are spread out. Dispersion describes the consistency, reliability, and homogeneity of a series.
Measures of Dispersion
1. Absolute measures (expressed in the same units as the data):
- Range (largest value minus smallest value). Simplest but unstable.
- Quartile Deviation (Semi-IQR) . Based on the middle 50% of data.
- Mean Deviation . Average of absolute deviations from the mean (or median).
- Standard Deviation . The most important and widely used measure.
2. Relative measures (unit-free, used to compare two series):
- Coefficient of Range
- Coefficient of Quartile Deviation
- Coefficient of Variation
Worked Computation (illustrative frequency distribution)
| Class | Mid | |||
|---|---|---|---|---|
| 0–10 | 5 | 5 | 25 | 125 |
| 10–20 | 8 | 15 | 120 | 1800 |
| 20–30 | 15 | 25 | 375 | 9375 |
| 30–40 | 7 | 35 | 245 | 8575 |
| 40–50 | 5 | 45 | 225 | 10125 |
| Total | 40 | 990 | 30000 |
Mean:
Standard deviation:
Coefficient of variation:
Comment on Consistency
The series whose C.V. is smaller is more consistent / uniform / stable, while a higher C.V. indicates greater variability. (When comparing two distributions, compute C.V. for each and conclude that the one with the lower C.V. is more consistent.) Here, a C.V. of about indicates a fairly high degree of variability in the data.
Explain the method of least squares. Fit a straight line of regression of Y on X to the given data and estimate the value of Y for a given X.
Method of Least Squares
The method of least squares fits a line (or curve) to data so that the sum of squares of the vertical deviations of observed points from the fitted line is a minimum. For the line of regression of on , written as
we minimise . Setting and gives the two normal equations:
Solving these gives:
Here is the regression coefficient of on , the average change in per unit change in .
Worked Example (fitting on )
For data and :
| 1 | 2 | 2 | 1 |
| 2 | 4 | 8 | 4 |
| 3 | 5 | 15 | 9 |
| 4 | 4 | 16 | 16 |
| 5 | 6 | 30 | 25 |
| 15 | 21 | 71 | 55 |
Regression line:
Estimating Y for a given X
For example, at :
So the estimated value of when is 6.6. (Substitute the actual asked in the question into the fitted line to obtain the estimate.)
Define a random variable. Explain probability mass function and probability density function. Find the mean and variance of a given probability distribution.
Random Variable
A random variable is a real-valued function that assigns a numerical value to each outcome (sample point) of a random experiment, i.e. . It is discrete if it takes a finite or countably infinite set of values (e.g. number of heads in tosses) and continuous if it can take any value in an interval (e.g. height, time).
Probability Mass Function (PMF)
For a discrete random variable , the PMF is
satisfying (i) for all , and (ii) . It gives the probability that equals each specific value.
Probability Density Function (PDF)
For a continuous random variable , the PDF satisfies (i) , (ii) , and the probability over an interval is
For a continuous variable ; only interval probabilities are meaningful.
Mean and Variance (worked example)
Let have the distribution:
| 0 | 1 | 2 | 3 | |
|---|---|---|---|---|
| 0.1 | 0.3 | 0.4 | 0.2 |
Mean (Expectation):
:
Variance:
Standard deviation .
For a continuous distribution the same idea is used with integrals: and .
Section B: Short Answer Questions
Attempt any EIGHT questions.
Define dispersion and its measures.
Dispersion is the degree to which the values of a data set are scattered or spread about a central (average) value. It indicates the consistency and homogeneity of the data.
Its measures are of two kinds:
- Absolute measures (same units as data): Range, Quartile Deviation, Mean Deviation, Standard Deviation.
- Relative measures (unit-free, for comparison): Coefficient of Range, Coefficient of Quartile Deviation, Coefficient of Mean Deviation, and Coefficient of Variation .
A smaller dispersion means more consistent/uniform data; a larger dispersion means more variability.
What is the range of a data set? Find the range of 12, 15, 20, 8, 25.
The range is the simplest absolute measure of dispersion, defined as the difference between the largest () and smallest () values in a data set:
For the data : largest , smallest .
Define regression coefficients.
Regression coefficients are the slopes of the two lines of regression; each measures the average rate of change of one variable per unit change in the other.
- Regression coefficient of on : — change in per unit change in .
- Regression coefficient of on : — change in per unit change in .
Important properties:
- , so the correlation coefficient is the geometric mean of the two regression coefficients.
- Both have the same sign as .
- Their product cannot exceed 1: .
State the multiplication theorem of probability.
The multiplication theorem of probability gives the probability of the joint occurrence of two events.
For dependent events and (general form):
where is the conditional probability of given that has occurred.
For independent events, , so the theorem reduces to:
This extends to events: .
What is a random variable?
A random variable is a real-valued function that assigns a numerical value to every outcome of a random experiment, i.e. where is the sample space.
It is classified as:
- Discrete random variable — takes a finite or countably infinite number of distinct values (e.g. number of heads when a coin is tossed three times: ).
- Continuous random variable — can assume any value within an interval (e.g. height, weight, time).
Example: Tossing two coins, let = number of heads. Then takes values with probabilities .
Define positive and negative correlation.
Correlation describes the direction and strength of the linear relationship between two variables.
-
Positive correlation: both variables move in the same direction — as one increases, the other also increases (and as one decreases, the other decreases). The correlation coefficient lies in . Example: height and weight; income and expenditure.
-
Negative correlation: the two variables move in opposite directions — as one increases, the other decreases. Here lies in . Example: price and quantity demanded; speed and time taken for a fixed distance.
is perfect positive and is perfect negative correlation.
What is the median of 7, 3, 9, 5, 11?
The median is the middle value of a data set when the observations are arranged in ascending (or descending) order.
Arrange the data in ascending order:
There are (odd) observations, so the median is the value.
Define the probability density function.
The probability density function (PDF) is the function that describes the distribution of a continuous random variable . It satisfies:
- for all (non-negative),
- (total area under the curve equals 1),
- The probability that lies in an interval is the area under the curve:
Note that for a continuous variable ; the PDF gives probabilities only over intervals, not at single points. The PDF is the derivative of the cumulative distribution function: .
What is meant by a scatter diagram?
A scatter diagram (scatter plot) is a graphical method of representing bivariate data, in which each pair of observations is plotted as a point on the -plane (with on the horizontal axis and on the vertical axis).
It gives a quick visual idea of the nature, direction, and degree of correlation between the two variables:
- If points cluster around an upward-sloping line → positive correlation.
- If points cluster around a downward-sloping line → negative correlation.
- If points lie exactly on a line → perfect correlation ().
- If points are scattered randomly with no pattern → little or no correlation.
It is simple to draw and is usually the first step before computing a numerical correlation coefficient.
Frequently asked questions
- Where can I find the BSc CSIT (TU) Statistics I (BSc CSIT, STA164) question paper 2075?
- The full BSc CSIT (TU) Statistics I (BSc CSIT, STA164) 2075 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
- Does the Statistics I (BSc CSIT, STA164) 2075 paper come with solutions?
- Yes. Every question on this Statistics I (BSc CSIT, STA164) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
- How many marks is the BSc CSIT (TU) Statistics I (BSc CSIT, STA164) 2075 paper?
- The BSc CSIT (TU) Statistics I (BSc CSIT, STA164) 2075 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
- Is practising this Statistics I (BSc CSIT, STA164) past paper free?
- Yes — reading and attempting this Statistics I (BSc CSIT, STA164) past paper on Kekkei is completely free.