BSc CSIT (TU) Science Statistics I (BSc CSIT, STA164) Question Paper 2078 Nepal
This is the official BSc CSIT (TU) (Science stream) Statistics I (BSc CSIT, STA164) question paper for 2078, as set in the regular annual examination. It carries 60 full marks and a time allowance of 180 minutes, across 12 questions. On Kekkei you can attempt this Statistics I (BSc CSIT, STA164) past paper online with a timer, get instant AI feedback and step-by-step solutions, and track the topics where you lose marks — completely free. Whether you are revising for your BSc CSIT (TU) Statistics I (BSc CSIT, STA164) exam or solving previous years' question papers, this 2078 paper is a great way to practise under real exam conditions.
Section A: Long Answer Questions
Attempt any TWO questions.
Explain the measures of central tendency. Compute mean, median and mode for the given grouped frequency distribution and establish the empirical relationship among them.
Measures of Central Tendency
A measure of central tendency is a single value that represents the centre or typical value of a data set. The three principal measures are:
- Mean (): the arithmetic average of all observations.
- Median (): the middle value when data are arranged in order.
- Mode (): the value that occurs most frequently.
Formulae for Grouped Data
Mean: , where is the mid-value of each class.
Median: , where = lower boundary of the median class, , = cumulative frequency before the median class, = frequency of the median class, = class width.
Mode: , where = frequency of the modal class, and are the frequencies of the preceding and following classes.
Worked Example
Consider the distribution:
| Class | 0–10 | 10–20 | 20–30 | 30–40 | 40–50 |
|---|---|---|---|---|---|
| 5 | 8 | 15 | 9 | 3 |
Here .
Mean: mid-values .
Median: . Cumulative frequencies: . The median class is 20–30 (, , , ).
Mode: modal class is 20–30 (, , , , ).
Empirical Relationship
For a moderately skewed (asymmetrical) distribution:
Check: , which agrees closely with the computed mode. The relation shows that the median always lies between the mean and the mode (about one-third of the way from the mean to the mode).
Define the Poisson distribution. State its properties and applications. Given the mean number of accidents per day is 2, find the probability of 0, 1 and at least 2 accidents on a given day.
Poisson Distribution
A discrete random variable follows a Poisson distribution with parameter if it gives the probability of a given number of independent rare events occurring in a fixed interval of time or space, when these events occur at a constant average rate. Its probability mass function is
where is the mean (average) number of occurrences.
Properties
- It is a discrete distribution defined for .
- Mean = Variance = (a characteristic property).
- It has a single parameter .
- It is the limiting form of the binomial distribution when , , with finite.
- It is positively skewed; skewness and kurtosis , both decreasing as increases.
- The sum of independent Poisson variates is also Poisson (additive property).
Applications
- Number of accidents, telephone calls, or customer arrivals per unit time.
- Number of printing errors per page or defects per unit length.
- Number of radioactive decay emissions; arrivals in queueing theory.
Numerical Solution
Given mean , so .
Results: , , .
Define regression. Derive the two regression equations (Y on X and X on Y). Explain the properties of the regression coefficients.
Regression
Regression is a statistical method that estimates the average relationship between two (or more) variables and is used to predict the value of a dependent variable from a known value of an independent variable. The line that gives the best estimate is the line of regression, fitted by the principle of least squares (minimising the sum of squared deviations of observed points from the line).
Derivation of the Regression Line of Y on X
Let the line be . By least squares we minimise . Setting the partial derivatives to zero gives the normal equations:
Solving for gives the regression coefficient of on :
The line of regression of Y on X is therefore
Derivation of the Regression Line of X on Y
By symmetry, taking and minimising gives
The line of regression of X on Y is
Properties of the Regression Coefficients
- The correlation coefficient is the geometric mean of the two regression coefficients: .
- Both regression coefficients have the same sign, which is also the sign of .
- If one regression coefficient is greater than 1, the other must be less than 1 (since their product ).
- The AM of the two coefficients is greater than or equal to : .
- Regression coefficients are independent of the change of origin but not of scale.
- The two regression lines intersect at the means .
Section B: Short Answer Questions
Attempt any EIGHT questions.
Define harmonic mean.
The harmonic mean (HM) of observations is the reciprocal of the arithmetic mean of the reciprocals of the values:
For a frequency distribution, , where . It is the appropriate average for rates and ratios (e.g. average speed over equal distances, price per unit) and gives more weight to smaller values.
State the empirical relation between mean, median and mode.
For a moderately skewed (asymmetrical) distribution, the empirical relationship among the three measures of central tendency is
Equivalently, . The median always lies between the mean and the mode, dividing the distance in the ratio . For a perfectly symmetrical distribution, Mean = Median = Mode.
What are the properties of the Poisson distribution?
Properties of the Poisson Distribution
- It is a discrete distribution with pmf ,
- Mean = Variance = .
- It has only one parameter, (= in the binomial limit).
- It is the limiting case of the binomial distribution when , with fixed.
- It is positively skewed (); the skewness decreases as increases, approaching normality for large .
- Additive property: the sum of independent Poisson variates with parameters is Poisson with parameter .
Define the line of regression.
A line of regression is the straight line that gives the best estimate (in the least-squares sense) of one variable for a given value of the other, by minimising the sum of squared deviations of the observed points from the line. For two variables and there are two such lines:
- Regression of Y on X: , with , used to estimate from .
- Regression of X on Y: , with , used to estimate from .
Both lines pass through the point of means and coincide only when .
What is conditional probability?
Conditional probability is the probability of an event occurring given that another event has already occurred (with ). It is defined as
It restricts the sample space to those outcomes in which occurs. From this, the multiplication rule follows: . If and are independent, then .
Example: drawing two cards without replacement, .
Define mean deviation.
Mean deviation (MD) is a measure of dispersion equal to the arithmetic mean of the absolute deviations of the observations from a central value (usually the mean or median).
About the mean: (for grouped data, ).
About the median: .
Absolute values are used so that positive and negative deviations do not cancel. Mean deviation is least when taken about the median. The coefficient of mean deviation = .
What is a probability mass function?
A probability mass function (pmf) gives the probability that a discrete random variable takes each of its possible values. If takes values , the pmf is
and it must satisfy two conditions:
- for all (non-negativity), and
- (total probability is one).
Example: for a fair die, for . The pmf is the discrete counterpart of the probability density function used for continuous variables.
State the properties of a good measure of central tendency.
Properties of a Good Measure of Central Tendency
According to Yule and Kendall, an ideal average should:
- Be rigidly (clearly) defined by a mathematical formula.
- Be based on all the observations of the data.
- Be easy to understand and simple to compute.
- Be least affected by sampling fluctuations (sampling stability).
- Be suitable for further algebraic / mathematical treatment.
- Not be unduly affected by extreme values (outliers).
The arithmetic mean satisfies most of these except the last (it is affected by extreme values).
Find the mode of 2, 3, 3, 5, 7, 3, 8.
The mode is the value that occurs most frequently. Arranging the data by frequency:
- occurs three times,
- every other value occurs once.
Since has the highest frequency, the mode = 3.
Frequently asked questions
- Where can I find the BSc CSIT (TU) Statistics I (BSc CSIT, STA164) question paper 2078?
- The full BSc CSIT (TU) Statistics I (BSc CSIT, STA164) 2078 (regular) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
- Does the Statistics I (BSc CSIT, STA164) 2078 paper come with solutions?
- Yes. Every question on this Statistics I (BSc CSIT, STA164) past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
- How many marks is the BSc CSIT (TU) Statistics I (BSc CSIT, STA164) 2078 paper?
- The BSc CSIT (TU) Statistics I (BSc CSIT, STA164) 2078 paper carries 60 full marks and is meant to be completed in 180 minutes, across 12 questions.
- Is practising this Statistics I (BSc CSIT, STA164) past paper free?
- Yes — reading and attempting this Statistics I (BSc CSIT, STA164) past paper on Kekkei is completely free.