Does the Statistical Computing with R 2081 paper come with solutions?

Yes. Every question on this Statistical Computing with R past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.

Is practising this Statistical Computing with R past paper free?

Yes — reading and attempting this Statistical Computing with R past paper on Kekkei is completely free.

Master in Data Science (SMS, TU) Statistical Computing with R Question Paper 2081 Nepal

Q: Where can I find the Master in Data Science (SMS, TU) Statistical Computing with R question paper 2081?

The full Master in Data Science (SMS, TU) Statistical Computing with R 2081 (Sa) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.

Q: How many marks is the Master in Data Science (SMS, TU) Statistical Computing with R 2081 paper?

The Master in Data Science (SMS, TU) Statistical Computing with R 2081 paper carries 45 full marks and is meant to be completed in 120 minutes, across 10 questions.

Master in Data Science (SMS, TU)Sa

45marks

120min

10questions

This is the official Master in Data Science (SMS, TU) Statistical Computing with R question paper for 2081, as set in the Sa examination. It carries 45 full marks and a time allowance of 120 minutes, across 10 questions. On Kekkei you can attempt this Statistical Computing with R past paper online with a timer, get instant AI feedback and step-by-step solutions, and track the topics where you lose marks — completely free. Whether you are revising for your Master in Data Science (SMS, TU) Statistical Computing with R exam or solving previous years' question papers, this 2081 paper is a great way to practise under real exam conditions.

Level	Master in Data Science (SMS, TU)
Subject	Statistical Computing with R
Year	2081 BS
Exam session	Sa
Full marks	45
Time allowed	120 minutes
Questions	10, all with step-by-step solutions

Group A

5 questions·3 marks each

1Short answer3 marks

Describe data visualization with focus on:

a) Concept of grammar of graphics with Wilkinson's approach

b) Layers in grammar of graphics with ggplot package's approach

c) Statistical transformations in grammar of graphics

data-visualizationggplotgrammar-of-graphics

2Short answer3 marks

Describe followings for checking fit of the multiple linear regression model:

a) Outliers

b) Cook's distance

c) Leverage

linear-regressionmodel-diagnostics

3Short answer3 marks

Describe supervised learning classification regression model with focus on:

a) Model fit indices

b) Confusion matrix with an example

c) Prediction accuracy with ROC curve

classificationconfusion-matrixroc

4Short answer3 marks

Describe following with example on it use:

a) Poisson regression

b) Zero-inflated Poisson regression

c) Negative binomial regression

poisson-regressioncount-models

5Short answer3 marks

Describe supervised linear regression model with focus on:

a) Cross-validation

b) K-fold cross-validation

c) Repeated k-fold cross-validation

cross-validationlinear-regression

Group B

5 questions·6 marks each

6Long answer6 marks

Do the following in R Studio using ggplot2 package with R script to knit PDF output:

a) Create a dataset with following variables: age (10-99 years), sex (male/female), educational levels (No education/Primary/Secondary/Beyond secondary), socio-economic status (Low, Middle, High) and body mass index (14 – 38) with random 200 cases of each variable. Your roll number must be used to set the random seed.

b) Create scatter plot of age and body mass index variables using ggplot2 package and interpret the result carefully.

c) Create classes of body mass index variable as: <18, 18-24, 25-30, 30+ and show it as pie chart using ggplot2 package and interpret it carefully

d) Create histogram of age variable with bin size of 15 using the ggplot2 package and interpret it carefully

ggplot2data-visualization

7Long answer6 marks

Do the following in R Studio using "airquality" data set of R with R script to knit PDF output:

a) Perform goodness-of-fit test on Temp variable to check if it follows normal distribution or not

b) Perform goodness-of-fit test on Temp variable by Month variable to check if the variances of mpg are equal or not on am variable categories

c) Discuss which independent sample test must be used to compare "Temp" variable by "Month" variable categories based on the results obtained above

d) Perform the best independent sample statistical test for this data now and interpret the results carefully

goodness-of-fithypothesis-testingairquality

8Long answer6 marks

Do the following in R Studio using "Arrests" dataset of car package with R script to knit PDF output:

a) Divide the Arrests data into train and test datasets with 80:20 random splits

b) Fit a supervised logistic regression and naïve Bayes classification models on train data with "released" as dependent variable and colour, age, sex, employed and citizen as independent variable

c) Predict the released variable in the test datasets of these models and interpret the result carefully

d) Compare and decide which classification model is better for this data

logistic-regressionnaive-bayesclassification

9Long answer6 marks

Do as follows using in-built "iris" dataset with R script to knit PDF output:

a) Create a "flower scale" of first four variables of iris dataset using the Principal Component Analysis

b) Compute the eigenvalues and interpret the PCA result carefully using Kaiser's criteria

c) Show the Scree plot and decide on the number of components to retain with careful interpretation

d) Revise the flower scale with 3 components using VARIMAX rotation and interpret the result carefully

Do as follows using given dataset of 10 US cities in R studio with R script:

City	Atlanta	Chicago	Denver	Houston	Los Angeles	Miami	New York	San Francisco	Seattle	Washington D.C
Atlanta	0	587	1212	701	1936	604	748	2139	2182	543
Chicago	587	0	920	940	1745	1188	713	1858	1737	597
Denver	1212	920	0	879	831	1726	1631	949	1021	1494
Houston	701	940	879	0	1374	968	1420	1645	1891	1220
Los Angeles	1936	1745	831	1374	0	2339	2451	347	959	2300
Miami	604	1188	1726	968	2339	0	1092	2594	2734	923
New York	748	713	1631	1420	2451	1092	0	2571	2408	205
San Francisco	2139	1858	949	1645	347	2594	2571	0	678	2442
Seattle	2182	1737	1021	1891	959	2734	2408	678	0	2329
Washington D.C	543	597	1494	1220	2300	923	205	2442	2329	0

a) Get dissimilarity distance as city dissimilarity object

b) Fit a classical multidimensional model using the city dissimilarity object

c) Get the summary of the model and interpret it carefully

d) Get the bi-plot of the model and interpret it carefully

pcamultidimensional-scalingiris

10Long answer6 marks

Use the first four variables of the "iris" data and do as follows with R Script to knit PDF output:

a) Fit a hierarchical clustering model using single linkage and get the dendogram for this model

b) Fit a hierarchical clustering model using complete linkage and get the dendogram for this model

c) Fit a hierarchical clustering model using average linkage and get the dendogram for this model

d) Find the best hierarchical clustering model for this data and locate the number of clusters for it

Use the first four variables of "iris" data file into R Studio and do as follows with R script to knit PDF output:

a) Fit a k-means clustering model in the data with k=2 and k=3

b) Plot the clusters formed with k=3 in the single graph and interpret them carefully

c) Add cluster centers for the plot of clusters formed with k=3 and interpret it carefully

d) Compare the k=3 clusters with Species variable using confusion matrix and interpret the result carefully

hierarchical-clusteringk-meansiris

Frequently asked questions

Where can I find the Master in Data Science (SMS, TU) Statistical Computing with R question paper 2081?: The full Master in Data Science (SMS, TU) Statistical Computing with R 2081 (Sa) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
Does the Statistical Computing with R 2081 paper come with solutions?: Yes. Every question on this Statistical Computing with R past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
How many marks is the Master in Data Science (SMS, TU) Statistical Computing with R 2081 paper?: The Master in Data Science (SMS, TU) Statistical Computing with R 2081 paper carries 45 full marks and is meant to be completed in 120 minutes, across 10 questions.
Is practising this Statistical Computing with R past paper free?: Yes — reading and attempting this Statistical Computing with R past paper on Kekkei is completely free.