Browse papers
LevelMaster in Data Science (SMS, TU)
SubjectStatistical Computing with R
Year2081 BS
Exam sessionBoard
Full marks45
Time allowed120 minutes
Questions10, all with step-by-step solutions
A

Group A

5 questions·3 marks each
1Short answer3 marks

Explain the following concepts with R codes:

a) Numeric variable and its type with example b) Categorical variable and its type with example

r-variablesdata-types
2Short answer3 marks

Explain the following concept with focus on R software:

a) Manipulating row and column of data frame in dplyr package with an example b) Extract, Transform and Load in dplyr package with an example

dplyrdata-frameetl
3Short answer3 marks

Explain the following concepts with focus on R software:

a) Boxplot with five number summaries with example b) Boxplot with outliers with example

boxplotoutliers
4Short answer3 marks

Explain the following concept with focus on R software:

a) Leverage in linear regression with example b) Multicollinearity in logistic regression with example

regressionleveragemulticollinearity
5Short answer3 marks

Explain the following concepts with focus on R software:

a) Biplot from principal component analysis b) Biplot from classical multidimensional scaling

pcamdsbiplot
B

Group B

5 questions·6 marks each
6Long answer6 marks

Load the "igraph" package in R studio and do the basic SNA as follows with R script and HTML output:

a) Define "g1" as graph object with ("R", "S", "S", "T", "T", "R", "R", "T", "U", "S") as its elements b) Plot "g1" with node color as green, node size as 30, link color as red and link size as 5 and interpret it c) Get degree, closeness and betweenness of g1 and interpret them carefully d) Get hub and communities of this data and interpret them carefully

OR

Do the following in R Studio using "airquality" dataset with R script:

a) Replace missing values of "Ozone" variable with median of this variable as corrected Ozone b) Get the histogram of the corrected Ozone variable using base R plot and interpret it carefully c) Get the boxplot of Wind variable using based R plot and interpret it carefully d) Get the Wind variable outliers using median and interquartile range and compare them with boxplot outlier values with justification

igraphsnaairquality
7Long answer6 marks

Do as follows in R Studio and do as follows with R script and HTML outputs:

a) Open R and go to Help and Manuals in PDF and open "An Introduction to R" file b) Import this pdf file in R using "pdftools" package c) Perform pre-processing and create 'corpus' afterwards d) Find the most frequent terms, create its bar diagram and interpret carefully

OR

Do the following in R Studio using "airquality" dataset with R script:

a) Get the boxplot of Temp variable using ggplot2 package and interpret it carefully b) Create class intervals of Temp variable using dplyr package and show it as frequency distribution c) Get pie chart of Temp variable class intervals using ggplot2 package and interpret it carefully d) Get scatter plot of corrected Temp and Wind variables using ggplot2 package and interpret it carefully

text-miningggplot2airquality
8Long answer6 marks

Do the following in R Studio using "airquality" dataset with R script:

a) Perform Shapiro-Wilk test on "Wind" variable to check if it follows normal distribution or not b) Perform Bartlett test on "Wind" variable by "Month" variable to check if the variances of Wind are equal or not on Month variable categories c) Fit 1-way ANOVA to compare "Wind" variable by "Month" variable and interpret the result carefully d) Fit the TukeyHSD post-hoc test with 95% confidence interval and interpret the result carefully

anovahypothesis-testingairquality
9Long answer6 marks

Do the following in R Studio using "USArrests" dataset with R script:

a) Divide the mtcars data into train and test datasets with 70:30 random splits b) Fit a supervised linear regression model and KNN regression model on train data with "Urban population – UrbanPop" as dependent variable and all other variables as independent variable c) Predict the UrbanPop variable in the test datasets using these two models and interpret results carefully d) Compare the fit indices (R-square, MSE, RMSE) of the two predicted models and choose the best model

regressionknnmodel-evaluation
10Long answer6 marks

Use the first four variables "iris" data and do as follows in the R Studio with R Script:

a) Fit a hierarchical clustering model using average linkage and get the dendogram for this model b) Get the best value of number of clusters to form (k) using the fitted model above c) Fit the k-means clustering with the best value of k identified above and interpret it carefully d) Compare k-means result with the last variable of this data usig confusion matrix and interpret the result carefully

clusteringk-meansiris

Frequently asked questions

Where can I find the Master in Data Science (SMS, TU) Statistical Computing with R question paper 2081?
The full Master in Data Science (SMS, TU) Statistical Computing with R 2081 (Board) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
Does the Statistical Computing with R 2081 paper come with solutions?
Yes. Every question on this Statistical Computing with R past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
How many marks is the Master in Data Science (SMS, TU) Statistical Computing with R 2081 paper?
The Master in Data Science (SMS, TU) Statistical Computing with R 2081 paper carries 45 full marks and is meant to be completed in 120 minutes, across 10 questions.
Is practising this Statistical Computing with R past paper free?
Yes — reading and attempting this Statistical Computing with R past paper on Kekkei is completely free.