Browse papers
LevelMaster in Data Science (SMS, TU)
SubjectFundamentals of Data Science
Year2080 BS
Exam sessionSa
Full marks45
Time allowed120 minutes
Questions10, all with step-by-step solutions
A

Group A

5 questions·3 marks each
1Short answer3 marks

Elaborate on agility (agile implementation) of CRISP-DM.

crisp-dmagile
2Short answer3 marks

Explain why data preprocessing is a major job of a data scientist.

data-preprocessing
3Short answer3 marks

Why is logistic regression a regression despite its use for classification tasks? Illustrate the sigmoid curve along with its mathematical function.

logistic-regressionsigmoid
4Short answer3 marks

Define entropy. How is it used with decision tree?

entropydecision-tree
5Short answer3 marks

Explain why data ethics is crucial in data science.

data-ethics
B

Group B

5 questions·6 marks each
6Long answer6 marks

Explain the OSEMN framework for data science project implementation with any suitable example of your own.

OR

Discuss the scope and limitations for data analysis of election data. What responsibilities should be fulfilled by a data scientist for such projects. Discuss with one example of your own.

osemndata-science-project
7Long answer6 marks

Discuss the problems caused by data quality during the training of machine learning algorithms. Do you recommend data first approach or model first approach for any of the data science problems? Explain with reason.

data-qualitymachine-learning
8Long answer6 marks

Apply Naïve bayes algorithm to decide if Married Female with salary of 42000 is probable to have illness or not based on data given below:

Marital StatusGenderIncomeIllness
MarriedMale40000Yes
UnmarriedMale35000No
MarriedMale60000Yes
MarriedFemale61000Yes
UnmarriedFemale36000Yes
MarriedFemale47500No
UnmarriedFemale32000No

OR

Explain the forward propagation and backward propagation of neural networks.

naive-bayesneural-networks
9Long answer6 marks

What do you mean by Random Forests? Why is random forest commonly used for feature selection despite being a machine learning model? Is random forest prone to over fitting? Why or why not?

random-forestfeature-selection
10Long answer6 marks

A healthcare organization developed a machine learning model to predict patients' risk of developing certain medical conditions based on their electronic health records (EHR). The model was trained using historical patient data, including diagnoses, treatments, and outcomes. However, it was later discovered that the model exhibited bias against patients from lower socioeconomic backgrounds. The training data disproportionately represented patients from wealthier neighborhoods who had better access to healthcare services and resources. As a result, the model erroneously associated higher socioeconomic status with lower health risks, leading to underestimating the risk of certain conditions for patients from disadvantaged backgrounds.

Explain how you can address the biases in such a situation. Be specific and suggest remedies to avoid biases.

biasfairnesshealthcare

Frequently asked questions

Where can I find the Master in Data Science (SMS, TU) Fundamentals of Data Science question paper 2080?
The full Master in Data Science (SMS, TU) Fundamentals of Data Science 2080 (Sa) question paper is available free on Kekkei. You can read every question online and attempt the paper under timed exam conditions.
Does the Fundamentals of Data Science 2080 paper come with solutions?
Yes. Every question on this Fundamentals of Data Science past paper includes a step-by-step solution, plus instant AI feedback when you attempt it on Kekkei.
How many marks is the Master in Data Science (SMS, TU) Fundamentals of Data Science 2080 paper?
The Master in Data Science (SMS, TU) Fundamentals of Data Science 2080 paper carries 45 full marks and is meant to be completed in 120 minutes, across 10 questions.
Is practising this Fundamentals of Data Science past paper free?
Yes — reading and attempting this Fundamentals of Data Science past paper on Kekkei is completely free.