Probability Engine · MDS 501

Fundamentals of Data Science: what's likely to come

Every real question from 13 past papers across 5 years (board + internal assessments each year) mapped to its official syllabus unit. Each prediction shows its receipt: the actual years it appeared.

13
Papers analyzed
2078-2082 · 5 yrs, multiple sittings
6
Syllabus units
from the official course
6
Very likely units
high-probability topics
5
Units = 80% of marks
study these first

13 papers across 5 years

This program sits several exams each year: one official board exam plus internal assessments. Every sitting is analysed.

20783 sittings
first assessmentfirst reassessmentsecond assessment
20791 sitting
board
20803 sittings
boardfirst assessmentsecond assessment
20814 sittings
boardfirst assessmentsecond assessment
20822 sittings
boardsecond assessment
Which exams to include?Showing: All exams
01The ranking

Topic predictions, ordered by what to study first

Every syllabus unit scored by how often it appears, its mark-weight, and its trend. See the exact questions behind each unit in the Explore-by-unit section below.

#Syllabus unitProbabilityAppearedAvg marksSyllabus weightExam vs syllabusTrendQuestions
1U4Machine LearningVery likely100%24.617%8 lecture hrsBalancedexam 21% · syllabus 17%Steady3 recurring25 total
2U2Data MungingVery likely100%22.817%8 lecture hrsBalancedexam 19% · syllabus 17%Steady3 recurring23 total
3U3Data Analysis TechniqueVery likely100%22.821%10 lecture hrsBalancedexam 19% · syllabus 21%Steady1 recurring23 total
4U1Introduction to Data ScienceVery likely100%22.221%10 lecture hrsBalancedexam 19% · syllabus 21%Steady3 recurring21 total
5U6Ethical Issues in Data ScienceVery likely80%17.28%4 lecture hrsBalancedexam 12% · syllabus 8%Rising1 recurring14 total
6U5Introduction to Big DataVery likely100%10.817%8 lecture hrsUnder-examinedexam 9% · syllabus 17%Steady2 recurring9 total
02Drill down

Explore by unit: every question, ranked

Pick a syllabus unit and walk its questions from most-important to asked-once. The fastest way to revise one topic end to end.

U4Machine Learning
25 questions · 28 appearances · 24.6 avg marks
03High yield

Most-asked questions across all years

The questions that come back exam after exam, grouped across years and ranked by how often they're asked. Open one to read its real past answer.

Lowest priority: asked only once (102)

  • U1

    You are a data scientist. Take a data science project and complete it using the CRISP-DM approach. Explain what should be done in every step. [1+5]

    OR

    What is data driven decision making and how does data science assist data driven decision making? Explain OSEMN lifecycle for your data science project. [3 + 3]

    2082
  • U1

    You are a data scientist. Take a data science project with title and complete it using the TDSP approach. Explain what should be done in every step detailly.

    OR

    Explain CRISP-DM with its steps and compare and contrast it with OSEMN framework.

    2082
  • U2

    What is data munging? Explain what are the different issues in real world data along with the steps needed for handling those steps detailly.

    2082
  • U3

    What is feature selection? Explain filters and wrappers method for feature selection. [1 + 5]

    2082
  • U3

    Explain linear regression, logistic regression and decision trees for fitting model in detail.

    2082
  • U4

    The following table represents a dataset of 10 objects with attributes Color, Type, Origin and the "class", whether the customer who bought was satisfied or not.

    S. NoColorTypeOriginSatisfied?
    1RedCasualDomesticYes
    2RedCasualDomesticNo
    3RedCasualDomesticYes
    4YellowCasualDomesticNo
    5YellowCasualImportedYes
    6YellowCasualImportedYes
    7YellowFormalImportedNo
    8YellowFormalImportedYes
    9YellowFormalDomesticNo
    10RedFormalImportedNo
    11RedCasualImportedYes

    Use ID3 algorithm to find the attribute with maximum information gain.

    OR

    What is multi-layer neural network? How is learning done in neural networks? Explain backpropagation algorithm. [1 + 1 + 4]

    2082
  • U4

    Explain node and weights in neural networks. Consider following Neural Network and compute its output considering sigmoid as activation function for all layers. Weights of synaptic links are provided above each link.

    Input: X1=2X1 = 2, X2=3X2 = 3. Feedforward net: input nodes feed hidden nodes 1 and 2, which feed nodes 3 and 4, which feed output node 5. Weights: X1X1 \to node 1 = 0.8, X1X1 \to node 2 = 0.4, X2X2 \to node 1 = 1, X2X2 \to node 2 = 0.6, node 1 \to node 3 = 1.2, node 1 \to node 4 = 0.4, node 2 \to node 3 = 0.7, node 2 \to node 4 = 0.5, node 3 \to node 5 = 1.5, node 4 \to node 5 = 0.5.

    OR

    What is multi-layer neural network? How is learning done in neural networks? Explain backpropagation algorithm.

    2082
  • U6

    What is cognitive bias? Explain any two cognitive biases. Explain techniques for addressing bias.

    2082
  • U1

    Define and explain the TDSP lifecycle in data science.

    OR

    A leading retail chain in Nepal wants to use data science to enhance its customer experience and optimize inventory management. They have data from customer transactions, online browsing behavior, and social media interactions.

    Briefly explain how data science can be applied in the retail industry to improve customer experience and optimize inventory management. Provide specific examples of data science techniques that could be used in this context.

    2081
  • U1

    Elaborate on TDSP (Team Data Science Process) as a framework for the data science lifecycle.

    OR

    Discuss CRISP-DM (Cross-Industry Standard Process for Data Mining) as an agile approach to the data science lifecycle.

    2081
  • U1

    What is Data Science? Explain CRISP-DM approach for Data Science. [1+5]

    OR

    What is Data Science Lifecycle? Explain TDSP approach for Data Science.

    2081
  • U2

    Describe the common data quality issues with tabular data and their mitigation techniques with appropriate examples.

    2081
04The strategy

Study smart and sit a probable paper

How far a few high-priority topics take you, and a full mock paper built from the most likely questions, mirroring the real exam structure.

Study smart, not hard

Study the units in priority order. Each bar shows the share of total marks you'd have covered by then. The top 5 units alone cover ~80% of marks.

1Machine Learning21%
2+ Data Munging41%
3+ Data Analysis Technique60%
4+ Introduction to Data Science79%
5+ Ethical Issues in Data Science91%
← study up to here for ~80% of marks
6+ Introduction to Big Data100%

Most Probable Paper

Mirrors the real structure · 45 marks · based on 5 past papers

Group A
  1. 1.

    List and explain in short three main limitations of Data Science.

    [3 marks]
    Introduction to Data ScienceVery likelyfrom 2079 paper →

    This question has recurred in 2 of 5 years; including the board exam 1× (2079); and its topic (Introduction to Data Science) appears in 100% of years.

  2. 2.

    What do you mean by backpropagation? Conceptually, how do they differ from forward propagation in neural network?

    [3 marks]
    Machine LearningVery likelyfrom 2080 paper →

    Asked once (2080); including the board exam 1× (2080); and its topic (Machine Learning) appears in 100% of years.

  3. 3.

    List the differences between supervised and unsupervised machine learning methods including examples.

    [3 marks]
    Machine LearningVery likelyfrom 2079 paper →

    Asked once (2079); including the board exam 1× (2079); and its topic (Machine Learning) appears in 100% of years.

  4. 4.

    Define and briefly describe neural networks. List a few example use-cases where neural networks can be used.

    [3 marks]
    Machine LearningVery likelyfrom 2079 paper →

    Asked once (2079); including the board exam 1× (2079); and its topic (Machine Learning) appears in 100% of years.

  5. 5.

    What are decision trees? What does impurity of a node mean in context of decision tree?

    [3 marks]
    Machine LearningVery likelyfrom 2079 paper →

    Asked once (2079); including the board exam 1× (2079); and its topic (Machine Learning) appears in 100% of years.

Group B
  1. 1.

    Apply map-reduce to the following set of data:

    Data, Science, Engineering

    Engineering, Data, Analytics

    Analytics, Intelligence, Science

    OR

    What is Hadoop? Explain the different components of Hadoop.

    [6 marks]
    Introduction to Big DataVery likelyfrom 2080 paper →

    This question has recurred in 2 of 5 years; including the board exam 1× (2080); and its topic (Introduction to Big Data) appears in 100% of years.

  2. 2.

    Explain node and weights in neural networks. Consider following Neural Network and compute its output considering sigmoid as activation function for all layers. Weights of synaptic links are provided above each link.

    Input: X1=2X1 = 2, X2=3X2 = 3. Feedforward net: input nodes feed hidden nodes 1 and 2, which feed nodes 3 and 4, which feed output node 5. Weights: X1X1 \to node 1 = 0.8, X1X1 \to node 2 = 0.4, X2X2 \to node 1 = 1, X2X2 \to node 2 = 0.6, node 1 \to node 3 = 1.2, node 1 \to node 4 = 0.4, node 2 \to node 3 = 0.7, node 2 \to node 4 = 0.5, node 3 \to node 5 = 1.5, node 4 \to node 5 = 0.5.

    OR

    What is multi-layer neural network? How is learning done in neural networks? Explain backpropagation algorithm.

    [6 marks]
    Machine LearningVery likelyfrom 2082 paper →

    Asked once (2082); including the board exam 1× (2082); and its topic (Machine Learning) appears in 100% of years.

  3. 3.

    How does machine learning differs from traditional learning? Explain the various type of machine learning techniques.

    [6 marks]
    Machine LearningVery likelyfrom 2081 paper →

    Asked once (2081); including the board exam 1× (2081); and its topic (Machine Learning) appears in 100% of years.

  4. 4.

    Elaborate the concept behind Naïve Bayes algorithm for classification task.

    [6 marks]
    Machine LearningVery likelyfrom 2080 paper →

    Asked once (2080); including the board exam 1× (2080); and its topic (Machine Learning) appears in 100% of years.

  5. 5.

    What is data munging? Explain what are the different issues in real world data along with the steps needed for handling those steps detailly.

    [6 marks]
    Data MungingVery likelyfrom 2082 paper →

    Asked once (2082); including the board exam 1× (2082); and its topic (Data Munging) appears in 100% of years.

Topics are the official MDS 501 syllabus units. Predictions are data-driven probabilities computed from 13 past papers (2078-2082) by mapping each real question to its syllabus unit. They indicate what has historically been likely, not guaranteed questions. Always study the full syllabus.