Browse papers
A

Section A: Long Answer Questions

Attempt any TWO questions.

3 questions·10 marks each
1long10 marks

What is a data warehouse? Explain the three-tier architecture of a data warehouse in detail with a neat diagram.

data-warehousearchitecture
2long10 marks

Explain the K-means algorithm. Cluster the given set of points into two clusters and show all iterations until convergence.

clusteringkmeans
3long10 marks

What is classification? Explain the K-Nearest Neighbour (KNN) algorithm and classify a new instance for a given dataset.

classificationknn
B

Section B: Short Answer Questions

Attempt any EIGHT questions.

9 questions·5 marks each
4short5 marks

Differentiate between operational database and data warehouse.

data-warehouse
5short5 marks

Explain the OLAP operations with examples.

olap
6short5 marks

What is market basket analysis?

association-rules
7short5 marks

Explain the candidate generation step in Apriori.

apriori
8short5 marks

What is overfitting in classification? How can it be avoided?

classification
9short5 marks

Explain the silhouette coefficient for cluster evaluation.

clusteringevaluation
10short5 marks

What is data discretization?

preprocessing
11short5 marks

Differentiate between star and snowflake schema.

schema
12short5 marks

Write short notes on spatial data mining.

spatial-mining