Browse papers
A

Section A: Long Answer Questions

Attempt any TWO questions.

3 questions·10 marks each
1long10 marks

Define data warehousing. Explain the data warehouse design process and the components of a data warehouse with a diagram.

data-warehousedesign
2long10 marks

What is cluster analysis? Explain the DBSCAN algorithm. Given a dataset with Eps = 2.5 and MinPts = 3, identify core points, border points, and outliers.

clusteringdbscan
3long10 marks

Explain the Apriori principle. Using the Apriori algorithm, generate frequent itemsets and association rules for a given transaction dataset.

association-rulesapriori
B

Section B: Short Answer Questions

Attempt any EIGHT questions.

9 questions·5 marks each
4short5 marks

What is data transformation? List its techniques.

preprocessing
5short5 marks

Differentiate between ROLAP and MOLAP.

olap
6short5 marks

Explain the bottom-up and top-down approaches of building a data warehouse.

data-warehouse
7short5 marks

What is a confusion matrix? How is it used to evaluate a classifier?

classificationevaluation
8short5 marks

Explain the concept of data cube.

olap
9short5 marks

What is outlier analysis? Why is it important?

outlier
10short5 marks

Define entropy and information gain.

decision-tree
11short5 marks

Explain the partitioning method of clustering.

clustering
12short5 marks

Write short notes on web mining.

web-mining