Browse papers
A

Section A: Long Answer Questions

Attempt any TWO questions.

3 questions·10 marks each
1long10 marks

Explain the data warehouse architecture and the ETL process in detail. Discuss the role of metadata in a data warehouse.

data-warehouseetl
2long10 marks

Explain the FP-growth algorithm. Construct the FP-tree for a given transaction dataset and mine the frequent patterns.

association-rulesfp-growth
3long10 marks

What is cluster analysis? Compare partitioning, hierarchical, and density-based clustering methods with examples.

clustering
B

Section B: Short Answer Questions

Attempt any EIGHT questions.

9 questions·5 marks each
4short5 marks

What are the characteristics of a data warehouse (subject-oriented, integrated, time-variant, non-volatile)?

data-warehouse
5short5 marks

Explain the concept of data cube and aggregation.

olap
6short5 marks

What is a frequent itemset? Explain with an example.

association-rules
7short5 marks

Explain the Gini index as a splitting criterion.

decision-tree
8short5 marks

What is the agglomerative approach in hierarchical clustering?

clustering
9short5 marks

Explain the issues in data quality.

preprocessing
10short5 marks

Differentiate between eager and lazy learners.

classification
11short5 marks

What is the support count and minimum support threshold?

association-rules
12short5 marks

Write short notes on temporal data mining.

temporal-mining