LOYOLA COLLEGE (AUTONOMOUS), CHENNAI – 600 034 M.C.A. DEGREE EXAMINATION - COMPUTER APPLICATIONS FOURTH SEMESTER – APRIL 2008 DC 28 CA 4805 - DATA MINING Date : 25/04/2008 Time : 9:00 - 12:00 Dept. No. PART-A Answer ALL questions 1. What is Data Mining? 2. What are the different functionalities of data mining? 3. How data transformation is achieved? 4. Why do we do data discretization? 5. Why are decision tree classifiers so popular? 6. Define a cluster. 7. What is multimedia data mining? 8. Why graph mining is so important? 9. How security is preserved from data mining? 10. What is spacial databases? PART- B Answer ALL questions Max. : 100 Marks 10 X 2 = 20 5 X 8 = 40 11(a) Explain the architecture of a data mining system with a diagram (or) 11(b) Briefly explain the three important functionalities of data mining. 12(a) How inconsistent data are handled in data preprocessing? (or) 12(b) Write the Apriori algorithm for frequent itemset generation. 13(a) Write short notes on any two of the following: i. Gain Ratio ii. Information Gain. Iii. Gini index. (or) 13(b) What is prediction? Explain the predication using regression? 14(a) Discuss Text data mining. (or) 14(b) Write an algorithm for Apriori-based frequent substructure mining. 15(a) Give an account of data mining application to retail industry. (or) 15(b) “Web poses great challenges for ko knowledge discovery” Elaborate. PART -C Answer any TWO questions ( Q.No 16 is compulsory ) 2 X 20 = 40 16. Discuss partitioning methods used in cluster analysis. 17(a) Is correlation analysis is necessary after association analysis? Why? Explain. (b) Explain Naïve Bayesian classification. 18. Data mining is becoming increasingly important in social life. Explain. ************** 1