Kingdom of Saudi Arabia Ministry of Higher Education Majmaah University Vice rectorate for Academic Affairs Measurement & Assessments Administration The final examination for the second semester 1434 / 1435 H College:…Science in az Zulfi……… Program: CSI Dept. Course Name: Data Mining Course Code: CSI 449-Z Section: 273 Date: 20-7-1435 Duration: two hours Number of pages: 5 The student's name: University ID: Examination Guidelines 1- Type your name and university identification number clearly in the space provided. 2- Use blue or black pen in answer and pencil in drawing. 3- Books or notes, papers and pu`blications are not allowed into the examination room. 4- Students are not allowed to get out from the examination room before passing 30 minutes from the beginning of test starting. Learning Outcomes The Knowledge Skills Interpersonal skills Cognitive skills and taking responsibility a b Communication, information technology and numerical skills Psychomotor skills d e c Grades Faculty member Corrector 1 Dr. Weal Khedr. /……………. Review Committee /……………. Name Signature /……………. /……………. /……………. Final grade...…../…....... Form (2) /……………. Learning outcome Question /……………. ………a……….. 1 /……………. ………a, b…….. 2 /……………. ……b, c……….. 3 /……………. ………c, d…….. 4 /……………. ……………….. 5 Corrector 2 /……………. Kingdom of Saudi Arabia Ministry of Higher Education Majmaah University Vice rectorate for Academic Affairs Measurement & Assessments Administration Question(1): Choose the right answer of the followings? 1) The values of ----------- attribute are just different names and provide only enough information to distinguish one object from another. a. Ratio b. Interval c. Ordinal d. Nominal 2) The values of a/an ------------ attribute provide enough information to order objects. a. Ratio b. Interval c. Ordinal d. Nominal 3) For ------------ attributes, the differences between values are meaningful, i.e., a unit of measurement exists. a. Ratio b. Interval c. Ordinal d. Nominal 4) It is a type of data sets that is based on a sequence or a transactions of data a. Record b. Graph c. Ordered d. Data Matrix 5) Reduce amount of time and memory required by data mining algorithms a. Data Reduction b. Data Mining c. Data aggregation d. Data matrix 6) It is the main technique employed for data selection. a. Noise b. Sampling c. Clustering d. Histogram 7) Combining two or more attributes (or objects) into a single attribute (or object) a. Noise b. Sampling c. Aggregation d. Histogram 8) It can be mapping Data to a New Space( Frequency Domain) . a. Aggregation b. Data Reduction c. Fourier transform d. Sampling 9) It refers to modification of original values. a. Aggregation 10) b. Data selection c. Noise d. Clustering Classify of records can be done by using a collection of -----------based classifier. a. Rules b. Clusters c. Decision tree d. Measure of Impurity Kingdom of Saudi Arabia Ministry of Higher Education Majmaah University Vice rectorate for Academic Affairs Measurement & Assessments Administration Question (2) Complete the followings? (A) Define Data Mining ? (B) What are Data Mining Tasks (C) Define Data Classification ? (D) Complete the following figure of a Classification model? ---------------- ------------------ ------------------- (E) What are Similarity Measures of Clustering ? (F) What are Challenges of Data Mining ----------- Kingdom of Saudi Arabia Ministry of Higher Education Majmaah University Vice rectorate for Academic Affairs Measurement & Assessments Administration Question (3) : 1- Draw the Decision tree to classify records based on class attribute (class)? 2- Find the class of tested set? 3- Calculate the Measure of Impurity by using GINI of Refund node? Answer Tid Refund Marital Status Taxable Income Class 1 Yes Single 125K No 2 No Married 100K No 3 No Single 70K No 4 Yes Married 120K No 5 No Divorced 95K Yes 6 No Married No 7 Yes Divorced 220K 8 No Single 85K Yes 9 No Married 75K No 10 No Single 90K Yes 60K No 10 Tested set: 1 2 3 NO Single Yes Married No Single 90 72 95 ? ? ? Kingdom of Saudi Arabia Ministry of Higher Education Majmaah University Vice rectorate for Academic Affairs Measurement & Assessments Administration Question (4) : A) How to determine/find the Best Split in Tree Induction classification technique ? B) What are Measure of Impurity of split? C) Construct a Rules-based Classifier of Question3?