بسمه تعالي دانشكده مهندسی كامپيوتر Multiple Choices 1. Summarization is a simple addition of values along one or more data dimensions. a) True b) False 2. _____ is a process of taking operational data from one or more sources and mapping it, field by field, onto a new data structure in the data warehouse. a) Transformation b) Cleansing c) Integration d) Scrubbing 3. Which of the following fields below typically make use of Data Mining techniques? a) Marketing b) Government intelligence c) Advertising d) All of above 4. What is meant by discrete data؟ a) One that allows Only finite set of values b) One that allows real numbers only c) Both a and b d) One that allows float values only 5. _________stores data in a summarized version a) Cube b) Roll up c) Both a and b d) A mine 6. _________ is a process of taking operational data from one or more sources and mapping it, field by field, onto a new data structure in the data warehouse a) Transformation b) Cleansing c) Integration d) Scrubbing 7. What is the nature of Quality of data? a) The data should be accurate b) The data could be stored according to data type c) The data should be timely d) Both a and c 8. Which algorithm is used to find correlations among different attributes in a data set? a) Associative algorithm b) Association algorithm c) Time Series algorithm d) Series algorithm 9. Which stage of data mining involves preparation and collection of data? a) Validation b) Exploration c) Both a and b نام و نام :خانوادگی شماره :دانشجویی d) Collection 10. Modeling is used to create a data model that helps further define information requirements a) True b) False 11. What is data mining? a) Used to find patterns by comparing large amounts of data b) Used to find patterns by comparing large amounts of data mainly for statistically inclined users c) used for extracting and storing data which allows easier reporting d) Both a and b 12. Height, width comes under which type of data? a) Finite b) Discrete c) Continuous d) None of the above Short Answer 1. If {1, 2, 3} and {2, 3, 4} are the only large 3 itemsets, identify for each one of the following sets if it is or is not a large itemset ,or you cannot be certain if it is a large itemset or not. i. ii. iii. iv. v. {1} {1, 2} {1, 4} {1, 2, 3, 4} {1, 3, 4} 2. Discuss the advantages and disadvantages of applying sampling to reduce the number of data objects that need to be displayed. 3. Is simple random sampling (without replacement) a good approach to sampling? Why or why not?