CUSTOMER_CODE SMUDE DIVISION_CODE SMUDE EVENT_CODE OCTOBER15 ASSESSMENT_CODE MC0077_OCTOBER15 QUESTION_TYPE DESCRIPTIVE_QUESTION QUESTION_ID 3121 QUESTION_TEXT Explain the process of Text Retrieval using SQL3/TextRetrieval SCHEME OF EVALUATION SQL3 supports storage of multimedia data, such as text documents, in an or-database using the blob/clob data types. Seekers of information from text-based documents, commonly use ‘free text’ queries. (4 marks) Basically, the new-to SQL3-functionality includes: *Indexing routines: for the various types of media data. *Selection operator for the SQL3 WHERE clause for specification of selection criteria for media retrieval. *Text processing subsystems for similarity evaluation and result ranking. (6 marks) QUESTION_TYPE DESCRIPTIVE_QUESTION QUESTION_ID 3123 QUESTION_TEXT Discuss the classification of data in Fuzzy Databases SCHEME OF EVALUATION 1.Crisp: there is no vagueness 2.Fuzzy: there is vagueness in the information and this can be divided into two subparts a.Approximate value: the information data is not totally vague and there is some approximate value, which is known and the data, lies near that value. b.Linguistic variable: it is a variable that apart from representing a fuzzy number also representing a fuzzy number also represents linguistic concepts interpreted in a particular context. A linguistic variable is fully characterized by a quintuple<v, T, X, g, m> (10 marks) QUESTION_T DESCRIPTIVE_QUESTION YPE QUESTION_ID 74081 QUESTION_T Briefly explain the characteristics of text documents and characteristics of images. EXT SCHEME OF EVALUATION Characteristics of Text Documents Text documents can be considered semi-structured in the sense that they are constructed from a defined term vocabulary according to known grammatical rules for forming sentences and paragraphs. As documents, they also share some common elements such as an author, title, date of origin, a (set of) title(s), and perhaps a recipient. A text document can be described from 3 perspectives: 1. Semantic content of the document, i.e. representation of its meaning, 2. Context of the document, e.g. its author, publisher 3. Structure of the document, e.g. its language, style, length, ... Context and structure descriptors can be given as regular attributes of the document and can be modeled using any structural data model. Modeling the semantic content is more difficult. Characteristics of Images Image documents are represented as unstructured data, actually by a bit or pixel (picture element) string. As opposed to text documents, they do not have a standard 'vocabulary' or grammar that can be used for automatic interpretation of the semantic content, or meaning of the image. Instead, most image management systems use manual annotations, such as title/caption, keywords and/or text descriptions, to capture semantic interpretation. Since an image can be viewed as a document, the same types of metadata as used in Dublin Core can capture context metadata, while structural descriptors would include such characteristics as size in height and width, materials used, and implementation format. In addition to text annotations, the semantic content of images can be described by content descriptors, frequently called features. Image features can be classified according to their level of abstraction, or distance from the actual physical content of the image: QUESTION_TYPE DESCRIPTIVE_QUESTION QUESTION_ID 112437 QUESTION_TEXT Explain the different reasons for using replication 1. SCHEME OF EVALUATION 2. 3. Availability Performance Disconnected computing 4. Network local Reduction 5. Mass Deployment QUESTION_TYPE DESCRIPTIVE_QUESTION QUESTION_ID 112440 QUESTION_TEXT Define fuzzy sets. Explain need for fuzzy database and techniques of fuzziness in databases. Fuzzy set (3 marks) Need for fuzzy database (3.5 marks) SCHEME OF EVALUATION Techniques for implementation of fuzziness (3.5 marks) QUESTION_TYPE DESCRIPTIVE_QUESTION QUESTION_ID 112441 QUESTION_TEXT Write a note on Database Tuning Performance is the way a computer system behaves given a particular work load. Performance is measured in terms of system response time, throughput, and availability. Performance is also affected by: The resources available in your system and How well those resources are used and shared. (3 marks) In general, you tune your system to improve its cost-benefit ratio. Specific goals could include: Processing a larger, or more demanding, work load without increasing processing costs. SCHEME OF EVALUATION (2 marks) For example, to increase the work load without buying new hardware or using more processor time: > Obtaining faster system response times, or higher throughput, without increasing processing costs. > Reducing processing costs without degrading service to your users. (2 marks) Translating performance from technical terms to economic terms is difficult. Performance tuning certainly costs money in terms of user time as well as processor time, so before you undertake a tuning project, weighs its costs against its possible benefits. Some of these benefits are tangible: More efficient use of resources.The ability to add more users to the system. (3 marks)