Assignment on Foundations of Business Intelligence: Databases and Information Management Program: MBA Semester: Summer ‘14 Course Title: Computer & Information Management Course No. CIS-501 Submitted To Mr. Raisul Alam Chairman Department of CSE BUBT, Dhaka Submitted By Nadia Azad Section: 2 ID No. 13142201033 MBA (30th Intake) Semester- Summer Shah Md. Ikhtiar Fahim Section: 1 ID No. 12132201007 MBA (27th Intake) Semester- Summer Index 6.1. Database management System & Application 6.2. Characteristics of the Database System 1. 2. 3. 4. Structured and Described Data Data Integrity Transactions Data Persistence 6.3. Benefits of DBMS 1. Easy data control 2. Accuracy 3. Better processing speed 4. Reuse of data code 6.4. Types of Database Management Systems 1. Hierarchical Databases (DBMS) 2. Advantage 3. Disadvantage 4. Network Database 5. Relational Databases 6.5. Hierarchy of Data 1. Bits (characters) 2. Fields 3. Records 4. Files 5. Databases INTRODUCTION Database Management systems encompass a broad and complex topic. The conversation in the press always seems to be about how the datacenter is drowning in the growth of unstructured (file-type) information, but what happened to the challenges of managing the content owned by the enterprise applications and databases that actually run the business? Rather than focus on what type of information and data are most common, ask “Which processes are being used to manage and control all that information and data? ”Control is what is at issue and databases are running away with the game. Databases of one form or another control 50-75% of information and data in the datacenter and are predicted to be the growth path Foundations of Business Intelligence: Databases and Information Management 6.1. Database management System A database management system (DBMS) is a collection of programs that enables you to store, modify, and extract information from a database. There are many different types of DBMSs, ranging from small systems that run on personal computers to huge systems that run on mainframes. The following are examples of database applications: computerized library systems automated teller machines flight reservation systems computerized parts inventory systems From a technical standpoint, DBMSs can differ widely. The terms relational, network, flat, and hierarchical all refer to the way a DBMS organizes information internally. The internal organization can affect how quickly and flexibly you can extract information. Requests for information from a database are made in the form of a query, which is a stylized question. For example, the query SELECT ALL WHERE NAME = "SMITH" AND AGE > 35 Requests all records in which the NAME field is SMITH and the AGE field is greater than 35. The set of rules for constructing queries is known as a query language. Different DBMSs support different query languages, although there is a semi-standardized query language called SQL (structured query language). Sophisticated languages for managing database systems are called fourth-generation languages, or 4GLsfor short. The information from a database can be presented in a variety of formats. Most DBMSs include a report writer program that enables you to output data in the form of a report. Many DBMSs also include a graphics component that enables you to output information in the form of graphs and charts. 6.2. Characteristics of the Database System The presentation of facts, information or concepts that are created in a computer readable form or are translated into such a form is called data. In database management system data is kept in a table. Following are the main characteristics of database approach. Concurrent Use A database system allows several users to access the database concurrently. Answering different questions from different users with the same (base) data is a central aspect of an information system. Such concurrent use of data increases the economy of a system. Data capturing and data storage is not redundant, the system can be operated from a central control and the data can be updated more efficiently. 1. Structured and Described Data A fundamental feature of the database approach is that the database system does not only contain the data but also the complete definition and description of these data. These descriptions are basically details about the extent, the structure, the type and the format of all data and, additionally, the relationship between the data. 2. Data Integrity Data integrity is a byword for the quality and the reliability of the data of a database system. In a broader sense data integrity includes also the protection of the database from unauthorized access (confidentiality) and unauthorized changes. Data reflect facts of the real world. Logically, it is demanded that this reflection is done correctly. A DBMS should support the task to bring only correct and consistent data into the database. 3. Transactions A transaction is a bundle of actions, which are done within a database to bring it from one consistent state to a new consistent state. In between the data are inevitable inconsistencies. 4. Data Persistence Data persistence means that in a DBMS all data is maintained as long as it is not deleted explicitly. The life span of data needs to be determined directly or indirectly be the user and must not be dependent on system features. Additionally data once stored in a database must not be lost. Changes of a database, which are done by a transaction, are persistent. When a transaction is finished even a system crash cannot put the data in danger. 6.3. Benefits of DBMS Database management system simply refers to handling and managing of data. It is commonly used in small and large organizations to allow the collection of data, as well as background integration. Not only do they ensure information is easily accessed by users, but also help to facilitate data duplication. 1. Easy data control Since all the information is centralized, you don’t need to duplicate data. Unlike traditional paper systems, all users can get the data firsthand without making new copies. Likewise, data base management system helps to remove the cost of duplication resulting from single application programs. 2. Accuracy Because data is stored on multiple computers using database management software, handling data becomes easier. In addition, organizations use database management system to maintain accuracy of information. By doing so, they are able to enforce high standards. Of all the benefits, this is one of the most important features of database management system. Data is restricted only to those who are certified to manipulate it, therefore preventing breach of information by unscrupulous individuals. 3. Better processing speed The responsiveness of data management using computer systems is better with a well-versed DBMS, as compared to linear computer applications. For instance, functions such as sum, calculate, and count are automated with ease. In a data management system, the applications to report, store, and update the data are planned in an orderly manner. 4. Reuse of data code The best thing about data management system is that each part can reuse code from other modules or sections. For instance, inventory data is stored in a section that is separate from another set of data. Nevertheless, queries are combined from all the modules without having to change their structure. Since all the data is combined into one single database, power outage or corrupted database may result in loss of information. 6.4. Types of Database Management Systems: There are four structural types of database management systems: Hierarchical databases. Network databases. Relational databases. Object-oriented databases Hierarchical Databases (DBMS) : In the Hierarchical Database Model we have to learn about the databases. It is very fast and simple. In a hierarchical database, records contain information about there groups of parent/child relationships, just like as a tree structure. The structure implies that a record can have also a repeating information. In this structure Data follows a series of records, It is a set of field values attached to it. It collects all records together as a record type. These record types are the equivalent of tables in the relational model, and with the individual records being the equivalent of rows. To create links between these record types, the hierarchical model uses these type Relationships. Advantage: Hierarchical database can be accessed and updated rapidly because in this model structure is like as a tree and the relationships between records are defined in advance. This feature is a two-edged. Disadvantage: This type of database structure is that each child in the tree may have only one parent, and relationships or linkages between children are not permitted, even if they make sense from a logical standpoint. Hierarchical databases are so in their design. it can adding a new field or record requires that the entire database be redefined. Network Database: A network databases are mainly used on a large digital computers. It more connections can be made between different types of data, network databases are considered more efficiency It contains limitations must be considered when we have to use this kind of database. It is Similar to the hierarchical databases, network databases .Network databases are similar to hierarchical databases by also having a hierarchical structure. A network database looks more like a cobweb or interconnected network of records. In network databases, children are called members and parents are called occupier. The difference between each child or member can have more than one parent. The Approval of the network data model similar with the esteem of the hierarchical data model. Some data were more naturally modeled with more than one parent per child. The network model authorized the modeling of many-to-many relationships in data. The network model is very similar to the hierarchical model really. Actually the hierarchical model is a subset of the network model. However, instead of using a single-parent tree hierarchy, the network model uses set theory to provide a tree-like hierarchy with the exception that child tables were allowed to have more than one parent. It supports many-tomany relationships. Relational Databases: In relational databases, the relationship between data files is relational. Hierarchical and network databases require the user to pass a hierarchy in order to access needed data. These databases connect to the data in different files by using common data numbers or a key field. Data in relational databases is stored in different access control tables, each having a key field that mainly identifies each row. In the relational databases are more reliable than either the hierarchical or network database structures. In relational databases, tables or files filled up with data are called relations (tuples) designates a row or record, and columns are referred to as attributes or fields. Relational databases work on each table has a key field that uniquely indicates each row, and that these key fields can be used to connect one table of data to another. The relational database has two major reasons: 1. Relational databases can be used with little or no training. 2. Database entries can be modified without specify the entire body. Properties of Relational Tables: In the relational database we have to follow some properties which are given below. It's Values are Atomic In Each Row is alone. Column Values are of the Same thing. Columns is undistinguished. Sequence of Rows is Insignificant. Each Column has a common Name. Object-Oriented Model: In this Model we have to discuss the functionality of the object oriented Programming .It takes more than storage of programming language objects. Object DBMS's increase the semantics of the C++ and Java .It provides full-featured database programming capability, while containing native language compatibility. It adds the database functionality to object programming languages.This approach is the analogical of the application and database development into a constant data model and language environment. Applications require less code, use more natural data modeling, and code bases are easier to maintain. Object developers can write complete database applications with a decent amount of additional effort. The object-oriented database derivation is the integrity of object-oriented programming language systems and consistent systems. The power of the object-oriented databases comes from the cyclical treatment of both consistent data, as found in databases, and transient data, as found in executing programs. Object-oriented databases use small, recyclable separated of software called objects. The objects themselves are stored in the object-oriented database. Each object contains of two elements: 1. Piece of data (e.g., sound, video, text, or graphics). 2. Instructions, or software programs called methods, for what to do with the data. Disadvantage of Object-oriented databases 1. Object-oriented databases have these disadvantages. 2. Object-oriented database are more expensive to develop. 3. In the Most organizations are unwilling to abandon and convert from those databases. They have already invested money in developing and implementing. The benefits to object-oriented databases are compelling. The ability to mix and match reusable objects provides incredible multimedia capability. 6.5. Hierarchy of Data Data are the principal resources of an organization. Data stored in computer systems form a hierarchy extending from a single bit to a database, the major record-keeping entity of a firm. Each higher rung of this hierarchy is organized from the components below it. Data are logically organized into: 1. Bits (characters) 2. Fields 3. Records 4. Files 5. Databases Bit (Character) - a bit is the smallest unit of data representation (value of a bit may be a 0 or 1). Eight bits make a byte which can represent a character or a special symbol in a character code. Field - a field consists of a grouping of characters. A data field represents an attribute (a characteristic or quality) of some entity (object, person, place, or event). Record - a record represents a collection of attributes that describe a real-world entity. A record consists of fields, with each field describing an attribute of the entity. File - a group of related records. Files are frequently classified by the application for which they are primarily used (employee file). A primary key in a file is the field (or fields) whose value identifies a record among others in a data file. Database - is an integrated collection of logically related records or files. A database consolidates records previously stored in separate files into a common pool of data records that provides data for many applications. The data is managed by systems software called database management systems (DBMS). The data stored in a database is independent of the application programs using it and of the types of secondary storage devices on which it is stored. Summary: A database management system (DBMS) is a computer application program designed for the efficient and effective storage, access and update of large volumes of information. This course will look at such systems from two perspectives: • A user-centered perspective focusing on how a DBMS is used to support a data intensive application. This perspective includes a look at the common data models, query languages and design techniques. • A system implementation perspective focusing on the policies, algorithms and data structures used to design and implement a DBMS.