Management Information Systems for the Information Age Third Edition Haag Cummings McCubbrey Databases and Data Warehouses McGraw-Hill/Irwin CHAPTER 3 © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition Knowledge Workers Checklist logical vs. physical organization databases & software to manage them relational model concepts data warehouses & data mining managing the information resource McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition Organizations do 3 things with Information process transactions OLTP make decisions OLAP manage the information logical vs. physical McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition A Database is: • a collection of information • organized • accessed by logical structure McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition Components of a Database • information itself – in the form of records • data dictionary • primary key for each record • foreign keys that relate files • built-in integrity constraints McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition A DataBase Management System • is software • contains 5 critical components – DBMS engine – data definition subsystem – data manipulation subsystem – application generation subsystem – data administration subsystem McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition DBMS Engine • most important but seldom recognized • separates logical from physical McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition Data Definition • builds data dictionary • builds logical database structure McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition Data Manipulation • views • report generators • QBE tools • SQL McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition Application Generation • build forms • build data entry screens • create transaction processing McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition Data Administration Subsystem • • • • • • backup and recovery security management query optimization reorganization facilities concurrency control change management McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition Relational Database Model • most widely used • tables store data – intuitive for humans • entity class –people, places or things • instance – one of an entity – i.e. an employee, a city, a baseball game McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition An example: Small-Town Book Store Book Customer cus-num 100 114 115 Purchase cus-num 114 100 114 114 115 100 McGraw-Hill/Irwin name Poulos Simmons Thompson book-code BS7-8 GW4-5 PQ1-3 GW4-5 PQ1-2 BS7-8 book-code GW4-5 BS7-8 PQ1-2 PQ1-3 date 9/16/01 9/17/01 9/17/01 9/17/01 9/17/01 9/18/01 title Blazing Sun Bright Star Cattle Run Cattle Skip entities? instances? primary keys? foreign keys? © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition Object-Oriented Model • an object is a software module • run by OODB • combines information and procedures • procedures are reusable McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition Data Warehouses and Data Mining • support OLAP • support decision making McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition Data Warehouse • information gathered from many databases • multidimensional • support OLAP, not OLTP McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition Data Mining Tools • query-and-reporting tools • intelligent agents • multidimensional analysis tools • support information discovery McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition Managing Information • determine type of models & databases • CIO serves as overseer • who owns the information? • ethical concerns • maintenance (outsource?) McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Haag Cummings McCubbrey Third Edition Summary • differences exist between the physical and logical view of information • databases have a significant role in organizations • data dictionaries hold the logical structure of databases • software to run databases is called a database management system • the relational database model is the most widely used • relations store information about an entity class • databases use primary and foreign keys • data warehouses and data mining tools help discover new information to support decision making • types of databases include: centralized, distributed, external, online, knowledge worker and operational • oversight of an organization’s information is often done by a CIO McGraw-Hill/Irwin © 2002 The McGraw-Hill Companies, Inc. All rights reserved.