DATABASES & DATA WAREHOUSES Access to a Gold Mine of Information “The real value will come from getting the right information to the right people, at the right time, and giving those people the tools to find the meaning in it.” Declaration of Integration Abbie Lundberg CIO 1Dec2002 Databases A.1c Aawwww, Rubbish! Database u “Business intelligence tools are only as good as the data that goes into them; faulty data leads to ill-informed decisions. The ramifications range from ticked-off customers, to misled investors, to testy regulators.” ► “Executives can face jail time under Sarbanes-Oxley if they don’t have financial data in order.” Information Week: Budiness Innovation Powered by Technology u “Our marketing effectiveness leads to our sales effectiveness, which leads to our service effectiveness. Data quality is the key to success. If you don’t have quality data, that whole chain breaks down.” Chuck Scoggins, VP Marketing @ Hilton Hotels A-2a Database ACCESS PROJECTS u “What’s really great about databases and DBMSs is that the concepts are the same whether you’re using a personal DBMSs package or a large mainframe DBMS package.” u “...once you learn database and DBMS concepts while using a personal DBMS package (like MS ACCESS), you can apply your knowledge directly to business environments where other DBMS packages may be used.” Haag.2004.94 u Team Projects use ACCESS to demonstrate fundamental concepts: VData Definition: tables (fields, properties, keys) Project A VApplication Generators: forms (controls) Projects B, C, D VData Manipulation: sorting , filters & queries Projects E, F A-2b Database Use Access as an Alternative…. u “Access gives them a robust database engine that does not lock into a specific technology.” 1.3 u “Your job is to get work done, not build computer programs. Access provides an easy way to get started …with a user interface that will be familiar to anyone who has used Microsoft Office.” 2.3 u Access is inexpensive, and most people catch on quickly….Your company probably owns several copies of Access. 2.4-5 u Access is a workable solution. Stonyfield Farms & CSS 3.3 TechRepublic 5Dec03 A-3 Access Project A cradling tables, properties relevance Access creating Projects E, F analysis filter, sort, query Access capturing reliability Projects B, C, D forms & controls conveying access DBMS processes data (reliability) info (relevance) knowledge (access & analysis) communicating access cradling shared information & decentralized computing A-12 Select Access Projects Choice: order CIO.1 presentations (Rx Tentative dates T TH CIO Team Access Project 800 A: Build) 930 Data Definition (cradling : relevance) B: Application Generators (capturing : reliably) BRING C: Application Generators (capturing : reliably) FLASH D: Application Generators (capturing : reliably) DRIVE E: Data Manipulation (creating : analysis) F: Data Manipulation (creating : analysis) THURSDAY 200 Handout Access Lessons Copy Projects to Travel Drive Review Project Notes A-4a ACCESS Projects: Structure & Process DATA for Project is in Chapter folder Application & Analysis (ALL 3) CONCEPTS & TEMPLATES from PowerPoint & handouts copy to flash-drive from P: after select projects PROCEDURES from MS step-by-step handout Synthesis & Evaluation of DBMS procedures: outline of text integrating concepts & screens Should do“Print Screens” into PPT as do project Hard limit: 20 minutes. # slides depends on content & use. Application & Analysis Presentation focusing on Analysis, Synthesis & Evaluation contributes to Comprehension Text in MS chapters illustrates how to apply, synthesize & evaluate data procedures & concepts CIO case See samples of prior presentations ASSIGNMENT: Work with team on ACCESS project. u Emphasize course concepts to explain why the procedures are important in: H meeting the MIS challenge H performing the 5Cs, H processing data into information then knowledge, H data sharing and decentralized computing that support decision making and innovation H building CIS (TPS), MIS (dashboards), DSS, WSS H creating information partnerships, virtual – learning organizations that are timeless & locationless u This requires reviewing Information Age, IT Systems, IT Strategies PowerPoints and using those concepts, frameworks and ideas to analyze and evaluate the ACCESS procedures. u These exercises are essential to the synthesis of concepts, frameworks and ideas into a new whole that changes the brain real learning (Bloom Hierarchy of Learning Objectives; WSJ 19Jan07 “Thinking Can Change the Brain”) 10 Seinfeld car rental: databases support all 5Cs take reservation hold reservation availability of cars presentation capture forms cradle tables create query communicate & convey report Cluster diagram: access performs 5Cs & supports IT systems A-4b Real World Connections— Resources and Strategies from BusinessWeek Levels of Learning Goal: For students to be • informed, • responsive, and • constructive ongoing learners. Being Informed Demonstrates: • Knowledge (define, identify, label, select, underline, etc.) • Understanding (classify, locate, restate, summarize, etc.) Being Responsive Demonstrates: • Application (compute, investigate, illustrate, translate, apply, etc.) • Analysis (calculate, debate, question, solve, test, relate, etc.) Being Constructive Demonstrates: • Synthesis (collect, compose, design, integrate, propose, perform, etc.) • Evaluation (assess, compare, contrast, judge, rate, value, decide, grade, etc.) 12 ACCESS 2010 Projects Information - Relevance Project A Creating Databases & Tables Data - Reliability Project B Creating Forms Project C Restricting Data Format & Validation Rules Project D Restricting Data to Values in Lists and Other Tables Knowledge – Access & Analysis Project E Sorting & Filtering Data in Table Project E Creating & Using Queries Project D: Data - Reliability *** Uncheck “read only box” in properties (right click on file for properties), then Apply, OK *** Security Warning: Enable content *** Handout 1. Access 2010: Chapter 6 “Maintain Data Integrity” (with notes) 2. GardenCompany06 GardenCompany06 - AfterRestrictingDataUsingValidationRules *** Context : pp. 143 6.5 Restricting Data to Values in Lists pp.167 - 171 6.6 Restricting Data to Values in Other Tables pp.172 - 176 At step 17, refer to procedures at step 4 on page 173. Do these pages Some projects have optional sections: for example Project A section 2.4 is optional Project F: Knowledge – Access & Analysis *** Uncheck “read only box” in properties (right click on file for properties), then Apply, OK *** Security Warning: Enable content *** Handout 1. 2. Access 2010: Chapter 8 “Create Queries” GardenCompany08 database *** Context : p.209 8.2 Creating Queries Manually (Required) pp.216 - 220 Do these pages 8.3 Using Queries to Summarize Data (Optional) pp.221 - 224 8.5 Using Queries to Update Records (Required) pp. 231 – 234 8.6 Using Queries to Delete Records (Required) pp. 235 – 238 Some projects have optional sections Handout Access Lessons Copy Projects to Travel Drive Review Project Notes 1. Right click the file, then click properties 2. Uncheck “Read Only” 4. click “OK” 3. click “Apply” select: enable content ASSIGNMENT: ACCESS PRESENTATION COMPARISONS 1. After all the presentations are completed, use Expert Choice and your notes to assess the relative importance of the criteria and to compare all presentations, other than yours, on each of criteria. Use the graphical mode in Expert Choice to make comparisons. Your Inconsistency Ratio should be .1 If your Inconsistency Ratio .1, don’t record the most recent comparisons. Redo them. 2. PrintScreen (key on keyboard) and paste into PowerPoint the results of the comparisons. There should be 6 slides: 1 comparing the criteria, 4 comparing the presentations on each of 4 criteria, and 1 synthesis graph. For examples see handout used to take your notes. 3. Return a team summary for the entire team on top followed by each team members notes on the Access presentations all their Expert Choice printouts. Analytic: decompose into components Hierarchical Structure Process of pairwise comparisons ASSIGNMENT: ACCESS PRESENTATION COMPARISONS 1. Review the criteria (Bloom’s Hierarchy) for comparing the team presentations: APPLICATION & ANALYSIS SYNTHESIS EVALUATION COMPREHENSION Used relevant concepts in describing objects & events (circumstances) Original thinking integrated the concepts to create a new whole Used concepts as standards for making judgments about what was or should be done Logical flow between slides and presenters helped me understand the concepts 2. During the presentations of other teams, take notes on next page . D.2 DATABASES & DATA WAREHOUSES perform information processing (5Cs) so meet MIS challenge Access to a Gold Mine of Information “The real value will come from getting the right information to the right people, at the right time, and giving those people the tools to find the meaning in it.” Declaration of Integration Abbie Lundberg CIO 1Dec2002 IA-15 Information Information: Key Business Resource u The MIS Challenge is to give decision makers the information needed to provide perfect service at customer’s moment of value. V DATA are observations that reliably describe a particular phenomenon. Online transaction processing (OLTP) captures & cradles data describing events & objects V INFORMATION - is data that has a particular meaning within a specific context -- is relevant to a decision. Online analytical processing (OLAP) creates information by organizing data around decision making and knowledge by supporting access and analysis of information V KNOWLEDGE results from having access to information and the tools to find meaning in it - analysis. ORE ANALOGY DB-2.a D A T A B A S E S & D A T A W A R E H O U S E S Tools to perform data into onformation so meet the 5Cs that process and knowledge MIS challenge capture cradle data journal DBMS (ACCESS) software that allows KW to perform these tasks: 5Cs derived from a data warehouse by KW using queries & reports to convey information & create knowledge with DSS (Excel) DATA MART (Excel) capture & cradle data about basic objects & events with TPS / CIS (journal) using forms & tables create information ledger queries & reports used to create & communicate information from operational database to a special form of database (ledger) that supports decision making with MIS & EIS Victoria Secret WSJ 9Dec02 DB-2.b OUR FOCUS IN THIS DISCUSSION u Knowledge workers need tools for to meet the MIS challenge Lundberg, IA.15 u Databases are tools that support OLTP by capturing & cradling data that describes events / objects V S&P’s Compustat (market data) & SEC’s Edgar (accounting data) u Data Warehouses are a form of database supports OLAP by creating information from different operational databases for decision making V Extractive Petroleum Industry (RRA) u Database management systems (ACCESS) are software used to specify logical structure of a database (data warehouses) and to access and work with it. KPMG.1 What IS Victoria’s Secret? CAPTURE & CRADLE BASIC DATA from store TPS & online CIS DATABASES PROVIDE ACCESS BY COMMUNICATING & CONVEYING INFORMATION TO KNOWLEDGE WORKERS WITH DASHBOARDS (MIS) ENABLE ANALYSIS (FILTERING, SORTING, QUERIES) THAT CREATES KNOWLEDGE FOR DECISION MAKING & INNOVATION What IS Victoria’s Secret? CAPTURE & CRADLE BASIC DATA from store TPS & online CIS DATABASES PROVIDE ACCESS BY COMMUNICATING & CONVEYING INFORMATION TO KNOWLEDGE WORKERS WITH DASHBOARDS (MIS) ENABLE ANALYSIS (FILTERING, SORTING, QUERIES) THAT CREATES KNOWLEDGE FOR DECISION MAKING & INNOVATION What IS Victoria’s Secret? “Databases enable e-tailers “ not only to track individuals’ purchases closely but also to analyze broad buying patterns - - by gender, location and credit-card type, at least, and even internet-service provider, if an e-mail address is given…(to) create powerful profiles of shoppers and their buying habits” WSJ 1.5 DATABASES “This wealth of data can be used to target marketing efforts….WSJ 1.5 Victoria Secret mails several catalogues a year, featuring largely the same merchandise but with different covers and product photos. Then it gauges which generates the most sales.” WSJ 3.1-4 Data Mining: Why Math Will Rock Your World! Why M4th Will Rock Your World! Bus Week 23 Jan06 Business Systems & Analytics “A generation ago, quants turned finance upside down. Now they’re mapping out ad campaigns (Victoria’s Secret) and building new businesses from mountains of personal data.” 54.1 “Math wizzes are the new business elite. They’re turning personal data, trends and online content into math, crunching the numbers and discovering new efficiencies and ways to market. Before long they’ll have a mathematical model of you.” 6.1 Why M4th Will Rock Your World! Bus Week 23 Jan06 “Statisitics and Probability …will become core skills for business people as we grapple with challenges involving large data sets. Winners will know how to use statistics.” Business Systems & Analytics 60.top “Statistics and probability …will come in handy whether you’re building financial models at Goldman Sachs or marketing plans at Ford.” RRA: building financial model to find new efficiencies to markets DB-12.b What are Databases & Data Data Mart RRA convey information & create knowledge finance - accounting example Tactical, aimed at meeting an immediate need Extractive Petroleum BUSINESS Industry ANALYTICS SIC 1311 efficiencies in capital market resource allocation Math Will Rock You create & cradle information NEW EFFICIENCIES TO MARKETS OLAP (queries, sorting, filtering) selectively extracts / organizes relevant data for analysis like ledger, strategic but unfinished concept SEC EDGAR whatis.com Figure Warehouses? 3-16 S&P Compustat market accounting DATABASE DATABASE OLTP: JOURNALIZING BASIC EVENTS BUSINESS SYSTEMS CIS (TPS) capture & cradle data KPMG.1 Conceptual Model--Technology Overview O p e r a ti o n a l D a t a b a s e s Supports Building Data Warehouses Common Data Store DBMS DB-3.a How is a relational database organized? Logical Structure (d e c i s i o n s) Physical Structure IT Columns Characteristics Rows Observations DBMS DBMS (Access) is software bridge between user’s logical requests and physical structure so more easily get to their work performing the 5Cs Access BS&A Table Why Math Will Rock Your World Bus. Week 23Jan06 p.62 DB-3.b Database How is a database organized? u Database is a collection of data that is organized and can be accessed according to its logical structure H.86.1-2, 84.2 whatis.com V Logical structure concerns how the data is viewed by its users -the knowledge workers (decision models) Financial Statement structure (investors concerns FCF) SFAC5.10 V u Physical structure is how the data exists on storage media H.84.2 DBMS engine accepts logical requests from the other DBMS subsystems and converts them to their physical equivalent permitting the user to concentrate on logical information needs, without concern for the physical/technical details H.85.2, 89.6 Access, Bloomberg Figure 3.4 DB-4.a Logical Structure of a Relational Database is composed of files (tables) Part file (table) Facility file (table) Figure 3.4 DB-4.a Logical Structure of Data Dictionary Fields in columns represent characteristics a Relational Database is composed of files Part file (table) defines properties / characteristics in fields of a table, including keys Facility file (table) Figure 3.4 DB-4.a Logical Structure of Data Dictionary Fields in columns represent characteristics a Relational Database is composed of files Part file (table) defines properties / characteristics in fields including keys Records in rows entered by forms Primary keys define records & relationships within a file Facility file (table) Figure 3.4 DB-4.a Logical Structure of Data dictionary Fields in columns represent characteristics a Relational Database is composed of files Part file (table) within tables defines properties / characteristics in columns including keys Records in rows entered by forms Primary keys define records & relationships within a file Facility file (table) DB-4.b Database What is the logical structure in a relational database? u Database consists of a logical collection of files with relations expressed in the rows and columns of tables Compustat, Edgar u File is a logical grouping of records about related objects/events in rows of a table with fields in columns u Record is a logical grouping of fields related to some object/event (firms) u Field is a logical grouping of characters describing some characteristic of an object/event (EPS, share price) u Character is the smallest logical unit of data (symbols/ letters/numbers) Figure 4.4 DB-4.c What defines logical structure in a relational database? Database u Data Dictionary identifies the names and properties of fields in the columns of tables including keys H.87.3-88.1 u Keys are fields in a relational databases identify relationships within and between files / tables V Primary - is a field uniquely identifies/describes each record indicating relationships within a file / table H.88.2 V Foreign - primary key of one file appearing in another file indicating a relationship between files / tables H.89.1 u Integrity Constraints are provided by the the logical structure defined by properties and keys V Properties and keys limit the kind of data that can be entered into a file therefore help ensure data reliability H.89.2-3 ACCESS PROJECTS: RELATIONSHIP TO DBMS SUBSYSTEMS 42 A-2a Database ACCESS PROJECTS u “What’s really great about databases and DBMSs is that the concepts are the same whether you’re using a personal DBMSs package or a large mainframe DBMS package.” u “...once you learn database and DBMS concepts while using a personal DBMS package (like MS ACCESS), you can apply your knowledge directly to business environments where other DBMS packages may be used.” Haag.2004.94 u Team Projects use Microsoft’s DBMS ACCESS to demonstrate: VData Definition: tables: fields, properties, keys Project A VApplication Generators: forms: controls VData Manipulation: sorting , filters & queries Projects B, C, D Projects E, F Access Project A cradling tables, properties relevance Access creating Projects E, F analysis filter, sort, query Access capturing reliability Projects B, C, D forms & controls conveying access DBMS process data (reliability) info (relevance) knowledge (access & analysis) Access Projects demonstrate how DBMS subsystems provide the tools to create and use the logical structure of a relational database communicating access cradling shared information & decentralized computing Figure 3.6 B D, E, F 45 DB-5 DBMS What is a DBMS Engine? u the logical-physical bridge V u accepts logical requests from the various other DBMS subsystems, converts them to their physical equivalent, and actually accesses the database and data dictionary as they exist on a storage device permits the user to concentrate on logical information needs, without concern for the physical/technical details Figures 3.6 & 3.2 DB-6 DBMS What is a Database Management System? u software used to specify the logical structure for a database and to access it. Comparison with word processing & spreadsheet software H.89.5 u contains 5 software components: 1. DBMS engine 2. Data definition subsystem 3. Data manipulation subsystem 4. Application generation subsystem 5. Data administration subsystem Transnational firm DB-7 DBMS What is a Data Definition Subsystem? u creates and maintains the data dictionary that defines the properties and logical structure of data in a file (table) H.89.8-90.1 u V properties include: field name, data type, format, default value, validation rule, keys V contributes to maintaining data integrity Team Project A DB-8 DBMS What is Application Generation Subsystem? u u contains facilities to develop transaction-intensive applications that require a detailed series of tasks to process a transaction. This subsystem includes: V Tools for creating data entry screens and controls in forms V Programming languages specific to the DBMS: SQL (standardized query language) in Access V Interfaces to commonly used programming languages that are independent of any DBMS: C, C++, Cobol Team Projects B, C, D DB-9.a DBMS What is a Data Manipulation Subsystem? u helps you change data in a database and mine it for valuable information. V most often the primary interface between a user and the data in a database V tools in the data manipulation subsystem include: views: filtering & sorting Project E report generators queries Project F DB-9.b DBMS What are the data manipulation tools? u views V view the content of a database file, make changes, perform simple filtering & sorting to find the location of specific information. u report generators V help quickly define the content and format of a report. u Team Project E DB-9.c DBMS What are the data manipulation tools? u Query V is a (graphical) tool that brings together data from multiple tables to answer a question Figure 3.7 V Queries are more powerful than filters extract data from many tables queries can be saved V Typically, start with wizard then modify in design view uTeam Project F DB-10.a DBMS What is Data Administration Subsystem? u helps manage the overall database environment by providing facilities for: V Backup and recovery Know how important if ever experience need V Security management Who gets access what they can do (view/change) V Query optimization restructures user queries so most efficient DB-10.b DBMS What is Data Administration Subsystem? V Reorganization optimizes the physical database structure to improve performance (e.g. defragment) V Concurrency control resolves conflicts among multiple database users (e.g. registration) V Change management assesses impact of proposed changes to a database DB-11 Database What is an Object-Oriented database? u Encapsulates both information about an entity class and the procedures to work with that information in a software module called an object V combining information and procedures more closely models how an organization works V once a set of procedures is defined for a given object, those procedures can be reused by other objects V Microsoft Access is an object-oriented database In-Database Analytics DB-13 Database What benefits / costs are related to databases & data warehouses? u POTENTIAL BENEFITS V improved data integrity (reliability: forms, controls, validation rules) V increased user productivity (input: forms; output: queries/reports) V increased security V reduced data redundancy (capture once, use many) u POTENTIAL COSTS V development V hardware V software training maintenance DB-14 Data Warehouse What Are the Critical Success Factors for Databases & Data Warehouses? Expectations are communicated to the users User involvement is ensured throughout the project Project has a good sponsor, from the business side not IT Project team has the right skill set (CSS : KPMG) A realistic schedule including all necessary tasks Tools have been chosen to match the needs of the users () Users are trained in the way they plan to use the warehouse DataWarehousing.com (7 Feb 2001) DB-12.a Issues addressed within SDLC uWho will be the users? V What data will they need? V What data mining tools will the users need ? V How often should the data be updated? H.104.6-8 H.104.9-105.2 H.105.3-4 uWhat types of databases and networks (insourced, outsourced, combination) are most appropriate? uWho should oversee the information resource? V CIO should manage information which is as important as the financial resources managed by CFO. DB-12.b Issues addressed within SDLC u Who has ownership of (responsibility for) the quality of specific information? u What ethical, security and privacy issues are involved in using information and IT? u How should technology changes be assessed? V Chasing technology is very expensive V The real issues is whether the new technology manages information better and if the benefit exceeds the cost Next topic SDLC Systems Development Life Cycle