databases - kennedyonline.us

advertisement
DATABASES
&
DATA WAREHOUSES
Access to a Gold Mine of Information
“The real value will come from getting the right information to the
right people, at the right time, and giving those people the tools to find
the meaning in it.”
Declaration of Integration
Abbie Lundberg
CIO 1Dec2002
Databases
A.1c
Aawwww, Rubbish!
Database
u “Business intelligence tools are only as good as the data that goes
into them; faulty data leads to ill-informed decisions. The
ramifications range from ticked-off customers, to misled investors, to
testy regulators.”
► “Executives can face jail time under Sarbanes-Oxley if they don’t
have financial data in order.”
Information Week:
Budiness Innovation
Powered by Technology
u “Our marketing effectiveness leads to our sales effectiveness, which
leads to our service effectiveness. Data quality is the key to success.
If you don’t have quality data, that whole chain breaks down.”
Chuck Scoggins,
VP Marketing @ Hilton Hotels
A-2a
Database
ACCESS PROJECTS
u “What’s really great about databases and DBMSs is that the
concepts are the same whether you’re using a personal DBMSs
package or a large mainframe DBMS package.”
u “...once you learn database and DBMS concepts while using a
personal DBMS package (like MS ACCESS), you can apply your
knowledge directly to business environments where other DBMS
packages may be used.”
Haag.2004.94
u Team Projects use ACCESS to demonstrate fundamental concepts:
VData Definition: tables (fields, properties, keys)
Project A
VApplication Generators: forms (controls)
Projects B, C, D
VData Manipulation: sorting , filters & queries
Projects E, F
A-2b
Database
Use Access as an Alternative….
u “Access gives them a robust database engine that does not
lock into a specific technology.”
1.3
u “Your job is to get work done, not build computer
programs. Access provides an easy way to get started
…with a user interface that will be familiar to anyone who
has used Microsoft Office.”
2.3
u Access is inexpensive, and most people catch on
quickly….Your company probably owns several copies of
Access.
2.4-5
u Access is a workable solution.
Stonyfield Farms & CSS
3.3
TechRepublic
5Dec03
A-3
Access
Project A
cradling
tables, properties
relevance
Access
creating
Projects E, F
analysis filter, sort, query
Access
capturing
reliability Projects B, C, D
forms & controls
conveying
access
DBMS processes
data (reliability) info
(relevance) knowledge
(access & analysis)
communicating
access
cradling
shared
information
&
decentralized
computing
A-12
Select Access Projects
Choice: order CIO.1 presentations (Rx
Tentative
dates
T
TH
CIO Team
Access Project
800
A:
Build)
930
Data Definition
(cradling : relevance)
B: Application Generators
(capturing : reliably)
BRING
C: Application Generators
(capturing : reliably)
FLASH
D: Application Generators
(capturing : reliably)
DRIVE
E:
Data Manipulation
(creating : analysis)
F:
Data Manipulation
(creating : analysis)
THURSDAY
200
Handout
Access Lessons
Copy
Projects to Travel Drive
Review
Project Notes
A-4a
ACCESS Projects: Structure & Process
DATA for
Project is in Chapter folder
Application
&
Analysis
(ALL 3)
CONCEPTS &
TEMPLATES from
PowerPoint & handouts
copy to flash-drive
from P:
after select projects
PROCEDURES from
MS step-by-step handout
Synthesis & Evaluation
of DBMS procedures: outline of
text integrating concepts & screens
Should do“Print Screens”
into PPT as do project
Hard limit:
20 minutes.
# slides depends
on content & use.
Application
&
Analysis
Presentation focusing on
Analysis, Synthesis & Evaluation
contributes to Comprehension
Text in MS chapters
illustrates how to
apply, synthesize &
evaluate data
procedures &
concepts
CIO case
See samples
of prior
presentations
ASSIGNMENT: Work with team on ACCESS project.
u
Emphasize course concepts to
explain why the procedures are
important in:
H
meeting the MIS challenge
H
performing the 5Cs,
H
processing data into information
then knowledge,
H
data sharing and decentralized
computing that support decision
making and innovation
H
building CIS (TPS), MIS
(dashboards), DSS, WSS
H
creating information
partnerships, virtual – learning
organizations that are timeless &
locationless
u This requires reviewing
Information Age, IT Systems,
IT Strategies PowerPoints
and using those concepts,
frameworks and ideas to
analyze and evaluate the
ACCESS procedures.
u These exercises are essential
to the synthesis of concepts,
frameworks and ideas into a
new whole that changes the
brain real learning (Bloom
Hierarchy of Learning
Objectives; WSJ 19Jan07
“Thinking Can Change the
Brain”)
10
Seinfeld car rental:
databases support all 5Cs
take reservation
hold reservation
availability of cars
presentation
capture
forms
cradle
tables
create
query
communicate & convey
report
Cluster diagram:
access performs 5Cs &
supports IT systems
A-4b
Real World Connections—
Resources and Strategies from BusinessWeek
Levels of Learning
Goal:
For students to be
• informed,
• responsive, and
• constructive ongoing learners.
Being Informed Demonstrates:
• Knowledge (define, identify, label, select,
underline, etc.)
• Understanding (classify, locate, restate,
summarize, etc.)
Being Responsive Demonstrates:
• Application (compute, investigate, illustrate,
translate, apply, etc.)
• Analysis (calculate, debate, question, solve,
test, relate, etc.)
Being Constructive Demonstrates:
• Synthesis (collect, compose, design, integrate,
propose, perform, etc.)
• Evaluation (assess, compare, contrast, judge,
rate, value, decide, grade, etc.)
12
ACCESS 2010 Projects
Information - Relevance
 Project A Creating Databases & Tables
Data - Reliability
 Project B Creating Forms
 Project C Restricting Data Format & Validation Rules
 Project D Restricting Data to Values in Lists and Other Tables
Knowledge – Access & Analysis
 Project E Sorting & Filtering Data in Table
 Project E Creating & Using Queries
Project D: Data - Reliability
***
Uncheck “read only box” in properties (right click on file for properties), then Apply, OK
***
Security Warning: Enable content
*** Handout
1.
Access 2010: Chapter 6 “Maintain Data Integrity” (with notes)
2.
GardenCompany06 GardenCompany06 - AfterRestrictingDataUsingValidationRules
*** Context : pp. 143
6.5 Restricting Data to Values in Lists
 pp.167 - 171
6.6 Restricting Data to Values in Other Tables
 pp.172 - 176
 At step 17, refer to procedures at step 4 on page 173.
Do
these
pages
Some projects have
optional sections: for
example Project A
section 2.4 is optional
Project F: Knowledge – Access & Analysis
***
Uncheck “read only box” in properties (right click on file for properties), then Apply, OK
***
Security Warning: Enable content
*** Handout
1.
2.
Access 2010: Chapter 8 “Create Queries”
GardenCompany08 database
*** Context : p.209
8.2 Creating Queries Manually (Required)
 pp.216 - 220
Do
these
pages
8.3 Using Queries to Summarize Data (Optional)
 pp.221 - 224
8.5 Using Queries to Update Records (Required)
 pp. 231 – 234
8.6 Using Queries to Delete Records (Required)
 pp. 235 – 238
Some projects
have optional
sections
Handout
Access Lessons
Copy
Projects to Travel Drive
Review
Project Notes
1. Right click the file,
then click
properties
2. Uncheck
“Read Only”
4. click “OK”
3. click “Apply”
select:
enable content
ASSIGNMENT: ACCESS PRESENTATION COMPARISONS
1. After all the presentations are completed, use Expert Choice and
your notes to assess the relative importance of the criteria and to
compare all presentations, other than yours, on each of criteria.
Use the graphical mode in Expert Choice to make comparisons.
Your Inconsistency Ratio should be  .1
If your Inconsistency Ratio  .1, don’t record the most recent
comparisons. Redo them.
2. PrintScreen (key on keyboard) and paste into PowerPoint the
results of the comparisons. There should be 6 slides: 1 comparing the
criteria, 4 comparing the presentations on each of 4 criteria, and 1
synthesis graph. For examples see handout used to take your notes.
3. Return a team summary for the entire team on top
followed by each team members notes on the Access presentations
all their Expert Choice printouts.
Analytic: decompose
into components
Hierarchical Structure
Process of pairwise comparisons
ASSIGNMENT: ACCESS PRESENTATION COMPARISONS
1. Review the criteria (Bloom’s Hierarchy) for comparing the
team presentations:
APPLICATION
&
ANALYSIS
SYNTHESIS
EVALUATION
COMPREHENSION
Used relevant concepts in describing objects & events
(circumstances)
Original thinking integrated the concepts to create a new
whole
Used concepts as standards for making judgments about what
was or should be done
Logical flow between slides and presenters helped me
understand the concepts
2. During the presentations of other teams, take notes on next
page .
D.2
DATABASES
&
DATA WAREHOUSES
perform
information
processing (5Cs)
so meet
MIS challenge
Access to a Gold Mine of Information
“The real value will come from getting the right information to the
right people, at the right time, and giving those people the tools to find
the meaning in it.”
Declaration of Integration
Abbie Lundberg
CIO 1Dec2002
IA-15
Information
Information: Key Business Resource
u
The MIS Challenge is to give decision makers the information needed to
provide perfect service at customer’s moment of value.
V
DATA are observations that reliably describe a particular
phenomenon.
Online transaction processing (OLTP) captures & cradles data
describing events & objects
V
INFORMATION - is data that has a particular meaning within a
specific context -- is relevant to a decision.
Online analytical processing (OLAP) creates information by
organizing data around decision making and knowledge by
supporting access and analysis of information
V
KNOWLEDGE results from having access to information and the
tools to find meaning in it - analysis.
ORE
ANALOGY
DB-2.a
D
A
T
A
B
A
S
E
S
&
D
A
T
A
W
A
R
E
H
O
U
S
E
S
Tools to perform
data into onformation
so meet the
5Cs that process
and knowledge
MIS challenge
capture
cradle
data
journal
DBMS
(ACCESS)
software that allows
KW to perform
these tasks: 5Cs
derived from a data
warehouse by KW using
queries & reports to convey
information & create
knowledge with DSS (Excel)
DATA MART
(Excel)
capture & cradle
data about basic
objects & events with
TPS / CIS (journal)
using forms & tables
create
information
ledger
queries & reports used to create
& communicate information from
operational database to a special
form of database (ledger) that
supports decision making with MIS
& EIS
Victoria Secret
WSJ 9Dec02
DB-2.b
OUR FOCUS IN THIS DISCUSSION
u Knowledge
workers need tools for to meet the MIS challenge
Lundberg, IA.15
u Databases are tools that support OLTP by capturing & cradling data
that describes events / objects
V S&P’s Compustat (market data) & SEC’s Edgar (accounting data)
u Data
Warehouses are a form of database supports OLAP by creating
information from different operational databases for decision
making
V Extractive Petroleum Industry (RRA)
u Database
management systems (ACCESS) are software used to
specify logical structure of a database (data warehouses) and to
access and work with it.
KPMG.1
What IS Victoria’s Secret?
CAPTURE & CRADLE
BASIC DATA
from store TPS & online CIS
DATABASES
PROVIDE ACCESS BY
COMMUNICATING &
CONVEYING
INFORMATION TO
KNOWLEDGE WORKERS
WITH DASHBOARDS (MIS)
ENABLE ANALYSIS
(FILTERING, SORTING, QUERIES)
THAT CREATES KNOWLEDGE
FOR DECISION MAKING &
INNOVATION
What IS Victoria’s Secret?
CAPTURE & CRADLE
BASIC DATA
from store TPS & online CIS
DATABASES
PROVIDE ACCESS BY
COMMUNICATING &
CONVEYING
INFORMATION TO
KNOWLEDGE WORKERS
WITH DASHBOARDS (MIS)
ENABLE ANALYSIS
(FILTERING, SORTING, QUERIES)
THAT CREATES KNOWLEDGE
FOR DECISION MAKING &
INNOVATION
What IS Victoria’s Secret?
“Databases enable e-tailers “ not only to track individuals’ purchases
closely but also to analyze broad buying patterns - - by gender,
location and credit-card type, at least, and even internet-service
provider, if an e-mail address is given…(to) create powerful profiles
of shoppers and their buying habits” WSJ 1.5
DATABASES
“This wealth of data can be used to target marketing efforts….WSJ 1.5
Victoria Secret mails several catalogues a year, featuring largely the
same merchandise but with different covers and product photos.
Then it gauges which generates the most sales.” WSJ 3.1-4
Data Mining:
Why Math Will
Rock Your World!
Why M4th
Will Rock
Your World!
Bus Week
23 Jan06
Business
Systems &
Analytics
“A generation ago, quants turned
finance upside down. Now they’re
mapping out ad campaigns
(Victoria’s Secret) and building new
businesses from mountains of
personal data.” 54.1
“Math wizzes are the new business elite. They’re
turning personal data, trends and online content into
math, crunching the numbers and discovering new
efficiencies and ways to market. Before long they’ll
have a mathematical model of you.”
6.1
Why M4th
Will Rock
Your World!
Bus Week
23 Jan06
“Statisitics and Probability …will
become core skills for business
people as we grapple with challenges
involving large data sets. Winners
will know how to use statistics.”
Business
Systems &
Analytics
60.top
“Statistics and probability …will come in handy
whether you’re building financial models at Goldman
Sachs or marketing plans at Ford.”
RRA: building
financial model to find
new efficiencies to
markets
DB-12.b
What are Databases & Data
Data Mart
RRA

convey
information
&
create
knowledge
finance - accounting example
Tactical, aimed at
meeting an
immediate need Extractive
Petroleum
BUSINESS
Industry
ANALYTICS
SIC 1311
efficiencies
in capital market
resource allocation
Math Will Rock You

create
&
cradle
information
NEW
EFFICIENCIES
TO MARKETS
OLAP (queries, sorting,
filtering) selectively extracts /
organizes relevant data for
analysis like ledger,
strategic but unfinished concept
SEC EDGAR
whatis.com
Figure
Warehouses? 3-16

S&P Compustat
market
accounting
DATABASE
DATABASE
OLTP: JOURNALIZING BASIC EVENTS
BUSINESS SYSTEMS
CIS
(TPS)
capture
&
cradle
data
KPMG.1
Conceptual Model--Technology Overview
O p e r a ti o n a l D a t a b a s e s
Supports Building
Data Warehouses
Common Data Store
DBMS
DB-3.a
How is a relational database organized?
Logical Structure
(d e c i s i o n s)
Physical Structure
IT
Columns
Characteristics
Rows
Observations
DBMS
DBMS (Access) is
software bridge between
user’s logical requests and
physical structure so more
easily get to their work
performing the 5Cs
Access
BS&A
Table
Why Math
Will Rock
Your World
Bus. Week
23Jan06
p.62
DB-3.b
Database
How is a database organized?
u
Database is a collection of data that is organized and can be accessed
according to its logical structure
H.86.1-2, 84.2
whatis.com
V
Logical structure concerns how the data is viewed by its users -the knowledge workers (decision models)
Financial Statement structure (investors concerns FCF)
SFAC5.10
V
u
Physical structure is how the data exists on storage media H.84.2
DBMS engine accepts logical requests from the other DBMS
subsystems and converts them to their physical equivalent permitting
the user to concentrate on logical information needs, without concern
for the physical/technical details
H.85.2, 89.6
Access, Bloomberg
Figure 3.4
DB-4.a
Logical Structure of
a Relational Database
is composed of files (tables)
Part file (table)
Facility file (table)
Figure 3.4
DB-4.a
Logical Structure of
Data
Dictionary
Fields in columns
represent
characteristics
a Relational Database
is composed of files
Part file (table)
defines
properties /
characteristics
in fields of a
table,
including
keys
Facility file (table)
Figure 3.4
DB-4.a
Logical Structure of
Data
Dictionary
Fields in columns
represent
characteristics
a Relational Database
is composed of files
Part file (table)
defines
properties /
characteristics
in fields
including
keys
Records
in rows
entered
by forms
Primary keys
define
records &
relationships
within a file
Facility file (table)
Figure 3.4
DB-4.a
Logical Structure of
Data
dictionary
Fields in columns
represent
characteristics
a Relational Database
is composed of files
Part file (table)
within tables
defines
properties /
characteristics
in columns
including keys
Records
in rows
entered
by forms
Primary keys
define
records &
relationships
within a file
Facility file (table)
DB-4.b
Database
What is the logical structure in a
relational database?
u
Database consists of a logical collection of files with
relations expressed in the rows and columns of tables
Compustat, Edgar
u
File is a logical grouping of records about related
objects/events in rows of a table with fields in columns
u
Record is a logical grouping of fields related to some
object/event (firms)
u
Field is a logical grouping of characters describing some
characteristic of an object/event (EPS, share price)
u
Character is the smallest logical unit of data (symbols/
letters/numbers)
Figure 4.4
DB-4.c
What defines logical structure in a
relational database?
Database
u Data Dictionary identifies the names and properties of fields in
the columns of tables including keys
H.87.3-88.1
u Keys are fields in a relational databases identify relationships within
and between files / tables
V Primary - is a field uniquely identifies/describes each record
indicating relationships within a file / table
H.88.2
V Foreign - primary key of one file appearing in another file
indicating a relationship between files / tables
H.89.1
u Integrity Constraints are provided by the the logical structure
defined by properties and keys
V Properties and keys limit the kind of data that can be entered into a
file therefore help ensure data reliability
H.89.2-3
ACCESS PROJECTS:
RELATIONSHIP TO DBMS
SUBSYSTEMS
42
A-2a
Database
ACCESS PROJECTS
u “What’s really great about databases and DBMSs is that the
concepts are the same whether you’re using a personal DBMSs
package or a large mainframe DBMS package.”
u “...once you learn database and DBMS concepts while using a
personal DBMS package (like MS ACCESS), you can apply your
knowledge directly to business environments where other DBMS
packages may be used.”
Haag.2004.94
u Team Projects use Microsoft’s DBMS ACCESS to demonstrate:
VData Definition: tables: fields, properties, keys
Project A
VApplication Generators: forms: controls
VData Manipulation: sorting , filters & queries
Projects B, C, D
Projects E, F
Access
Project A
cradling
tables, properties
relevance
Access
creating
Projects E, F
analysis filter, sort, query
Access
capturing
reliability Projects B, C, D
forms & controls
conveying
access
DBMS process
data (reliability) info
(relevance) knowledge
(access & analysis)
Access Projects
demonstrate how
DBMS subsystems
provide the tools
to create and use
the logical
structure of a
relational database
communicating
access
cradling
shared
information
&
decentralized
computing
Figure 3.6
B
D, E, F
45
DB-5
DBMS
What is a DBMS Engine?
u
the logical-physical bridge
V
u
accepts logical requests from the various other DBMS
subsystems, converts them to their physical
equivalent, and actually accesses the database and data
dictionary as they exist on a storage device
permits the user to concentrate on logical information
needs, without concern for the physical/technical details
Figures 3.6 & 3.2
DB-6
DBMS
What is a Database Management System?
u
software used to specify the logical structure for a database and
to access it.
Comparison with word processing & spreadsheet software
H.89.5
u
contains 5 software components:
1. DBMS engine
2. Data definition subsystem
3. Data manipulation subsystem
4. Application generation subsystem
5. Data administration subsystem Transnational firm
DB-7
DBMS
What is a Data Definition Subsystem?
u
creates and maintains the data dictionary that defines the
properties and logical structure of data in a file (table)
H.89.8-90.1
u
V
properties include: field name, data type, format,
default value, validation rule, keys
V
contributes to maintaining data integrity
Team Project A
DB-8
DBMS
What is Application Generation Subsystem?
u
u
contains facilities to develop transaction-intensive
applications that require a detailed series of tasks to
process a transaction. This subsystem includes:
V Tools for creating data entry screens and controls in
forms
V Programming languages specific to the DBMS: SQL
(standardized query language) in Access
V Interfaces to commonly used programming languages
that are independent of any DBMS: C, C++, Cobol
Team Projects B, C, D
DB-9.a
DBMS
What is a Data Manipulation Subsystem?
u helps you change data in a database and mine it for
valuable information.
V
most often the primary interface between a user and
the data in a database
V
tools in the data manipulation subsystem include:
views: filtering & sorting
Project E
report generators
queries
Project F
DB-9.b
DBMS
What are the data manipulation tools?
u views
V view the content of a database file, make changes,
perform simple filtering & sorting to find the
location of specific information.
u report generators
V help quickly define the content and format of a
report.
u
Team Project E
DB-9.c
DBMS
What are the data manipulation tools?
u
Query
V is a (graphical) tool that brings together data from
multiple tables to answer a question
Figure 3.7
V Queries are more powerful than filters
extract data from many tables
queries can be saved
V Typically, start with wizard then modify in design view
uTeam
Project F
DB-10.a
DBMS
What is Data Administration Subsystem?
u
helps manage the overall database environment by providing
facilities for:
V
Backup and recovery
Know how important if ever experience need
V
Security management
Who gets access what they can do (view/change)
V
Query optimization
restructures user queries so most efficient
DB-10.b
DBMS
What is Data Administration Subsystem?
V Reorganization
optimizes the physical database structure to improve
performance (e.g. defragment)
V Concurrency
control
resolves conflicts among multiple database users (e.g.
registration)
V Change
management
assesses impact of proposed changes to a database
DB-11
Database
What is an Object-Oriented database?
u
Encapsulates both information about an entity class and the
procedures to work with that information in a software module
called an object
V combining
information and procedures more closely models
how an organization works
V once
a set of procedures is defined for a given object, those
procedures can be reused by other objects
V Microsoft
Access is an object-oriented database
In-Database Analytics
DB-13
Database
What benefits / costs are related to
databases & data warehouses?
u POTENTIAL BENEFITS
V improved data integrity (reliability: forms, controls,
validation rules)
V increased user productivity (input: forms;
output: queries/reports)
V increased security
V reduced data redundancy (capture once, use many)
u POTENTIAL COSTS
V development
V hardware
V software
training
maintenance
DB-14
Data Warehouse
What Are the Critical Success Factors
for Databases & Data Warehouses?
Expectations are communicated to the users
User involvement is ensured throughout the project
Project has a good sponsor, from the business side not IT
Project team has the right skill set (CSS : KPMG)
A realistic schedule including all necessary tasks
Tools have been chosen to match the needs of the users ()
Users are trained in the way they plan to use the warehouse
DataWarehousing.com (7 Feb 2001)
DB-12.a
Issues addressed within SDLC
uWho
will be the users?
V What data will they need?
V What data mining tools will the users need ?
V How often should the data be updated?
H.104.6-8
H.104.9-105.2
H.105.3-4
uWhat
types of databases and networks (insourced, outsourced,
combination) are most appropriate?
uWho
should oversee the information resource?
V CIO should manage information which is as important as the
financial resources managed by CFO.
DB-12.b
Issues addressed within SDLC
u
Who has ownership of (responsibility for) the quality of
specific information?
u
What ethical, security and privacy issues are involved in using
information and IT?
u
How should technology changes be assessed?
V Chasing technology is very expensive
V The real issues is whether the new technology manages
information better and if the benefit exceeds the cost
Next topic
SDLC
Systems
Development
Life
Cycle
Download