Foundations of Business Intelligence:
Databases and Information Management
Terms
Bits (smallest unit of data a computer can handle)
Bytes (8 bits; each represents a single character – letter,
number, or symbol)
Field (group of words or complete number)
Record (group of related fields)
File (group of records of same type)
Database (group of related files)
Entity (person, place, thing, or event about which we
store and maintain info)
Attribute (characteristic or quality describing an entity)
Traditional File Environment
Data redundancy and inconsistency
Program-data dependence
Lack of flexibility
Poor security
Lack of data sharing and availability
Database Management
Systems (DBMS)
DBMS – Access, Oracle, DB2 examples/software
Logical view – data as perceived by end users and data
specialists
Physical view – where data stored and structured
Relational DBMS – represent data as two dimensional
tables (called relations)
Tuples (rows in a table)
Key field
Primary Key
Foreign key
Relational DBMS
Select, Join, and Project
DBMS
Object-Oriented (stores data and procedures as
objects)
Databases in the Cloud
DBMS capabilities
Data definition – specify structure
Data dictionary – stores definitions of data
Query and reporting tools, including SQL
Database Design
Normalization (smallest form of data structures)
Database Design (cont.)
Referential integrity (rules; consistency in
relationships between tables)
Entity Relationship (ER) diagram (show
relationships between the entities in your database)
Data Warehouses
Data Warehouse (stores current and historical data; from
multiple sources)
Data Mart (subset; separate database for different population)
Multidimensional Model
Tools for Business Intelligence
Online Analytical Processing (OLAP) (supports
multidimensional data analysis)
Data Mining (discovery driven data analysis)
Associations
Sequences
Classification
Clustering
Forecasting
Predictive analytics (uses data mining techniques; predict
future outcomes)
Web Mining (patterns from WWW) – example Google
Analytics
Text Mining (extract elements from unstructured data sets)
Database Server
Database server (where database resides)
Other
Information Policy
Data administration
Data governance
Database administrator
Data Quality
Data quality audit
Data cleansing (scrubbing)