Foundations of Business Intelligence: Databases and Information Management Terms Bits (smallest unit of data a computer can handle) Bytes (8 bits; each represents a single character – letter, number, or symbol) Field (group of words or complete number) Record (group of related fields) File (group of records of same type) Database (group of related files) Entity (person, place, thing, or event about which we store and maintain info) Attribute (characteristic or quality describing an entity) Traditional File Environment Data redundancy and inconsistency Program-data dependence Lack of flexibility Poor security Lack of data sharing and availability Database Management Systems (DBMS) DBMS – Access, Oracle, DB2 examples/software Logical view – data as perceived by end users and data specialists Physical view – where data stored and structured Relational DBMS – represent data as two dimensional tables (called relations) Tuples (rows in a table) Key field Primary Key Foreign key Relational DBMS Select, Join, and Project DBMS Object-Oriented (stores data and procedures as objects) Databases in the Cloud DBMS capabilities Data definition – specify structure Data dictionary – stores definitions of data Query and reporting tools, including SQL Database Design Normalization (smallest form of data structures) Database Design (cont.) Referential integrity (rules; consistency in relationships between tables) Entity Relationship (ER) diagram (show relationships between the entities in your database) Data Warehouses Data Warehouse (stores current and historical data; from multiple sources) Data Mart (subset; separate database for different population) Multidimensional Model Tools for Business Intelligence Online Analytical Processing (OLAP) (supports multidimensional data analysis) Data Mining (discovery driven data analysis) Associations Sequences Classification Clustering Forecasting Predictive analytics (uses data mining techniques; predict future outcomes) Web Mining (patterns from WWW) – example Google Analytics Text Mining (extract elements from unstructured data sets) Database Server Database server (where database resides) Other Information Policy Data administration Data governance Database administrator Data Quality Data quality audit Data cleansing (scrubbing)