Introductie Data Vault

advertisement
Introduction Data Vault
Historical development Business Intelligence
•
•
•
1950
1960
1970
•
1980
•
•
•
1990
2000
2010
Turing : First computers
Codd : 3NF
Management Information Systems (MIS)
Executive Information Systems (EIS)
Kimball : Dimensional Modeling Kimball
Inmon : Corporate Information Factory
Enterprise Datawarehousing
Linstedt : Datavault
Ronstad : Anchor Modeling
Challenges Classical Datawarehouse
• Time-to-Build
– Complexity, High Failure Rates
• Lack of Agility
– Expensive and Extensive re-engineering required to adapt
• No auditability
– Lack of tracebility, accountability and compliance
• Departmental scope
– No enterprise view, inability to effectively integrate disparate
systems
• Duplicate efforts and Spread Marts
Dan Lindstedt - Founder
Data Vault
•
”The Data Vault is a detail oriented, historical tracking
and uniquely linked set of normalized tables that
support one or more functional areas of business. It is
a hybrid approach encompassing the best of breed
between 3rd normal form (3NF) and star schema. The
design is flexible, scalable, consistent and adaptable
to the needs of the enterprise..”
Dan Linstedt
•
“The Data Vault is a data modelling approach and
methodology that is specifically tuned to optimize your
Enterprise Datawarehouse initiatives.”
Hans Hultgren
The DWH Guru’s opinion
“The Data Vault is the optimal choice for modeling the
Enterprise Datawaerehouse in the DW 2.0 framework.”
Bill Inmon, June 2007
Datavault application in Netherlands
Datavault – 3 Building blocks
• Hub : Identification
– Unique collection of Business Keys
– Business Key: identity of an enterprise entity
•
Link : Relationship
– Unique collection of associations between
two or more Business Keys
– Unit Of Work (grain), transactions and events
• Satellite : Description
– Adds context to a Hub orLink
– Timebound (System Date/Time !)
Sample model
•
Identification
Relation
Description
Datavault modelled Datawarehouse Architecture
Source 1
Star 1
Star 6
Star 2
Star 7
Star 3
Star 8
Source ..
Star 4
Star 9
Source x
Star 5
Star x
Source 2
Data
Vault
EDW
Source 3
Error
Marts
(Operational)
Reports
Business
Data
Vault
Example
Data Vault model
Airline
#IATA Code
Airline
Aircraft
#RegNr
Aircraft Airline
Product
Hierarchy
Product Sale
Aircraft
Sale Flight
Product
Product
#ProductCode
Product
Sale
Sale
#TrxID
Product Shop
Flight
#FlightNr
#ScheduledDate
#ArrDepIndicator
Flight Gate Ramp
Sale
Product Shop
Sale Shop
Shop
#ShopNo
Flight
Aircraft Flight
Sale Shop
Gate / Ramp
#GateRampCode
Lounge
#LoungeCode
Gate / Ramp
Shop Lounge
Shop
Shop Lounge
Lounge
Gate Lounge
Added value of Datavault modelling
• Build Incrementally
– Think Big Start Small
• Scale to Infinity
– Insert only
• Auditability
– Tracebility of the data and it’s history, versioning of data
• Absorb all data all of the time
– Store RAW data, Independent Loads, Lazy Updates, Single Point of
Facts
• Adapts to new sources easily
– No need for re-engineering when new sources need to be integrated
• Need for Operational Business Intelligence
– Real Time Loading (SOA), Real Time Reporting and data maintenance
Build a flexible foundation with us!
DWH / Datavault
implementation
Semantic layer
Implementation
Develop dashboards
& reports
Data Quality
Management
Data Integration
Build an Analytics
capability
Self-service BI
Implementation
Big data POC’s
14
Datavault implementation services & tools
• Datavault Education
– By certified Datavault engineers
• Datavault Implementation consultancy
– Assist you in creating your Datavault model
• Datavault Implementation Tools
– Model driven Datawarehouse ETL code generation
Download