Competitive Platforms: Enterprise Data Model

Comparing Healthcare Data Warehouse Approaches:
A Deep-dive Evaluation of the Three Major
Methodologies
February 2014
Creative Commons Copyright
A Personal Experience with Healthcare
•
•
Dear mother…
A trip to the doctor…
22
© 2013 Health Catalyst | www.healthcatalyst.com
Healthcare Analytics Goal
Why have an EDW?
●
It is a means to a greater end
●
It exists to improve:
1. The effectiveness of care delivery (and safety)
2. The efficiency of care delivery (e.g. workflow)
3. Reduce Mean Time To Improvement (MTTI)
3
Three Systems of Care Delivery
Analytic
System
Deployment
System
Content
System
4
Creative Commons Copyright
Population Health Management
Mean
# of
Cases
1 box = 100
cases in a year
Excellent Outcomes
# of
Cases
Poor Outcomes
Excellent Outcomes
Poor Outcomes
Focus On Inliers (“Tighten the Curve and Shift It to the Left”)
• Strategy. Identify best practices through research and analytics
and develop guidelines and protocols to reduce inlier variation
• Result. Shifting the cases which lie above the mean (47+%)
toward the excellent end of the spectrum produces a much more
significant impact than focusing on the adverse outlier tail (2.5%)
© 2013 Health Catalyst | www.healthcatalyst.com
5
Healthcare Analytics Adoption Model
Level 8
Personalized Medicine
& Prescriptive Analytics
Tailoring patient care based on population outcomes and
genetic data. Fee-for-quality rewards health maintenance.
Level 7
Clinical Risk Intervention
& Predictive Analytics
Organizational processes for intervention are supported
with predictive risk models. Fee-for-quality includes fixed
per capita payment.
Level 6
Population Health Management
& Suggestive Analytics
Tailoring patient care based upon population metrics. Feefor-quality includes bundled per case payment.
Level 5
Waste & Care Variability Reduction
Reducing variability in care processes. Focusing on
internal optimization and waste reduction.
Level 4
Automated External Reporting
Efficient, consistent production of reports & adaptability to
changing requirements.
Level 3
Automated Internal Reporting
Efficient, consistent production of reports & widespread
availability in the organization.
Level 2
Standardized Vocabulary
& Patient Registries
Relating and organizing the core data content.
Level 1
Enterprise Data Warehouse
Collecting and integrating the core data content.
Level 0
Fragmented Point Solutions
Inefficient, inconsistent versions of the truth. Cumbersome
internal and external reporting.
Polling Question
What level would you to the healthcare analytic
solutions with which you are most familiar?
(levels 1 – 8)
An Analyst’s Time
Too much time spent
hunting for and
gathering data rather
than understanding and
interpreting data
Analyst’s or Clinician's Time
Understanding the need
Hunting for the data
Waste
Gathering or compiling
(including waiting for
IT to run report or query)
Value-add
Interpreting data
Distribution of data
8
© 2013 Health Catalyst | www.healthcatalyst.com
HR – Desired State
Authors
Typical
User
Distribution
Drillers
• Authors or knowledge workers are scarce and in high
demand – few users have both clinical knowledge
AND access to tools and data
• Large backlogs of analytic/report requests exist since
underlying systems are too complex for the average
user (users make analytic requests vs. self-service)
Viewers
• Create more knowledge workers by doing the following:
• Expand data access (audit access vs. control access)
• Simplify data structures (relational vs. dimensional)
• Continue use of naming standards (intuitive vs. cryptic)
• Providing better tools (metadata, ad hoc, etc.)
• Promote shift in culture by rewarding process knowledge discovery
rather than punishing outliers
9
Authors or
Knowledge
Workers
Drillers
Ideal User
Distribution for
Continuous
Improvement
Viewers
© 2013 Health Catalyst | www.healthcatalyst.com
Comparison of prevailing
approaches
© 2013 Health Catalyst | www.healthcatalyst.com
Enterprise Data Model
EDW
FINANCIAL SOURCES
(e.g. EPSi, Lawson,
PeopleSoft)
DEPARTMENTAL
SOURCES
(e.g. Apollo)
Patient
Provider
Bad Debt
Provider
Survey
Encounter
Cost
ADMINISTRATIVE
SOURCES
(e.g. API Time Tracking)
Charge
ENTERPRISE
DATA MODEL
Census
Facility
House
Keeping
Diagnosis
Procedure
Employee
Catha Lab
Time
Keeping
EMR SOURCE
(e.g. Cerner)
More Transformation
Less Transformation
PATIENT SATISFACTION
SOURCES
(e.g. NRC Picker)
Enforced Referential Integrity© 2013 Health Catalyst | www.healthcatalyst.com
11
Enterprise Data Model – Still need Subject Area Marts
EDW
FINANCIAL SOURCES
(e.g. EPSi, Lawson,
PeopleSoft)
DEPARTMENTAL
SOURCES
(e.g. Apollo)
Patient
Provider
Bad Debt
Provider
Readmissions
ADMINISTRATIVE
SOURCES
(e.g. API Time Tracking)
Survey
Encounter
Cost
Diabetes
Charge
ENTERPRISE
DATA MODEL
Census
Facility
Sepsis
House
Keeping
Diagnosis
Procedure
Employee
Catha Lab
Time
Keeping
EMR SOURCE
(e.g. Cerner)
More Transformation
Less Transformation
PATIENT SATISFACTION
SOURCES
(e.g. NRC Picker)
Enforced Referential Integrity© 2013 Health Catalyst | www.healthcatalyst.com
12
Bill of Materials Conceptual Model
Product
Supplier
Order
Customer
Typical Analyses
• Counts
• Simple aggregations
• By various dimensions
© 2013 Health Catalyst | www.healthcatalyst.com
13
Star Schema Conceptual Model
Dimension 4
Dimension 1
(Location)
(Product)
Typical Analyses
• Transaction counts
• Simple aggregations
• By various dimensions
Fact
(Transaction)
Dimension 3
Dimension 2
(Purchaser)
(Date)
© 2013 Health Catalyst | www.healthcatalyst.com
14
Vertical Summary Data Marts
FINANCIAL SOURCES
(e.g. EPSi, Lawson,
PeopleSoft)
DEPARTMENTAL
SOURCES
(e.g. Apollo)
Regulatory
Labor
Productivity
ADMINISTRATIVE
SOURCES
(e.g. API Time Tracking)
Pregnancy
Dimensional
Data
Model
Revenue Cycle
Oncology
Asthma
Redundant
Data
Extracts
Heart
Failure
Diabetes
Census
PATIENT SATISFACTION
SOURCES
(e.g. NRC Picker)
EMR SOURCE
(e.g. Cerner)
More Transformation
Less Transformation
© 2013 Health Catalyst | www.healthcatalyst.com
15
Adaptive Data Warehouse
Metadata: EDW Atlas Security and Auditing
Common, Linkable
Vocabulary
FINANCIAL SOURCES
(e.g. EPSi, Peoplesoft,
Lawson)
Financial
Source Marts
DEPARTMENTAL
SOURCES
(e.g. Apollo)
Departmental
Source Marts
Readmissions
Administrative
Source Marts
ADMINISTRATIVE
SOURCES
(e.g. API Time Tracking)
Diabetes
Patient
Source Marts
PATIENT SATISFACTION
SOURCES
(e.g. NRC Picker, Press
Ganey)
Sepsis
EMR
Source Marts
HR
Source Mart
EMR SOURCE
(e.g. Cerner)
Human Resources
(e.g. PeopleSoft)
More Transformation
Less Transformation
© 2013 Health Catalyst
| www.healthcatalyst.com
Classic Star Schema Deficiencies
•
Resolution of many many-to-many relationships
•
Not as much about counts of transactions
•
More about:
•
•
•
Events
States of change over time
Related states (e.g. co-morbidities, attribution)
17
© 2013 Health Catalyst | www.healthcatalyst.com
Sample Diabetes Registry Data Model
Procedure
Code
Diagnosis
Code
Diagnosis
History
Procedure
History
Office Visit
Vital Signs
History
Diabetes
Patient
Exam
History
Current Lab
Result
Lab Result
History
Exam Type
Lab Type
Typical Analyses
•
How many diabetes patients do I have?
•
When was there last HA1C, LDL, Foot
Exam, Eye Exam?
•
What was the value for each instance for
the last 2 years?
•
What are all the medications they are on?
•
How long have they been taking each
medication?
•
What was done at each of their visits for
the last 2 years?
•
Which doctors have seen these patients
and why?
•
List of all admissions and reason for
admission?
•
What co-morbid conditions do these
patient have?
•
Which interventions (diet, exercise,
medications) are having the biggest
impact on LDL, HA1C scores?
© 2013 Health Catalyst | www.healthcatalyst.com
18
Measurement System Exercise
Webinar
19
© 2012
2013 Health Catalyst | www.healthcatalyst.com
The Enterprise Shopping Model
Your Shopping List
Enterprise Shopping Model
Produce
__ Apples
__ Pears
__ Tomatoes
__ Carrots
Dairy
__ Celery
__ Banana
__ Melon
__ Grapes
Meat
__ Beef
__ Ham
__ Chicken
__ Pork
__ Milk
__ Eggs
__ Cheese
__ Cream
__ 2% Milk
__ Half & Half
__ Yogurt
__ Margarine
Dry Goods
__ Turkey
__ Sausage
__ Lamb
__ Bacon
__ Pasta
__ Flour
__ Sugar
__ Soup
__ Baking soda
__ Rice
__ Beans
__ B. Sugar
Apples
Sugar
Tomato Soup Beans
Flour
Hot dogs
Milk
Banana
Turkey
Noodles
Lettuce
Yogurt
Additional purchases
Eggs
Flowers
Tires
Dry cleaning
© 2013 Health Catalyst | www.healthcatalyst.com
Enterprise Data Model (Technology Vendors)
EDW
FINANCIAL SOURCES
(e.g. EPSi, Lawson,
PeopleSoft)
DEPARTMENTAL
SOURCES
(e.g. Apollo)
Patient
Provider
Bad Debt
Provider
Survey
Encounter
Cost
ADMINISTRATIVE
SOURCES
(e.g. API Time Tracking)
ENTERPRISE
DATA MODEL
Charge
Census
Facility
House
Keeping
Diagnosis
Procedure
Employee
Catha Lab
Time
Keeping
EMR SOURCE
(e.g. Cerner)
More Transformation
Less Transformation
PATIENT SATISFACTION
SOURCES
(e.g. NRC Picker)
Enforced Referential Integrity© 2013 Health Catalyst | www.healthcatalyst.com
21
Using a dimensional model in Healthcare
is kind of like shopping for data like this …
22
© 2013 Health Catalyst | www.healthcatalyst.com
23
© 2013 Health Catalyst | www.healthcatalyst.com
The Dimensional Shopping Model
Trip #1 to the Store
Trip #2 to the Store
Dimensional Shopping Model - Cookies
Dairy
__ 4 eggs
__ 2 c shortening
Dimensional Shopping Model - Cake
Dry Goods
Dairy
Dry Goods
__ ½ cup of butter
__ ½ cup milk
__ 2 eggs
__ 1 c sugar
__ 2 c brown sugar
__ 2 t baking soda
__ 2 t vanilla
__ 1 t salt
__ 4-5 c all-purpose flour
__ 4 cups chocolate chips
__ 1 cup white sugar
__ 1 ½ cups all-purpose flour
__ 2 teaspoons vanilla extract
__ 1 ¾ teaspoon baking powder
How many recipes to do you need to make?
24
© 2013 Health Catalyst | www.healthcatalyst.com
Dimensional Data Model (Healthcare Point Solutions)
FINANCIAL SOURCES
(e.g. EPSi, Lawson,
PeopleSoft)
DEPARTMENTAL
SOURCES
(e.g. Apollo)
Regulatory
Labor
Productivity
ADMINISTRATIVE
SOURCES
(e.g. API Time Tracking)
Pregnancy
Revenue Cycle
Oncology
Heart
Failure
Asthma
Redundant
Data
Extracts
Diabetes
Census
PATIENT SATISFACTION
SOURCES
(e.g. NRC Picker)
EMR SOURCE
(e.g. Cerner)
More Transformation
Dimensional
Data
Model
Less Transformation
© 2013 Health Catalyst | www.healthcatalyst.com
25
The Adaptive Shopping Model
Adaptive Shopping Model
Additional
Apples
Tomato Soup
Flour
Milk
Turkey
Lettuce
Sugar
Beans
Hot dogs
Banana
Noodles
Yogurt
Get eggs
Buy flowers
Get tires rotated
Pick up dry cleaning
•
•
Store: _____________________________
__ ______________
__ ______________
__ ______________
__ ______________
__ ______________
__ ______________
__ ______________
__ ______________
Initial List
•
•
__ ______________
__ ______________
__ ______________
__ ______________
__ ______________
__ ______________
__ ______________
__ ______________
•
•
•
•
•
•
•
•
And Even More
Buy a Christmas tree
Baking Powder
Baking Soda
Buy a new couch
Get oil change
Chocolate Chips
Buy paint and painting supplies
Buy yarn and knitting supplies
Vanilla extract
Buy a set of pots and pans
•
•
•
•
•
•
•
•
•
•
26
© 2013 Health Catalyst | www.healthcatalyst.com
Shopping List Revisited
Initial List
Additional
Apples
Tomato Soup
Flour
Milk
Turkey
Lettuce
Sugar
Beans
Hot dogs
Banana
Noodles
Yogurt
Get eggs
Buy flowers
Get tires rotated
Pick up dry cleaning
•
•
•
•
Once you are home can
you make these recipes?
•
•
•
•
•
•
•
•
And Even More
Buy a Christmas tree
Baking Powder
Baking Soda
Buy a new couch
Get oil change
Chocolate Chips
Buy paint and painting supplies
Buy yarn and knitting supplies
Vanilla extract
Buy a set of pots and pans
•
•
•
•
•
•
•
•
•
•
27
Cake:
1 cup white sugar
1 ½ cups all-purpose flour
2 teaspoons vanilla extract
1 ¾ teaspoon baking powder
½ cup of butter
½ cup milk
Cookies:
2 eggs
1 cup (2 sticks) butter, softened
2 large eggs
3/4 cup white sugar
2 1/4 cups all-purpose flour
1 teaspoon vanilla extract
1 teaspoon salt
1 teaspoon baking soda
2 cups chocolate chips
© 2013 Health Catalyst | www.healthcatalyst.com
Adaptive Data Warehouse
Metadata: EDW Atlas Security and Auditing
Common, Linkable
Vocabulary
FINANCIAL SOURCES
(e.g. EPSi, Peoplesoft, Lawson)
Financial
Source Marts
DEPARTMENTAL SOURCES
(e.g. Apollo)
Departmental
Source Marts
Readmissions
Administrative
Source Marts
Diabetes
Patient
Source Marts
ADMINISTRATIVE SOURCES
(e.g. API Time Tracking)
PATIENT SATISFACTION
SOURCES
(e.g. NRC Picker, Press Ganey)
Sepsis
EMR
Source Marts
HR
Source Mart
EMR SOURCE
(e.g. Cerner)
Human Resources
(e.g. PeopleSoft)
More Transformation
Less Transformation
© 2013 Health Catalyst
| www.healthcatalyst.com
Late-binding Deeper Dive
29
© 2012
2013 Health Catalyst | www.healthcatalyst.com
Data Modeling Approaches
Corporate Information Model
Early Binding
Popularized by Bill Inmon and Claudia Imhoff
I2B2
Popularized by Academic Medicine
Star Schema
Popularized by Ralph Kimball
Data Bus
Popularized by Dale Sanders
File Structure Association
Popularized by IBM mainframes in 1960s
Reappearing in Hadoop & NoSQL
Late Binding
© 2013 Health Catalyst | www.healthcatalyst.com
30
Origins of Early vs Late Binding
•
Early days of software engineering
● Tightly coupled code, early binding of software at compile
●
●
●
●
time
Hundreds of thousands of lines of code in one module,
thousands of function points
Single compile, all functions linked at compile time
If one thing breaks, all things break
Little or no flexibility and agility of the software to
accommodate new use cases
© 2013 Health Catalyst | www.healthcatalyst.com
31
Origins of Early vs Late Binding
•
1980s: Object Oriented Programming
● Alan Kay, Universities of Colorado & Utah, Xerox/PARC
● Small objects of code, reflecting the real world
● Compiled individually, linked at runtime, only as needed
● Agility and adaptability to address new use cases
•
Steve Jobs: NeXT Computing
● Commercial, large-scale adoption of Kay’s concepts
● Late binding – or as late as practical – becomes the norm
● Maybe Jobs’ largest contribution to computer science
© 2013 Health Catalyst | www.healthcatalyst.com
32
Data Binding in Analytics
● Atomic data can be “bound” to business rules about that
data and to vocabularies related to that data
● Vocabulary binding in healthcare
–
–
–
–
Unique patient and provider identifiers
Standard facility, department, and revenue center codes
Standard definitions for sex, race, ethnicity
ICD, CPT, SNOMED, LOINC, RxNorm, RADLEX, etc.
● Binding data to business rules
–
–
–
–
–
–
Length of stay
Patient attribution to a provider
Revenue and expense allocation and projections to a department
Data definitions of general disease states and patient registries
Patient exclusion criteria from population management
Patient admission/discharge/transfer rules
© 2013 Health Catalyst | www.healthcatalyst.com
33
Analytic Relations
The key is to relate data, not model data
Core Data Elements
Charge Code
CPT Code
Date & Time
DRG code
Drug code
Employee ID
Employer ID
Encounter ID
Sex
Diagnosis Code
Procedure Code
Department ID
Facility ID
Lab code
Patient type
Patient / member ID
Payer / carrier ID
Postal code
Provider ID
High Value Attributes
About 20 data attributes account for
90% of healthcare analytic use cases
Vocab in
Source
System 1
Vocab in
Source
System 2
Vocab in
Source
System 3
Highest value area
for standardizing
vocabulary
© 2013 Health Catalyst | www.healthcatalyst.com
34
Six Points to Bind Data
Data Analysis
Internal
Source Data
Content
Source System
Analytics
Clinical
Clinical
Financial
Financial
Supplies
Supplies
HR
HR
Others
Others
Customized Data
Marts
Visualization
Disease Registries
QlikView, Tableau
Materials Management
Microsoft Access
Compliance Measures
Web Applications
Clinical Events
Excel
Operational Events
SAS, SPSS
Research Registries
et al.
5
6
External
State
Academic
State
Academic
1
2
3
4
Business Rule and Vocabulary Binding Points
Low volatility = Early binding
High volatility = Late binding
© 2013 Health Catalyst | www.healthcatalyst.com
35
Binding Principles & Strategy
1. Delay Binding as long as possible…until a clear analytic
use case requires it
2. Earlier binding is appropriate for business rules or
vocabularies that change infrequently or that the
organization wants to “lock down” for consistent
analytics
3. Late binding in the visualization layer is appropriate for
“what if” scenario analysis
4. Retain a record of the bindings from the source system
in the data warehouse
5. Retain a record of the changes to vocabulary and rules
bindings in the data models of the data warehouse
© 2013 Health Catalyst | www.healthcatalyst.com
36
Thank you!
37
© 2012
2013 Health Catalyst | www.healthcatalyst.com