Data Integration - Inside Analysis

advertisement
AFPOA Virtual Vendor Day
Topic: Data Integration
Gregory J. Vaughan – Executive Consultant, WW Military and Defense
Lead, Information Agenda Tiger Team
Information Management
© 2011 IBM Corporation
Information Management
There’s no “easy button” for this…
 Data Integration is a complex
problem
 A myopic view of the problem
frustrates the desired end state
 Scoping the problem too narrowly
reduces the likelihood of success
 Focusing later on data integration
requires a revisit of the problem
scope
 Data integration presents the
greatest risk to IT related business
initiatives
 Data Governance is required, but
frequently overlooked
 The complexities of data
integration requires a
comprehensive solution
2
© 2011 IBM Corporation
Information Management
Solution Architecture – General View
 Info. Integration
 Data Quality
 Info. Services
BI (REPORTS,
DASHBOARDS,
QUERY, OLAP)
UNSTRUCTURE
CONTENT
OPERATIONAL
DATA
PREDICTIVE
ANALYTICS
TEXT
ANALYTICS
USERS
APPLICATIONS
DATA MARTS
DATA
WAREHOUSE
INTERNAL/
EXTERNAL
DATBASES
MASTER
DATA
APPLICATIONS
OPTIMIZATION
OLAP
CUBES
INTERNAL
DATABASES
METADATA
EXTERNAL
DATABASES
3
© 2011 IBM Corporation
Information Management
The IBM Solution: IBM Information Server
Delivering information you can trust
IBM Information Server
Unified Deployment
Discover, model, and
govern information
structure and content
Standardize, merge,
and correct information
Combine and
restructure information
for new uses
Synchronize, virtualize
and move information
for in-line delivery
Unified Metadata Management
4
© 2011 IBM Corporation
Information Management
Align business and IT objectives using single platform that creates
trusted information for use in key initiatives
Sources
Business
Initiatives
Executives
legacy
Business
Analysts
Enterprise
Architects
Data
Analysts &
Architects
Subject Matter
Experts
apps
BI
SAP
dbs
warehouse
Xls., xml,
flat
mdm
warehouse
z/OS
custom
Data
Steward
DBA
Developer
5
System
Architect
ERP System
Manager
© 2011 IBM Corporation
Information Management
Align business and IT objectives using single platform that creates
trusted information for use in key initiatives
Sources
Business
Initiatives
Executives
legacy
Business
Analysts
Enterprise
Architects
Data
Analysts &
Architects
Subject Matter
Experts
apps
BI
SAP
dbs
warehouse
Xls., xml,
flat
mdm
warehouse
z/OS
custom
Data
Steward
DBA
Developer
6
System
Architect
ERP System
Manager
© 2011 IBM Corporation
Information Management
InfoSphere Information Analyzer
Requirements
Information Analyzer
Analyze source data quality and
monitor adherence to integration
and quality rules
 Perform data quality
assessment
 Define business rules to
monitor data quality
 Establish stewards for
governance of data
quality
Benefits
 Identify data quality
issues early to reduce
project risks
 Monitor quality metrics
over time for compliance
 Create business
confidence with trusted
information
7
© 2011 IBM Corporation
Information Management
InfoSphere Business Glossary
Requirements
Create and manage business
vocabulary and relationships and
related to physical sources
Business Glossary
 Capture business terms and
classifications
 Link business terms and
classifications to IT assets
 Identify data stewards and
make glossary accessible
Benefits
 Context for information is
available to everyone,
immediately
 IT projects are aligned with
data governance
 Collaboration increases
across business and IT
8
8
© 2011 IBM Corporation
Information Management
Align business and IT objectives using single platform that creates
trusted information for use in key initiatives
Sources
Business
Initiatives
Executives
legacy
Business
Analysts
Enterprise
Architects
Data
Analysts &
Architects
Subject Matter
Experts
apps
BI
SAP
dbs
warehouse
Xls., xml,
flat
mdm
warehouse
z/OS
custom
Data
Steward
DBA
Developer
9
System
Architect
ERP System
Manager
© 2011 IBM Corporation
Information Management
InfoSphere QualityStage
Requirements
QualityStage
Standardize, cleanse and
deduplicate data, ensuring a
complete, accurate view of
information
 Resolution of data
quality issues
 Standardization of data
formats
 Cleanse data
 Manage duplicate data
 Enable ongoing quality
Benefits
 Removes duplicates
 Cross-references
matching records
 Survives a single,
complete record
 Validate and enriches
data
10
© 2011 IBM Corporation
Information Management
Align business and IT objectives using single platform that creates
trusted information for use in key initiatives
Sources
Business
Initiatives
Executives
legacy
Business
Analysts
Enterprise
Architects
Data
Analysts &
Architects
Subject Matter
Experts
apps
BI
SAP
dbs
warehouse
Xls., xml,
flat
mdm
warehouse
z/OS
custom
Data
Steward
DBA
Developer
11
System
Architect
ERP System
Manager
© 2011 IBM Corporation
Information Management
InfoSphere Metadata Workbench
Requirements
Support information governance with
traceability on data movement,
modeling & BI applications
Metadata Workbench
 Handle Change
Management processes with
measured impact.
 Visualize and trace
information flows across
enterprise landscape
 Access and report on
operational and design
metadata
Benefits
 Deliver enterprise audit
control information.
 Mediate system
disruptions.
 Govern enterprise assets
over time.
12
 Ensure effective
collaboration with line of
business stakeholders.
© 2011 IBM Corporation
Information Management
Align business and IT objectives using single platform that creates
trusted information for use in key initiatives
Sources
Business
Initiatives
Executives
legacy
Business
Analysts
Enterprise
Architects
Data
Analysts &
Architects
Subject Matter
Experts
apps
BI
SAP
dbs
warehouse
Xls., xml,
flat
mdm
warehouse
z/OS
custom
Data
Steward
DBA
Developer
13
System
Architect
ERP System
Manager
© 2011 IBM Corporation
Information Management
InfoSphere Data Architect
Requirements
Model, visualize, and relate diverse
and distributed data assets
Data Architect
 Design and manage
enterprise models
 Enforce model conformance
to enterprise standards
 Leverage industry data
models for best practices
Benefits
 Speed design activities
 Populate Business
Glossary from model terms
 Validate models for
enterprise conformance
14
14
© 2011 IBM Corporation
Information Management
InfoSphere FastTrack
Requirements
Capture Design Specifications and
accelerate translation into data
integration projects
FastTrack
 Capture business
requirements for source to
target mappings
 Leverage source analysis
and business vocabulary
 Generate candidate ETL
jobs
Benefits
 Accelerate development of
integration processes
 Centralized management of
specifications
 Audit design decisions over
time
15
15
© 2011 IBM Corporation
Information Management
IBM InfoSphere Optim Data Masking Solution
Information Governance Core Disciplines
Security and Privacy
Understand &
Define
De-identify sensitive information
with realistic but fictional data for
testing & development purposes
Secure &
Protect
Monitor
& Audit
Requirements
 Protect confidential data
used in test, training &
development systems
 Implement proven data
masking techniques
 Support compliance with
privacy regulations
 Solution supports custom
& packaged ERP
applications
Benefits
JASON MICHAELS
ROBERT SMITH
Personal identifiable information
is masked with realistic but
fictional data for testing &
development purposes.
16
 Protect sensitive
information from misuse
and fraud
 Prevent data breaches and
associated fines
 Achieve better data
governance
© 2011 IBM Corporation
Information Management
IBM InfoSphere Optim Test Data
Management Solution
Information Governance Core Disciplines
Security and Privacy
Understand &
Define
Create “right-size”
production-like environments
for application testing
 Create referentially intact,
“right-sized” test
databases
 Automate test result
comparisons to identify
hidden errors
 Shorten iterative testing
cycles and accelerate time
to market
Subset & Mask
2TB
25 GB
25 GB
Benefits
Development
Unit Test
100 GB
Integration Test
Monitor
& Audit
Requirements
Test Data
Management
Production or
Production Clone
Secure &
Protect
50 GB
Training
InfoSphere Optim TDM supports data on distributed platforms (LUW) and z/OS.
Out-of-the-box subset support for packaged applications ERP/CRM solutions as well as :
 Deploy new functionality
more quickly and with
improved quality
 Easily refresh & maintain
test environments
 Reduce storage and
operational costs
Other
17
© 2011 IBM Corporation
Information Management
Guardium: Full Lifecycle of Database Security & Compliance
18
© 2011 IBM Corporation
Information Management
Best Practices Capabilities & Differentiators
 Single data integration platform with multiple components
 Consistent and repeatable methodology for mitigating risks
 Industry leading Probabilistic Matching Engine for data
standardization jobs
 Native Parallel Processing Engine for scalability
 Shared GUI Interface between major components of the platform
 Centralized repository of critical metadata shared across the
platform
 Data integration enablement in an SOA environment
19
© 2011 IBM Corporation
Information Management
IBM Information Server Federal Customers
•
•
•
•
20
Agency data migrations
Authoritative source
Personnel record consolidation
System synchronization
•
•
•
•
Personnel and recruiting analysis
Procurement system consolidation
Real-time data management
Inventory parts analysis
© 2011 IBM Corporation
Information Management
Questions?
21
© 2011 IBM Corporation
Download