AFPOA Virtual Vendor Day Topic: Data Integration Gregory J. Vaughan – Executive Consultant, WW Military and Defense Lead, Information Agenda Tiger Team Information Management © 2011 IBM Corporation Information Management There’s no “easy button” for this… Data Integration is a complex problem A myopic view of the problem frustrates the desired end state Scoping the problem too narrowly reduces the likelihood of success Focusing later on data integration requires a revisit of the problem scope Data integration presents the greatest risk to IT related business initiatives Data Governance is required, but frequently overlooked The complexities of data integration requires a comprehensive solution 2 © 2011 IBM Corporation Information Management Solution Architecture – General View Info. Integration Data Quality Info. Services BI (REPORTS, DASHBOARDS, QUERY, OLAP) UNSTRUCTURE CONTENT OPERATIONAL DATA PREDICTIVE ANALYTICS TEXT ANALYTICS USERS APPLICATIONS DATA MARTS DATA WAREHOUSE INTERNAL/ EXTERNAL DATBASES MASTER DATA APPLICATIONS OPTIMIZATION OLAP CUBES INTERNAL DATABASES METADATA EXTERNAL DATABASES 3 © 2011 IBM Corporation Information Management The IBM Solution: IBM Information Server Delivering information you can trust IBM Information Server Unified Deployment Discover, model, and govern information structure and content Standardize, merge, and correct information Combine and restructure information for new uses Synchronize, virtualize and move information for in-line delivery Unified Metadata Management 4 © 2011 IBM Corporation Information Management Align business and IT objectives using single platform that creates trusted information for use in key initiatives Sources Business Initiatives Executives legacy Business Analysts Enterprise Architects Data Analysts & Architects Subject Matter Experts apps BI SAP dbs warehouse Xls., xml, flat mdm warehouse z/OS custom Data Steward DBA Developer 5 System Architect ERP System Manager © 2011 IBM Corporation Information Management Align business and IT objectives using single platform that creates trusted information for use in key initiatives Sources Business Initiatives Executives legacy Business Analysts Enterprise Architects Data Analysts & Architects Subject Matter Experts apps BI SAP dbs warehouse Xls., xml, flat mdm warehouse z/OS custom Data Steward DBA Developer 6 System Architect ERP System Manager © 2011 IBM Corporation Information Management InfoSphere Information Analyzer Requirements Information Analyzer Analyze source data quality and monitor adherence to integration and quality rules Perform data quality assessment Define business rules to monitor data quality Establish stewards for governance of data quality Benefits Identify data quality issues early to reduce project risks Monitor quality metrics over time for compliance Create business confidence with trusted information 7 © 2011 IBM Corporation Information Management InfoSphere Business Glossary Requirements Create and manage business vocabulary and relationships and related to physical sources Business Glossary Capture business terms and classifications Link business terms and classifications to IT assets Identify data stewards and make glossary accessible Benefits Context for information is available to everyone, immediately IT projects are aligned with data governance Collaboration increases across business and IT 8 8 © 2011 IBM Corporation Information Management Align business and IT objectives using single platform that creates trusted information for use in key initiatives Sources Business Initiatives Executives legacy Business Analysts Enterprise Architects Data Analysts & Architects Subject Matter Experts apps BI SAP dbs warehouse Xls., xml, flat mdm warehouse z/OS custom Data Steward DBA Developer 9 System Architect ERP System Manager © 2011 IBM Corporation Information Management InfoSphere QualityStage Requirements QualityStage Standardize, cleanse and deduplicate data, ensuring a complete, accurate view of information Resolution of data quality issues Standardization of data formats Cleanse data Manage duplicate data Enable ongoing quality Benefits Removes duplicates Cross-references matching records Survives a single, complete record Validate and enriches data 10 © 2011 IBM Corporation Information Management Align business and IT objectives using single platform that creates trusted information for use in key initiatives Sources Business Initiatives Executives legacy Business Analysts Enterprise Architects Data Analysts & Architects Subject Matter Experts apps BI SAP dbs warehouse Xls., xml, flat mdm warehouse z/OS custom Data Steward DBA Developer 11 System Architect ERP System Manager © 2011 IBM Corporation Information Management InfoSphere Metadata Workbench Requirements Support information governance with traceability on data movement, modeling & BI applications Metadata Workbench Handle Change Management processes with measured impact. Visualize and trace information flows across enterprise landscape Access and report on operational and design metadata Benefits Deliver enterprise audit control information. Mediate system disruptions. Govern enterprise assets over time. 12 Ensure effective collaboration with line of business stakeholders. © 2011 IBM Corporation Information Management Align business and IT objectives using single platform that creates trusted information for use in key initiatives Sources Business Initiatives Executives legacy Business Analysts Enterprise Architects Data Analysts & Architects Subject Matter Experts apps BI SAP dbs warehouse Xls., xml, flat mdm warehouse z/OS custom Data Steward DBA Developer 13 System Architect ERP System Manager © 2011 IBM Corporation Information Management InfoSphere Data Architect Requirements Model, visualize, and relate diverse and distributed data assets Data Architect Design and manage enterprise models Enforce model conformance to enterprise standards Leverage industry data models for best practices Benefits Speed design activities Populate Business Glossary from model terms Validate models for enterprise conformance 14 14 © 2011 IBM Corporation Information Management InfoSphere FastTrack Requirements Capture Design Specifications and accelerate translation into data integration projects FastTrack Capture business requirements for source to target mappings Leverage source analysis and business vocabulary Generate candidate ETL jobs Benefits Accelerate development of integration processes Centralized management of specifications Audit design decisions over time 15 15 © 2011 IBM Corporation Information Management IBM InfoSphere Optim Data Masking Solution Information Governance Core Disciplines Security and Privacy Understand & Define De-identify sensitive information with realistic but fictional data for testing & development purposes Secure & Protect Monitor & Audit Requirements Protect confidential data used in test, training & development systems Implement proven data masking techniques Support compliance with privacy regulations Solution supports custom & packaged ERP applications Benefits JASON MICHAELS ROBERT SMITH Personal identifiable information is masked with realistic but fictional data for testing & development purposes. 16 Protect sensitive information from misuse and fraud Prevent data breaches and associated fines Achieve better data governance © 2011 IBM Corporation Information Management IBM InfoSphere Optim Test Data Management Solution Information Governance Core Disciplines Security and Privacy Understand & Define Create “right-size” production-like environments for application testing Create referentially intact, “right-sized” test databases Automate test result comparisons to identify hidden errors Shorten iterative testing cycles and accelerate time to market Subset & Mask 2TB 25 GB 25 GB Benefits Development Unit Test 100 GB Integration Test Monitor & Audit Requirements Test Data Management Production or Production Clone Secure & Protect 50 GB Training InfoSphere Optim TDM supports data on distributed platforms (LUW) and z/OS. Out-of-the-box subset support for packaged applications ERP/CRM solutions as well as : Deploy new functionality more quickly and with improved quality Easily refresh & maintain test environments Reduce storage and operational costs Other 17 © 2011 IBM Corporation Information Management Guardium: Full Lifecycle of Database Security & Compliance 18 © 2011 IBM Corporation Information Management Best Practices Capabilities & Differentiators Single data integration platform with multiple components Consistent and repeatable methodology for mitigating risks Industry leading Probabilistic Matching Engine for data standardization jobs Native Parallel Processing Engine for scalability Shared GUI Interface between major components of the platform Centralized repository of critical metadata shared across the platform Data integration enablement in an SOA environment 19 © 2011 IBM Corporation Information Management IBM Information Server Federal Customers • • • • 20 Agency data migrations Authoritative source Personnel record consolidation System synchronization • • • • Personnel and recruiting analysis Procurement system consolidation Real-time data management Inventory parts analysis © 2011 IBM Corporation Information Management Questions? 21 © 2011 IBM Corporation