Information Server: Upgrade Workshop Maj 2010 1 © 2010 IBM Corporation Upgrade Workshop What Is New In InfoSphere DataStage Developer Usability Functional Stages Administrator Features Operations & Runtime ● ● ● ● Three releases of DataStage since the 7.5.x series Each release has introduced new features targeted specifically for the data integration user Enhancements extend across a variety of product areas (as shown to the left) The following slides will: introduce each category list some specific features in each set dive more deeply on a few Connectivity 2 2 © 2010 IBM Corporation Upgrade Workshop Developer Usability Developer Usability Functional Stages Administrator Features Operations & Runtime Connectivity 3 3 Features that simplify tasks within the user interface tasks for both basic and advanced use cases ● Designer Performance ● Function Expansion ● Designer Graphical & Function Upgrades ● Multiple User Environment – Locking/Read Only ● Find/Search for Objects in the Repository ● Graphical Impact Analysis ● Job, Table or Routine Difference ● Job Deployment with Information Server Manager ● Balanced Optimization – leverage the power of the DBMS ! ● DSX Export Improvements ● Globalization – UI and messages translated to 9 different languages © 2010 IBM Corporation Upgrade Workshop Designer – New Repository Tree New tree view for display of Repository contents – New folder model – In-place “Find” – Expandable view New folder model – Replaces previous Category model – No restrictions on where objects live in the folder structure • • 4 Jobs can live in the same folder as Table Definitions, Routines, Transforms etc. Allows user to configure Repository content in the way that suits their application e.g. task based vs class based structure © 2010 IBM Corporation Upgrade Workshop Quick Find - Basic ● Find item in Repository tree – In-place find – Find by Name (Full or Partial) – Wild card support – Find next… – Filter on type 5 © 2010 IBM Corporation Upgrade Workshop Impact Analysis – Graphical View Impact Analysis: - Find dependencies …What does this item depend on? - Find where used …Where is this item used? Results shown using the Advanced Find window 6 © 2010 IBM Corporation Upgrade Workshop Job Difference – Integrated report Difference report displayed in Designer - jobs opened automatically from report hot links Options available to: - Print report - Simple “Find” in report - Launch external diff tool for more indepth diff of textual properties, e.g. Routine source 7 © 2010 IBM Corporation Upgrade Workshop General Enhancements Design Time Performance – Significant Performance improvement in Job Open, Save, Compile etc. Function Expansion – New Utility, String and Date/Time functions eg: IsValidTime, NthWeekdayFromDate, DecimalToTimestamp 8 Parallel Only © 2010 IBM Corporation 8 Upgrade Workshop Functional Stages Developer Usability Functional Stages Administrator Features Operations & Runtime New stage types that introduce new off the shelf data integration functions or expand existing ones ● New XML Stage ● Vertical Pivot ● Transformer Enhancements ● Slowly Changing Dimension ● Enhanced Surrogate Key Stage ● Multi Format Flat File Support ● Range Lookup ● Horizontal Parallel Pivot Stage ● Checksum Stage Connectivity 9 9 © 2010 IBM Corporation Upgrade Workshop XML Pack Improved environment for mapping complex XML documents from one schema to another in single or multiple steps Support for Schemas (XSD XML Schema 1.0, WSDL 1.1 ) Deliver Support for complex XML Transformations with need for shredding the XML – Hierarchical Join, Relational join, Filter, Switch, Sort, Union, Regroup, RowToColumns, ColumnsToRows, Aggregate, Distinct Support for multiple input and output links, including reference, reject Shall support partitioning, multi threaded and stream processing of large XML documents Performance and volume improvements – reduced memory requirements, increased throughput – Remove restrictions on document size 10 10 © 2010 IBM Corporation Upgrade Workshop Customer XML Job – pre 8.5 vs 8.5 Ran in 11% of the time without introducing any of the new parallelization features 11 11 © 2010 IBM Corporation Upgrade Workshop Parallel Only Vertical Pivot • Enhanced Pivot stage to support - Vertical Pivoting • mapping multiple input rows with a common key, to a single output row containing multiple columns • Coves three basic requirements: key based groups, columnar pivot and aggregate functions 12 12 © 2010 IBM Corporation Upgrade Workshop Parallel Only Transformer Enhancements • Deliver Looping in the Transformer • Allow multiple output rows to be produced from a single input row • Support for End of Data Flag to support Key Break Logic • New Input Cache • • SaveInputRecord() • GetSavedInputRecord() New System variables, function • @ITERATION, Loop Count • LastRow(), End of data flag for last row • LastRowInGroup(InputColumn), automate change detection • Stage, Loop Variables support for Nullability • More options for Null Handling 13 13 © 2010 IBM Corporation Upgrade Workshop Parallel Only SCD Stage Functionality 14 A single stage that encapsulates all of the functionality required to target a star schema model “Wizard”-style features that enable the user to enter only the minimum amount of information needed Support for columns of SCD Types 1 and 2 in the same dimension table Initial and incremental population of fact and dimension tables Support for surrogate key management across job runs © 2010 IBM Corporation Upgrade Workshop Functional Stages Developer Usability Functional Stages Administrator Features Operations & Runtime Simplify and expand the functionality available for administrators of the tool ● New Suite Installer ● Increased Availability ● Information Server Manager ● Source Code Control System Integration ● Audit Logging ● Web Logging Console ● New Administration and Super Operator Roles ● Support Assistant Connectivity 15 15 © 2010 IBM Corporation Upgrade Workshop New Suite Installer Web enabled (no XWindows configuation requirement) Prerequisite checker confirms system meets basic requirements as well as selected installation options Installer supports resuming an installation that failed for unexpected reasons Streamlined feature selection including adding additional Information Server products and additional tiers to the system Trust based licensing is now default Install utility also handles patches/updates including unique Patch Merging Installation. 16 © 2010 IBM Corporation Upgrade Workshop Increased Availability 17 Delivering Horizontal and Vertical scaling / load balancing of the Domain and DB Tiers Delivering higher levels of Availability for Production and Development environments Cluster support for Application Server Tier – WAS 6.1 and 7.0 ND Cluster support for Repository Tier – DB2 HADR / Cluster, Oracle RAC Improved Failover support for Engine Tier © 2010 IBM Corporation Upgrade Workshop What is the Information Server Manager? New Information Server application Manages DataStage & QualityStage objects’ lifecycles Move objects from development environments to test and production environments Handle updates to previously deployed objects New import/export capabilities GUI available on windows platforms (client or server) Command line interface available on all client and server platforms 18 Is a replacement for, NOT a reimplementation of, the pre-8.0 DataStage Version Control application © 2010 IBM Corporation Upgrade Workshop Source Code Control System Integration Leveraging the Eclipse Platform for Team Development Integration with codemanagement (SCCS) providers supported through Eclipse Plugins Support for ClearCase, CVS – Other providers configurable via Eclipse Team Plugins Functions to interact with the SCCS invoked from the Information Server Manager 19 19 © 2010 IBM Corporation Upgrade Workshop ISA Lite - Support Assistant 1. System Requirements Installation prerequisites checker (general IS and SAP) 2. Data Collection General IS data files (passive) and specific component collections incl SAP. Option to collect IS repository records. 3. Diagnostics (Health) General IS checks as well as component specific checks. 4. Utilities DS project cleanup tool 20 © 2010 IBM Corporation Upgrade Workshop Audit Logging Deliver an audit log of security-related events. Delivers SOX and Security Compliance The following groups of audit events are logged – User and group management creation and removal of users and groups, user group membership changes, and user credential changes – User, group, and project security role assignments creation or deletion of a security role, assignment and removal of security roles to users or groups, and assignment or removal of users or groups and roles to a project. – Engine credential mapping assignment and removal of credentials to IBM InfoSphere DataStage® suite users and assignment of default credentials for an IBM InfoSphere Information Server engine when mapping credentials using the Engine Credentials panel of the IBM InfoSphere Information Server Web console. – User session management user login and logout, direct session termination, and session expiration – Audit configuration auditing properties file location, audit file configuration settings, and audit event settings. 21 21 © 2010 IBM Corporation Upgrade Workshop Operations & Run Time Developer Usability Functional Stages Administrator Features Operations & Runtime Increased support for the management and performance of the run time environment ● New Platform Support ● Run-time Performance Improvements ● Job Parameter Sets ● Runtime Optimizations ● Machine Resource Estimation ● Job Performance Data Analysis ● Serviceability Tools ● Documentation ● Server-side import/export (via new istool command line utility) Connectivity 22 22 © 2010 IBM Corporation Upgrade Workshop Platform Support Red Hat Enterprise Linux 5,6 (64 bit) SUSE Linux Enterprise Server 9,10 (64 bit) Windows Server 2008 64bit (32-bit app) AIX 5.3, 6.1 (64 bit) Solaris 9,10 (64 bit) HP-UX Itanium (64 bit) Red Hat Enterprise Linux for System Z (64 bit) SUSE Linux Enterprise Server for System Z (64 bit) Windows Server 2003 (32-bit) Red Hat Enterprise Linux 5, 6 (as 32-bit app) SUSE Linux Enterprise Server 9, 10 (as 32-bit app) Clients – Windows XP, Vista and 7 (32 & 64 bit) Repository – DB2 9/5, 9.7, Oracle 10g, 11g, SQL Server 2005, 2008 23 23 © 2010 IBM Corporation Upgrade Workshop Parameter Sets • Job Parameter Sets • New object in repository that contains the names and values of job parameters. • A Job Parameter set can be referenced by one or more jobs enabling easier deployment of jobs across machines and also enabling easy propagation of a changed job parameter value 24 © 2010 IBM Corporation Upgrade Workshop Parallel Only Run Time Optimizations ● Buffer Optimization – Improved buffer placement algorithm – ● Improved Job Startup Time – ● 25 E.g., Removed unnecessary buffer before parallel sort in some instances Startup time improvements allow efficient use of EE against smaller data sets Adaptive Job Monitoring – The primary function of the Adaptive Job Monitoring feature is to detect when CPU utilization by the conductor reaches 80% and throttle the volume of jobmon data by sending control messages to the players to reduce the output rate – When 80% CPU utilization by the conductor is reached, a warning message will be issued to the user – Note: only monitor messages will be throttled, metadata and summary messages are not affected © 2010 IBM Corporation Parallel Only Upgrade Workshop Machine Resource Estimation ● Three modes: – Estimate – Re-Estimate ● – Run Estimate Mode – For a given job, provides estimates for the disk space required and CPU utilization. – Two models: • • 26 Static – provides disk space estimates based on schema and job design Dynamic – provides calculated estimates by node based on a run of the job against a sample of the data © 2010 IBM Corporation Upgrade Workshop Parallel Only Job Performance Data Analysis • Provides a graphical display of job performance and utilization based on a job run • The type of data includes CPU time, system time, elapsed time, memory (heap) size, processed number of records • The data is presented as sub-phases and phases for each of the stage/operator execution in the job • Can view all nodes in the job or specific nodes in the job 27 © 2010 IBM Corporation Upgrade Workshop Connectivity Developer Usability Functional Stages Administrator Features Operations & Runtime Connectivity 28 28 Maximize the reach across the organization to easily access various types of data ● New Connectors (available on Server canvas in 8.5) ● Local transaction support ● z/OS File Stage ● Distributed Transaction Stage ● CDC integration through DTS ● Connection Objects - Meta Data Reuse ● SQL Builder Enhancements ● Netezza Enterprise Stage ● iWay Enterprise Stage ● WebSphere II Federation ● Stored Procedure Plug-in Support (SQL Server & Teradata added) © 2010 IBM Corporation Upgrade Workshop Common Connectors ● One component to access data from same source/target – ● ● Combines & extends features of existing connectivity stages Ease of Use - improved and consistent interfaces ● Extended Functionality ● Better Performance ● DBMS Version Management ● 29 Supports DataStage, QualityStage, & Information Analyzer Common Connectors can co-exist. Supersede DS SE plug-ins and DS EE Operators, yet, they can co-exist. Where IBM is investing and adding new capabilities. Connectors DB2 ODBC WebSphere MQ Oracle Teradata DTS – Distributed Transaction Stage (XA) Migration Tool to convert existing jobs to use new Connectors © 2010 IBM Corporation Upgrade Workshop Connection Migration Tool 30 30 Modifies jobs that use legacy plugin/operator stages to use newer Connectors Migrates all compatible stages GUI and command line (batch) modes Server and Parallel jobs Backup, clone or replace jobs Jobs are annotated with information about the migration © 2010 IBM Corporation Parallel Only Upgrade Workshop Distributed Transaction Stage The Distributed Transaction Stage (DTS) utilizes the WebSphere MQ Transaction Manager to enable distributed two-phase transactions across multiple resources. It works in collaboration with the WebSphere MQ Connector to move source messages to database targets. Currently supports MQ (Source), DB2. Adding support for MQ (Target), Teradata, Oracle and ODBC Source Queue Target Database Source data arrives on MQ queue from some external application 31 31 Business logic transforms the data to construct target actions DTS updates target and deletes message from MQ source queue. © 2010 IBM Corporation Upgrade Workshop Multiple Input Link Support for Connectors ● Support Local Transaction Grouping – Insert/Update/Delete – Commit all rows, on all links, or fail / roll back 32 ● Both batch and real-time ● Support for SQL Error code and reject links by link © 2010 IBM Corporation Upgrade Workshop Parallel Only z/OS File stage (already enabled for V8.1) • • • 33 33 New Native Support for Accessing Mainframe files from Distributed platforms and Linux for System Z – Through a new stage called z/OS File stage – VSAM files - KSDS, ESDS, RRDS – Sequential files - QSAM, Sequential read of BDAM/BSAM, PDS members, GDG files. Initial release – Read/Write for Sequential files and read only for VSAM. – Fixed and variable-length records – single or multi record type format files will be supported Leveraging InfoSphere Classic Federation © 2010 IBM Corporation Upgrade Workshop Tack för uppmärksamheten Peter Bjelvert IBM Software Group InfoSphere Services peter.bjelvert@se.ibm.com www.ibm.com/software/data/infosphere 34 © 2010 IBM Corporation