sap-data-services-the-future-of-enterprise-etl

advertisement
SAP Data Services
The Future Of Enterprise ETL ?
Housekeeping
• Venue
• Tea / Coffee
• Comfort Break (maybe)
• Presentations
• Duration
• Questions welcome
Speaker Bio
•
•
•
•
•
•
•
Richard Munn
DBA @ Nationwide Building Society
Experience: 15yrs IT, 11yrs SQLServer
Blog: http://sqlmunkee.blogspot.com
Twitter: @sqlmunkee
Mail: SQLMunkee@gmail.com
Social status: Northener
Presentations
• 1 - Quick comparisons between SSIS and
SDS
• 2 - Overview / Explanation
• 3 – Demonstration (I hope)
• Knowledge of SSIS / ETL terminology is assumed
• Disclaimer
<presentation1>
SSIS vs. SDS
Paradigm Shift ?
The Future Is Yellow
• SAP Data Services (SDS) is an Enterprise
Solution for ETL.
• Platform agnostic
• Repository based ETL development.
• Any object can be shared (I think)
• Supports CheckOut /CheckIn usage
Comparisons
Feature
SSIS
SDS
ETL Operations
Yes
Yes
Sharing
No
Yes
Wide range of standard objects
Yes
No
Custom objects
Yes
Yes
Standalone development
Yes
Yes (but…)
Custom
Built-in
No
Yes
“Push down”
WTF ?
Yes
Script support
Yes
Yes
VB.net / C# (2008)
Python
Try…Catch
Metadata and Lineage
Script Language
Job Execution
• SSIS just requires a command line
(DTExec)
• SDS (even in development) needs:
– A job to contain Workflow / Dataflow
– A job server to execute the job
• More infrastructure is required to provide
service
Restartable Jobs
• Must be hand-crafted in SDS
• Complex logic required with multiple
Workflows, Try…Catch, Parameters,
Variables, etc…
• Built-in with SSIS
Challenges
• Requires dedicated, experienced
developers and support staff (IMHO)
• Does the job in a controlled and ordered
manner.
• Not as “user-friendly” as SSIS.
</presentation1>
<presentation2>
All aboard the big yellow fun bus !
Key Elements
• Repository / Repositories
• Data Services Designer
• Job Server / Server Group
• Management Console
Repositories (1)
• Somewhere to store your objects:
– Projects
– Jobs
– DataSources
– DataFlows
– WorkFlows
– Transforms
More on these later…
Repositories (2)
• 4 types:
– Local - Held on a local DBMS
– ‘Personal’ * - Work in progress
– Central - Items to be used / shared
– Secure - Only Admin access (this can be a problem)
* - not an SAP option, more ‘best practice’
Data Services Designer
• Local installation of Development Environment
for creating the objects to store in the Repository
• …but can be delivered over Citrix or
Softgrid/AppV
• Fairly simple in look and feel, but actually quite
complex
• Would anyone like to take a look ?
Starting The Designer
Jobs (1)
• Anything you want to do is treated as a ‘job’ in
SDS
• Jobs usually contain WorkFlows and DataFlows
(but you can have one without the other, unlike
SSIS)
• 2 types:
– Batch Job
– Real-Time Job
Jobs (2)
• Batch Job
– Commonly single unit-of-work
– Overnight / Off-peak ETL processing
• Real-Time Job
– Creates a Web service
– Typically 3Flow/2Transform data movement
– Simple operations ONLY please
Job Servers
• Actually runs your job (even if you’re
moving data about locally)
• Multiple Job Servers can function as a
“Server Group”
• Server Groups share the submitted jobs
between them, but once a job starts it
can’t move servers (ask me about this)
Management Console
• Single point of administration for…
– Users
– Job Servers (reporting)
– Repositories
• Browser-based, so requires Apache Tomcat
(later versions support other web servers like
IIS)
• Typically you’ll have one DSMC per landscape
unit
I hope there’s a WhiteBoard…
• Shall we draw some pictures of what it
might look like ?
</presentation2>
Comfort Break
If we haven’t already had one..
<presentation3>
Let’s go look see…
</presentation3>
Questions ?
GOTO: Pub()
Download