Session_2_Implementing_HANA - Lenoir

advertisement
Session2: Implementing SAP HANA
Dr. Bjarne Berg
Associate professor Computer Science
Lenoir-Rhyne University
Introduction
This second session is more technical as we explore
what HANA is and how it works for real-time data
warehousing and fast access to big data.
We will look inside HANA, examine its capabilities
and interfaces, and see how to size, plan, and install
HANA based on examples from several real
companies.
You will also see several HANA demos on how to
create tables, how to quickly load data, and how to
execute hyper-fast reports using BusinessObjects
Analysis, WebIntelligence, dashboards, Crystal
Report, BI Self-Service through modules, and BI
Workspaces on HANA.
2
EDW Design Vs. Evolution
An organization has two fundamental choices:
1.
Build a new well architected EDW
2.
Evolve the old EDW or reporting system
Both solutions are feasible, but organizations that
selects an evolutionary approach should be selfaware and monitor undesirable add-ons and
‘workarounds”.
Failure to break with the past can be detrimental to
an EDW’s long-term success…
What is Analytics?
•
Discovery and communication of meaningful patterns in data.
 Applied to business data, to describe, predict, and improve business
performance.
•
Analytic applications often favor data visualization to communicate insight.
•
Intelligent data
 Behavioral data, sensor data, transactional data, market research, …
•
Questions (e.g., marketing):
 Who are my core customers?
 What drives their behavior?
 Can I predict change?
 Can I segment?
 What has led up to this point?
Session Agenda
•
Introduction
•
The Components of SAP HANA
•
Sizing, planning and installing HANA
•
New EDW Data Design In HANA
•
Demo: Performance dashboard from from real implementation
•
Demo: HANA building tables, data loads & deploying
BusinessObjects
•
Demo: Deploying Analyics with HANA and BOBJ
•
Questions and Answers
5
Looking Inside SAP HANA — In-Memory Computing Engine
Metadata Manager
Authorization
Transaction
Manager
Manager
SQL Script
SQL Parser
Disk Storage
Data Volumes
Log
Relational Engine
-Row Store
-Column Store
Calculation
Engine
Session
Manager
MDX
Volumes
Load Controller
Replication Server
Inside the Computing Engine of SAP HANA, we have many different components that
manage the access and storage of the data. This includes MDX and SQL access, as
well as Load Controller (LC) and the Replication Server.
BusinessObjects Data Services
Session Agenda
•
Introduction
•
The Components of SAP HANA
•
Sizing, planning and installing HANA
•
New EDW Data Design In HANA
•
Demo: Performance dashboard from from real implementation
•
Demo: HANA building tables, data loads & deploying
BusinessObjects
•
Demo: Deploying Analyics with HANA and BOBJ
•
Questions and Answers
7
A T-Shirt Model for Sizing HANA on BW
A T-shirt model is a quick
way to get some basic ideas
on what a system may look
like.
While very inaccurate for
sizing, it provides basic
information for those just
staring considering SAP
HANA
Data Compress
ion (from)
Working
Memory
Processors
SAS/SSD
(for data)
Replication
Speed (per
hour)
Extra
Large
(XXL)
7000–100,000
GB
3072 GB
12+ Intel
E7 2.4 Ghz
10+ TB
20+ GB
Very
Large
(XL)
3500–7000GB
2048 GB
8+ Intel E7
2.4 Ghz
5 – 10 TB
20+ GB
Large (L)
2000–3500GB
1024GB
4 x Intel
E7 2.4 Ghz
4 - 5 TB
5–20GB
Medium
(M)
1250–
2000GB
512GB
4 x Intel
E7 2.0+
Ghz
2048GB
5–20 GB
Small (S)
500–
1250GB
256GB
2 x Intel
E7 2.0+
Ghz
1024GB
5GB
Extra
Small
(XS)
256–
500GB
128GB
2 x Intel
E7 2.0+
Ghz
1024GB
5GB
to 20+ TB
(multi node)
(multi node)
The number of processors are largely driven by the number of users and usage
patterns. Serious consideration should be made before buying hardware.
A Rule-Of-Thumb Approach to Sizing HANA - Memory
Memory can be estimated by taking the current system size, and running the
programs in ”get_size.zip” in SAP Note 1637145 to get row and column store
sizes for your system.
Memory = 50 GB +
[ (rowstore tables footprint / 1.5) +
(colstore tables footprint * 2 / 4) ] * Existing DB Compression
The 50 GB is for HANA services and caches. The 1.5 is the compression
expected for rowstore tables and the 4 is the compression expected for column
store tables. The 2-factor refers to the space needed for run-time objects and
temporary result sets in HANA. Finally the term ‘existing DB compression’ is to
account for any compression already done in your system (if any).
Remember, these are quick rules-of-thumb, so don’t rely
on it for finalized budgeting and hardware purchases.
A Rule-Of-Thumb Approach to Sizing HANA - Disk
The next item you need is disk space, which can be estimated by the following
Disk for persistence layer = 4 Memory
Disk for the log = 1 Memory
In this example, you need 4 x 710 GB disk for the persistence layer and about
710 GB for the logs. This equals around 3.5TB (don’t worry, disk space of this
size is now almost “cheap”).
The persistence layer is the disk that keeps the system secure and provides for
redundancy if there are any memory failures, so it’s important not to
underestimate this.
Remember, these are quick rules-of-thumb, so don’t rely
on it for finalized budgeting and hardware purchases.
A Rule-Of-Thumb Approach to Sizing HANA - CPU
The CPUs are based on the number of cores that you include. For
example 10 core CPUs now exist (depending on when you bought your
system).
CPU = 0.2 CPU cores per active user
If you have a single node with 4 x 10 cores, you will have 40 cores and
can handle 200 active users on that hardware node, and quite a larger
number of named users.
Remember, these are quick rules-of-thumb, so don’t rely
on it for finalized budgeting and hardware purchases.
SAP QuickSizer tool for HANA
There are three versions of the tool for each versions of SAP HANA.
The Quick Sizer for the
Rapid Deployment Solutions
(RDS) allows you to size for
specific tools.
The second Quick
Sizer version is for
SAP HANA on SAP
NetWeaver BW
The last is for those who want
to use SAP HANA as a
standalone platform for inmemory data (i.e., using SAP
Data Services to load data to).
SAP’s Quick Sizer for SAP HANA, is available at
http://service.sap.com/quicksizer.
An Alternative SAP BW on HANA Sizing Tool
SAP has released a new ABAP based tool that generates a report significantly
better sizing fro SAP BW than using just the QuickSizer above. This program
takes into consideration existing database compression, different table types
and also include the effects of non-active data on the HANA system.
The higher precision
you run the estimate at
the longer the program
is going to run.
With 14 parallel
processors and 8Tb
data warehouse, it is
not unusual to see 4575 minutes run time.
To increase speed,
you can also
suppress analysis
tables with less
than 1 MB size.
SAP BW on HANA Sizing Tool
Since timeouts are common when
running the sizing program, you
can temporarily change the
parameter in rdisp/max_wprun_time
to 0 in BW transaction RZ11.
Finally, you estimate the growth for
the system as a percentage, or as
absolute growth.
After you have downloaded and installed the program, and-selected the parameters
above, you can go to SE38 and run SDF/HANA_BW_SIZING as a background job.
The output is stored in the file you specified and the file can now be email emailed
to hardware vendors for sizing input and hardware selection.
This program is attached to SAP Note: 1736976 on SAP Marketplace
Summary of HANA Sizing Approaches
Approach
Quality of Estimate
T-Shirt Sizing
Rule-of-Thumb
SAP QuickSizer
Sizing for BW program
•
 Sort of ‘OK”
 Better accuracy
 Much better
 Excellent
Effort Required
Very Low
Low
High
Moderate
Work with your preferred vendor before ordering your hardware or
finalizing your budgets.
Doing a correct HANA sizing effort is essential for not
buying a too small, or too big HANA system.
Staffing a HANA Migration Project – Small Team
System Profile
Raw data size:
Complexity:
DataStores:
InfoCubes:
Queries:
2.7 TB
Medium
87
63
409
Duration:
14 weeks
Environments: 4+1
Risk aversion: Medium
Other usage:
IP

This assumes that the test team is dedicated for 3 weeks
during the migration of QA and Prod environments

The test team from the business is already experienced
users of the BW system and need minimal training

HANA Optimization of InfoCubes and DSOs are currently
for SD only for this organization.
This organization is using BWA and will be
retiring it as part of the HANA migration
Staffing a HANA Migration Project – Medium Team
System Profile
Raw data size: 5.6 TB
Complexity:
Medium
DataStores:
439
InfoCubes:
603
Queries:
1,300+
(incl. BOBJ)
Duration:
18 weeks
Environments: 4
Risk aversion: HIGH
Other usage:
None

This assumes testing of core queries in BEx and
WebIntelligence is done by the business

The data reconciliation and process chain testing is
done by dedicated resources in each team.

This team must be staffed with experienced resources.
HANA training for team members and hardware installs
must be in place prior to project start.
Staffing a HANA Migration Project – Very Large Team
System Profile
Raw data size: 38TB
Complexity:
High
DataStores:
1,300+
InfoCubes:
1,720+
Queries:
2,200+
Duration:
5 mos
Environments: 4
Risk aversion: HIGH
Other usage:
APO, IP,
BPC
This assumes minimal
additional functional
optimization
Budgeting a HANA Migration Project - Systems
There are a set of items you need to budget for. From a system
perspective you will need to consider:
•
Hardware quotes
Give at least two vendors your sizing estimate and ask for quotes.
•
Vendor Support
Make sure your hardware vendor include 3-years support in your purchase
•
Upgrades
Plan and budget for any DW upgrades required before going to HANA
Do DW data cleanup, archiving and move non-needed data to Near-Line Storgae,
and then conduct a sizing effort, before requesting a hardware quote.
Hardware Options for HANA
There are currently 7 different
certified HANA hardware vendors
with 13 different products.
Memory
Hardware
128GB
Cisco C260
X
Cisco C460
256GB
X
X
X
X+
X
Dell R910
X
X
X
Hitachi CB 2000
X
X
X
X
X
X
X
NEC Express 5800
Fujitsu RX 600 S5
X
Fujitsu RX 900 S2
HP DL 580 G7
X
X
X
HP DL 980 G7
X+
X+
X
X
X
X+
HP BL 680
X
X
X
IBM x3690 X5
X
X
X
X
X
IBM x3950 X5
1024GB
X
Cisco B440
Some boxes can be used as single nodes
with others are intended for scale-out
solutions for large multi-node systems
512GB
X+
Small Example HW Quotes - Dell
This is example is a quote for a smaller
128 GB Memory Box with 2 x 10 cores is
based on Dell’s R910 platform for a HANA
sidecar usage for less then $40,000
(including tax!)
Most of the smaller HANA systems from
the other vendors are similarly prices and
depends on the number of boxes you
buy, existing discount agreements and
the size of the deals you are requesting.
Expect competitive bids for larger systems and
similar vendor pricing for similar capabilities
Mid-Size Budgeting Example HW Quotes - HP
This example quote is for a
mid-sized 512 GB memory
box with 4 x 10 cores CPUs
and 7 TB disks based on
Hewlett-Packard's high-end
DL-980 Box.
Including all services and
support agreements, this
quote is only $150,000
Certified HANA vendors such as HP, IBM, Dell, Cisco, NEC, Hitachi and
Fujitsu has dedicated staff to help you get a detailed quote in matter of days.
Large Example HW Quotes - Fujitsu
This is example is
a quote for a Large
1 TB Memory Box
for only $105,000
Budgeting a HANA Migration Project - People
Remember to budget for
HANA training for your
employees before the
project starts
Class schedules are
found at: training.sap.com
On average plan for
$3,000 to $6,000 to
train each team
member on average
plus travelling costs.
Budgeting a HANA Migration Project - People
•
Experienced HANA consultants are in very high demand, so
budget $1,600 to $2,300 per day for these resources (US)
•
Testers with BW experience and some HANA training can be
found for more normal consulting rates.
•
Solid hands-on migration experience with the latest
versions is key a HANA migrations project.
When staffing your HANA project, don’t schedule
the start date before you get your staff. You want
the best resources, not whoever is available.
On-Going Support Tasks and Staff Required
Major on-going support tasks consists of:
 User and role maintenance
 Security maintenance
 Backup and disaster recovery
 Load balancing, monitoring and hardware maintenance
 Software patches and notes for HANA, BW and Components
 Cleanup, NLS, Archiving, log deletions
 Transports, table copies, system copies and data copies
 Periodic system upgrades
While most tasks are similar to the old relational database systems, the way
we do this is quite different. Make sure your HANA support staff is onboarded
early and trained before cut-over to production of your migration project.
Session Agenda
•
Introduction
•
The Components of SAP HANA
•
Sizing, planning and installing HANA
•
New EDW Data Design In HANA
•
Demo: Performance dashboard from from real implementation
•
Demo: HANA building tables, data loads & deploying
BusinessObjects
•
Demo: Deploying Analyics with HANA and BOBJ
•
Questions and Answers
27
The Layered Scalable Architecture (LSA)
The LSA consists logically of:






Acquisition layer
Harmonization/quality layer
Propagation layer
Business transformation layer
Reporting layer
Virtualization layer
Data Design - Real Example of LSA Implementation
This company
implemented a full LSA
Architecture and also
partitioned the
Infoproviders for faster
data loads and faster
query performance.
While this architecture has
benefits, there are
significant issues around
data volumes and the Total
cost of Ownership when
changes are made to the
data model(s)
Data Design Example of LSA Simplification In HANA
Since many of the
benefits sought by the
LSA architecture are
inherent in HANA,
significant simplifications
can be made to the data
design and data flows
This design has a dramatically
smaller cost of ownership (fewer
objects to maintain, design and
change) than the traditional LSAs
Session Agenda
•
Introduction
•
The Components of SAP HANA
•
Sizing, planning and installing HANA
•
New EDW Data Design In HANA
•
Demo: Performance dashboard from from real implementation
•
Demo: HANA building tables, data loads & deploying
BusinessObjects
•
Demo: Deploying Analyics with HANA and BOBJ
•
Questions and Answers
31
HANA Performance Demo
Session Agenda
•
Introduction
•
The Components of SAP HANA
•
Sizing, planning and installing HANA
•
New EDW Data Design In HANA
•
Demo: Performance dashboard from from real implementation
•
Demo: HANA building tables, data loads & deploying
BusinessObjects
•
Demo: Deploying Analyics with HANA and BOBJ
•
Questions and Answers
33
HANA Design, Build and Deploy Demo
Session Agenda
•
Introduction
•
The Components of SAP HANA
•
Sizing, planning and installing HANA
•
New EDW Data Design In HANA
•
Demo: Performance dashboard from from real implementation
•
Demo: HANA building tables, data loads & deploying
BusinessObjects
•
Demo: Deploying Analyics with HANA and BOBJ
•
Questions and Answers
35
Demo: Deploying Analyics with HANA and BOBJ
Session Agenda
•
Introduction
•
The Components of SAP HANA
•
Sizing, planning and installing HANA
•
New EDW Data Design In HANA
•
Demo: Performance dashboard from from real implementation
•
Demo: HANA building tables, data loads & deploying
BusinessObjects
•
Demo: Deploying Analyics with HANA and BOBJ
•
Questions and Answers
37
Questions and Answers
Dr. Berg
Bergb@lr.edu
Moving EDW to SAP HANA – Automated tool 1
SAP has a checklist tool for
SAP NetWeaver BW powered by
HANA (thanks Marc Bernard).
In this tool, SAP provided
automatic check programs for
both the 3.5 version and the 7.x
version of BW. These are found
in SAP Note: 1729988.
In version 2.x of this tool,
hundreds of checks are done
automatically in the BW system.
This includes platform checks
on database and application
There are even basis checks for support packs, ABAP/JAVA
and system information.
stacks, Unicode, BW releases, and add-ons to your system.
39
Moving EDW to SAP HANA – Automated tool 1
The idea of the checklist tool
from SAP is that you run it
several times throughout the
project.
Once before you start, then
periodically as you resolve
issues and upgrade
requirements, and then finally
when the system has been
migrated to HANA.
The checklist tool also has specific checks for the HANA system that can help
you identify any issues before turning over the system to end users..
40
Moving EDW to SAP HANA – Automated tool 2
SAP has released a new ABAP based tool that generates a report significantly
better sizing fro SAP BW than using just the QuickSizer above. This program
takes into consideration existing database compression, different table types
and also include the effects of non-active data on the HANA system.
The higher precision
you run the estimate at
the longer the program
is going to run.
With 14 parallel
processors and 8Tb
data warehouse, it is
not unusual to see 4575 minutes run time.
To increase speed,
you can also
suppress analysis
tables with less
than 1 MB size.
41
Moving EDW to SAP HANA – Automated tool 2
Since timeouts are common when
running the sizing program, you
can temporarily change the
parameter in rdisp/max_wprun_time
to 0 in BW transaction RZ11.
Finally, you estimate the growth for
the system as a percentage, or as
absolute growth.
After you have downloaded and installed the program, and-selected the parameters
above, you can go to SE38 and run SDF/HANA_BW_SIZING as a background job.
The output is stored in the file you specified and the file can now be email emailed
to hardware vendors for sizing input and hardware selection.
This program is attached to SAP Note: 1736976 on SAP Marketplace
42
Moving EDW to SAP HANA – Automated tool 3
Many experienced developers are
not aware that moving BW to
HANA can in some cases result in
slower transformations during
data loads.
a. Select for all entries (SFAE)
statements without HANA DB hints
--> add hints
b. Select * --> specify fields to select
c. Database access in the field routines
--> move to start routine
d. Loops which do not assign field
symbols --> use field symbols
e. Selects from master data tables -->
Use the read master data rule
f. Selects from DSOs --> Use the read
DSO rule
g. Direct updates to BW object tables -> Do not update tables directly
43
Download