CA2

advertisement

SG5022 – IT Service Management – CA 2

Information Technology

Service Management

CA – 2

Team 7

1.

2.

3.

4.

5.

A0079953 : LEE ZHONG DE ROLLEI

A0079949 : JIANG HAO MIN

A0080037 : ZHOU ZIFENG

A0092677 : JOHN ROBERT BORCHORST

A0092649 : VISWALINGAM ARIVAZHAGAN

11

SG5022 – IT Service Management – CA 2 a) Calculate the overall availabiligy of this e-service- Assuming that the agreed service time of this e-service is 24x7, what is the amount of unscheduled downtime (ie. DT) in a year‘?

 Overall availability = 0.995 * 0.98 * 0.993 * 0.995 = 0.9634329285

Unscheduled downtime = (365 * 24) * (1-0.9634329285) = 320.32 (hours) b) XYZ Company has been receiving many complaints from customers that the e-service is often down when they need to use it. XYZ Company has decided that it should raise the availablily of this e-service to 97%. What is the unscheduled downtime (ie. DT) or disruption in a year with this new availability?

Unscheduled downtime = (365 * 24) * (1-0.97) = 262.8 (Hours) c) What is the difference in DT between b) and a) above? What Cost of Unavaiiability would

XYZ Company avoid with the increase in availability’?

Total lost in cost incurred for each hour of downtime = 300 + 100 + 50 = 450.

Cost Incurred for existing 96.3% = 320.32 * 450 = 144,144.00

Cost Incurred for targeted 97% = 262.8 * 450 = 118,260.0

Total savings = 144,144.00 - 118,260.0 = 25,884.00

11

SG5022 – IT Service Management – CA 2 d) With these basic options and the configuration in a) above as a base, draw two configuration alternatives for the e-service infrastructure (ie. draw Option A an Option B)?

Option A - $30,000

A B C D

99.5% 99.5% 99.3% 99.5%

Availability = 0.995 * 0.99.5 * 0.993 * 0.995 = 0.978179350875 = 97.8%

Unscheduled downtime = (365 * 24) * (1-0.9781) = 191.844 (hours)

Total lost in cost incurred due to downtime = 191.844 * 450 = $86,329.800

Option B - $15,000

B

A

98%

C D

99.5% B 99.3% 99.5%

98%

Availability =0.995 * (1 – ( (1-0.98) * (1-0.98) ) )* 0.993 * 0.995 = 0.982701587 = 98.2%

Unscheduled downtime = (365 * 24) * (1-0.9827) = 151.548 (hours)

Total lost in cost incurred due to downtime = 151.548 * 450 = $68,196.600

Will each of the alternatives be able to meet the new target of 97% availability? What is the actual availability of each alternative?

Yes. Both options will be able to meet the 97% availability. Option A has a availability of

97.8% and Option B has 98.2%

11

SG5022 – IT Service Management – CA 2

Be cost-justifiable based on the benefit from c) above and the need to the cost within one year‘? (Clue: What is the actual new DT based on availability of the alternative‘? How much has the actual Cost of Unavailability been reduced?)

Previously the unavailability cost was: 144,144.00

Actual Cost of Availability of Option A = 86,329.800, hence factoring in the upgrade cost of

30,000 the actual unavailability cost been reduced = 144,144.00 – (86,329.800 - 30,000) =

27,814.200

Actual Cost of Availability of Option B = 68,196.600, hence factoring in the upgrade cost of

15,000 the actual unavailability cost been reduced = 144,144.00 – (68,196.600

- 15,000) =

60,947.400

Yes. Both options have a breakeven point within a year. However, option B has a higher cost reduction.

Which option will you recommend to XYZ. Company?

Option B has a lower implementation cost and higher availability. Hence option B is a better option

11

SG5022 – IT Service Management – CA 2 e) Research the web for 3 tools or software which can be used to measure service a and/or component availability. Give the 3 web references (ie. URL) for these tools sofiware. Give a brief stmlmary explanation of what these tools or software do (instating which type of availability do they measure - service or component?)

Tool 1: CA Spectrum

Type of measurement

Service & Component

URL http://www.dachsug.ch/wiki/index.php/Report_Manager_%28SRM%29

Description

CA Spectrum is a product suite part of CA Service Assurance / Infrastructure Management solutions.

Spectrum is a expert system using a semantic network with frames as knowledge base as a sort of virtual model of the real infrastructure (CA uses the term Virtual Network Machine). Inference handles are implemented to update the VNM.

The core module will collect availability data from components and store up/down type of event in a database.

Spectrum uses the VNM to determine cause and impact.

Component availability is calculated based on the the time gap between up, down events and whether the component was the cause or just impacted.

Services are measured based on a kind of Component Failure Impact Analysis (CFIA) model. Where component events will impact the service depending on CFIA model for each.

Tool 2: CA eHealth

Type of measurement

Service & Component

URL http://www.ca.com/us/products/detail/CA-eHealth-Performance-Manager.aspx

Description

CA eHealth is a product suite part of CA Service Assurance / Infrastructure Management solutions.

The core module will collect availability and performance data from components and store the raw data in a database.

It will create baselines for trending and forecasting. Service profiles can be created to set thresholds for acceptable performance and availability. Projections is created estimating the time in days before a component is likely to reach saturation point.

Service exception point will be assigned based on the service profiles and a service report can be

11

SG5022 – IT Service Management – CA 2 created highlighting current capacity and future problems. Including over/underutilized and unbalanced components .

Its includes a basic what-if modeling tool than can be use show what and percentage increase/decrease in demand or capacity would impact the trend.

Ad-on modules can used to monitor the performance and availability of business-critical applications, transactions, and services as well as the end-user experience. Using passive and robotic transactions monitoring. eHealth uses 2 terms for availability management, reachability and availability on component level.

Where reachability is the measurement for whether the component is contactable from a defined point in the network. And availability is calculated based attributes like system uptime and is not impacted on whether any upstream device prevented the component being contactable.

Availability for services would be a combination of components groups and transaction timeouts.

Tool 3: BMC Service Impact Manager

Type of measurement

Service & Component

URL http://www.columnit.com/images/stories/PDFfiles/bmc-service-impact-manager.pdf

Description

BMC Service Impact Manager is part of BMC ProactiveNet Performance Management Suite. Its a pure event manage, relying on other tools to send events to it. Availability is calculated using an event based ProLog rule engine.

Service Availability is calculated based on a kind of Component Failure Impact Analysis (CFIA) model. And/Or the inclusions of events from a transaction monitoring tool.

11

SG5022 – IT Service Management – CA 2

2. The e-service application of XYZ. Company from question 1 consists of two different types of e-service — one for customers‘ purchase of XYT products and another for supplier's use in processing of invoices for supplies to XYZ Company. a) With regard to Business Capacity. Currently, XYZ. Has 12,000 e-purchases a year and has to process 2,400 supply invoices annually. For next year, the Sales Dept of XYZ company has projected a 10% increase in e-purchases while the Manufacturing Dept has projected a 3% increase in e-invoices. What is the projected Business Capacity requirements for next year in terms of number of e-purchases and number of e-invoices?

Ans: Annually e-purchases will increase to 12000 * (1+10%) = 13200

Ans: Annually e-invoices will increase to 2400 * (1+3%) = 2472 b) With regard to Service Capacity, one e-purchase will result in an average of 5 transactions in the

XYZ. e-service application, while one e-invoice will result in an average of 3 transactions in the

XYZ e-service application.

What is the current Service Capacity requirement in terms of total number of transactions a year in the e-service application?

Ans : 5 * 12000 + 3 *2400 = 67200

What is the projected Service Capacity requirements for next year in terms of the overall total number of transactions in c-service application?

Ans : 5 * 12000 * (1 + 10%) + 3 * 2400 * (1+ 3%) = 73416

What is the % increase in total number transactions?

Ans : (73416 – 67200) / 67200 = 9.25% c) With regard to Component Capacity, the simulation model of CPU usage to transaction rate for the XYZ Company e-service is shown in the chart below. The current peak transaction rate is 30 transactions per minute. Assume that for next year, the increase in peak transaction rate will be the same % as the increase in total number of transactions (see b) above).

What is the projected peak transaction rate for next year?

Ans : 30 * (1 + 9.25%) = 32.775

 What is the projected CPU utilisation percentage next year?

Ans : 20%

11

SG5022 – IT Service Management – CA 2

If XYZ Company has the policy to upgrade capacity before CPU utilization percentage reaches 70%, what plan should the company make for this server in the Capacity

Plan?

Scope

The scope only covers the server CPU utilization for the server used in e-service

Duration of reporting

Since no duration or frequency is indicated for the service capacity plan, the resource capacity report will only cover 12 months.

Frequency

The frequency of the resource capacity report is one month. In order to be able to detect deviation from the plan baseline.

Assumptions and data which the forecast is based

We do not have any other data than CPU utilization and is therefore unable to determine any potential bottlenecks course but other components performance indicators.

We assume that the relationship between transactions and CPUs utilization is correct. And that the data in which the CPU utilization modeling is performed is sufficiently granular and has excluded potential

CPU load course by other processes then the processes needed for the service application.

Trends in resource utilization

We have no CPU utilization data to perform any trending.

Capacity and performance threshold about to be reached

We do not have any current or open diction for any trade shows to be breached.

Exceptional deviations from the baseline

We do not have any prior baseline in which to determine any deviations.

Current and recent resource usage

We do not have any current or recent resource usage, only the current requirements in terms of transactions related into CPU utilization. And based on this estimate our current peak utilize station is around 20%.

11

SG5022 – IT Service Management – CA 2

Forecast

Short –term: We do not have a short-term for cost d ue to th e lack of data

Long-term : Our best 12 months for cost would estimate a utilization of around 20%.

Measures to cope with extra demand

No measures exist to cope with any extra demand.

Risks

That's a high risk that the underlying data that this capacity plan is performed on this flawed.

We do not have any response time, transaction time for the e-service which resources we are creating this plan for. And therefore have no way to assess the relationship between acceptable response time and resource consumption. Furthermore we are only assessing one parameter (CPU Utilization) whereas we should be interested in identifying all the potential bottlenecks and delivery stream of service. Additionally we cannot have any actual performance data in which we can go in and make a real assumption about the actual acquirement of resources needed to support the service.

Recommendations

Our best forecast for the server CPU does not warrant any upgrade based on our policy of upgrading before we reach 70%

But based on the huge amount of risk that we have taken in creating this capacity plans. We are leaving ourselves extremely vulnerable to getting our plan wrong.

Furthermore the only policy we have regarding to capacity planning is one that states that we should operate once at peak utilize station gets to 70%. This policy is potentially a problem in our goal to optimize resources and giving value to the business by reducing our cost. We should instead assess our services in terms off the user experience (their utility) and plans and resources and money should be targeted towards ensuring that we can maintain the user experience. If a modeling shows that the user experience with the CPU/station that doesn't have a prolonged peak of 100% would be sufficient

11

SG5022 – IT Service Management – CA 2

Research the web for 3 tools or software which can be used to measure CPU utilization or any other IT service capacity. Give the 3 web references (ie. URL) for these tools or software. Give a brief summary explanation of what these tools or software do and how they might be used for capacity planning or management.

Tool 1: Blue Coat PacketShaper

Type of measurement

Service

URL http://www.bluecoat.com/products/packetshaper

Description

PacketShaper from blue coat is a appliance that monitors network traffic. It has visibility on network layer 7, this enable it to be able to identify the different applications that are being routed through this network point and monitors the transaction time between the end user and the application giving it a full picture of the time for single transaction. Thereby being able to identify potential slowdowns delays within client network and application. It also happened capability to prioritize traffic so that business-critical application traffic gets priority and give by insured throughput and other types of traffic that on non-are critical will be given a new priority and only give through once sufficient bandwidth is available. packetShaper can be used in capacity management in 2 ways 1st it can be used to monitor the transaction time between end users and applications/services. In the 2nd way is that it can be used as an activity among management tool by prioritizing application traffic.

Tool 2: CA eHealth

Type of measurement

Service & Component

URL http://www.ca.com/us/products/detail/CA-eHealth-Performance-Manager.aspx

Description

CA eHealth is a product suite part of CA Service Assurance / Infrastructure Management solutions.

The core module will collect performance data from components and store the raw data in a database.

It will create baselines for trending and forecasting.

Service profiles can be created to set thresholds for acceptable performance and availability.

Projections are created estimating the time in days before a component is likely to reach saturation point.

11

SG5022 – IT Service Management – CA 2

Service exception point will be assigned based on the service profiles and a service report can be created highlighting current capacity and future problems. Including over/underutilized and unbalanced components .

It includes a basic what-if modeling tool than can be use show what and percentage increase/decrease in demand or capacity would impact the trend.

Ad-on modules can used to monitor the performance and availability of business-critical applications, transactions, and services as well as the end-user experience. Using passive and robotic transactions monitoring.

Tool 3: CACTI

Type of measurement

Component

URL http://www.cacti.net/index.php

Description

Cacti is an open-source, web-based network monitoring and graphing tool build as a front-end to

RRDtool (Round Robin database tool). it collects component data such as CPU, disk space, network traffic. Stores the collected data in a database and provides a graphing of historical data functionality.

The graphing tool also enables you to show the current trending as an additional chart. Catching can be used in capacity management to monitor that the expected performance of the components are working optimal and can be used as a baseline for forecasting.

11

Download