Ryan-Shuttleworth

advertisement
Big data accessible to all
Ryan Shuttleworth, AWS Evangelist
Big Data
Compute
Lorem ipsum dolorStorage
sit
met,
consectetur
Bring compute capacity to the data
dipiscing elit. Etiam
uis ligula neque, eget
enenatis
sem. Personal
Suspendisse non eros
ulla, at placerat nibh.
Cras id lectus mattis est
llamcorper
blandit.
Proin ut nisi vitae enim
ulputate
tempor.
Phasellus id commodo
ros.
Mauris
nec
ignissim turpis. Nunc
Very large dataset
Cras id lectus mattis
est
ullamcorper
seeks
strong
&
consistent compute for
short term relationship,
possibly longer. GSOH a
plus
aws.amazon.com
Lorem ipsum dolor
amet,
consecte
adipiscing elit. Etia
quis ligula neque, eg
venenatis
se
Suspendisse non er
nulla, at placerat nibh
Cras id lectus mattis
est
ullamcorper
blandit. Proin ut nisi
vitae enim vulputate
tempor. Phasellus id
commodo
eros.
Mauris nec dignissim
turpis. Nunc
Storage
Big Data
Compute
Data has gravity
App
Data
App
http://blog.mccrory.me/2010/12/07/data-gravity-in-the-clouds/
Storage
Big Data
Compute
…and inertia at volume…
Data
http://blog.mccrory.me/2010/12/07/data-gravity-in-the-clouds/
Storage
Big Data
Compute
…easier to move applications to the data
Data
http://blog.mccrory.me/2010/12/07/data-gravity-in-the-clouds/
Storage
Big Data
From one instance…
Compute
Storage
Big Data
…to thousands
Compute
Storage
Big Data
and back again…
Compute
have data
have data
can store
have data
can store
can analyse
economically
fast
Who is your customer really?
What do people really like?
What is happening socially with
your products?
How do people really use your
products?
15
Lesson 1: don’t leave your Amazon
account logged in at home
Lesson 1: don’t leave your Amazon
account logged in at home
Lesson 2: Drive proactive business
processes based upon data you have
1 instance for 100 hours
=
100 instances for 1 hour
Small instance = $8
1 instance for 1,000 hours
=
1,000 instances for 1 hour
Small instance = $80
Achieving economies of scale
100%
Time
Achieving economies of scale
100%
Reserved capacity
Time
Achieving economies of scale
100%
On
On-demand
Reserved capacity
Time
Achieving economies of scale
100%
Spot
On
On-demand
Reserved capacity
Time
EMR with spot instances
Scenario #1
Job Flow
Duration:
14 Hours
#1: Cost without Spot
4 instances *14 hrs * $0.50 = $28
EMR with spot instances
Scenario #1
Scenario #2
Job Flow
Job Flow
Duration:
14 Hours
#1: Cost without Spot
4 instances *14 hrs * $0.50 = $28
Duration:
7 Hours
EMR with spot instances
Scenario #1
Scenario #2
Job Flow
Job Flow
Duration:
14 Hours
#1: Cost without Spot
4 instances *14 hrs * $0.50 = $28
Duration:
7 Hours
#2: Cost with Spot
4 instances *7 hrs * $0.50 = $14 +
5 instances * 7 hrs * $0.25 = $8.75
Total = $22.75
EMR with spot instances
Scenario #1
Scenario #2
Job Flow
Job Flow
Duration:
Time Savings: 50%
Cost Savings: ~22%
14 Hours
#1: Cost without Spot
4 instances *14 hrs * $0.50 = $28
Duration:
7 Hours
#2: Cost with Spot
4 instances *7 hrs * $0.50 = $14 +
5 instances * 7 hrs * $0.25 = $8.75
Total = $22.75
Easily and rapidly analyze
petabytes of data
Introducing Amazon
Redshift
Data Warehousing the AWS Way
1/10 the cost of traditional data
warehouses
Automated deployment &
administration
Compatible with popular BI tools
Internal Testing:
At Least 10X Faster for a Fraction of the Cost
Our Test
On-premises retail
data warehouse
32 nodes, 4.2 TB of
RAM, 1.6 PB of disk
Several million dollars
2 billion row data set
& 6 most complex
queries
Amazon
Redshift
Two 16 TB /
128 GB RAM nodes
$3.65 / hour
Get insight faster for less
Utility
Accessible
Pay-as-you-go
Big Data Cloud Economics
Unlimited
Low price
Performant
Download