Title Slide

advertisement
DICloud
Spiral 2 Year-end Project Review
Data-Intensive Cloud Control for GENI
University of Massachusetts Amherst
PI: Michael Zink, Prashant Shenoy and Jim Kurose
Staff: David Irwin and Emmanuel Cecchet
August 6, 2010
Sponsored by the National Science Foundation
Project Summary
• Conduct data-intensive experiments in GENI from
start (data collection point) to finish (processing
and archiving)
• Augment Orca control framework to
– Obtain data-centric slices that span GENI/VISE
(sensornet) resources and cloud resources (servers
and storage)
– Provide access to Amazon Web Service resources
– Execute experiment workflows to explicitly control
experiment data flow and resource allocation across a
network of components/aggregates.
Sponsored by the National Science Foundation
August 6, 2010
2
Milestone & QSR Status
ID
Milestone
Status
On
Time?
On
Wiki?
GPO
signoff?
S2.a
Cluster plan for VLANs
between testbeds
Static VLAN connection from UMass-Amherst to
BBN in Cambridge. NLR access through Boston
NOX or BBN.
On time
Yes
Yes
S2.b
Plan to connect to cloud
No layer 2 VPN. Amazon VPC still in beta.
Recommend usage of OpenVPN.
On time
Yes
Yes
S2.c
Handlers to allocate cloud
resources
Developed Orca handlers to allocate resources
from Amazon’s Elastic Compute Cloud (EC2),
Simple Storage Service (S3), and Elastic Block
Store (EBS) cloud services.
On time
Yes
Yes
S2.d
Policy to track usage
Based on existing Orca brokers accounting node
resources until handlers support AWS resource
tracking.
On time
Yes
Yes
S2.e
Demo archiving sensor data
Demo at GEC7 where RENCI dynamically stood
up a web server after stitching together the larger
network that queried archived radar data from a
web server running on an EC2 node on Amazon,
which ViSE bridged onto the VLAN using
OpenVPN. Radar data was displayed as a
Google Maps animation after Orca finished
stitching the links from BEN, NLR, Starlight, to
UMass.
On time
Yes
Yes
Sponsored by the National Science Foundation
August 6, 2010
3
Milestone & QSR Status
ID
Milestone
Status
On
Time?
On
Wiki?
GPO
signoff?
S2.f
Use CloudWatch
to monitor usage
Use Amazon CloudWatch for real timemonitoring of resource
usage and adjust cost at the end of a lease by querying
Amazon billing service.
On time
Yes
Yes
S2.g
Demo initial proxy
aggregate
manager
Demo at GEC8 storing radar data directly into Amazon S3,
retrieved the data and processed it on an EC2 server to
generate NowCast images stored back in S3. Visualized
images from S3 with real time cost usage. Revoked
automatically resources when allocated budget had been
spent.
On time
Yes
Yes
S2.m
Contribution to
GENI outreach
Two days workshop at the University of Puerto Rico, Mayaguez
in January 2010. Teach UPRM students about emerging
technologies in virtualization, cloud computing, wireless
communication, networking, and sensing that make it possible
to multiplex experimental testbeds, such as those being
incorporated into the GENI prototype. Lectures and tutorials on
virtualization, the GENI project, wireless communication,
research efforts at UMass-Amherst and cloud computing.
Early
Yes
Yes
QSR: 4Q2009
Complete
Yes
Yes
Yes
QSR: 1Q2010
Complete
Yes
Yes
Yes
QSR: 2Q2010
Complete
Yes
Yes
Yes
Sponsored by the National Science Foundation
August 6, 2010
4
Accomplishments 1:
Advancing GENI Spiral 2 Goals
• Integration of Amazon Web Service (AWS) resources in the Orca
control framework
– Elastic Compute Cloud (EC2) instances
– Simple Storage Service (S3)
– Elastic Block Storage (EBS)
• Framework independent Instrumentation & Measurement of cloud
resource usage
– Re-usable AWS accounting library for any framework
– Monitors server usage, network activity and disk usage (both storage
space and IOs)
• Deep programmability
– Root SSH access on compute servers
– Complete access to EBS volumes
– Transparent proxying of S3 queries
Sponsored by the National Science Foundation
August 6, 2010
5
Accomplishments 2:
Other Project Accomplishments
• Outreach activities
•
– Workshop at UPRM
– Tutorials at INRIA
External Publications
– Resource Management in Data-Intensive Clouds: Opportunities and Challenge David Irwin, Prashant Shenoy, Emmanuel Cecchet, and Michael Zink - Proceedings
of the 17th IEEE Workshop on Local and Metropolitan Area Networks (LANMAN
2010), May 5-7, 2010, Long Branch, New Jersey, USA.
– Automated Negotiation with Decommitment for Dynamic Resource Allocation in
Cloud Computing - Bo An, Victor Lesser, David Irwin, and Michael Zink Proceedings of the Ninth International Conference on Autonomous Agents and
Multiagent Systems (AAMAS), Toronto, Canada, May 2010.
•
GEC Demos
– Integrated plenary demonstration at GEC7 using OpenVPN on Amazon servers
– GEC8 weather processing demo using storage and compute servers on Amazon.
Sponsored by the National Science Foundation
August 6, 2010
6
Issues
• Currently no way to reserve/attach to dedicated circuits
with Amazon
– No isolation from the public Internet; can't link nodes directly to
NLR
– OpenVPN viable solution (GEC7 demo)
– Amazon Virtual Private Cloud service is beta and no budget for it
• How to allocate the AWS budget?
– 5000$ for the year (about 5 months for 5 servers (8hr/day), 200GB
network traffic and 5TB of storage)
– 1 experiment could use it all in few days
– Long term storage?
– Too little for multiple experiments?
Sponsored by the National Science Foundation
August 6, 2010
7
Plans
• Remainder of Spiral 2
– DICLOUD: S2.h Release initial proxy aggregate manager (Due 08/13/10)
already demo-ed at GEC8
– DICLOUD: S2.i Extend ViSE web portal to include cloud (Due 09/15/10)
– DICLOUD: S2.j Make available initial set of resources (Due 09/30/10)
– DICLOUD: S2.k POC to GENI response team (Due 09/30/10)
– DICLOUD: S2.l POC to GENI security team (Due 09/30/10)
• Spiral 3
– Focus on users
• Availability to users by extending Vise’s portal
• Internal testing by lab students
• Support for HPC applications and EC2 HPC instances
– Add a 5 node Eucalyptus cluster to the Cluster D foundation of generalpurpose resources
– Adapting to new AWS offering (VPC, S3 ACLs, …)
– Preparation for Orca/Gush integration
Sponsored by the National Science Foundation
August 6, 2010
8
Download