2. OptIPuter FY2004 Program Plan

advertisement
OptIPuter FY2004 Revised Program Plan
NSF ITR Cooperative Agreement ANI-0225642
Larry Smarr, Principal Investigator
Cal-(IT)2, University of California, San Diego
smarr@cs.ucsd.edu
For the period October 1, 2003 – September 30, 2004
DRAFT COMPLETED January 30, 2004
SUBMITTED Monday, May 17, 2004
1.
Preface
On December 22, 2003, we submitted our Response to the OptIPuter Year 1 Project Review to NSF. Reviewers
noted (item 1b) that: By reducing scope and reallocating resources, or minimally making sure that a few key
research problems are addressed, the project will greatly enhance its opportunity for success and impact.
In our response, we stated the following:
We agree with the reviewers that defining 3-4 research topics each year, in order to provide quantitative metrics for
evaluating the success and impact of the project, is a good approach. We are now working with the team members
to define these research focus areas for year 2, proposed below:




How to we control lambdas and how do protocols influence their utility?
How is a LambdaGrid different from a Grid in terms of middleware?
How can lambdas enhance collaboration?
How are applications quantitatively helped by LambdaGrids?
These four areas represent goals to help focus OptIPuter team research efforts in year 2 (and beyond). Hence, our
teams must review their deliverables in the Cooperative Agreement for year 2 and in the Annual Progress Report
and Program Plan (PPP) (pp. 33-38) and then hone their activities to make sure these topics are addressed. We do
not advocate changing the deliverables, but tailoring them; i.e., making them more specific…This will happen by the
OptIPuter All Hands Meeting (AHM) in January 2004…and submitted shortly thereafter.
We held our AHM January 14-16, at which time all funded partners had a chance to work with OptIPuter Team
Leaders to address their Year 2 deliverables. Attached is an updated 2004 Program Plan, in which the deliverables
have been restated so that they emphasize the research topics defined above.
2.
OptIPuter FY2004 Program Plan
2.A. Year 2 Milestones
Our Year 2 Milestone, as described in the OptIPuter Cooperative Agreement:
All Southern California OptIPuter partners (UCSD, UCI, USC and SDSU) will have at least one fully
functioning OptIPuter node(s) connected to California’s CENIC experimental network. Chicago partners (UIC
and NU) will have OptIPuter nodes connected to I-WIRE. The nodes will all be based on UIC/NU’s “Mod 1”
work in Year 1.
2.B. Testbed Infrastructure (Networking, Clusters, Data Storage, Signaling Protocols) – Tom
DeFanti and Phil Papadopoulos, Team Leaders
Year 2 tasks: UCSD had hoped to install a DWDM core system on campus and interconnecting the Southern
California OptIPuter sites; however, this activity may overlap with year 3, given the limited capital money in the
budget. USC, UCI and SDSU, in addition to UCSD, will install OptIPuter nodes and link them to the CENIC
dark fiber experimental network. UIC, however, will deploy OptIPuter nodes connected via I-WIRE DWDM and
other dedicated lambdas; the nodes will all be based on UIC’s “Mod 1” work in Year 1.
UIC, with input from the application teams, will begin building the OptIPuter “Mod 2” using 64-bit processors
The OptIPuter FY2003 Annual Progress Report and FY2004 Program Plan – UPDATED January 30, 2004
1
and upgraded storage, visualization and optical networking components. UIC will work with NU to test and
document the metro-area performance of clusterPC-to-clusterPC using up to 8 GE parallel connections through
the Glimmerglass and Calient O-O-O switches. Signaling protocols for switching lambdas will be designed,
tested, and refined. Similar tests will be done between UIC and UvA over the transatlantic links using VLANs and
groomed GigE circuits, also passed through the O-O-O switches. The APIs will be documented in papers and
manuals.
UIC will work with NU to develop optical network control technologies, incorporating network performance and
protocol parameters into new signaling protocols. The first-generation LambdaRAM will be designed and
implemented.
Southern California Testbed Infrastructure (Networking and Computing)
In Year 2, UCSD (Papadopoulos, Hidley) will:






Complete deployment of a campus-wide distributed testbed consisting of compute, storage, visualization,
and instrument endpoints. Management policies will be put in place so that experiments can schedule
physical hardware and use specialized (experiment-specific) software loaded on assigned nodes.
Provide account creation, cataloguing of resources, MRTG-based network monitoring and ganglia resource
monitoring on a web site; a preliminary version exists on the OptIPuter Intranet <http://web.optiputer.net>
The UCSD campus testbed will have a best-case bisection bandwidth of 5:1 and a worst-case of 8:1.
Monitoring will be put in place to measure when these links are saturated.
A donated IBM 48-node, 21 TB storage cluster will be deployed
Three 24-node dual Opteron clusters will be operational at UCSD campus nodes located at JSOE
(Engineering), SIO, and the School of Medicine. One of these clusters will be storage oriented
IPv6 capability will be made a standard part of the base OptIPuter software stack by the end of Year 2
Storage and compute cluster donations to UCSD, combined with a modest monetary investment from project funds,
have enabled ~150 OptIPuter nodes on campus. The OptIPuter project does not have the financial flexibility to
match the network to this large number of endpoints. To this end, UCSD will judiciously expand the network in the
most useful places


The storage endpoint will be connected to the Chiaro router over a 10 GigE link connected to a single 48port GigE aggregration switch, to be purchased by the project. Bisection will be 5:1.
The Chiaro router will have a 1GigE connection to the UCSD border router. A production virtual router
will be instantiated on the Chiaro to protect the campus network.
Effort will be made to create a Southern California OptIPuter testbed (UCSD, SDSU, UCI, and USC/ISI), connected
via CENIC CalREN HPR 1Gbps circuits. At UCSD, connection from the campus border router will be made to the
existing Chiaro OptIPuter router. Completion of this deliverable is dependent on CENIC’s internal deliverables.
Chicagoland Testbed Infrastructure (Networking and Computing)
UIC (DeFanti) will install a Force10 electronic switch and a 128x128 optical switch at the StarLight facility in
Chicago to connect 1GigE and 10GigE metro, regional, national and international research networks and lambdas.
UIC will also install a 64x64 optical switch at EVL which connects to StarLight over 8 GE circuits.
UIC (DeFanti, Leigh, Renambot) will evaluate AMD 64-bit clusters. Built and installed will be:





Mod 1 (32-bit) and Mod2 (64-bit) OptIPuter clusters as application production nodes
A production Terabyte raid at StarLight for Scripps caching,
“Reflector” PCs for achieving facile testbed subsets
A cluster for LambdaRAM
An Itanium PC with 10GigE NIC (provided by UvA)
In addition, UIC will monitor and benchmark various SOTANs (State-Of-The-Art Nodes) for proof of concept,
notably Itanium, Opteron and IBM G5. SOTANs are, by definition, the newest available technology – from
networking , architecture, and middleware points of view – but may not be usable by applications for several years.
The OptIPuter FY2003 Annual Progress Report and FY2004 Program Plan – UPDATED January 30, 2004
2
Prototype/Evaluate High-Speed NIC Hardware and Driver/Software Stack
UCSD (Papadopoulos) will conduct an Infiniband test, connecting 4 interface cards (at 10Gbps each) to the IBM
Storage cluster and a Topspin switch. The Topspin switch will have 4 channel-bonded GigE interfaces for direct
Infiniband-to-GigE connections into the UCSD campus’ OptIPuter fabric. We will conduct experiments to measure
how protocol offloading of IP over Infiniband can improve performance. It will also be used to explore how highperformance cluster-area connections can be exploited to provide storage services.
Optical Signaling Protocols
UIC (Yu), NU (Mambretti), UCSD (Chien) and UvA (de Laat) will refine their optical backplane architecture. They
will evaluate techniques for optical control and switching. AAA will be developed by UvA. Signaling protocols are
being developed by UIC and NU, working with UvA and others.
Photonic Multicasting
UIC (Leigh, Renambot) will explore the extension of photonic multicast capabilities across the wide area.
LambdaRAM
LambdaRAM takes advantage of multiple-gigabit networks to pre-fetch information before an application is likely
to need it (similar to how RAM caches work in computers today). This technique collects memory in a compute
cluster and then allocates it as a cache to minimize the effects of latency over long-distance, high-speed networks.
LambdaRAM (UDP blasting) has been tested locally. In Year 2, UIC will design a second LambdaRAM prototype.
2.C. System Software and Middleware – Andrew Chien, Team Leader
Year 2 tasks: USC will begin designing system software and middleware between the cluster communication
fabric and the wide-area optical network. Design of a transparent (MAN/WAN) lambda  OS-bypass (local
cluster communication) messaging transport framework will be undertaken. UCSD will collaborate with UCI to
begin exploring security models using packet-level redundancy and multi-pathing. Elements of prototype
OptIPuter system software will begin to be released to partner sites for application development and evaluation.
UCI will begin design of the Time-Triggered Message-Triggered Object (TMO) programming framework for the
OptIPuter. TAMU will include network protocols and middleware in its performance analysis work.
System Software Architecture
UCSD (Chien) will prototype Distributed Virtual Computers (DVCs) for the OptIPuter, including the development
of DVC Interface/Services. These service interfaces constitute the primary presentation of OptIPuter capabilities to
applications. DVCs also require resource descriptions, which are used to allocate resources using Globus resource
management mechanisms as well as novel new tools that allocate light paths and directly control optical networks.
As stated in Section 2.B (Optical Signaling Protocols), UIC, NU, UCSD and UvA are working together to define
and implement a shared API to allow applications to access these capabilities, and to perform the necessary network
configurations to make these control capabilities useful. This capability, while far from a complete solution, will
enable basic experimentation with on-demand optical network configuration.
Real-Time Operating System
UCI (Kim) will work with UCSD to integrate the TMO middleware subsystem into OptIPuter middleware (DVC),
enabling demonstrations of current LAN capabilities on optical network WANs. A TMO-based application could
then run across distributed OptIPuter resources, with DVC support for allocation, setup, etc. Based on
experimentation in Year 3, we will design a real-time programming framework for the OptIPuter.
Data Storage
UCSD (Chien) will complete OptIPuter storage benchmarks that characterize the requirements for BIRN and
EarthScope applications; specifically, he will document baseline configurations (what they do in the demonstrations
today) and scaled, distributed versions (what we anticipate they will do when the distributed optical network
infrastructures and the applications are mature). Chien will develop analytical and simulation models for novel
approaches to managing distribution and replication of data using erasure codes, and use these models to explore the
design space. These explorations will help characterize and quantify the benefits that can be achieved, as in
The OptIPuter FY2003 Annual Progress Report and FY2004 Program Plan – UPDATED January 30, 2004
3
performance, robustness, scalability, and will provide approaches for how to achieve them.
Security
UCI (Goodrich) will work with UCSD (Chien, Karin) to define distributed virtual computer security models that
simplify application access and cooperation across distributed sites. These models, in contrast to traditional Internet
and client-server security protocols (e.g., SSL), frame a new set of security problems. We anticipate framing and
initial characterization of these problems in this year, with deeper protocol work in subsequent years.
Application Performance Modeling
TAMU (Taylor) is working with the Data, Visualization and Collaboration Team (Leigh) and the Applications Team
in order to characterize their computing, communications, and storage access behaviors. We will pursue a particular
focus on how these applications are expected to behave in optical network environments. The TAMU researchers
will also help diagnose current performance bottlenecks and problems in the OptIPuter’s visualization software.
High-Performance Transport Protocols
UCSD (Chien) will simulate a Group Transport Protocol (GTP) that efficiently manages receiver contention
(efficient bandwidth handoff) likely to arise in optical switched networks. He will also pursue evaluation via
simulation, emulation, and ultimately implementation during Year 2. The GTP implementation will be integrated
with the concept of DVC for presentation to applications.
UIC (Leigh) will extend his work on RBUDP to Streaming RBUDP, which provides improved jitter control for
gigabit streaming applications (e.g, graphics and remote visualization) over dedicated light paths. Streaming
RBUDP employs flow-rate estimation and early loss recovery to reduce jitter in high-bandwidth environments. UIC
(Leigh) and UCSD (Chien) will work towards integrating LambdaRAM and photonic multicasting in DVCs. UIC
(Grossman) will continue to improve SABUL/UDT, moving to 10GigE interfaces and networks; this effort is
expected to produce an IETF draft on transport protocols as part of the OptIPuter’s technology transfer.
USC (Bannister) will document XCP’s design, develop a protocol specification, and publish XCP header format and
other protocol parameters as Internet Drafts. XCP performance will be benchmarked using BSD end-systems and
PC routers. XCP will be compared to TCP to determine how fast XCP will go on a PC router. The effects of XCP on
TCP and other background traffic will also be considered. USC will use a prototype implementation to explore
research issues, such as: How does XCP behave when there are L2 queues? How does XCP behave over variable
bandwidth links? USC will continue exploration of router implementation constraints in protocol design; e.g.,
header format and number representation.
The XCP PC-router design will be made available to OptIPuter collaborators. USC will help OptIPuter applications
migrate to XCP by creating instructions on how to modify code and (possibly) writing wrapper/shim software to
adapt TCP binaries to use XCP. USC will collaborate with router vendors on XCP router implementations.
2.D. Data, Visualization and Collaboration – Jason Leigh, Team Leader
Year 2 tasks: UIC, UCSD and SDSU will deploy data management, visualization and collaboration software
systems developed in Year 1 to OptIPuter application teams, both for testing and for feedback. UIC will continue
to enhance computing and visualization cluster hardware/software to achieve very-large-field real-time volume
rendering. UIC, UCSD and UCI will integrate the data middleware and high-level data services into an
OptIPuter application and begin using the data middleware to develop data-mining algorithms for clustering data
streaming over the OptIPuter. In addition, high-level software abstractions for remote data retrieval and for
accessing bundled computing resources as high-level computational units or peripherals will be studied.
Data
UCSD will deploy a large storage-intensive cluster funded by an IBM SURS grant to enable new OptIPuter
Storewidth research. See Section 2.B (Southern California Testbed Infrastructure).
UIC (Grossman) will achieve data transport rates of 5Gbps, improve the I/O interfaces of his DSTP servers, and
begin development of wide-area storage and database services over lambdas using SIO and/or NCMIR datasets.
UCI (Smyth) and SDSU (Frost) will continue to develop new statistical modeling and clustering algorithms capable
of handling high-bandwidth, spatio-temporal scientific data streams, which allow interactive visualization and
The OptIPuter FY2003 Annual Progress Report and FY2004 Program Plan – UPDATED January 30, 2004
4
detection of clusters, outliers, and other structural properties of streaming datasets. UCI/SDSU will begin to examine
relevant geoscience data – notably MODIS satellite data – as a potential data source for this data mining research.
Photonic Multicasting
UIC/EVL will test Glimmerglass Networks’ photonic multicast option as a means of supporting collaboration over
lambdas.
Visualization/Collaboration Tools (General)
Further evaluation by UIC/EVL and UCSD/SIO of the effectiveness of seismoglyphs is needed.
UIC (Leigh) and USC (Thiébaux) will continue development of visualization techniques for data streaming, from
servers to tiled displays driven by an OptIPuter node. UCSD/SIO (Nayak) will tailor SIO’s visualization cluster
(purchased late in Year 1 for SIO) to enable Fledermaus to run over the cluster using Chromium.
UCSD/CRCA (Brown) is upgrading his multi-user, interactive graphic environment authoring toolkit (SceneEditor)
with OptIPuter-based applications in mind. Enhancements will include flexible scalability to heterogeneous display
configurations, as well as balancing storage, computer and display for very large, interactive datasets.
SDSU will help university and corporate partners connect visualization centers over high-speed networks and will
help data fusion and applications, using both wireless and optical communication, especially with IPv6 components.
Volume Visualization Tools
UIC and USC/ISI will integrate performance monitoring capability into their existing visualization tools to begin to
understand where current bottlenecks lie. This will inform the design of a new adaptive visualization architecture to
support both high-resolution digital montages and volume visualization.
NCSA (Cox), a new member of the OptIPuter team, will start working with SIO oceanographers to understand their
visualization requirements, with the goal of determining how to translate visual idioms from atmospheric science
and similar domains into oceanographic/earth science visualizations.
2.E. Applications and Education Activities – Mark Ellisman, Team Leader
Year 2 tasks: UCSD and SDSU will conduct full-scale visualization and collaboration trials using sample
geoscience and bioscience datasets over updated OptIPuter testbeds in order to identify key system bottlenecks
and usability problems with the software developed in Year 1. Education efforts will continue with the Preuss
School and the Lincoln Elementary School in Chicago, testing first-draft modules in the experimental section of
its core curriculum, involving grade-school children in observational science activities. The Sixth College will
select a group of students, from its distinctive Honors Students, to join OptIPuter application teams and work on
the OptIPuter node located within the College. In addition, an annual OptIPuter meeting/workshop will be held,
with some days dedicated to OptIPuter partners and some days open to the wider e-Science and highperformance computing and communications communities.
Application Codes Running on OptIPuter Clusters
SDSU will continue to work with UCSD/SDSC (Mike Bailey) and others at UCSD/SIO to develop content and
applications across the optical connectivity between the two sites as well as further develop connections to other
universities both in US and internationally. This will involve large datasets from both satellite imagery and seismic
cubes. Seismic data cubes, such as the Beta field in the offshore Long Beach area, which SDSU provided to Mike
Bailey, will be acquired where possible and satellite imagery over the top of the fields will be incorporated into the
visualizations. SDSU will also work to develop a high-resolution image of the San Diego county region for use by
first responders during crisis management situations.
All UCSD application teams will continue to provide sample datasets to technologists and will conduct a series of
collaboration trials with OpIPuter infrastructure and visualization groups to further elicit user requirements and test
emerging technologies in practicum.
UCSD/NCMIR will conduct a series of targeted collaboration trials with:


UCI to explore interactive volume rendering and annotation system
Osaka University to explore HDTV over IPv6 featuring future decoder/compression technologies
The OptIPuter FY2003 Annual Progress Report and FY2004 Program Plan – UPDATED January 30, 2004
5



NetherLight and UK eScience in Edinburgh to explore optical projection tomography and the mouse
embryo database as integrated large data sources
EVL on newly constructed NCMIR Geowall, and
ISI to explore distributed rendering application.
UCSD/Preuss School
UCSD/SDSC (Steckler) will harden the visualization facility; i.e., it will go from a pilot facility to one easily used
by all teachers in multiple classrooms. Emphasis will be on using the OptIPuter network to bring data to Preuss for
analysis and visualization by the classes. Steckler will start exploring the use of the OptIPuter resources for other
areas outside of earth science.
UCSD/SIO (Kilb) will continue to provide Preuss School with educational material.
UCSD/Sixth College
The Sixth College will select a group of students, from its distinctive Honors Students, to join OptIPuter application
teams and work on the OptIPuter node located within the College.
UCSD/SIO (Kilb) will work with UCSD (Steckler, Wienhausen) to bring the education programs similar to what
was done in year 1 at the Preuss School to the UCSD Sixth College.
Lincoln Elementary School in Oak Park, IL
During Year 2, UIC (Moher) will (a) finalize, deploy and assess the effectiveness of the RoomQuake system that
was developed during Year 1 in collaboration with UCSD/SIO (Kilb), and (b) develop of a new application,
RoomBrain. These applications are part of a series of RoomWare systems designed to map scientific phenomena
into/onto the physical space of the classroom. Moher’s working hypothesis is that the classroom holds a special
place in the minds of students and teachers – apart from their bedroom, it is likely the place that a child spends most
of his or her time. By using the physical space of the classroom as a “stage” populated with props and a “thin layer
of technology”, he hope to provide settings in which elementary students can “play-act” their roles as scientists in
“participatory simulations.” In these settings, thin clients (e.g., PDAs, tablet displays) and large shared displays
serve as “activity points” for spontaneously aggregated groups of students.
UCSD/SIO Outreach Activities
SIO hopes to hold another teacher workshop at the SIO Visualization Center. SIO will work in tandem with the
Regional Workbench Consortium (RWBC), as needed. SIO will continue development of 3D visualizations for use
by members of the GeoWall Consortium in educational venues.
Annual OptIPuter meeting
Our second year annual OptIPuter meeting will take place January 14-16. All partners and interested parties will be
invited.
The OptIPuter FY2003 Annual Progress Report and FY2004 Program Plan – UPDATED January 30, 2004
6
Download