Grids, The TeraGrid and Beyond

advertisement
Paper Critique by SAMPATH AKKINENI
GRIDS, THE TERAGRID, AND BEYOND
By Daniel A. Reed
( IEEE Computer Society, Volume 36, Issue 1, January 2003 )
Due: 19th November 2003
Summary
TeraGrid or Extensible Terascale Facility (ETF) is a massive research computing project funded by
National Science Foundation. TeraGrid project includes combining five large computing and data
management facilities to create a distributed information technology infrastructure to support
collaborative computational science. Worldwide scientific infrastructure can be created by
combining numerous resources all over the world. TeraGrid is a virtual supercomputer built from
individual clusters linked by very high speed cross-country networks.
The following partners are involved in the TeraGrid project
 National Center for Supercomputing Applications at University of Illinois
 San Diego Supercomputer Center at University of California
 Argonne National Laboratory, Mathematics and Computer Science Division
 California Institute of Technology, Center for Advanced Computing Research
 Pittsburgh Supercomputing Center
The TeraGrid is being constructed as a persistent infrastructure within which resources are
individually managed at individual sites, but they provide some agreed upon grid services. TeraGrid
architecture contains Linux based HPC clusters powered by Intel’s 64-bit Itanium2 processors and
other open source software. TeraGrid components will be integrated using Linux and grid
middleware Globus toolkit which contains software for intracluster interactions. Globus toolkit will
also help with tasks like discovering what resources are available on the grid and deciding what jobs
have permissions to use them.
After completion, this TeraGrid project will include more then 16 Tflops of Linux cluster computing
distributed across the facilities that can manage and store 800 Tbytes of data. As part of this
TeraGrid project, researchers are working on projects like gravitational wave detection, earthquake
simulation, digital sky surveys and system biology etc. Other research proposals are being accepted
from researches and solutions for those problems will be found as part of this TeraGrid project.
Strengths





TeraGrid will help researchers solve complex problems that are beyond the capabilities of
conventional supercomputers.
Reduce or eliminate the barriers of time and space that make it difficult for distributed
research groups to collaborate
Complex real world problems can be solved with the availability of high speed
computational power and ability to analyze huge amounts of data
Allows people to tap into computing power, data resources and collaboration tools based on
their needs no matter where those tools or resources are located
Enables data and resource sharing among collaborating groups
Weaknesses


Privacy and security are major concerns as there is huge amount of data and resource
sharing among the participants
TeraGrid architecture introduces multiple points of failure as any computing facility can fail
any time
Interesting Points


The scale of TeraGrid is unprecedented in the history of computing. Success of TeraGrid
will be measured by how well it functions, how faster it is in comparison with the existing
traditional computing environments
Scalability and extensibility are critical factors for the success of TeraGrid project.
Information exchange protocols and data formats need to be defined and implemented at
individual facilities
Critical Questions





Is the TeraGrid interoperable? Is it possible to join the systems running on different
operating systems / languages to form a distributed computing environment?
How simple/complex is it to add other computer facilities and make the TeraGrid scalable?
How safe / vulnerable is the TeraGrid from internet attacks?
Are there any benchmarks against which the grid’s performance will be measured?
How can data integrity be maintained when huge amounts of data is shared and exchanged
between the facilities?
Download