Data Grid: GRASP Mike Smorul <>

advertisement
Data Grid: GRASP
Mike Smorul
<toaster@umiacs.umd.edu>
Grid Retrieval and Search Platform
Based on concepts developed in the Earth
Science Data Interface (ESDI) developed
at the UMIACS GLCF.
 Provides a graphical interface into data
grid holdings.
 Access to entire GLCF holdings through
the Storage Resource Broker(SRB)

Earth Science Data Interface
ESDI Overview
Designed to allow for intuitive browsing
and searching of large geospatial data
sets.
 Tightly integrated set of web, ftp, and file
servers. (customized to GLCF)
 Distributes over 7Tb of data per month
 Over 27,000 Landsat scenes and 13Tb of
data available for download

SRB Grid Testbed (original)




Modified the SRB to hold spatial data
Contributed Informix port of the SRB (v3.2)
Linked three ESIP sites
Tested replication between
GMU and UMD.
Remote registration at
UNH
Informix
UMD MCAT enabled srbmaster

MSS(2)
GMU srbmaster
TM
MSS(1)
UMD srbmaster
with dfs access
MODIS
UNH srbmaster
Lessons Learned
The SRB can easily handle textual
metadata.
 Spatial data could be stored into extended
Informix attributes, but querying was
available only through the DAI

 Limited
SRB MCAT to Informix based systems
GRASP Architecture
Spatial
Information
Data Grid
Textual
Information
Query Abstraction
I/O Abstraction Layer
Browse / Display
Data Discovery
Data download
Clients
GRASP Architecture
GRASP uses a data grid as an abstract
storage repository.
 Metadata in the grid is mined from the grid
itself or from external sources and
published into a browsable form.

 Data
grids may allow for platform independent
metadata, but may not be optimal for access
GRASP Screenshot
Grid Holdings

Registered GLCF holdings
 Over
338,000 registered files
 4.4Tb total size
 Granular permissions on registered holdings

No need to move all data into grid,
registered pre-existing holdings in place.
Current data grid



Designed using newer SRB software that allows
for Federated grid model.
Created using standard SRB software
configuration (postgreSQL)
Large data sites can maintain independent
MCATs
 Administratively
independent
 Ability to customize data grid to site security/data
requirements while maintaining compatibility across
federation.

Smaller peers can register as clients of UMD
Data grid overview
GRASP Interface
Zone: esip-remote
Remote MCAT
SRB Master
SRB Master
SRB Master
Zone: esip-umiacs
UMD MCAT
SRB Master
SRB Master
DFS
Gateway
Local Peer
Growing data grid
Additional MCATs can be federated
 Additional SRB aware clients can be
added
 Remote data supplying sites can
contribute data sets from their resources
 Sites needing quick local access to data
can replicate to local SRB setup.

Questions?
Download