Evolution status and plans for NSIDC DAAC EOS Core System (ECS) Vince Troisi 1 PoDAG 25: Data Systems Evolution status and plans for ECS: EOSDIS Evolution 2015 Vision Tenets Vision Tenet Archive Management EOS Data Interoperability Future Data Access and Processing Data Pedigree Cost Control User Community Support IT Currency Vision 2015 Goals NASA will ensure safe stewardship of the data through its lifetime. The EOS archive holdings are regularly peer reviewed for scientific merit. Multiple data and metadata streams can be seamlessly combined. Research and value added communities use EOS data interoperably with other relevant data and systems. Processing and data are mobile. Data access latency is no longer an impediment. Physical location of data storage is irrelevant. Finding data is based on common search engines. Services invoked by machine-machine interfaces. Custom processing provides only the data needed, the way needed. Open interfaces and best practice standard protocols universally employed. Mechanisms to collect and preserve the pedigree of derived data products are readily available. Data systems evolve into components that allow a fine-grained control over cost drivers. Expert knowledge is readily accessible to enable researchers to understand and use the data. Community feedback directly to those responsible for a given system element. Access to all EOS data through services at least as rich as any contemporary science information system. Feb 3, 2005 Evolution status and plans for ECS: Technology Infrastructure • Commodity Servers IBM Blade Servers o Redhat Linux OS SGI ALTIX 350 o Suse Linux OS, Sybase/SQS, FC/SCSI Disk Array • Enhanced storage system to support Data Pool ingest, archive disk cache, and Data Pool storage Fiber Channel SAN (SATA) o EMC CX700 disk array (74 TB SATA) ADIC/Quantum StorNext File System • To extent hardware allows, all data on rotating disk ADIC/Quantum StorNext Storage Manager replaces AMASS for providing file system for data stored in tape silo 3 PoDAG 25: Data Systems Evolution status and plans for ECS: Technology Infrastructure (HW Architecture Transition) Local Raid Local Monitor Enterprise Server Local Raid Local Raid Enterprise Server Enterprise Server Processing Blades SAN Local Raid Enterprise Server Data Pool Storage GigE Switch Local Raid Enterprise Server Enterprise Server Local Raid Enterprise Server Local Raid FUTURE: Rack mounted blades Linux OS Rack mounted SAN Optical fiber network infrastructure All platforms connected to SAN TODAY: Server platforms with local RAID Solaris or Irix OS Switched GigE Network Some platforms connected to SAN 4 PoDAG 25: Data Systems Evolution status and plans for ECS: Technology Infrastructure (Target HW Architecture) RAID/SAN Fibre Channel Switch Commodity Blade Server Commodity Blade Server Silos/Drives Commodity-Based Shared Memory DB Server Commodity Shared Memory DB Server Shared SAN with Shared Filesystem 5 PoDAG 25: Data Systems Evolution status and plans for ECS: Technology Infrastructure (Software Transition) • ECHO to provide user search, browse, and order submission functions to the evolved ECS Warehouse Inventory Search Tool (WIST) and other ECHO clients will provide these services The V0Gateway will be removed ECS orders will be implemented by ECHO clients using SOAP messages (via HTTP? JMS?) to establish the ECHO WSDL Ordering Component (EWOC) • Data Pool drill down search interface would be retained Java application operating in the Tomcat Java Application Server • Other services retained include HDF-EOS to GeoTiff format conversion utility (HEG) and the HEW Subsetter Appliance • Machine to Machine Gateway will be removed Still assessing the impact if this capability is removed from ECS Primary use of this service is by NSIDC staff May collaborate with LP DAAC (EDC) by providing requirements for a MTMGW service via the ECHO client 6 PoDAG 25: Data Systems Evolution status and plans for ECS: Technology Infrastructure (Current SW Architecture) Request Flow Data Flow Users Ingest Requests Searches & Orders Search & Order Gateways Ingest Services Metadata Inserts Searches Results Metadata Data Services Metadata Events Insert Requests Processing Services Distribution Requests Email Notifications Distribution Requests Subscription Services Distribution Requests Distribution Services Data Insert Requests Recall Requests Data Pool Services Browse Searches Downloads Storage Management Services FSMS Metadata … FSMS SAN Disk Caches & Nearline Tape Storage 7 PoDAG 25: Data Systems Users Data Providers Evolution status and plans for ECS: Technology Infrastructure (Target SW Architecture) Users ECHO DN, Granules, (ftp, scp, e-mail) Orders (tcp/ip) Processing Requests DAAC On-Demand Processing Data Providers PDR, Granules (ftp, scp) Order Management Service Data Pool Service Request Staging Processing Requests Subsetting Reformatting HEG Service Data Pool SAN (StorNext) Inventory DB Anonymous FTP Service Direct Downloads Bulk data flows Control flows, messages 8 PDRD, PAN QA Updates (ftp, scp, (e-mail) e-mail) Metadata (ftp) Granules Metadata (ftp) Tape Archive (StorNext) Data Pool Web Application Metadata (http) Users PoDAG 25: Data Systems Discovery (Data Pool drill down) Data orders On-demand processing Order status inquiries Status and Plans for NSIDC DAAC non-ECS Systems Vince Troisi 9 PoDAG 25: Data Systems Status and Plans for non-ECS Systems: Technology Infrastructure (Hardware Architecture) • Commodity Servers Intel and AMD Servers with Linux OS o Phase two of transition in progress includes transition of data production software to AMD servers Intel Servers with Windows OS • Enhanced Storage System to support NSIDC Storage Infrastructure Fiber Channel SAN (SATA and SCSI) o ~14 TB including 3 TB to enable GLAS subsetting ADIC/Quantum StorNext File System (in progress) o Evaluating ADIC/Quantum StorNext Storage Manager as a replacement for AMASS o Working issue related with certification of SNSM for Copan MAID vtl o May need to re-evaluate the Copan MAID vtl as an archive solution o • Considering disk archive solutions Network upgrade for servers to Gigabit Ethernet Switchover is scheduled for October 10 PoDAG 25: Data Systems Status and Plans for non-ECS Systems: Technology Infrastructure (Current Capabilities) • Metadata Database • • 11 o DIFs to GCMD and to NSIDC dynamic Catalog pages o Transition of CIMS (V0 Database Server) planned TRAK Data Operations and Preservation Metadata application Production Software • Writer’s interface and Publisher via Cata-matic Grid-o-matic Tools Passive Microwave Swath Data Tools AMSR Swath to Grid Tool MODIS Swath to Grid Tool EASE-Grid Geolocation Toolkit NSIDC GLAS Altimetry Extraction Tool Search and Order Interface GISMO (Geographic Interface for Subsetting, Mapping, and Ordering) SNOWI (Search and Order Web Interface) EOS Data Gateway (EDG, developed externally) PoDAG 25: Data Systems Status and Plans for non-ECS Systems: Technology Infrastructure (Current Capabilities) • Development Tools Mapx o A library for mapping and gridding transformation JAZ o A JAVA geographic search interface for selecting an area of interest on a map o Basic geographic query function in GISMO and PSQ o Infused into the spatial search functionality of the EDG Backtrack o The Backtrack Orbit Search Algorithm is a unique spatial search method for orbital data. Backtrack has a degree of accuracy that rivals predict methods while being faster, less costly to implement, and less costly to maintain than any other method o In process of generating an RFC to establish Backtrack as a standard orbital search algorithm within the Process and Standards subgroup of the NASA Earth Science Data Systems Working Group o Has been integrated into ECHO for searching orbital data collections 12 PoDAG 25: Data Systems Status and Plans for non-ECS Systems: Recent technology infrastructure evolution activities • Deployment of LDAP • Installation of StorNext File System for establishing a FC SAN Completion of installation scheduled for October • Established a SOAP Server environment Used to provide database service for administrating checksums of files stored in the archive • Deployment of MapServer MODIS Mosaic of Antarctica and RAMP Atlas of the Cryosphere (under development) • Additional Web services Export MDDB Catalog information to Google base Export of location information from MDDB Catalog into KML format to Google Earth Sea Ice Time Series in KML (Google Earth) 13 PoDAG 25: Data Systems Status and Plans for non-ECS Systems: Recent technology infrastructure evolution activities • Added Postgresql and PostGIS to development environment Evaluating this technology for use as a geospatial database in lieu of Sybase Installed Tomcat to evaluate as a JAVA Application Server solution Installed GeoServer in development environment for evaluation In process of re-engineering GISMO Adding option for Browse of subsetted data via Google Earth Expose backend processes as public web services • Developed utility to convert binary to GeoTiff for our Polar Stereographic and EASE gridded data sets 14 PoDAG 25: Data Systems BACKUP SLIDES 15 PoDAG 25: Data Systems Capabilities Envisioned for Future Science Information Systems The Technology Infusion Working Group, one of four working groups established by the NASA-sponsored Earth Science Data Systems Working Group (aka SEEDS), identified 10 capabilities in their vision of what information systems must provide to support scientific research in the future. This activity was conducted in parallel with the formulation of the NASA EOSDIS Evolution Vision for 2015. The group is in the process of assessing the correlation between their top ten capabilities and the goals identified in the NASA EOSDIS Evolution Vision for 2015. 16 PoDAG 25: Data Systems Future Science Information Systems: Top Ten Capabilities Scalable Analysis Portals Community Modeling Frameworks Interactive Seamless Data Analysis Data Access Assisted Data & Service Discovery Interoperable Responsive Information Information Services Delivery Evolvable Technical Infrastructure 17 PoDAG 25: Data Systems Assisted Knowledge Building Verifiable Information Quality Capabilities Vision: Enabling Technologies • Scalable Analysis Portals • Assisted Data and Service Discovery • Network data access protocols (OpenDAP, WMS/WCS, WebDAV, GridFTP) Established data server tools (MapServer, DODS/LAS, ArcWeb) Interoperable Information Services 18 Visual grammars Visual programming environments (Cantata, Triana, Viper) High-level analysis tools (IDL, Matlab, Mathematica) Seamless Data Access • Data and service description standards (XML, WSDL, RDF, OWL, DAML), web service directories (UDDI), syndication services (RSS), topic maps Rule-based logic systems Established directory services (GCMD, ECHO, THREDDS) Interactive Data Analysis • Assisted data & service discovery Interactive data analysis Seamless data access Interoperable information services Responsive information delivery Verifiable information quality Network service protocols (SOAP, Java Remote Method Invocation, OpenDAP, WS-*) Grid toolkits (Globus) PoDAG 25: Data Systems Capabilities Vision: Enabling Technologies (Continued) • Assisted Knowledge Building • Community Modeling Frameworks • Optical networks (National LambdaRail) Peer-to-peer networks with swarming (Modster) Direct downlink of satellite observations Evolvable Technical Infrastructure 19 Data pedigree algorithms (NEXRAD Data Quality Control Algorithms) Machine-readable formats (XML) Responsive Information Delivery • Multi-model frameworks (ESMF, Model Coupling Toolkit, COCOLIB) Model data exchange standards (BUFR, GRIB) Verifiable Information Quality • Data mining algorithms (Support vector machines, independent component analysis, rule induction) Data mining toolkits (Adam, D2K, Darwin) Data mining plug-ins (IMAGINE, ENVI, ArcGIS) Processor & storage virtualization software (VMware, volume manager) Scalable architectures (Beowolf, Grid) Bandwidth-on-demand PoDAG 25: Data Systems Correlation Matrix: Capabilities Vision and Vision 2015 Goals Scalable Analysis Portals Community Modeling Framework Assisted data & service discovery Assisted Knowledge Building Interactive data analysis Seamless data access Interoperabl e information services Responsive information delivery Evolvable Verifiable Technical information Infrastructur quality e Safe Stewardship 1 Peer Review 3 Combining data streams seamlessly EOS interoperability with other Mobile processing and data 3 2 1 3 2 2 3 1 3 Data access latency 3 Physical location irrelevancy 3 Common search engines Open interfaces and standard protocols 1 3 2 Machine-tomachine service interfaces Custom processing 2 2 1 2 1 1 3 3 2 2 3 Data pedigree 2 1 3 Fine-grained cost control 2 Accessible expert knowledge Community feedback Rich access to EOS data 20 1 1 1 1 1 2 PoDAG 25: Data Systems 2 3 – High Correlation 2 – Moderate Correlation 1 – Some Correlation Blank – No Correlation Identified Evolution status and plans for ECS: Timeline for ECHO-WIST Transition •LP DAAC DAR tool is in OPS and decouples MSS user profiles from logins its DAR tool logins •Testing starts for End to End All-Guest access to ODL Orders and Sub setting using WIST + V0 Adaptors •ECHO ready for testing SOAP Orders, Sub setting and ASTER OD, but needs UAH, ECS & LP DAAC components •End-to-End ODL AllGuest access from WIST ECHO 9 to Legacy SDPS is in OPS •ECHO continues to interoperate with ECS through the V0 Adaptors ECS REL 7.20 EXTEST 05/31/07 21 •Retire JDT & DAR components ECHO 9.0 EXTEST 01/2007 •End-to-end testing starts for SOAP/ ECHO 9 AllGuest access to Sub setting and Ordering at NSIDC •LP DAAC (TBD) testing starts for SOAP AllGuest access to ASTER OD and Orders ECS OPS (TBD) •End-to-end testing starts for LP DAAC MTMGW to ECHO •LP DAAC interfaces directly to DORRAN for billing and accounting but needs EWOC •OPS continues ODL support until ECHO User Profiles •End-to-End SOAP AllGuest access from WIST to “Final Release” is in OPS •ECHO ready for testing End-to-end thread for Registered User access to all threads but needs ECS Patch •End-to-end testing starts for Registered User access from WIST - ECHO 10 to SDPS •REL 7.20 LP DAAC OPS 12/31/06 •JDT and DAR Support •User Profile Mgt for DAR User Logins to LP-JDT •UAH ready for testing SOAP Sub setting but needs ECS EWOC •SOAP I/F to Ordering and Sub-setting ECHO 9.0 OPS 04/01/07 JAN2007 APR2007 UAH EXTEST 05/31/07 •HEW Subsetter: SOAP I/Fs MAY2007 LP DAAC EXTEST 07/2007 •Traffic Cop I/F in SOAP •B&A I/F with DORRAN ECS 7.20+ EXTEST 07/01/07 ECS OPS •EWOC LP DAAC OPS 08/2007 •Traffic Cop •B&A I/F with DORRAN LP DAAC EXTEST 08/2007 •EWOC: ECHO-WSDL Ordering Component •MTMGW to ECHO JUL2007 AUG2007 ECHO 10.0 EXTEST 09/2007 •User Profile Management Tools & Services SEP2007 PoDAG 25: Data Systems •Final ECS Evolution Release •MTMGW to ECHO UAH OPS •SOAP sub setter •Target WISTECHO SW architecture •ECS-V0 adaptors can be disabled •EDG can be retired •All searches and browse serviced by ECHO •All registered user access managed by ECHO •Obsolete ECS components – V0GWY, MTMGW, MSS user profiles LPDAAC OPS ECS 7.20+ EXTEST (TBD) •Remove User Profile accesses from ECS Custom Code •Incorporate final user profile design into OMS contact info OCT2007 ECS 7.20+ OPS •All legacy profile accesses removed ECHO 10.0 OPS •User Profile Management transitions to ECHO in OPS DEC2007 ECS REL 7.21 EXTEST 03/21/08 •Retire SDSRV, V0GW, MTMGW, MSS User Profiles •Retire Sybase Replication Tables and COTS MAR2008 Evolution status and plans for ECS: Operations Concept for EWOC (Tentative) 4i. Send email to user External Subsetter EMD 4d. Order 4e. Order 4f. Update Order Status User OMS 4a. Order 4c. Place order 2. Search EWOC WIST 4g. Order Status Update 3. Browse 4h. Order Status Update 4. Order 4b. Register order and request ECHO 1a. Export metadata BMGT 1b. Export browse 22 PoDAG 25: Data Systems MSS Conceptual Layered Architectural: One Vision for an Evolved NSIDC Science Information System User Communities External Systems Message Router/Gateway Interface Services ftp http GISMO/PSQ/WSRD(JAVA) Search and Retrieve Service WSDL/SOAP Server adapters Custom Visualization WMS Custom Search Reformat Service Data Provider RSS Feed OWS NISE/NRTSI Processing AVHRR Processing EASE-Grid Processing SSMI Sea Ice Processing SI IndexGenerator WCS Catalog Service Inventory (granule/file) Service Data Operations Metadata Service Preservation Metadata Service Subset | Stitching Services Grid | Regrid Service Project | Reproject Service WFS PerlDesk gzip, tar Metrics Reporting System TRAK Ingest Services •Data Transfer •Validation •Checksum •Extract/Generate Metadata •Insert to Data Store •Trigger Subscriptions (Data, Metadata, Browse) Archive Storage Metadata Database Replicate Working Storage Recover DVD/CD Production Core Infrastructure Services Problem Tracking System SNIPS AntiVirus System Intrusion Detection System 23 Sybase RDBMS Storage Management System (StorNext/AMASS ) Postgresql/PostGIS MySQL LDAP SNMP Legato Backup System PoDAG 25: Data Systems Apache Tomcat Copan MAID Vtl (backup/archive) Offsite backup