Geospatial Information, Fundamental Grid Challenges, and the Role of Standards Organizations

advertisement
Geospatial Information, Fundamental
Grid Challenges, and the Role of
Standards Organizations
European Geoinformatics Workshop
Edinburgh, March 9, 2007
Dr. Craig A. Lee, lee@aero.org
The Aerospace Corporation
(a non-profit, federally funded R&D center)
© 2006 OpenGridForum
What’s the Motivation for All This?
• Geospatial data has immense practical value
• Claim that large percentage of all data is
geospatial in nature
• Applicability across many domains
• Service Architecture concept is gaining wide
momentum
• Natural concept for designing, deploying, using
distributed systems
• Managing access to data, machines -- resources
of all kinds -- for geographically distributed users
• A Service Architecture for Geospatial Data
and Tools is a Clear Win
• It is imperative to engage key stakeholders
© 2006 OpenGridForum
2
But What’s the Larger Context?
• Geospatial Systems part of larger Systems-of-Systems
•
•
•
•
•
•
Automatically detect, ingest, and disseminate input data events
Automatically analyze the events and known data
Automatically plan responses
Distributed execution of workflows to enact the response
Workflows dynamically respond to further events of interest
Secure, autonomous operation in an environment with only
partial control and observability
• Focus: Event-Driven Workflows or Dynamic Workflows
• Events delivered to decision-making elements that need to know
• Decision makers plan and modify responses according to policy
• Workflows executed with distributed control in a dynamic env.
• Dynamic, Data-Driven Application Systems
© 2006 OpenGridForum
3
Motivation: DDDAS
Experiment
Measurements
Field-Data
User
Frederica Darema, NSF
© 2006 OpenGridForum
Experiment
Measurements
Field-Data
User
Challenges:
Application Simulations Development
Algorithms
Computing
Systems Support
4
Examples of Applications
benefiting from the new paradigm
• Engineering (Design and Control)
• aircraft design, oil exploration, semiconductor mfg, structural eng
• computing systems hardware and software design
(performance engineering)
• Crisis Management
• transportation systems (planning, accident response)
• weather, hurricanes/tornadoes, floods, fire propagation
• Medical
• customized surgery, radiation treatment, etc
• BioMechanics /BioEngineering
• Manufacturing/Business/Finance
• Supply Chain (Production Planning and Control)
• Financial Trading (Stock Mkt, Portfolio Analysis)
DDDAS has the potential to revolutionize
science, engineering, & management systems
© 2006 OpenGridForum
5
Fire Model
• Sensible and latent heat fluxes
from ground and canopy fire ->
heat fluxes in the atmospheric
model.
• Fire’s heat fluxes are absorbed
by air over a specified
extinction depth.
• 56% fuel mass -> H20 vapor
• 3% of sensible heat used to dry
ground fuel.
• Ground heat flux used to dry and
ignite the canopy.
Coupled Models
• Sensible and latent heat
• Fire Propagation
• Atmospheric Dynamics
© 2006 OpenGridForum
6
Slide Courtesy of Cohen/NCAR
Kirk Complex Fire. U.S.F.S. photo
Forest Fires in the Context
of a Sensor Network
Atmospheric
Model
Fire Prop.
Model
Fire
Fighters
Kirk Complex Fire. U.S.F.S. photo
© 2006 OpenGridForum
7
Combustion
Model
Policy,
Planning,
Response
Economic Modeling and
Well Management
Production Forecasting
Well Management
Reservoir
Performance
Data
Analysis
Simulation Models
Visualization
Multiple Realizations
Data Management and Manipulation
Data Collections from Simulations and
Field Measurements
© 2006 OpenGridForum
8
Field
Measurements
Reservoir Monitoring
Field Implementation
The NGS Program developsTechnology for integrated feedback & control
Runtime Compiling System (RCS) and Dynamic Application Composition
Application
Model
Dynamic Analysis
Situation
Distributed
Programming
Model
Application
Program
Compiler
Front-End
Application
Intermediate
Representation
Compiler
Back-End
Launch
Application (s)
Dynamically
Link
&
Execute
Performance
Measuremetns
&
Models
Application
Components
&
Frameworks
Distributed Computing Resources
Distributed Platform
tac-com
alg accelerator
….
data
base
fire
cntl
SAR
9
© 2006 OpenGridForum
MPP
NOW
fire
cntl
data
base
SP
F. Darema,
NSF
A DDDAS Model
(Dynamic, Data-Driven Application Systems)
Discover, Ingest, Interact
Models
Discover,
Ingest,
Interact
Computations
Loads a behavior into
the infrastructure
sensors & actuators
sensors & actuators
Cosmological: Humans:
10e-20 Hz.
3 Hz.
sensors & actuators
Computational
Infrastructure
(grids, perhaps?)
Spectrum of Physical Systems
© 2006 OpenGridForum
10
Subatomic:
10e+20 Hz.
Top-Level Concept: Integration of Event
Notification and Workflow
Policy
Decision
Maker
Communication
Domain
Sensed
Events
Decision
Maker
Abstract Plan
Decision
Maker
discovery
Response
Resource Info
and
Mgmt Service
© 2006 OpenGridForum
Concrete Action
register
11
Top-Level Concept
Policy
Content-Based Routing Domain
Decision
Maker
Communication
Domain
Sensed
Events
Decision
Maker
Abstract Plan
Decision
Maker
discovery
Response
Resource Info
and
Mgmt Service
© 2006 OpenGridForum
Concrete Action
register
12
Top-Level Concept
Policy
Decision
Persistent
Maker
Decision-making
Computations
Determined
Decision
by Policy
Maker
Communication
Domain
Sensed
Events
Decision
Maker
Abstract Plan
discovery
Response
Resource Info
and
Mgmt Service
© 2006 OpenGridForum
Concrete Action
register
13
Top-Level Concept
Policy
Decision
Maker
Communication
Domain
Sensed
Events
Decision
Maker
Abstract Plan
Decision
Maker
discovery
Response
Resource Info
and
Mgmt Service
Concrete Action
register
Grid Information Service
© 2006 OpenGridForum
14
Top-Level Concept
Policy
Decision
Maker
Sensed
Events
Communication
Domain
Decision
Maker
Abstract Plan
Decision
Maker
Dynamic
Grid Workflow
Management
discovery
Response
Resource Info
and
Mgmt Service
© 2006 OpenGridForum
Concrete Action
register
15
Required Capabilities
• Events delivered to decision-making elements that need to know
• Event Notification Service Managed by Publish/Subscribe
• Pre-defined Topics
• Publication Advertisements
• User-defined Attributes
• Content-Based Routing – Topology-Aware Communication
• Decision makers plan responses as determined by policy
• Semantic analysis to determine the “meaning” of sets of events
• Planning - “path construction” from current state to goal state
• Classic topics in Artificial Intelligence
• Resource Information & Management Systems
• Distributed, Scalable, Timely
• Metadata schemas, Ontologies
• Responses executed as distributed workflows
• Workflow Engine independently manages
• Scheduling of Data Transfer
• Scheduling of Process Execution
• Centralized vs. Distributed
© 2006 OpenGridForum
16
General Architecture for Topology-Aware
Communication Services
Peer-to-Peer Network
Subscription “signals”
propagate through the
P2P Network
© 2006 OpenGridForum
Events are published to the
P2P network which are then
routed to subscribers
17
Many Types of Communication
Services Improved or Enabled
• Augmented Semantics
• Caching (web caching), filtering, compression, encryption,
quality of service, data-transcoding, etc.
• Collective Operations
• Accomplished “in the network” rather than using point-to-point
msgs across the diameter of the grid
• Communication Scope
• Named topologies can denote a communication scope to limit
problem size and improve performance
• Content and Policy-based networking
• Publish/subscribe, interest management, event services, tuple
spaces, quality of service
• Issues
• Topology management/construction
• Dynamic member join/departure
• Reliability
• Maintaining distributed state in the network
• Security
• Integrity, authentication, authorization of signaling messages
© 2006 OpenGridForum
18
Grid Workflow Management
• Organization of distributed computing services
• Rather than building applications with ad hoc, "hard-coded“ task
organization, workflow provides a general mechanism for
distributed task organization
• Independent scheduling of data transfer and process
execution
•
•
•
•
Key Capability for all Workflow tools
Subsequent task may not exist when previous task completes
Where subsequent task is to execute may not even be decided
Output data may have to be buffered until it is needed/can be
used
© 2006 OpenGridForum
19
Workflow Mgmt Considerations
• Representation
• Graphical (DAGs), Syntactic (code, XML)
• Creation
• Eager vs. lazy binding of service to physical resources
• Eager vs. lazy binding of workflow to service
• Co-Scheduling vs. Incremental Scheduling
• Data Transfer
• Streaming
• Buffered channel
• File Transfer
• Data Persistence and Lifetime
• How long does the data live where it is?
• Workflow Engine – executes the workflow
• Centralized? (“orchestration”)
• Decentralized? (“choreography”)
© 2006 OpenGridForum
20
Combining Events and Workflow:
Dynamic Event-Driven Workflows
• Besides events precipitating an initial response
workflow, subsequent events may alter an existing
workflow that is underway
• Current amount of workflow completed must be determined
• Current tasks on the “leading edge” of the workflow must be
terminated or allowed to complete
• Status and disposition of data referenced by tasks must be
determined
• “Classical” storage management issues reoccur
• Dangling references to no data or stale data
• Unaccessible data referenced by no one
• Such event-driven task mgmt is similar to fault tolerance
• Similar mechanisms could be used to detect and respond to
faults (failed servers, networks, etc.)
• Directly Supports DDDAS Concept
© 2006 OpenGridForum
21
Responding to Events under
Centralized Workflow Control
Event
Subscription
Event
Notification
What is State of Workflow
When Event Received?
Client
Making Decision
(Centralized
Control)
Possible Actions after Event:
• Do Nothing
• Cancel Entire Workflow
• Cancel Part of Workflow
• Conditional Workflow
© 2006 OpenGridForum
How is Workflow Executed?
• Client statically decides workflow
services and servers prior to start-time
• Client incrementally decides services
and servers during run-time
22
In General, Nested or Recursive
Workflows will be Possible
Event
Subscription
Event
Notification
What is State of Workflow
When Event Received?
Client
Making Decision
(Centralized
Control)
Even if Control is Centralized,
Client May Not Know Entire Workflow State
© 2006 OpenGridForum
23
Avoiding Single Point of Failure:
Decentralized Workflow Control
Event
Subscription
Event
Notification
Client
Making Decision
(Decentralized
Control)
• Workflow Representation passed among workflow services
• Initiating Client does not explicitly manage each service
• Nested, recursive workflows still possibly
© 2006 OpenGridForum
24
Responding to Events under
Decentralized Workflow Control
Event Notification
• Currently active workflow agents subscribe to
appropriate event topics
• Workflow agents may need to find and coordinate
with their active collaborators
25
© 2006 OpenGridForum
Programming Decentralized Workflows?
• “Process programming” in a distributed environment
• Example: Little-JIL
• Agent Coordination Language
• A coordination tree with four non-leaf operations
sequential, parallel, try, choice
• Other possibilities?
• Stream-based languages?
• Dataflow languages?
• Decentralized Workflows similar to
Active Networks, Active Agents and Active Messages
•
“Programming the message, not the node”
• Autonomic behavior
• If peer agent fails, agent will have to infer workflow repair to reach
goal state
© 2006 OpenGridForum
26
Use of A Priori Information in Grids
Knowable independently of experience
Task-Define-time
Task-Run-time
Start Time
• Expects the world to have certain
properties or be in a known state
• Semantic translation tools can be
used, i.e., compilers
• Entire code units can be
examined, analyzed, optimized
• Static information compiled-in
• Everything that can be statically
defined a priori takes complexity
out of the application and
improves performance
© 2006 OpenGridForum
• Increasing use of a posteriori
information learned from experience
• Capturing more information about a
running app and the environment
• More and more dynamic late binding
• “Smart” run-time
• “Smart back-end” of a compiler
• Limited control and imperfect
knowledge of the environment
• Must apply reasoning to what is
semantically understandable
27
Future Generation Grids:
We Are Being Pushed Into…
• Dynamic discovery, late binding
• How little a priori knowledge can be "compiled-in"?
• Resource virtualization
• Performance penalty for deciding everything dynamically
• Autonomic Control Cycle Occurs Everywhere
•
•
•
•
•
Monitor, Understand, Plan, Respond
Fault Tolerance/Recovery
Real-Time/Physical System Monitoring & Interaction
Dynamic configuration (late discovery, binding)
Anytime a goal state must be reached
• Planning is a classic AI capability
• Chaining of "moves" to get from current state to goal state
• Inferencing on known and discoverable facts
• Done in environment with imperfect knowledge and limited control
• If plan fails, replan and try again
• Declarative programming techniques
• Programming the “What”, not the “How”
• Geosemantics is an archetypal example of this fundamental
challenge to grid computing
© 2006 OpenGridForum
• Advances made in this field should be understood, and hopefully
generalized, for this wider context
28
• Interdisciplinary approach
How Do We Make Progress on
these Fundamental Challenges?
• Research
• Organized Research
• Governmental funding agencies
• Organized, Interdisciplinary Research
• Getting the right fields of expertise to collaborate
• Organized Adoption
• Open Grid Forum (OGF)
• World Wide Web Consortium (W3C)
• Organization for the Adv. of Structured Information
Standards (OASIS)
• Distributed Management Task Force (DMTF)
• Storage Networking Industry Association (SNIA)
• Tele Management Forum (TMF)
• Internet Engineering Task Force (IETF)
• International Telecommunication Union (ITU-T)
© 2006 OpenGridForum
29
Key Technical Areas
• Security
• How to manage grid identity and access
• Metadata and Ontologies
• How to define the relevant information architecture
• Data Discovery and Management
• How to manage the location and access to cached and
replicated data
• Semantics
• How to use the meaning of data to produce information
• Service Architectures
• How to integrate and manage all resources as a whole and
provide dynamic, transparent access
© 2006 OpenGridForum
30
Security
• Security Capabilities
• Authentication, Authorization, Privacy, Integrity, Non-Repudiation
• Authentication
• Evolving to combination of GSI, Kerberos and Shibboleth
• Authorization
• Databases (VOMS and Permis)
• Role-based (TeraGrid, OSG)
• WS-Security
• Performance is an issue
• Delegation of Trust -- Delegation of Identity
• Identity is also dependent on role in a Virtual Organization
• Identity has a structure
© 2006 OpenGridForum
31
Metadata and Ontologies
•
Metadata – data about data, e.g.,
•
•
•
Federal Geographic Data Committee, Content Standard on Digital Geospatial Metadata
GML 3.0 (Geographic Markup Language)
ISO Standards
•
•
•
•
•
ISO 19115:2003 Metadata
ISO 19115.2 Metadata-Part 2: Extensions for Imagery and Gridded Data (within two years)
ISO 19119:2005 Services
ISO 19130 Sensor Model and Data Model for Imagery (within two years)
Ontologies
•
Needed to capture process behavior, spatial/temporal characteristics, data and process
relationships
• Need to be more than just keyword lists for classification
•
OWL: Web Ontology Language
•
•
•
•
Semantic markup language for publishing and sharing ontologies on the web
OWL ontology: description of classes, properties and their instances
OWL-S: web service ontology
Are GML, ISO standards and OWL sufficient for geospatial representation and
reasoning?
•
(No!)
© 2006 OpenGridForum
32
Data Discovery and Management
• Data (and services) must be published in a registry to be
discoverable
• Metadata and Ontologies are essential
• UDDI is generally considered to be inadequate
• Not scalable, poor semantics for application data
• Combined catalogue and storage management
• Storage Resource Broker (SRB, SDSC)
• SRB MCAT (Metadata Catalogue) used to manage access across
multiple remote sites
• OGSA-DAI (Open Grid Forum)
• Open Grid Service Architecture-Data Access and Integration
• Web service access to files, databases
• Globus Data Replication Services
• Built to support high-energy physics projects
• Controls pushing of data closer to key consumers
• Enables user to choose “closest” replica
• Storage Networking community driving to storage virtualization
• E.g., Amazon S3 (Simple Storage Service)
© 2006 OpenGridForum
33
Semantics: Enabling Intelligence
• Automated Systems Only Possible with
Well-Known Semantics
• Environmental Decision Systems
• Emergency Decision Systems
• SWRL: Semantic Web Rule Language
•
•
•
•
Extension to OWL
Adds parts of RuleML into OWL
Extends OWL axioms to Horn-like clauses
Will this be sufficient?
© 2006 OpenGridForum
34
Service Architectures: Key Capability Areas
Covered by Core WS-* Specs
WS-* Specification Area
Examples
1: Core Service Model
XML, WSDL, SOAP
2: Service Internet
WS-Addressing, WS-MessageDelivery; Reliable
Messaging WSRM
WS-Notification, WS-Eventing (Publish-Subscribe)
3: Notification
4: Workflow and
Transactions
5: Security
BPEL, WS-Choreography, WS-Coordination
6: Service Discovery
WS-Security, WS-Trust, WS-Federation, SAML, WSSecureConversation
UDDI, WS-Discovery
7: System Metadata and State
WSRF, WS-MetadataExchange, WS-Context
8: Management
WSDM, WS-Management, WS-Transfer
9: Policy and Agreements
WS-Policy, WS-Agreement
10: Portals and User
Interfaces
WSRP (Remote Portlets)
Consensus
© 2006 OpenGridForum
Merging
Developing
35 Ho, Pierce – U. Indiana
B&W table courtesy of Fox,
Issues from an Organizational Perspective
• General Consensus Only on WS Basic Building Blocks
• Must avoid vendor-specific solutions – Adopt vendor-neutral approach
• Adoption Roadmap and Timetable?
• Much Work Remains to be Done – And It Is Underway
• Topics for Harmonization
• Merging of competing WS standards expected
• Service Component Architecture (SCA) and Open Grid Services Arch (OGSA)
• Service Data Objects (SDO) and Web Service Resource Framework (WSRF)
• Workflow Management (aka web service chaining)
• Triana, Taverna, Pegasus, BPEL
• Semantically-aware workflow engine
• SAGA: Simple API for Grid Apps – a basic grid programming model
• Common “look-and-feel” for programming in a distributed environment
• Appropriate use and cost
• Not everything needs to be a service in a service architecture
• Adoption of any new technology, e.g., SOA, is more expensive up front
© 2006 OpenGridForum
36
Driving Innovation
Slide Borrowed from Ulf Dahlsten, Director
‘Emerging Technologies and Infrastructures’
Phase 0
Research
Phase 1
Solution proposal
Phase 2
Prototype
Phase 3
Pre-commercial
product/service
Phase 4
Commercial
product/service
Innovation
Marketpull
Pull
Market
Research push
“no man’s
(and other
land” SDOs)
OGF
Managing the Technology Maturation Process
© 2006 OpenGridForum
37
OGF Technical Strategy/Stakeholder
Alignment Process
Application of Best Known
Practices and Current Standards
Uses Cases
Architectures
OGF
Events
Requirements
Requirements
Workshops
Technical Strategy
Committee
OGF
Technical
Strategy &
Roadmap
Milestones
Standards Groups
& Workshops
OGF
Document
Series
Best Practices
Analysis, Interpretation &
Prioritization of Requirements
© 2006 OpenGridForum
38
Specifications
A More Refined View
Standards
Groups
Best Practice
Workshops
Requirements
Solicitation
Best Practices
Best
Practices
EGR-RG
Applications
Req
Req
SN-CG
Architecture
Financial
Telco
Pharma
EDA
Vendors
© 2006 OpenGridForum
Req
Requirements
Rollup, Analysis
&
Prioritization
(EGR-RG)
Prioritized Req
and Req Patterns
Data
Infrastructure
What WGs are doing
i.e. WG roadmap
Req and Req
Patterns
Requirements
Compute
TSC
GAP Analysis
• Overall standards roadmap
• Gap analysis of WG
roadmap vs. prioritized Req
•Recommended actions
39
Management
Security
Specs
OGF Grid Requirements Roll-up
• INFORMATION ARCHITECTURE
• DATA MANAGEMENT
• Metadata Schemas, Ontologies & Semantics
• Data Profiling
• Data-tagging, including managing files
• DISCOVERY
• Detailed Asset Discovery
• API for Product Capability Discovery
• Extract information about a project or a product •
• Lets users grab the right data -- categorizing data
• Content/Data Discovery
• Catalogue-based Data Access
• RESOURCE VIRTUALIZATION
• Dynamic Provisioning
• Capacity on demand
• Capacity grows as available
• Content Provisioning
• Provisioning and Capacity Management
• JOB MANAGEMENT
•
•
•
•
Distributed Execution
Job Submission
Job control management
Job Migration
© 2006 OpenGridForum
•
•
•
•
•
•
Data Copy, Data Movement
Backup
Storage Policy Mgmt
Replica Mgmt
Caching (local disk, indexes, memory)
Data Grid APIs
GRID MANAGEMENT
•
•
•
•
•
• FAULT TOL. & ERROR MGMT
Mgmt Console GUI
Asset Management and Topology
Policy Management and Quotas
Mitigate management overhead
Transition/evolution models
• WORKFLOW
• Planning
• Management (Cent. & Dist.)
• SCHEDULING
• Meta scheduler, data aware
• MONITORING & EVENT NOTIF.
•
•
•
•
•
•
Deep Error Analysis
Error Audit
Verification & Audit
Root Cause Analysis
Job error management
Very high levels of uptime
• SECURITY
• Grid Identity Mgmt
• Strong Security
• Multiple domains
• ACCOUNTING & AUDITING
• Billing and Chargeback
• Chargeback models
• Business issues (charge back)
• Sarbanes-Oxley Support
• SYSTEM DEVEL & DEPLOY
• Simplify application development
• End-User Tools and Envs
• Monitoring, Auditing and Alert Mgmt • AUTONOMIC BEHAVIORS
• Monitoring
• Semantics
• Planning
• Action
40
Current OGF Standards Work
• Applications
• Infrastructure
Distributed Resource Mgmt App. API WG (drmaa-wg)
Grid Checkpoint Recovery WG (gridcpr-wg)
Grid Information Retrieval WG (gir-wg)
Grid Remote Procedure Call WG (gridrpc-wg)
Simple API for Grid Applications Core WG (saga-core-wg)
Grid and Virtualization Working Group (gridvirt-wg)
Network Mark-up Language Working Group (nml-wg)
Network Measurements Working Group (nm-wg)
• Management
Application Contents Service WG (acs-wg)
Configuration Description, Deployment, and Lifecycle
Management WG (cddlm-wg)
Glue Schema Working Group (glue-wg)
OGSA Resource Usage Service WG (rus-wg)
Usage Record WG (ur-wg)
• Architecture
OGSA Naming Working Group (ogsa-naming-wg)
Open Grid Services Architecture WG (ogsa-wg)
• Compute
Grid Resource Alloc. Agreement Protocol WG (graap-wg)
Job Submission Description Language WG (jsdl-wg)
OGSA Basic Execution Services WG (ogsa-bes-wg)
OGSA High Perf. Computing Profile WG (ogsa-hpcp-wg)
OGSA Resource Selection Services WG (ogsa-rss-wg)
• Security
OGSA Authorization WG (ogsa-authz-wg)
Trusted Computing Research Group (tc-rg)
• Data
Data Format Description Language WG (dfdl-wg)
Database Access and Integration Services WG (dais-wg)
Grid File System Working Group (gfs-wg)
Grid Storage Management WG (gsm-wg)
GridFTP WG (gridftp-wg)
Info Dissemination WG (infod-wg)
OGSA ByteIO Working Group (byteio-wg)
OGSA Data Movement Interface WG (ogsa-dmi-wg)
OGSA-Data Working Group (ogsa-d-wg)
© 2006 OpenGridForum
41
Gap Analysis:
OGF Technical Strategy & Roadmap doc
Category
Security
Operations
Capability
Error! Bookmark not defined.
Multiple Security Infrastructures
Error! Bookmark not defined.
Perimeter Security Solutions
Virtual OrganizationError! Bookmark not defined.
Error! Bookmark not defined.
Encryption
CertificationError! Bookmark not defined.
Error! Bookmark not defined.
Authentication
AuthorizationError! Bookmark not defined.
Web Service Protocol Security
Instantiate New ServicesError! Bookmark not defined.
DeploymentError! Bookmark not defined.
ProvisioningError! Bookmark not defined.
Service Level ManagementError! Bookmark not defined.
Notification]Error! Bookmark not defined.
MessagingError! Bookmark not defined.
Logging ServiceError! Bookmark not defined.
Error! Bookmark not defined.
Service and Resource Monitoring
Metering and AccountingError! Bookmark not defined.
PolicyError! Bookmark not defined.
Policy ManagementError! Bookmark not defined.
AdministrationError! Bookmark not defined.
Systems ManagementError! Bookmark not defined.
Aggregation of Services and ResourcesError! Bookmark
Working Group or Comment
OGSA Auth-Z
Firewall Issues RG
VOMS work applies
Existing technology is currently adequate
CA Ops WG
OGSA-AuthN
OGSA-AuthZ
OASIS/WSS, OGSA Secure Channel
CDDLM-WG, degenerate workflow
ACS-WG, CDDLM-WG
ACS-WG, CDDLM-WG
GRAAP-WG
OASIS/WS-Notification, WS-Eventing
OASIS/WS-Notification, WS-Eventing
Related to metering, see below
Grid Monitoring Architecture
UR-WG and RUS-WG More needed
WS-Policy
Management standards needed for policy
Community practices needed
Reference Model-WG
See OASIS WS-ServiceGroup
Maturity
Evolving
Research
Gap
Out of Scope
Evolving
Evolving
Evolving
Mature
Evolving
Evolving
Evolving
Evolving
Mature
Mature
Gap
Evolving
Evolving
Evolving
Gap
Gap
Evolving
Mature
OGSA-Naming
Resource DiscoveryError! Bookmark not defined.
Error! Bookmark not defined.
Resource Brokering
Job ManagementError! Bookmark not defined.
Choreography, Orchestration and WorkflowError! Bookmark not
WS-Naming-WG, GFS-WG
OASIS/WSDM
RSS-WG
OGSA-BES-WG, JSDL-WG.
OASIS/BPEL, OGSA Workflow Design Team
Evolving
Mature
On Hold
Mature
Mature
Resource VirtualizationError! Bookmark not defined.
Information ModelError! Bookmark not defined.
vii
CPU Scavenging
Legacy ProgramsError! Bookmark not defined.
ReservationError! Bookmark not defined.
Data Movement
Data Access
Data Integration
Data Management
Data ProvisioningError! Bookmark not defined.
MetadataError! Bookmark not defined.
Application DebuggingError! Bookmark not defined.
Application APIsi
Communication ProtocolsError! Bookmark not defined.
ArchitectureError! Bookmark not defined.
Grid SemanticsError! Bookmark not defined.
Grid Fabric Error! Bookmark not defined.
Error! Bookmark not defined.
Fault Tolerance
Error! Bookmark not defined.
Load Balancing
Error! Bookmark not defined.
Failure Recovery
GridVirt-WG, CDDLM-WG
DMTF/CIM, GLUE-WG
Proprietary Solutions Exist
ACS-WG
GRAAP-WG, GSA-RG
DMI-WG, Grid-FTP
GFS-WG and DAIS-WG
DAIS-WG
Storage Network-CG, OGSA-Data-WG
Continuation of EGA data work, OGSA-Data
OASIS/WSRF-RMD,
Evolving
Mature
Mature
Evolving
Evolving
Evolving
Mature
Evolving
Evolving
Gap
Evolving
Gap
Mature
Mature
Mature
Research
Mature
Mature
Mature
Evolving
not
defined.
Resource Management
Error! Bookmark not defined.
defined.
Data
Application Development
Foundations
System Properties
© 2006 OpenGridForum
SAGA-WG,GridRPC-WG,GridCPR-WG,DRMAA-WG
HTTP/SOAP,
Reference Model-WG, OGSA-WG
Semantic Grid-RG
OASIS/WSRF, NM-WG, NML-WG
Implementation Property
Implementation Property
Implementation Property
42
(Formerly) Competing Camps!
• IBM & Friends
• Open Grid Services
Architecture (OGSA)
• Built on top of Web
Services Resource
Framework (WSRF)
• Designed in
collaboration with the
Globus Alliance and
used in GT4
© 2006 OpenGridForum
• MS & Friends
• .NET
• Built on WS-Interoperability
(WS-I)
• Forms basis of MS’s Web
Service Extensions
(WSEs)
43
IBM, MS, HP, Intel Publicly Announce Intent
to Converge Web Service Standards
© 2006 OpenGridForum
44
Strategic Organizational Liaison
• Potential OGC-OGF Collaboration
• Workshop at OGF-20
• May 7, 2007, Manchester, UK
• Organized by Chris Higgins (Edinburgh)
• General Agenda
• Statements from key stakeholders & potential adopters
• Panel on Specific Goals
• Goal
• Memorandum of Understanding outlining concrete steps of
collaboration
• Potential Technical Directions
•
•
•
•
Integration of registry concepts with current standards
Integration of services (e.g., WMS, WFS) w/ emerging WS standards
Identification of suitable security (user identity) model
Integration of resource mgmt, workflow, notification, tools, …
© 2006 OpenGridForum
46
NSF Support for
Semantic Web Research
Frank Olken
National Science Foundation
CISE/IIS
folken@nsf.gov
Presentation to
SICOP Special Conference
Falls Church VA
Feb. 6, 2007
© 2006 OpenGridForum
Why does NSF care about
semantic web technologies?
•
•
•
•
Formalization of scientific knowledge
Facilitate sharing of scientific data
Facilitate access to scientific data and knowledge
Natural language processing
• Information extraction, digital libraries, ...
• Support for digital government
• Semantic rules languages, disaster support, ...
• Support for machine learning
• Support for math/science education
Bullets courtesy Frank Olken, NSF/CISE/IIS, folken@nsf.gov
© 2006 OpenGridForum
48
Debates about semantic
web research
• Skepticism about adoption of semantic tagging by the
masses (and the quality of the tagging)
• NSF is concerned about scientific/govt uses, not MySpace.
• Poor Quality Ontologies
• Ontology development and assessment remains difficult,
rare skill. Some progress (e.g., Ontoclean), clear need for
more research and more training of practitioners.
• Ontology Merging is very very hard:
• Currently subject of research, see Ontoclean work, also work
by Joslyn, et al. on use of partial orders.
• Skepticism of semantics by most of the database
research community:
• Still somewhat an issue, because semantic proposals often
go to to panels dominated by DB researchers. Progress in
adding more semantic web researchers to panels.
Bullets courtesy Frank Olken, NSF/CISE/IIS, folken@nsf.gov
© 2006 OpenGridForum
49
More debates about semantic
web research
• Description Logic vs. First Order Logic
• Heated debates in KR research community about whether
description logics are adequate or whether FOL or other logics
should be used.
• Scalability and structuring of rule bases
• Concerns about the software engineering of large rule bases (or
collections of logic axioms). Efforts to partition such large rule
bases / logic axiom collections (cf. Cyc's microtheories, etc.) This
remains an open research topic.
• Skepticism about scalability of semantic search and
inference engines
• Open research issue ...
Bullets courtesy Frank Olken, NSF/CISE/IIS, folken@nsf.gov
© 2006 OpenGridForum
50
• Which challenges
and priorities does
this group want to
put on their
research agenda?
© 2006 OpenGridForum
51
Final Bit of Wisdom: Why there
are no Penguins at the North Pole
Any further questions?
© 2006 OpenGridForum
52
Download