ogce-rt-projectreview

advertisement
Open Grid Computing
Environments: Advanced
Gateway Support Activities
RT Project Review
October 7th, 2010
Software
Description
OGCE Gadget
Container
Google Gadget/Open Social compatible
software for building Web-based user
interfaces.
XBaya
A visual user interface for composing,
launching and monitoring workflows
GFAC
An application factory service for wrapping
command-line tools as Web services
XRegistry
A service and workflow registry
Registry Gadget,
Experiment Builder
User interface for registering services and
creating online experiments with registered
workflows
Software Strategy
• We develop and package downloadable software
• Focus on gadget container and tools for running
science applications on grids and clouds.
• Provide a tool set that can be used in whole or in
part.
– If you just want GFac, then you can use it without
buying an entire framework.
• Outsource security, information services, data
and metadata, etc to other providers.
– MyProxy, TG IIS, Globus, Condor, XMC Cat, iRods, etc.
NSF SDCI Award: 2010-2013
• Indiana: Marlon Pierce (PI), Suresh Marru (CoPI), Raminder Singh, Rob Quick, Gregor von
Laszewski, Gerald Guo
• Purdue: Carol Song (Co-PI), David Braun, Lan
Zhao
• NCSA/UIUC: Sudhakar Pamidighantam (Co-PI),
Shaowen Wang, Yan Liu
• UTHSCSA: Emre Brookes (Co-PI), Borries
Demeler
Project Highlights
• Full Circle Development
– Directly fund both software developers and gateway
consumers.
• Directly supported (non-IU) gateways:
– UltraScan (UTHSCSA), GridChem (NCSA),
SimpleGrid/GISolve (UIUC), Purdue CCSM and
Environmental Data Gateways
– Among the most used TG gateways.
• Sustainability strategy: Apache Incubator for
workflow suite of tools
– XBaya, GFac, and supporting services.
UltraScan High Level Overview
User
Web Server
MySQL DB
US LIMS
UTHSCSA Jacinto
Terascale storage
GridControl
TeraGrid
TIGRE/Globus
Network
High Performance
Computing Clusters
UltraScan Middleware
• Immediate Goals: Use GFAC as a
replacement job submission
service.
– GRAM 2, 4, 5 independence
– Significant effort into GRAM5
testing on Ranger.
• Longer term goals
– Integrate with TG information
services to provide better job
scheduling.
• OGCE Resource Prediction Service
– Support UNICORE job
management.
Current Architecture
Gram5 Testing Steps
• Developed Testing harness to run different
UltraScan cases.
• Started with small number of jobs and increased
the concurrency later
• Watched job behavior of the job on resource and
monitored the gram log
– There were lot of issue which we found from
the logs and working with Globus team to fix
them
• Recorded all the job run data to create a google
gadget to create graph for different runs on
different resources.
TG Resources and Patterns
Version
Resource
Endpoint
GT 5.0.2
QueenBee
queenbee.loni-lsu.teragrid.org:2120/jobmanager-pbs
GT 5.0.2
Ranger
login5.ranger.tacc.teragrid.org:2120/jobmanager-sge
GT 5.0.2
Lonestar
gatekeeper.lonestar.tacc.teragrid.org:2120/jobmanagerlsf
Patterns:
Concurrent jobs
Batch Size
Total jobs
Job Status Pass : Fail
1
10
10
10:0
3
10
30
30:0
5
10
50
50:0
10
10
100
20:0
20
10
200
40:0
50
10
500
100:0
100
10
1000
200:0
200
5
1000
Not tested (Need allocation)
500
2
1000
Not tested (Need allocation)
Learning from this exercise on Ranger
• There were problems with Sun Grid Engine job manager
(sge.pm) not loading system environment and user
environment settings.
• You can't load modules using Gram parameters.
– On Ranger all the libraries are modules and user may want to load
some special module version for his job.
– Example is if you have different version of your code using different
modules.
• On Ranger user default shell is C shell and Gram runs using
bash shell. This is not a problem but users need to be
informed where to add user environment.
• You can look at detailed results on following
http://teragridforum.org/mediawiki/index.php?title=GRAM_5_Testing
GridChem Science Gateway
• A chemistry/material Science Gateway for running
computational chemistry codes, workflows, and parameter
sweeps.
• Integrates molecular science applications and tools for
community use.
• 400+ users heavily using TeraGrid. One of the consistent top5
TeraGrid Gateway users.
• Supports all popular Chemistry applications including
Gaussian, GAMESS, NWChem, QMCPack, Amber and MolPro,
CHARMM
• ParamChem is a follow-on project to develop workflows for
chemical parameter studies and provide the infrastructure to
execute them.
Cyberenvironments for Parameterization
Computational Reference Data Generation
GridChem , ParamChem
Integration
• Immediate Goals:
integrate XBaya workflow
composer with GridChem
user interface and
middleware.
– Target SC10
• Longer term goals:
provide part of GridChem
next generation
middleware; support
large scale ParamChem
workflows.
SimpleGrid (UIUC)
SimpleGrid is a Gateway teaching tool that is also
the core of GISolve
SimpleGrid Gadgets
Yan and Gerald’s work from our Oct 6th hacking session.
SimpleGrid, GISolve
• Short term goal: develop SimpleGrid Gadgets
deployable into gadget container.
– Must meet security requirements
– Support PHP development
– Support interactivity requirements
• Integrate YUI JavaScript libraries with Gadget
JavaScript.
• Longer term goals: investigate workflow, job
management tools. Apply to GISolve
Purdue CCSM and Data Portals
• Short terms goals: Develop CCSM and data
management gadgets and necessary backing
middleware.
– Support GWT development
– Interactivity and security requirements.
– Significant requirements overlap with SimpleGrid
• Longer term goals: Build gateways out of
gadgets hosted by multiple containers;
examine workflow and other tools.
Conclusion
• We focus initially on one component per
gateway.
– SimpleGrid, CCSM, Data Portal: gadgets
• Other gadget based gateways at UC
– GridChem: Xbaya
– UltraScan: GFac
• Goal is to establish an Apache-style
meritocracy for contributed code.
• Making distributed teams work: hacking
retreats, SourceForge, Jira, Google apps
Acknowledgments
• The following student interns have
contributed to the project over the last year
– Ye Fan
– Patanachai Tangchaisin
– Suresh Kumar Deivasigamani
– Roland Mai
– Sashikiran Challa
Backup Slides
OGCE Gateway Tool Adaption & Reuse
LEAD
LEAD
Experiment Builder, XRegistry Interface
GFac, XBaya,
XRegistry, FTR
Eventing System
GridChem
Xbaya, GC Middleware
GridChem
Resource
Discovery Service
TeraGrid
User Portal
GPIR, File
Browser
OGCE Team
Gadget Container,
GTLab, Javascript Cog,
XRegistry Interface,
Experiment Builder,
Axis2 Gfac, Axis2
Eventing System,
Resource Prediction
Service, Swarm
Ultrascan
OGCE
Re-engineer,
Generalize,
Build, Test and
Release
GFac, Eventing
System
OVP/
RST/
MIG
BioVLab
XBaya, GFac
ODI
Workflow Suite, Gadget Container
Bio Drug Screen
Swarm->GFac
EST Pipeline
Swarm->GFac
Future Grid
GFac, Xbaya, …
21
Putting It All Together
Software Strategy
• Focus on gadget container and tools for running
science applications on grids and clouds.
• Provide a tool set that can be used in whole or in
part.
– If you just want GFac, then you can use it without
buying an entire framework.
• Outsource security, information services, data
and metadata, etc to other providers.
– MyProxy, TG IIS, Globus, Condor, XMC Cat, iRods, etc.
Advanced Support Scenarios
• GridChem/ParamChem workflow support
• UltraScan Job Submission (GFAC)
• EST Pipeline
– Bioinformatics pipeline for managing mass job
submission.
More Information
• This is downloadable, packaged software.
– Apache Maven build system provides everything
you need to to build the gadget container,
gadgets, workflow composer, and backing
services.
– Get code by anonymous SVN checkout.
• Email: mpierce@cs.indiana.edu,
smarru@cs.indiana.edu, ogcediscuss@googlegroups.com
• OGCE Web Site: www.collab-ogce.org
• Blog/News Feed: http://collabogce.blogspot.com/
Acknowledgements and People
• Funding by TeraGrid GIG, RP and by OCI SDCI
• IU: Marlon Pierce, Suresh Marru, Raminder
Singh, Archit Kulshrestha, Zhenhua Guo
• TACC: Maytal Dahan, Rion Dooley
• SDSC: Nancy Wilkins-Diehr, Jeff Sale
• SDSU: Mary Thomas
GridChem Advanced Support
• IU Personal – Suresh Marru, Marlon Pierce,
Raminder Singh, Ye Fan
• Gateways Area Director – Nancy Wilkins-Diehr
• GridChem Team – Sudhakar Pamidigantam,
Rion Dooley, Vikram Gazula & Rest of
GridChem team.
Empirical ForceFields Parameterization
Need
Process
Lack of Accurate Force Fields Produce
Erroneous Property Estimation
Fig. 1. Errors (V) in electrostatic potential on a surface at 1.8 times van der Waals radii around N-methyl
propanamide for two models. (Left) Point charges; (right) charge, dipole, and quadrupole on C, N, and O; charge and
dipole on H. The errors are much reduced in the multipole approach
A. J. Stone Science 321, 787 -789 (2008)
Published by AAAS
Vanommeslaeghe et al.
J. Comp.Chem 2010, 31,
671-690
Cyberenvironments for Parameterization
Computational Reference Data Generation
Molecular Force Field Cyberenvironments
Parameter Initialization and optimization Workflow
Workflow
Manager
Parameter definitions
Optimization
Monitor
Model/Reference Data
Definition
Merit Function
Specification
Optimization
Methods Choice
Consistency Checker
Optmization Job
Launcher
Optimization
Incomplete?
Expert
Interface
Optimization Job
Completed?
Paramater testing Model
Successful Testing
Paramater Sensitivity
Analysis
Update Parameter Database
with new set
Notification of End
of Workflow
OGCE Gateway Tool Adaption & Reuse
LEAD
LEAD
Experiment Builder, XRegistry Interface
GFac, XBaya,
XRegistry, FTR
Eventing System
GridChem
XBaya
GridChem
Resource
Discovery Service
TeraGrid
User Portal
GPIR, File
Browser
OGCE Team
Gadget Container,
GTLab, Javascript Cog,
XRegistry Interface,
Experiment Builder,
Axis2 Gfac, Axis2
Eventing System,
Resource Prediction
Service
Ultrascan
OGCE
Re-engineer,
Generalize,
Build, Test and
Release
Gfac, Eventing
System
OVP/
RST/
MIG
BioVLab
XBaya, GFac
ODI
Workflow Suite
Bio Drug Screen
Workflow Suite
EST Pipeline
Workflow Suite
Future Grid
???
31
Cyberinfrastructure Layers
User
Interfaces
Gateway
Software
Web/Gadg
et
Container
Web/Gadge
t Interfaces
Application
Abstractions
Fault
Tolerance
Workflow
System
Auditing &
Reporting
Resource
Middleware
Cloud Interfaces
Compute
Resources
Computational
Clouds
Color Coding
Gateway
Abstraction
Interfaces
Web Enabled
Desktop
Applications
Monitoring
Registry
User
Management
Information
Services
Security
Provenance
& Metadata
Management
Grid Middleware
SSH & Resource
Managers
Computational
Grids
Local Resources
OGCE Gateway Components
Complimentary Gateway Components
Dependent resource provider components
LEAD/OGCE Layered Workflow
Architecture
Workflow
Interfaces (Design
& Definition)
XBaya GUI
(Composition,
Deploying, Steering
& Monitoring)
BPEL 2.0
Flex/Web
Composition
Gadget Interface for
Input Binding
Python
Scufl
Workflow
Specification
BPEL 1.0
Workflow
Execution & Control
Engines
Apache
ODE
GBPEL
Java Code
Pegasus DAG
Dynamic
Enactor
Condor
DAGMan
Jython Interpreter
Taverna
GFac Current & Future Features
Globus
Apache Axis2
Input
Handlers
Output
Handlers
Auditing
Registry Interface
Fault
Tolerance
Checkpoint
Support
Scheduling
Interface
Monitoring
Interface
Data Management Abstraction
Job Management
Abstraction
Campus
Resources
Amazon
Eucalyptus
Unicore
Condor
Color Coding
Existing Features
Planned/Requested Features
Gateway Computing Environments (GCE10)
Download