2004_07_cern_talk - UM Personal World Wide Web Server

advertisement
A Grid Enabled Collaboratory for
Scientific Research (GESCR)
Charles Severance
University of Michigan
Sakai Project, NEESGrid Project, OGCE Project
www.dr-chuck.com
csev@umich.edu
Outline
•
•
•
•
•
•
•
Collaborative Software
Historical Context
The Sakai Project
NEESGrid Experience
Comparing Civil Engineering and HENP
The Concepts of GESCR
Going Forward
Collaboration Happens
• As individuals, we are parts of many
groups and have many roles in those
groups
WLAP
Sakai
Architecture
Board
Dec MTG
Tools
Board
Database
UM Sakai
June MTG
Next Grant
Support
OGCE
Support
Next Ver
uPortal
JSR-168
NEESGrid
Data Model
One View of
Chuck’s
Context Map
Data Repo
Post Oct.
Minnesota
Texas
UM Issues
Version 3
FusionGrid
HENP
GESCR
Another
View
Another
View
Maintaining the Map
•
•
•
•
Read E-Mail and move to proper folders
Copy attachments into folders
Searching for information
Making calendar entries from E-Mail
Imagine Software
• That could create a new “context” in a few clicks
– Enroll/invite others to the context as necessary in a few more clicks
• Context capabilities
– E-Mail list (automatically extracts attachments and places them in
folders which appear on your desktop)
– Schedule (you can either see a “federated” schedule across all
contexts or look at one context)
– Persistent browser-based chat - quite useful during meetings when
the Polycom or VRVS messes up :)
– Resource area where anyone can upload files which appear on
everyone’s desktop at the same time (WebDAV)
– Threaded discussion area for the context
• Problem: There are literally hundreds of solutions to portions of
this problem.
More Software
• A single place to see new activity in your “contexts”
• These contexts are stored on backed-up production
servers rather than your desktop for many years
• A search across your contexts - that would be really
cool
• The ability to customize each context in terms of look,
feel, and capabilities
• The ability to build unique domain specific tools and
interfaces to extend the mechanism using Portlets,
Servlets, or Applets
A 10-Year Collaborative Mission @ UM
Portal Technology
Jetspeed 2.0
uPortal 3.0
Websphere Й
Java
Swing
JSR-168 Technology
Legacy
Channels,
Teamlets
CHEF
Services
JSR-168
Portlets
Sakai GUI
Sakai GUI
Sakai
Teamlet
Sakai
Teamlet
OKI
Services
Sakai
Other
Services
OGCE Grid Portal
NEESGrid
CHEF 1
CHEF 2
Science of Collaboratories
Worktools (Notes Based)
WTNG
CTNG
Coursetools (Notes Based)
SPARC
1991 - 1997 1998
1999
2000
2001
2002
2003
2004
2005
SPARC
2/2001 600 users 800 data sources
CourseTools
Over 42,000 users at the end of 2003
WorkTools
Over 9000 users (2000 active) at the end of 2003
Science of Collaboratories
people-to-people
Communication,
Collaboration
Services
groups-toinformation
Distributed,
media-rich
information
technology
Digital libraries &
documents
http://www.scienceofcollaboratories.org/
NSF Funded ITR
groups-tofacilities
Remote
instruments
CHEF 1.0
•
Fall 2001: CHEF Development begins
– Generalized extensible framework for building collaboratories
– “Best-of” CourseTools, SPARC, WorkTools
•
•
•
Integrate across current UM projects and adopt relevant standards
Funded internally at UM as replacement for CourseTools
All JAVA - Open Source
– Jakarta Jetspeed Portal
– Jakarta Tomcat Servlet Container
– Jakarta Turbine Service Container
•
Build community of developers through workshops and outreach
Not “just” a portal
• Portals are a framework to deploy tools (aka
rectangles) and focus on how the user wants to
arrange their own “rectangles”
• While CHEF technically is a portal, the goal is for the
tools to work together closely and seem to really be
parts of a larger “tool”
• CHEF has a lot of features, (services, presence,
notification, etc..) which bridge the gap between
portal and application framework
CHEF Applications
•
•
•
•
CourseTools Next Generation
WorkTools Next Generation
NEESGrid
NSF National Middleware Grid Portal
CourseTools Next Generation
Over 5000 users at the end of 2003
http://coursetools.ummu.umich.edu/
Worktools Next Generation
New WorkTools Sites being created in WTNG as of 12/2003
Run on the same servers as CTNG.
NEESGrid - The Equipment
Network for
Earthquake
Engineering
Simulation
NSF Funded.
NCSA, ANL, USC/ISI, UM, USC, Berkeley, MSU
CHEF-Based NEESGrid
Software
Grid
Service
Stubs
Portal
Portlets
And
Teamlets
Service
API
Grid
Protocols
Grid
Services
NMI / OGCE
Local
Portal
Services
Remote
Content
Services
HTTP
Remote
Content
Servers
Jetspeed
Internal
Services
Figure 4: The revised portal architecture will provide a unified interface for
portal services.
NSF National Middleware Iniative
Indiana, UTexas, ANL, UM, NCSA
www.ogce.org
What we learned in 10 years.
• Portal technology is a good idea - forces component
approach - functionality does not “smear”
• Portals are not just aggregators of independent
information - but can be an application framework
• Many (but not all) tools can be used for both teaching
and learning and research collaboration
• Separating functionality into lightweight GUI
components and pluggable services with strong and
well-specified APIs allows significant reusability
• GUI elements program to abstract service interfaces not databases, file systems, LDAP, etc. - this allows
great flexibility.
While we were building
collaboratories…
• The Open Knowledge Initiative (OKI) at MIT was developing
APIs for learning management systems - involving many
universities (UM, Indiana, Stanford, and MIT were strong
participants)
• Indiana, Stanford, MIT all developed learning management
system
• Java Community Process (JCP) produced JSR-168 - The
“unified” portal standard API
• Oasis developed the Web Services for Remote Portals (WSRP)
standard
• The open-source uPortal portal project had quietly moved into
the #1 open source portal (#4 including commercial vendors)
So we got together and drew an über
collaboration picture…
July 04
Jan 04
May 05
Activity:
Maintenance &
Transition from a
project to
a community
Michigan
•CHEF Framework
•CourseTools
•WorkTools
Indiana
•Navigo Assessment
•Eden Workflow
•Oncourse
MIT
•Stellar
Stanford
•CourseWork
•Assessment
OKI
•OSIDs
Dec 05
SAKAI 1.0 Release
•Tool Portability Profile
•Framework
•Services-based Portal
•Refined OSIDs
& implementations
SAKAI Tools
•Complete CMS
•WorkTools
•Assessment
SAKAI 2.0 Release
•Tool Portability Profile
•Framework
•Services-based Portal
SAKAI Tools
•Complete CMS
•Assessment
•Workflow
•Research Tools
•Authoring Tools
Activity: Ongoing implementation work at local institution…
uPortal
Primary SAKAI Activity
Architecting for JSR-168 Portlets,
Refactoring “best of” features for tools
Conforming tools to Tool Portability Profile
Primary SAKAI Activity
Refining SAKAI Framework,
Tuning and conforming additional tools
Intensive community building/training
KYOU / sakai
Boundary, Situation
Sakai Core Members
• Universities
–
–
–
–
Indiana
Michigan
MIT
Stanford
• Projects
– Open Knowledge Initiative (OKI)
– uPortal - JaSIG
• Funding ($6.8M - 2 Years)
–
–
–
–
Mellon Foundation
Hewlett Foundation
Partners Program
Core member match
Sakai Concepts
• It is neither research nor teaching, it is
all “collaboration” - many common tools
– Teaching: Courses, tools, drop-boxes
– Research: Putting the user interface on the
Grid and Virtual Organizations
Teaching and
Learning
Collaborative
Research
Collaboration and
Learning Environment
What we agreed to build…
• A Collaborative Learning Environment
– Open Source
– Uses OKI (Open Knowledge APIs)
– Uses uPortal as its portal framework
• Similar to
– Blackboard
– WebCT
• And all four core institutions would deploy the
commonly developed software
Collaboration and Learning
Environment
• Learning management systems are
really just a form of collaboration
– Freshman Calculus
– Chess Club
– Group of 5 faculty members working on
curriculum
– 2000 physics researchers collaborating
across the world on a 15-year physics
experiment
Sakai 1.0
• Site based collaboration environment
–
–
–
–
–
–
–
Worksite management
E-Mail Lists
Threaded Discussion
Resources (folders) with WebDav support
Chat
No search yet :(
Many other tools
• Beta Release July 15, 2004
• Production site available at ctools.umich.edu
More Sakai Beta Tools
Admin: Alias Editor (chef.aliases)
Admin: Archive Tool (chef.archive)
Admin: Memory / Cache Tool
(chef.memory)
Admin: On-Line (chef.presence)
Admin: Realms Editor (chef.realms)
Admin: Sites Editor (chef.sites)
Admin: User Editor (chef.users)
Announcements (chef.announcements)
Assignments (chef.assignment)
C. R. U. D. (sakai.crud)
Chat Room (chef.chat)
Discussion (chef.discussion)
Discussion (chef.threadeddiscussion)
Dissertation Checklist (chef.dissertation)
Dissertation Upload
(chef.dissertation.upload)
Drop Box (chef.dropbox)
Email Archive (chef.mailbox)
Help (chef.contactSupport)
Membership (chef.membership)
Message Of The Day (chef.motd)
My Profile Editor (chef.singleuser)
News (chef.news)
Preferences (chef.noti.prefs)
Recent Announcements
(chef.synoptic.announcement)
Recent Chat Messages (chef.synoptic.chat)
Recent Discussion Items
(chef.synoptic.discussion)
Resources (chef.resources)
Sample (sakai.module)
Schedule (chef.schedule)
Site Browser (chef.sitebrowser)
Site Info (chef.siteinfo)
Web Content (chef.iframe)
Worksite Setup (chef.sitesetup)
WebDAV
Sakai Going Forward
• Focus on the”Learning” of Collaborative Learning
Environment through 2Q05
– Getting ready for production deployment at the four partner
sites
– Improving the look and feel of the software
– Many feature enhancements (to satisfy four + 60 schools)
– New GUI Programming Environment based on Java Server
Faces
– Building new set of Sakai APIs (Java)
• Based on OKI - Enabling RDF
• Move into OGCE and NEESGrid starting 3Q04
• Release 2.0 - 2Q04
The George E. Brown, Jr. Network
for Earthquake Engineering
Simulation (NEES)
NEES Founding
• George E. Brown, Jr. Network for Earthquake
Engineering Simulation (NEES).
• Funded in 1999 - > $100M
• Goal: Transform the nation’s ability to carry out
earthquake engineering research, to obtain
information vital to develop improved methods for
reducing the nation’s vulnerability to catastrophic
earthquakes, and to educate new generations of
engineers, scientists and other specialists committed
to improving seismic safety.
• To be Completed: October 2004
• NEESgrid facilitates research capabilities previously unavailable
• NEESgrid links earthquake researchers across the U.S. with
leading-edge computing resources and research equipment and
allowing collaborative teams (including remote participants) to
plan, perform, and publish their experiments
• NEESgrid is a coordinated and secure architecture/environment
• NEESgrid is a modular and extensible environment with a
customizable user interface
• NEESgrid provides common tools that allow leveraging resources
and experiences
• Rather than having to worry about the required cyber
infrastructure, NEESgrid allows researchers to focus on the
earthquake engineering challenges at hand
• The goal of the System Integrator (SI) is to develop NEESgrid as
the Cyber Infrastructure that will facilitate this next generation of
experimentation/simulation in earthquake engineering
NEES Components
• New experimental facilities (15)
– Oregon State University, Rensselaer Polytechnic Institute,
University of Buffalo, University of Colorado at Boulder,
University of Minnesota, University of Nevada at Reno,
University of Texas at Austin, and the University of California
campuses at Berkeley, Davis and Los Angeles
• Collaborative Software System: NEESGrid
–
–
–
–
–
Collaboration
Data capture and sharing
Tele-presense and Tele-operation
Simulation
Support for Hybrid Simulation and Physical Experiments
Shake table: Nevada, Reno
Reaction wall: Minnesota
Centrifuge: UC Davis
Wave basin: Oregon State
Field structural: UCLA
Field geotechnical: Texas
If we build it, they will
collaborate
• Data and access to data represent fundamental
barriers to dispersed collaboration
• Efficient movement of vast amounts of data is a prime
rationale for cyberinfrastructure
• Federating, visualizing and mining data are principle
challenges
The collaboratory concept
Synchronous
communication
Asynchronous communication
Researchers
Synchronized
data
Synchronized data and images
Data discovery
Teleoperation
Teleobservation
Facilities
Automatic
archiving
Simulation codes
Hybrid experiments
Data
Remote Users
(Faculty,
Students,
Practitioners)
Instrumented
Structures
and Sites
NEES Resources
Simulation
Tools
Repository
Laboratory
Equipment
Field Equipment
Curated Data
Repository
Leading Edge
Computation
Global
Connections
Remote Users:
Laboratory Equipment
(K-12 Faculty and
Students)
The Grid in NEESgrid
Grid
Operations
Center
Experimental Component
Grid Data
Repository
Campus Net Component
NEESgrid Component
Internet Fabric and Operations
Hub A
Hub C
Hub B
NEESpop A
Teleobservation
Equipment
Experimental
Equipment
Telepresence
Equipment
Video I/O
Active PI
Passive co-PI
Audio I/O
Data Cache
Data Cache
Site A: Experimental
Data Producer
Site B: Remote
Lead Investigator
Site C: Passive
Collaborator
NEESGrid Software
• Founding NMI Technologiess
– Globus Toolkit
– OGCE Collaboration Toolkit
• New Work
–
–
–
–
Data and Metadata Repository - NCSA
Data Acquisition, Storage, and Visualization
Simulation Portal
Synchronized data and video (live and stored)
NEESgrid Experiment Data Flow
SiteSpecific
ExperimentalSetup
ProjectRelated
Data
Model
Project
Browser
Data
Ingestion
Experiment
Control
ExperimentalElement
DataElement
NEESGrid
Data
Repository
Stored
Viewer
Streaming
Viewer
C
Data
Turbine
DAQ
D
DAQ
Disk
Capturing Video and Data
PTZ/
USB
BT848
DAQ
Still
Capture
DT Client
Camera Control
Gateway
Video
Frames
DT Client
DT Main System
Data
Capture
DT Client
Site A
Simulation
Coordinator
Site B
Data Monitoring Tools
Camera Control
Gateway
Still Image / Camera Control
^
< >
Still image
camera
control
^
DT Main System
Thumbnail
~ < >
Creare
viewers
Video and
Data Tivo
Thumbnail + Audio + Data
<
>
+
Sample
Experiment
Setup
NTCP
Server
Comp
Sim
Simulation
Coordinator
DAQ
Live
Extractor
DAQ Data
Quicktime
Data
Repository
NTCP
Server
DAQ
Data Models
• Data models are developed in RDF
• Local repository supports multiple simultaneous data
models with cross-model linkages
• Metadata browser (aka Project browser) becomes
the Project Browser, Notebook Browser, Site
Specification Database Browser
• Metadata browser can federate multiple sources of
Metadata
Specimen
EquipmentSetup
CalibrationSet
Instrumentation
Setup
Equipment
SensorGroup
DataEquipment
Camera
Sensor
Overall Data Modeling Efforts
NEES
Site
Specifications
Database
Site A
Equipment
Project
Description
Domain
Specific
models
Common
Elements
Data /
Observations
Site B
Equipment
People
Experiments
Trials
Tsnumai
Specimen
Shake Table
Specimen
Units
Data
Ref. Source: Chuck Severance
Site C
People
Experiments
Geotech
Specimen
Sensors
Data
Trials
Centrifuge
Specimen
Descriptions
Data
Models + Data Model
Data
RDF Load
<owl:ObjectProperty rdf:ID="hasPublications">
<rdfs:domain>
<owl:Class>
<owl:unionOf rdf:parseType="Collection">
<owl:Class rdf:about="#Project"/>
<owl:Class rdf:about="#Task"/>
</owl:unionOf>
</owl:Class>
</rdfs:domain>
<rdfs:range rdf:resource="#Publications"/>
</owl:ObjectProperty>
Models
RDF/
OWL
Configure
Repo
Models + Data Model
Data
RDF Load
Repo
Models
RDF/
OWL
Protégé - 2K
Configure
DOE ELN / Example
NEESgrid: Simulation
Lessons Learned in NEESGrid
(Chuck’s views)
• A collaborative environment *must* have a user interface
• Just asking about requirements is a tiny part of the problem - CS
developers must learn to “walk in the shoes” of the scientists.
• Scientists know a lot about Computer Science - listen to them
and involve them
• The real work begins after software is “delivered” - Version 1.0 is
usually just a conversation starter - but a very important step
• There are some things that are useful across fields - but the
most valuable elements are field-unique
• Data models, data repositories, and long-term curation is
difficult!
HENP/Civil Similarities
HENP/Civil: Differences
• Willingness to collaborate
– CE: Not too keen on the idea - NSF forcing the
issue using “carrot and stick”
– HENP: It is part of the fabric of the field (at least
within Atlas and CMS :) )
• Technology Savvy
– CE: It is all about the “within lab” electronics
– HENP: In lab electronics is difficult, important and
different. The internet is a tool to be used both for
human communication and data manipulation have tried everything - use simple reliable stuff
because the work cannot wait.
NEES Going Forward
• Release 3.0 complete July 31, 2004
– Improve documentation and bug fixes through
September 30, 2005
• As of October 1, 2004, NEESGrid will
transition from development lead by NCSA to
maintenance led by SDSC
– Focus on deployment and support
– UM will continue involvement - move to Sakai improving data modeling, internationalization
(Japan)
GESCR* - The Vision
* A Grid Enabled Collaboratory for Scientific Research
HENP: Unique Aspects
• Already highly collaborative as a
scientific field
• Globally distributed in space and time
(both time-zones and years)
• Technology savvy but wary :)
• Solutions underway (iVDGL, GriPhyn,
etc) without significant user interface
component
GESCR in a sentence..
….. will combine the best open-source applications from
within the HENP communities (and from allied Grid and
National Middleware efforts) within a common portal
interface.
Put another way: There is a lot of work going on in HENP
that does not really have much of a user interface yet. We
want to put a user interface on the Grid and other activities.
GESCR in a sentence..
….. will combine the best open-source applications from
within the HENP communities (and from allied Grid and
National Middleware efforts) within a common portal
interface.
What I tell people:
GESCR will be a tri-corder for the Atlas
and CMS collaborations. Starting with
inter-human communication and
moving towards data processing and
analysis.
Elements of GESCR
• Sakai with Grid Support (from OGCE /
NEESGrid)
• Sakai Collaboration Tools
• Well funded analysis of application,
requirements, and challenges throughout the
project
• Additional new capabilities
– VRVS Integration (rooms, record, playback)
– Web Lecture Archive (WLAP)
– VNC support
GESCR Elements (cont)
•
•
•
•
MonaLisa integration
Collaborative LaTeX editing environment
Peer-to-Peer file sharing
Language of Access to handle “overcollaboration”
• Adaption of HEPBook
• GESCR support office (Y3)
• Education and outreach activities
GESCR Project
• Planned: Four years - Medium ITR (NSF)
• Partners: Caltech, Fermilab, Florida International
University, Lawrence Berkeley, Maryland, University
of Michigan, University of Texas Austin, University of
Iowa
• Joint between Atlas and CMS
• Has been submitted twice to the NSF ITR program at
the urging of the Physics directorate to create a
CMS/Atlas joint IT effort
GESCR - Going Forward
• Waiting for funding decision on most recent
submission
– If funded, work on unique physics capabilities
(VRVS, MonaLISA, etc) will begin immediately
– If not funded, leverage NEESGrid / Open Grid
Computing Environment / Sakai - focus on the
generic tools - deploy in a physics context
• Immediately, we can begin some
experimental usage for interested groups
using the University of Michigan CTools
service
University of Michigan: Ctools
• University of Michigan runs a large scale professional
server cluster for teaching and learning and research
collaboration
– Research collaboration simply needs a single UMich
member of each group - the rest of the members have nocharge “friend” accounts based on E-Mail addresses.
– For multiple accounts, it is best to have a UM person to act
as your “problem solver” familiar with your group. If there is
interest, we can provide staff from the MGrid
(www.mgrid.umich.edu) organization to support HEP
collaborative activities.
• ctools.umich.edu
Sakai-Based CTools @ UM
In production August 1, 2004.
Summary
• We will deploy collaboration software for Atlas
/ CMS sooner or later
• If / when we get funding it will accelerate the
process and provide Physics specific
capabilities
• Collaborative portals is a well-funded and
well-coordinated activity (Sakai 30+FTE,
NEESGrid 10+FTE, OGCE 5+ FTE)
• It is (almost) a no-lose situation - the longer
we wait, the more mature the software will be
Questions
• Thank you for your time…
• Questions to csev@umich.edu
Download