Security-oriented Portals for the Life Sciences Prof. Richard O. Sinnott

advertisement
Security-oriented Portals for the Life Sciences
Prof. Richard O. Sinnott
National e-Science Centre
University of Glasgow, Scotland
r.sinnott@nesc.gla.ac.uk
IWPLS 2009,
Edinburgh, 14th September 2009
Overview
NeSC Glasgow
Security and Grid security
X
X
X
X
Compute-oriented,
Data-oriented,
User-oriented,
…
Life Sciences at NeSC Glasgow
Data Management through e-Social Science (DAMES)
Disorders of Sex Development (EuroDSD)
Demonstration/videos
Conclusions
IWPLS 2009,
Edinburgh, 14th September 2009
NeSC Glasgow
E-Science Hub
Externally
X
Glasgow end of NeSC
– Involved in numerous UK wide activities/projects
Internally
X
X
Focal point for e-Science research/activities at Glasgow
Work closely with foundation departments
– Department of Computing Science
J. Jiang
Chris
Bayliss
» Established first UK Grid Computing course
X
– Department of Physics & Astronomy
Also working with other groups including
– Bioinformatics Research Centre,
– Biostatistics
– Electronics and Electrical Engineering
– Clinicians & numerous hospitals across Scotland,
– Arts & Humanities,
– University Services …
NeSC GU now part of University IT Services
IWPLS 2009,
Edinburgh, 14th September 2009
J.Mohammad
(PhD)
Gordon
Stewart
C. Millar
T.Doherty
VPman
S. Hussain
(PhD)
David Martin
(ScotGrid
sys-admin)
Camera
Shy
H. Sarwar
NeSC Glasgow Projects
Running/Due to Start
Completed
National e-Science Centre (NeSC-I, NeSC-II, NeSC-III)
Dynamic Virtual Organisations for e-Science Education
(DyVOSE)
Biomedical Research Informatics Delivered by Grid
Enabled Services (BRIDGES)
Grid Enabled Microarray Expression Profile Search
(GEMEPS)
GridNet
Glasgow early adoption of Shibboleth (GLASS)
Joint Data Standards Survey (JDSS)
ESP-Grid
GridNet-2
HPC Compute cluster award
Sun industrial sponsorship
OGC Collision
OMII-Security Portlets
OMII-RAVE
Grid Enabled Occupational Data Environment (GEODE)
Towards an e-Infrastructure for e-Science Digital
Repositories
Grid enabled Biochemical Pathway Simulator
Virtual Organisations for Trials and Epidemiological
Studies (VOTES)
Towards a European e-Infrastructure for e-Science
Repositories
Modelling, Inference and Analysis for Biological Systems
up to the Cellular Level
IWPLS 2009,
Edinburgh, 14th September 2009
Scottish Bioinformatics Research Network (SBRN)
Generation Scotland Scottish Family Health Study
Meeting the Design Challenges of nanoCMOS
Electronics (nanoCMOS)
Integrating VOMS and PERMIS for Superior Grid
Authorization (VPman)
Drug Discovery Portal
EU FW7 Avert-IT
EU FW7 EuroDSD
Breast Cancer Tissue Biobank
Data Management through e-Social Science (DAMES)
NeSC Research Platform (NRP)
NeSC Information Network (NIN)
Advanced Grid Authorisation through Semantic
Technologies (AGAST)
ShinTau (Supporting Multiple Shibboleth Attribute
Authorities)
Pharming of Therapeutic RNA
NCeSS Technical Management
CESSDA PPP
Scottish Health Informatics Platform for Research (SHIP)
National E-Infrastructure for Social Simulation (NeISS)
Enhancing Repositories for Language and Literature
Researchers (ENROLLER)
SECURITY
IWPLS 2009,
Edinburgh, 14th September 2009
Populations
Organisms
Physiology
Organs
Tissues
Cell signalling
Cell
Protein-protein interaction (pathways)
Protein functions
Protein Structures
Gene expressions
Nucleotide structures
Nucleotide sequences
The e-Health Future…
e-Security
Security
Key is that should support
X
X
seamless access to a heterogeneous variety of “distributed” compute and data
(and other) resources
– Often domain specific – especially data!
single sign-on
– Authenticate once and access numerous distributed resources
AAAA (
+privacy, confidentiality, integrity…)
– Authentication
» (know who “they” are)
– Authorisation
» (decide what “they” can do and enforce it)
– Auditing/accounting
» (keeping track of who did what/when for security checks/charging etc)
Portals good fit here…
Interface to targeted resources
X
Services, data resources, tools, …
First line of security…
IWPLS 2009,
Edinburgh, 14th September 2009
Authentication
A_ _ _
Authentication – knowing your punters
X
X509-based Public Key Infrastructures (PKI)
– Hated by all non X509-PKI Grid folk
» (=99.9% of people academics/researchers)
– Should never have to be made visible to users!
– $> openssl pkcs12 -in cert.p12 -clcerts -nokeys -out usercert.pem
Ouch!
IWPLS 2009,
Edinburgh, 14th September 2009
More User Oriented Security
A_ _ _
Federated Authentication, e.g. through Shibboleth
LDAP
4.
Identity Provider
AuthN
Ho
m
e
Home Institution
3.
U
ho ser
m se
e
in lect
st s
itu th
tio eir
n
sit
e
au
th
en
tic
at
es
Federation
us
er
Service provider
Log-in once and roam
W.A.Y.F.
User
1.
2
us . Sh
er ib
to bo
W let
.A h
.Y red
.F ir
. s ec
er ts
vic
e
User points browser at Grid
resource/portal (or non-Grid
resource)
IWPLS 2009,
Edinburgh, 14th September 2009
5. User accesses resource
Web site/e-Journal
_A__
Authorisation
Defining what they can do and define and enforce rules
X
Each site will have different rules/regulations
Also known as Virtual Organisations (VO)
X
Collection of distributed resources shared by collection of users from one or more
organizations typically to work on common research goal
– Provides conceptual framework for rules and regulations for resources to be
offered/shared between VO institutions/members
– Different domains place greater/lesser emphasis on expression and
enforcement of rules and regulations (policies)
VO
Org1
{Resources}
{Users}
...
{Resources}
IWPLS 2009,
Edinburgh, 14th September 2009
Orgn
{Users}
Shibboleth-based Federated VOs
uid
Identity Provider
AuthZ
LDAP
AuthZ
Service provider
LDAP
Identity Provider
Shib
Frontend
AuthN
LDAP
LDAP
W.A.Y.F.
2
us . S
er hib
to bo
W let
.A h
.Y red
.F ir
. s ec
er ts
vic
e
1. User points browser at Grid
resource/portal
6. Make final AuthZ decision
ion
licat
p
p
A
Grid
5. Pass authentication info and
attributes to authZ function
Ho
m
pu u e s
sh se ite
e r
a
se s at and uth
en
rv tri
ice bu
tic
t
at
pr es
es
ov to
ide th
r e
3.
U
ho ser
m se
e
in lect
st s
itu th
tio eir
n
Home Institution
4.
Home Institution
4.
H
pu om
sh e s
es ite
at au
tri
bu the
te nti
s
to cate
th s
e us
se e
rv r a
ice nd
Federation
pr
ov
ide
r
User points browser
Grid Portal
at Grid resource/
portal
User
IWPLS 2009,
Edinburgh, 14th September 2009
User
AuthN
Privileges, Resources, Access
Control and Trust
uid
Identity Provider
AuthZ
LDAP
Home Institution
3.
U
ho ser
m se
e
in lect
st s
itu th
tio eir
n
Shib
Frontend
AuthN
4.
H
pu om
sh e s
es ite
at au
tri
bu the
te nti
s
to cate
th s
e us
se e
rv r a
ice nd
Federation
pr
ov
ide
r
W.A.Y.F.
User
Service provider
LDAP
2
us . Sh
er ib
to bo
W let
.A h
.Y red
.F ir
. s ec
er ts
vic
e
6. Make final AuthZ decision
tion
plica
p
A
Grid
5. Pass authentication info and
attributes to authZ function
1. User points browser at Grid
resource/portal
IWPLS 2009,
Edinburgh, 14th September 2009
Grid Portal
SPAM-GP Portlets
‰ Family of JSR-168 compliant portlets for VO admins:
‰ scoped attributed management portlet (SCAMP)
¾ Who do you trust?
‰ dynamic portal configuration management (CCP)
¾ What you can see is what you can do.
‰ attribute certificate portlet (ACP)
¾ securely push attributes out to collaborators and use them when
accessing protected services
IWPLS 2009,
Edinburgh, 14th September 2009
Demonstration
IWPLS 2009,
Edinburgh, 14th September 2009
Data, Data Everywhere…
Data Management through e-Social Science
(DAMES – www.dames.org.uk)
Various data management themes
X
X
X
X
Occupational data theme
– builds on GEODE project (www.geode.stir.ac.uk)
Education data theme (GEEDE)
Minorities/Ethnicity theme (GEMDE)
E-Health theme (GEHDE)
– Initial focus on depression, self-harm and suicide
» Does the number of people in a household have any effect on suicide rates?
» Is there a correlation between age, sex, marital status, history of drug use (including
prescribing drugs / anti-depressants) on suicide?
» What is relation with access to parkland/green fields on depression?
» What is optimal way to treat different forms of depression, e.g. drug treatments,
therapists, …?
» …
IWPLS 2009,
Edinburgh, 14th September 2009
Data, Data Locked Away Everywhere…
GEHDE
Linking clinical data with social science data for research into self
harm, depression and suicide
X
Scottish Morbidity Records (SMR)
– Aggregated clinical records from last 30+ years across Scotland
» SMR01A
» SMR04A
discharges
» SMR06A
» SMR99A
General acute inpatient and day case discharges (3,719,206 records)
Psychiatric and mental handicap hospitals and units: admissions, residents and
(241,599 records)
Scottish cancer registrations (171,167 records)
Deaths (173,615 records)
– All SMR data contain various geographical indicators for instance postcode, regional
areas, council areas, census output areas
X
General Practitioners Administration System for Scotland (GPASS)
– E-Pharmacy
» Prescriptions issued for drugs across Scotland, e.g. anti-depressants
X
X
Census health related data
– CasWeb / MIMAS
Geospatial data
– www.edina.ac.uk
IWPLS 2009,
Edinburgh, 14th September 2009
Demonstration
IWPLS 2009,
Edinburgh, 14th September 2009
Example of EuroDSD
Investigation of the molecular pathogenesis and
pathophysiology of Disorders of Sex Development
(DSD) - EuroDSD
3-year project started May 2008
X
X
X
Builds on initial software prototypes built for ESPE by NeSC
Support for research into disorders of sex development
– Currently quite topical
Clinical Contributors across Europe
– Germany, France, UK, Sweden, Italy, Netherlands, …
IWPLS 2009,
Edinburgh, 14th September 2009
EuroDSD Work
WP1 - Virtual Research Environment (VRE) for DSD research
More later
WP2 - Identification of novel genetic markers for DSD
Design and validate a DSD GeneChip
Identify and confirm novel genetic markers of DSD
WP3 - Functional assessment of androgen receptor (AR) mutant
analysis
Europe-wide study of patients with partial androgen insensitivity syndrome
(PAIS) to correlate phenotype, AR mutation and function in vitro with
pubertal outcome
WP4 - Characterization of the “androgen-memory”
high throughput methylation analyses related to phenotype, genotype,
molecular androgen receptor (AR) function
WP5 - Steroid Metabolomics
steroid profiling as discovery tool for patients with 46,XY DSD
WP6 - DSD e-learning webportal
interactive learning environment for up to date program on DSD
WP7 – Project Management
IWPLS 2009,
Edinburgh, 14th September 2009
EuroDSD::WP1
At the heart of the VRE is the Registry for registering cases
“…we expect that around 1800 primary cases will fulfil the requirements for
data entry, approximately 300 from each centre”
Other tools
Collaborative tools, e.g. wikis, Bioinformatics tools
registration
SECURE PORTAL / VRE
upload
Registry
Bioinformatics
tools
consent
query
Core Data
BLAST
edit
delete
Gene
Screening
OtherData
discuss
IWPLS 2009,
Edinburgh, 14th September 2009
Biochemical
simulators
Demonstration
IWPLS 2009,
Edinburgh, 14th September 2009
Registry Statistics
As of last night
… 302 cases and moving in the right direction!
IWPLS 2009,
Edinburgh, 14th September 2009
Conclusions
Continuing to work in this space
Many domains have different demands and
expectations
X
X
nanoCMOS electronics did not like portal based solutions
Life scientists/other researchers accept them more readily
Scottish Health Informatics Platform for Research
3 year project £3.5m funded by Wellcome Trust, EPSRC,
ESRC, MRC
X
Started April 2009
– Includes ~£1m for the NHS
» Essential to support collaboration
IWPLS 2009,
Edinburgh, 14th September 2009
Download