Ontologies & Ontology tools for the CCLRC Neutron & Muon Facility e-Science

advertisement
Ontologies & Ontology tools for the
CCLRC Neutron & Muon Facility
Louisa Casely-Hayford
e-Science
ISIS a CCLRC Neutron & Muon Facility
• ISIS is the worlds leading pulsed Neutron & Muon source situated at
the CCLRC Rutherford Appleton Laboratory. ISIS supports an
international community of around 1600 scientists in a range of
scientific disciplines.
• Currently ISIS produces about 700GB of combined Neutron & Muon
data each year and this figure is set to rise with the addition of a new
target station.
• The ISIS Metadata Catalogue (ICAT) is a twenty year back
catalogue of experiments conducted at ISIS it contains
approximately 3GB of metadata which references 3TB of data.
• In order to maximise the value of data produced from the facility, it
must be fully searchable.
• To address this problem, e-Science is developing numerous
software solutions and ontologies are seen one of these useful
approaches.
Presenter Name
Louisa Casely-Hayford
Facility
Name
e-Science
Why Ontologies are a useful solution?
• Ontologies offer a powerful means to formally express the nature of
a domain.
• To share common understanding of the structure of information
among people
• To enable reuse of domain knowledge
• To make domain assumptions explicit
• They provide central controlled vocabularies that can be integrated
into catalogues, databases, web publications and knowledge
management applications
• Ontologies will facilitate searching of data by category and grouping
of data into keywords across studies
Presenter
Name
Louisa Casely-Hayford
Facility
Name
e-Science
Building an Ontology
• Defining terms in a domain and relations between them.
– Defining concepts in the domain (classes)
– Arranging the concepts in a hierarchy (subclass-superclass
hierarchy)
– Defining which attributes and properties (slots) classes can have and
constraints on their values
– Defining individuals and filling in slot values
• Involves collaboration between domain experts and ontology builders.
• Ontologies are expressed in a formal language and developed within an
editing environment.
Presenter
Name
Louisa Casely-Hayford
Facility
Name
e-Science
A Protégé-OWL Ontology
• Classes
• Individuals
• Properties
Italy
livesIn
America
Gemma
England
hasSibling
A class is a concept in the domain
- a class of People
- a class of Pets
- a class of Countries
A class is a collection of elements with
similar properties.
Instances of classes
- America can be an instance of the
class Country.
Class Country
Mathew
hasPet
Class Person
Fluffy
Fido
Class Pet
Presenter
Name
Louisa Casely-Hayford
Facility
Name
e-Science
Building of the ISIS Facilities Ontology
of keywords
in these five
categories
are:
•Examples
The ISIS
facilities ontology
is based
on keywords
in the ISIS
Metadata catalogue (ICAT).
• HRP00145.RAW - a datafile name.
•
Over 10,000 keywords housed in ICAT and many are synonyms.
- a High
Resolution
Powder into
Diffractometer
one of the many
•• HRPD
Keywords
in ICAT
were grouped
5 main categories:
instruments used in experiments at the ISIS facility.
1. Datafile name
2. Instrument
• Hydrazinium
- an investigation title, chemical names and
3. Investigation
compounds
were used astitle
investigation titles of experiments in
ICAT. 4. Investigator
5. Year
• 1986 - the year in which a particular experiment was conducted
• JINR (Joint Institute for Nuclear Research) - the name of an
investigator.
Presenter
Name
Louisa Casely-Hayford
Facility
Name
e-Science
ISIS Facilities Ontology Hierarchy
Presenter
Name
Louisa Casely-Hayford
Facility
Name
e-Science
ISIS Facilities Ontology
Class ISISExperiment
hasTitle
Hydrazinium
Protein Crystallography
GroupExperiment
Class InvestigationTitle
wasConductedIn
Class CrystallographyGroupExperiment
1986
hasInvestigator
hasDataFileName
Class Year
hasUsedInstrument
Pete Jones
HRPD
HRP00145.RAW
Class Investigator
Class Instrument
Class DataFile
Presenter
Name
Louisa Casely-Hayford
Facility
Name
e-Science
Presenter
Name
Louisa Casely-Hayford
Facility
Name
e-Science
ISIS Online Proposal System
• Scientists can submit applications for beamtime at ISIS through an
online application form which is known as the ISIS Online Proposal
System
• The ICAT(Metadata catalog) not only holds the 20 year back catalog
of data, but will also hold data from approved proposals and data
generated from experiments conducted at ISIS
• Three separate modular ontologies for Sample, Investigator and
Experiment are being developed to mark up the Proposal system
• These ontologies are partly based on the proposal system database
schema
Presenter
Name
Louisa Casely-Hayford
Facility
Name
e-Science
Sample, Investigator and Experiment Ontologies
Sample
Investigator
Experiment
Presenter
Name
Louisa Casely-Hayford
Facility
Name
e-Science
OntoMaintainer
• Consensus on Concepts modelled in the ISIS Facilities ontology,
was achieved through a series of interviews with domain experts.
• During the design and creation process, there was a difficulty in
sharing current versions of the ontology with our collaborators at
ISIS.
• This is because to view the hierarchical structure of the ontology,
scientists would have to download and install Protégé locally.
• The Ontology Maintainer was developed to facilitate the community
in remotely viewing current versions of the ontology.
Presenter
Name
Louisa Casely-Hayford
Facility
Name
e-Science
Screen Shot of OntoMaintainer
Presenter
Name
Louisa Casely-Hayford
Facility
Name
e-Science
Benefits of OntoMaintainer
• It is easily accessible because it is available over the web
• Allows domain experts to contribute towards the maintenance of the
ontologies
• Encourages collaboration between domain experts (scientists) and
ontology builders by allowing members of the community to be involved
in the development and maintenance of ontologies
• Makes collaboration between domain experts and ontology builders more
efficient
Presenter
Name
Louisa Casely-Hayford
Facility
Name
e-Science
Future Work
• Completion of Sample, Investigator, Experiment and ISIS Facilities
Ontologies
• Ontology Maintainer will be improved through the addition of properties
to enable relationships between individuals in classes to be shown.
• Graphical view of hierarchies of the ontology will be added to the user
interface of the Ontology Maintainer.
• Tree hierarchy will be made more dynamic through automatic updating of
classes.
Presenter
Name
Louisa Casely-Hayford
Facility
Name
e-Science
Conclusion
• Ontologies to mark up the ICAT back catalogue and new approved
studies submitted through Online Proposal System to improve the search
and navigation of data and search of concepts across scientific
disciplines.
• Ontology Maintainer will facilitate the process of creating and maintaining
ontologies by providing a means of getting feedback directly from domain
experts.
• Major challenge scope, modularity and integration of ontologies.
Presenter
Name
Louisa Casely-Hayford
Facility
Name
e-Science
Question Time
Presenter
Name
Louisa Casely-Hayford
Facility
Name
e-Science
Download