RightField Rich Annotation of Experimental

advertisement
RightField
The Semantic Annotation of
Experimental Data using Spreadsheets,
Katy Wolstencroft, Stuart Owen, Matthew Horridge,
Olga Krebs, Wolfgang Mueller Carole Goble
RightField
A tool for embedding ranges of ontology terms
into spreadsheets to allow the users of those
spreadsheets to add semantic annotations from
simple drop-down lists
RightField
A tool for embedding ranges of ontology terms
into spreadsheets to allow the users of those
spreadsheets to add semantic annotations from
simple drop-down lists
Why?
 Makes annotation quicker and more efficient
 Standardises annotation
 Hides the ontology complexity from the users

Managing Biological Data
Describe experiments and results of experiments
Minimal Information Models
Guidelines,
Checklists,
vocabularies
Necessary for publication, submission to
public databases and sharing
Managing Biological Data
Describe experiments and results of experiments
Minimal Information Models
Guidelines,
Checklists,
MIACA Minimal Information About a Cellular Assay
MIAME Minimum Information About a Microarray Experiment
MIAPE Minimum Information About a Proteomics Experiment
MIARE Minimum Information About a RNAi Experiment
MIASE Minimum Information About a Simulation Experiment
MIBBI >30
Managing Biological Data
Describe experiments and results of experiments
Ontologies and Vocabularies for
Annotation
Gene Ontology
ChEBI
MGED
SBO
BioPortal >270 biomedical ontologies
MIBBI Model
Data
Microarray
Ontologies
MIAME:Minimum
Information MGED
about a Microarray Experiment
Proteomics
MIAPE: Minimum Information PSI-MI, PSI-MS, PSI-MOD
about a Proteomics Experiment
Interaction
experiments
MIMIX:Minimum Information about PSI-MI
a Molecular Interaction Experiment Protein-Protein Interaction
Systems
Models
Biology MIRIAM:Minimal
Information SBO:
Systems
Required In the Annotation of Ontology
biochemical Models
Biology
Systems
Biology MIASE:Minimum
Information KISAO:Kinetic Simulation
Model Simulation
About a Simulation Experiment
Algorithm Ontology
SysMO: Systems Biology of MicroOrganisms
SysMO Consortium






Pan-European consortium
> 100 research groups
> 320 scientists
Distributed, interdisciplinary
projects
Expected to pool data and
results and disseminate
Microbiologists, molecular
biologists, biochemists,
mathematicians....not many
informaticians
SysMO-DB



SysMO-SEEK – a platform
for systems biology data
sharing
Web based environment for
sharing in the consortium
and disseminating to the
community
Used in other consortia:

Virtual Liver, EraSysBio+,
UNICELLSYS and more....
Associating Experiments
Investigation
Study
Assay
SOP
SOP
SOP
http://isatab.sourceforge.net/
Construction
Validation
Data Templates and Vocabularies
Metabolomics
SOP
Proteomics
Metabolomics
SOP
Mass
Spec
Fluxomics
Transcriptomics
SOP
Construction
Validation
Fitting in with Laboratory practices



Scientists can continue to do what they have
always done
Embedding semantics into the tools already in
use
Excel, excel, excel.....
The End Result
Ontology terms for markedup cells in drop-down boxes
How it Works
Marked-up workbook
Saved in plain Excel
Excel Workbook
RightField Client
Ontology
“Portion” of ontology terms
Informaticians/ontologists
Terms Embedded into
Excel Workbook
End Users
RightField Application
Loading Ontologies
Published ontologies
Multiple versions
You can also load local ontologies from file or URL
Loading Ontologies
Excel workbook loaded into
RightField with multiple
worksheets
Class hierarchies of
loaded ontologies
Selected parent term
from the ontology
Methods for specifying
ontology terms
Term lists for
selected cells
Excel workbook with
marked-up cells
Marking-up Columns or Rows
The User View
Ontology terms for markedup cells in drop-down boxes
Ontology Information

Ontologies encapsulated




Scientists can work offline
Ensures same versions of ontologies used for a series
of experiments
No special macros or plugins required, just Excel or
Open Office
Versions and URIs captured in hidden
worksheets



Provenance
Comparisons between sheets
Linking back to the vocabularies
Provenance
The human readable term
Term Label
label
The (unique) term
Term IRI
identifier
The ontology that defines the
Ontology IRI
term
Ontology Version The version of the ontology
The (web) location of the
Physical Location
ontology
RightField Technologies
Java
Platform Independent
OWL API
Loading ontologies and reasoning
Apache POI HSSF libraries
Loading and saving of Excel Spreadsheets
Ontology Languages
RDFS - RDF Schema
OWL - Web Ontology Language
OBO - Open Biomedical Ontologies
RightField in Use



SysMO – Systems Biology of MicroOrganisms
E-Lico - a virtual laboratory for interdisciplinary
collaborative research in data mining and data-intensive
sciences. Case Studies in kidney research
BioBanking in the Netherlands
Outside Biology


Oil and Gas industry
Egyptology specimen classification
Using RightField Spreadsheets
Populate
Extract
RDF Graph
Store / Reuse
Future Developments



Auto-complete
Validation of annotation
Populating ontology content - Populous
Populous
http://www.e-lico.eu/populous




Generic tool for populating ontology templates
Supports validation at the point of data entry
Expressive Pattern language for OWL Ontology
generation
Helps biologists with ontology design patterns
Simon Jupp, Robert Stevens, University of Manchester
Availability


Open source
http://www.rightfield.org.uk
Acknowledgements
Stuart Owen
Matthew Horridge
Katy Wolstencroft
Carole Goble
Wolfgang Mueller
Olga Krebs
Download