Novel High
Resolution tools at
the HRPPC
Dr Xavier Sirault1
Dr Bob Furbank1
1: CSIRO Plant Industry, Black Mountain
Cnr Clunies Ross St & Barry Drive
Canberra, ACT 2601
Xavier.sirault@csiro.au
An Ontology-centric
Architecture for
Extensible
Scientic Data
Management Systems
Gavin Kennedy1,2
Dr Yuan-Fang Li3
2: School of ITEE, University of Queensland,
St Lucia, QLD
3: Clayton School of IT, Monash University,
Clayton, VIC
Gavin.kennedy@csiro.au
What is Plant Phenomics?
Phenome = Genome X Environment
Genomics is accelerating gene
discovery but how do we capitalise
on these data sets to establish gene
function and development of new
genotypes for agriculture?
High throughput and high
resolution analysis capacity now
the factor limiting discovery of new
traits and varieties
“ In the next 50 years we must produce more food
than we have consumed in the history of mankind”
Megan Clarke, CSIRO CEO 2009
Phenomics from the Leaf to the Field
Imagine a plant breeder walking his trials logging plant performance distributed sensors
with his mobile phone or logging on to Phenonet from home to view his wheat in real time
HRPPC: Canberra node of the Australian
Plant Phenomics Facility
Role
 Deep phenotyping
 Development of next generation tools to probe plant
function and performance (come and see us)
Infrastructure:
1500 m2 lab space
 245 m2 greenhouse
 260 m2 growth cabinets
Analytical tools packaged in:
Brachypodium
distachyon
Arabidopsis thaliana
Triticum and Hordeum species,
Vigna unguiculata (cowpea),
Cicer arietinum (chickpea),
Zea mays (maize),
Sorghum bicolor, …
1- Model Plant Module (HTP)
2- Crop-Plant Shoot Module (MTP)
3- Crop-Plant Root Module (MTP)
4- Crop-Plant Field Module (HTP)
Gossypium species
Capitalising on new imaging technologies
Plant Morphology
Visible imaging
•Plant area, biomass, structure
•Senescence, relative chlorophyll
content, pathogenic lesions
Plant Function
Far Infrared imaging
•Canopy / leaf temperature
•Water use / salt tolerance
Chlorophyll Fluorescence
imaging
•Physiological state of
photosynthetic machinery
Near IR imaging
•Tissue water content
•Soil water content
FTIR Imaging Spectroscopy / Hyperspectral imaging
•Cellular localisation of metabolites (sugars, protein, aromatics)
•Carbohydrates, pigments and proteins
PlantScan: next generation phenotyping
platform for n-dimensional Models
•
•
•
Light Detection and Ranging (LiDAR)
Micro-bolometer sensors (FarInfrared)
4-CCD line scanner (NIR and visible
split)
Addressing issues with fluorescence
and environmental control
Automated features extraction and
quantification of n-dimensional models
Jurgen Fripp CSIRO ICT E-Health Brisbane
Automated segmentation – extracted stem
Bounding box extraction and Delauney
triangulation for convex 3D hull
Volume over time
Height and total
volume extraction
Sirault, Fripp and Furbank (in preparation)
An integrated phenotyping platform for Model
Plants
•PAM Fluorescence imaging
•Far Infrared imaging
•Visible imaging for growth
•Climate controlled in equilibration
chamber and imaging chambers
2500 plants per day
Applications:
•1001 genomes project - 65 re-sequenced Arabidopsis thaliana ecotypes under analysis - with
Detlef Weigel
•USDA Brachypodium distachyon project
www.phenonet.com
Distributed Sensor Network for Phenomics
Measure and log range of
environmental factors on
field trials.
Zigby wireless transmitters:
Thermopile Temp Sensor
Humidity
Imaging: Estimate biomass; greeness index for
Ambient Temp
fertilization; detect flowering; estimate yield.
Soil Moisture
Imaging constrained: Develop smarter portable
platforms.
Ontologies
Ontologies are a set of formalised terms that allow us to represent
knowledge about concepts and relationships in a domain.
Annotating with ontologies means describing a domain object or
process.
This image shows the wheat plant on
the left has increased “salt tolerance
(TO:0006001)”
Modelling with ontologies means classifying a domain object or
process, and its relationship to other domain concepts.
OBI:0000050 : “platform”
“A platform is an object_aggregate that is
the set of instruments and software needed
to perform a process. “
Ontologies
Evolutionary
Changes in Domain, Model & Data
Expressed in OWL (& RDF Schema)
Provides syntax & semantics - enables reasoning
Expressivity vs decidability
Validation via reasoning
Designed to be open & interoperable
Facilitates sharing, reuse & Integration
Maturing technology stacks
APIs, reasoners, triple stores, query engines
PODD
PlantScan
The Phenomics Ontology Driven Data
repository
Phenonet
Data
Phenomobile TrayScan
Metadata
PODD
Metadata
Repository
Data
PODD
Data Stores
Metadata
A research data and metadata
repository.
Managing Phenomics Data from
Multiple Heterogeneous High Volume
High Resolution Data Generation
Platforms
A methodology for managing and
publishing research data outputs.
A semantic web data resource.
Putting the OD in PODD
Basics: Ontologies as domain models for research
data
Model domain objects as ontological objects
Base ontology: domain independent
Phenomics ontology: domain specific
Organizes data logically
Represented as metadata objects
Parent-child relationship
Referential relationship
Drives all operations in the data lifecycle
Domain Concepts
OWL Classes
Attributes and relations
OWL Predicates
Domain Objects
OWL Individuals
Comments, descriptions
OWL Annotations
The PODD Ontology
Project
Project Plan
Treatment
Material
Investigation
Material
Platform
Container
Analysis
Event
Data
Environment
Genotype
Design
Gene
Sex
Observation/
Phenotype
Treatment
Archive
Data
Sequence
Measurement
Measurement
Parameter
PODD Architecture
Objects represented semantically
Semantics (metadata) captured in RDF
Repository operations on RDF:
Ingestion, retrieval, update, query & search, export
Backend Object Management:
Fedora Commons
Fedora objects mapped to Java objects for:
Business Logic Layer
Interface Layer
Future Work
Annotation Services
Ontological tagging of PODD objects
Annotation tools, search/discovery tools, browsers, etc.
Virtual Laboratory Environment
Support Phenome to Genome (and back) discovery processes
Analyse linkages across data resources
Workflows for statistical inferences & mathematical modelling.
Visualisation tools
etc...
Resources
Plant Phenomics Test Instance: http://poddtest.plantphenomics.org.au/
Plant Phenomics Production Instance: http://podd.plantphenomics.org.au/
Mouse Phenomics Production Instance: http://podd.australianphenomics.org.au
PODD Project Website: http://projects.arcs.org.au/trac/podd
Contact: Gavin.Kennedy@csiro.au
Ph: +61413 337 819
This work is part of a National eResearch Architecture Taskforce (NeAT) project, supported
by the Australian National Data Service (ANDS) through the Education Investment Fund
(EIF) Super Science Initiative, and the Australian Research Collaboration Service (ARCS)
through the National Collaborative Research Infrastructure Strategy Program.
The Team
PODD Project Manager
Gavin Kennedy
University of Queensland eResearch Lab:
Faith Davies (Developer)
Simon McNaughton (Developer)
Jane Hunter (eResearch Lab Leader)
APN
Philip Wu (Developer)
Martin Hamilton (Developer)
Adrienne McKenzie (APN Head of Network
Services)
Monash Univesity
Yuan-Fang Li (Designer)
APPF/HRPCC/CSIRO
Xavier Sirault (Science Leader, HRPPC) NeAT
Xueqin Wang (Tester, Documentor)
Andrew Treloar (Deputy Director ANDS)
Bob Furbank (APPF HRPPC Leader)
Paul Coddington (Projects Manager, ARCS)
APPF/Plant Accelerator/Uni of Adelaide
Bogdan Masznicz (Bioinformatician)
Mark Tester (APPF TPA Leader)
ALA
Donald Hobern (Director, ALA)