Semantic Vernacular for Fungi - Tetherless World Constellation

advertisement
Scientific Names and
Descriptions for Organisms on
the Semantic Web
Nathan Wilson1, Han Wang2, and Deborah McGuinness3
1
Marine Biological Laborary, 7 MBL St., Woods Hole, MA 02556, USA
nwilson@mbl.edu
2 Tetherless World Constellation, Rensselaer Polytechnic Institute, 110 8th Street Troy, NY 12180, USA
wangh17@rpi.edu
3 Tetherless World Constellation, Rensselaer Polytechnic Institute, 110 8th Street Troy, NY 12180, USA
dlm@cs.rpi.edu
For Example…
Candidate Names:
Pine Spike California, USA
© 1993 Nathan Wilson CC-BY
•
•
•
•
•
Chroogomphus vinicolor
Chroogomphus rutilus
Chroogomphus ochraceus
Chroogomphus
Pine Spike
Chroogomphus vinicolor California,
USA
© 2007 Darvin DeShazer CC-BY-NC
11/12/2012
1
Motivation
• Take advantage of the crowd to
understand the world’s biodiversity.
• Clarify connection between
observations and scientific literature.
• Provide accurate and machineinterpretable definitions for groups of
organisms.
• Build a central repository for semantic
descriptions of groups of organisms.
11/12/2012
2
What is a
Species?
• Concept: “A group of organisms
capable of interbreeding and producing
fertile offspring”
• Definition: Type specimen, a name, and
a circumscription that is believed to
describe the species that specimen
belongs to
11/12/2012
3
Definition of a
Scientific Name
Latin Name
Reference
Type
Circumscription
Cap: overlapping, wavy;
multicolored concentric
11/12/2012
Problem: Circumscriptions
change frequently
4
Definition of a
Scientific Name
Latin Name
Reference
Type
Cap: overlapping, wavy;
multicolored concentric
11/12/2012
Cap: 2-10cm wide;
overlapping, also rosette;
can be flat to wavy;
multicolored concentric;
zones alternate
Cap: 2-10cm wide;
overlapping or in a row or a
rosette; kidney shaped, also
described as fan shaped;
can be flat to wavy;
multicolored concentric
Multiple Circumscriptions
Cap: 2-10cm wide; usually
overlapping or in a row or a
rosette; kidney shaped, also
described as fan shaped,
sometimes fused laterally;
can be flat to wavy;
multicolored concentric
5
Scientific Names
& Observations
Circumscription(s)
People
Observations
Genetic Barcode
11/12/2012
Scientific Names
Type
6
Semantic Vernacular
Description (SVD)
SVD
Identifier
Scientific Name(s)
e.g. SV1234
e.g.
Chroogomphus rutilus
Chroogomphus vinicolor
Chroogomphus ochraceus
Name
Description
e.g. PineSpike
e.g.
EquivalentTo:
Fungus
and (hasOverallShape some StipitateAgaric)
and (hasHymenophoreShape some Gilled)
and ((hasPileusDiscColor some Brown)
...
11/12/2012
7
Definition of a
Scientific Name
Reference
Latin Name
Specimen
SV2345
11/12/2012
SV3456
SV4567
Multiple Circumscriptions
SV5678
8
SVDs
Circumscription(s)
People
Observations
SVDs
Genetic Barcode
11/12/2012
Scientific Names
Type
9
Fungal Ontology
• Provides a controlled vocabulary for
describing an observation.
• Associates an observation to one or
more scientific names.
• Starts with macroscopic features.
• Moving into microscopic, chemical, and
molecular features.
• Generated by an open collaborative
process.
11/12/2012
10
Observational
Features
ObjectProperty: 'has surface color’
Annotations:
label "has surface color"^^Literal
SubPropertyOf:
'has color’
Domain:
Fungus
and (('has overall shape' some earthstar)
or ('has overall shape' some gasteroid))
Range:
'Color Value Partition'
11/12/2012
11
Descriptions
Class: SV1112
Annotations:
hasID "1112"^^positiveInteger
EquivalentTo:
Fungus
and (('has surface color' some white)
or ('has surface color' some gray)
or ('has surface color' some off-white))
and ('has hymenophore shape' some 'spore mass')
and ('has overall shape' some gasteroid)
and ('has substrate attachment' some pileate-sessile)
SubClassOf:
'proposed at' value "2012-07-03T12:00:0005:00"^^dateTime,
'Vernacular Feature Description',
'proposed by' value SV1090
11/12/2012
12
SVDs
Class: SV1012
Annotations:
hasID "1012"^^positiveInteger
SubClassOf:
'has SVD name' value WhitePuffball,
'Semantic Vernacular Description',
'has definition' some SV1112,
'has associated scientific name' some 'Bovista pila',
'has associated scientific name' some 'Lycoperdon
perlatum'
11/12/2012
13
Fungal Ontology
11/12/2012
14
Highlights
• Explicit, fixed collection of observational
features
• ‘Duck’ Typing: If it ‘looks’ like a
PineSpike, it is a PineSpike
• Amenable to peer review/codification
• Inherently unique
• Stable for observers
11/12/2012
15
Peer-review
Process
• Every SVD needs review before use
• Alternative names and definitions can
be proposed
• Discussion/voting happens where there
are alternatives
• Votes are weighted according to users’
past contributions to the process
11/12/2012
16
Implementation
Prototype
• A Ruby on Rails
application
• Triple store powered
by Jena TDB
• RESTful web service
ready for use
http://mushroomobserver.org/semantic_vernacular
11/12/2012
17
Questions?
Comments?
Acknowledgements
Katie Dunn
Jason Hollinger
Encyclopedia of Life
Tetherless World Constellation
Marine Biological Laboratory
Rensselaer Polytechnic Institute
Mushroom Observer Community
11/12/2012
1. Artportalen, http://artportalen.se
2. Biodiversity Heritage Library, http://biodiversitylibrary.org
3. Encyclopedia of Life, http://eol.org
4. International Code of Nomenclature of Bacteria: Bacteriological Code, 1990 Revision. ASM Press, Washington, DC,
USA (1992)
5. International Code of Zoological Nomenclature. The International Trust for Zoological Nomenclature, London, UK,
4th edn. (2000)
6. Burdsall, H.H., Bank, M.T.: The Genus Laetiporus in North America. Harvard Papers in Botany 6(1),43{55 (2001)
7. Dahdul, W.M., Lundberg, J.G., Midford, P.E., Balho, J.P., Lapp, H., Vision, T.J., Haendel, M.A., Westereld, M.,
Mabee, P.M.: The Teleost Anatomy Ontology: Anatomical Representation for the Genomics Age. Systematic Biology
59, 369{383 (2010), doi:10.1093/sysbio/syq013
8. Knapp, S., McNeill, J., Turland, N.J.: Changes to Publication Requirements Made at the XVIII Inter-national
Botanical Congress in Melbourne - What Does e-Publication Mean for You? PhytoKeys 6(0),5{11 (2011),
doi:10.3897/phytokeys.6.1960
9. Knowlton, N.: Sibling Species in the Sea. Annual Review of Ecology and Systematics 24, 189{216 (1993),
doi:10.1146/annurev.es.24.110193.001201
10. Mayr, E.: The Bearing of the New Systematics on Genetical Problems. The Nature of Species. Advances in
Genetics 2, 205{237 (1948)
11. McGuinness, D.L., van Harmelen, F.: OWL Web Ontology Language Overview. World Wide Web Consortium
(W3C) Recommendation. (2004), http://www.w3.org/TR/owl-features/
12. Miko, I., Deans, A.R.: Masner, a New Genus of Ceraphronidae (Hymenoptera: Ceraphronoidea) Described Using
Controlled Vocabularies. ZooKeys 20, 127{153 (2009), doi:10.3897/zookeys.20.119
13. Patterson, D.J., Cooper, J., Kirk, P.M., Pyle, R.L., Remsen, D.P.: Names are Key to the Big New Biology. Trends in
Ecology & Evolution 25(12), 686{691 (2010), doi:10.1016/j.tree.2010.09.004
14. Sato, H., Yumoto, T., Murakami, N.: Cryptic Species and Host Specicity in the Ectomycorrhizal Genus
Strobilomyces (Strobilomycetaceae). American Journal of Botany 94(10), 1630{1641 (2007)
15. Sullivan, B.L., Wood, C.L., Ili, M.J., Bonney, R.E., Fink, D., Kelling, S.: eBird: a Citizen-based Bird Observation
Network in the Biological Sciences. Biological Conservation 142, 2282{2292 (2009),doi:10.1016/j.biocon.2009.05.006
16. Ueda, K., Loarie, S.: iNaturalist, http://inaturalist.org
17. Wilson, E.O.: The Future of Life. Random House Digital, Inc. (2002)
18. Wilson, N., Dunn, K., Wang, H., McGuinness, D.L.: Application of Semantic Technology to Define Names for
Fungi. Tech. rep., Tetherless World Constellation at Rensselaer Polytechnic Institute (2012),
http://tw.rpi.edu/web/doc/ApplicationofSemanticTechnologytoDefineNamesforFungi
19. Wilson, N., Hollinger, J.: Mushroom Observer, http://mushroomobserver.org
20. Yoder, M.J., Miko, I., Seltmann, K.C., Bertone, M.A., Deans, A.R.: A Gross Anatomy Ontology for Hymenoptera.
PLoS ONE 5(12), e15991 (2010), doi:10.1371/journal.pone.0015991
18
Download