Applications of Semantic Web Technology in the Life Sciences

advertisement
Applications of Semantic Web
Technology in the Life Sciences
Susie Stephens
Principal Product Manager, Life Sciences, Oracle
Coordinator BioRDF Subgroup, HCLSIG, W3C
susie.stephens@oracle.com
Outline
•
•
•
•
Oracle’s RDF Data Model
Applications of Oracle’s Technology
BioRDF Overview and Tasks
Challenges & Future Work
Oracle’s RDF Data Model
Oracle RDF Data Model
•
•
•
•
Support for RDF and RDFS
Object-relational implementation
Subjects and objects are re-used
Links represent complete RDF triples
RDF Triples:
P1
S1
O1
P2
S2
P2
• {S1, P1, O1}
• {S1, P2, O2}
O2
• {S2, P2, O2}
SPARQL-like Query Capability
• A table function allows a graph query to be
embedded in a SQL query
• Searches for an arbitrary pattern against the
RDF data
• Includes inferencing based on RDF, RDFS, and
user-defined rules
Enterprise Functionality
• Real Application Clusters (RAC), Security
• Multi-threaded, parallel processing, indexed, etc.
• Performance testing with UniProt
Units in seconds
Source: Chong et al. VLDB 2005
Image Search
“Find me all DICOM images that contain the
term ‘Jaw’”
• Map relationships to
terms using RDF triples
- ‘Mandible’, sameAs’, ‘Jaw’
- ‘Maxilla’, ‘partOf’, ‘Jaw’
Text Search
“Find me all papers that contain the
term ‘Jaw’”
• Map relationships to
terms using RDF triples
- ‘Mandible’, sameAs’, ‘Jaw’
- ‘Maxilla’, ‘partOf’, ‘Jaw’
Data Integration
• SQL / RDBMS
–
–
Concise, efficient transactions
Transaction metadata is embedded or implicit in
the application or database schema
• XQuery / XML
–
–
Transaction across organizational boundaries
XML wraps the metadata about the transaction
around the data
• SPARQL / RDF
–
–
Information sharing with ultimate flexibility
Enables semantics as well as syntax to be
embedded in documents
Applications of Oracle’s
Technology
Seamark Navigator Use Case
Aggregation of Data
Source: Stephens et al. J Web Semantics 2006
Seamark Navigator Use Case
Faceted Browser Interface
Source: Stephens et al. J Web Semantics 2006
Pathways Use Case
Mapping Pathway to BioPAX Format
Source: http://www.symbionicproject.org/Data/Events/Course_2005/Persico.pdf
Stanford University Use Case
Web Interface
Source: http://pkb.stanford.edu/
Eli Lilly Use Case
Source: http://www.olsug.org/wiki/images/d/df/AWL.pdf
University of Texas Health
Science Center Use Case
Image Source: Semantic Technologies Conference 2006
BioRDF Overview
BioRDF Charter
•
•
•
Build a life sciences demo that spans from bench
to bedside using RDF & OWL to help scientist
better understand the value of the Semantic Web
Explore the effectiveness of current tools for
making data available as RDF
Document our finding to help accelerate the
adoption of the Semantic Web by others
Neuroscience Focus
•Heterogeneous Data
• Molecules to nervous system
• Numerous Web resources
• Effective data sharing &
integration needed
Disease Focus
• Huntingtons
• Alzheimers
• Parkinsons
The BioRDF Meadow
BioRDF Tasks
Challenges & Future Work
Challenges
• Semantic Web & Database communities differ
• View that moving to the ‘Semantic Web’ is a
huge jump
• Lack of awareness that the database can be
downloaded for free
• Maturity of standards and tools
• Assigning URIs to public data sources
• Gluing ontologies together
• Not much data is available in RDF
Future Work
•
•
•
•
•
Continue to enhance Oracle’s products
Explore hybrid approaches
Further work to explore scalability
BioRDF to convert more data sets to RDF
Build the BioRDF demo
Summary
• Oracle has a scalable, secure, highlyavailable RDF Data Model
• Oracle continues to enhance technology for
the Semantic Web
• Many organizations are already using
Semantic Web solutions
• BioRDF is a community effort to solve many
of the challenges
Further Information
• Oracle’s Semantic Technologies Web sites
–
–
http://www.oracle.com/technology/industries/life_sciences
http://www.oracle.com/technology/tech/semantic_technologies
• BioRDF
–
–
http://w3.org/2001/sw/hcls
http://esw.w3.org/topic/HCLSIG_BioRDF_Subgroup
Download