Functional requirements for bibliographic linked data: the FR family

advertisement
Functional requirements for
bibliographic linked data: the FR
family and the Semantic Web
Gordon Dunsire
Presented to the ALCTS FRBR Interest
Group, ALA Annual, 24 June 2011
Overview
Background
Resource Description Framework
Representing Functional Requirements for the
Semantic Web
Some issues
Legacy catalogue records
FRAD
FRBR
Does not cover
“authorities”
Does not cover
subjects
FRSAD
FR
Consolidated
model
Semantic
recapitulate
1998Web versions
2009
2010 this sequence:
2012?
*
*
*
*
Earlier models in use before later models published
Later models re-use elements from earlier models
Reflects semantic development over time
Provides learning curve for FRBR Review Group
Background
RDA: resource description and access is based
on FRBR and FRAD
Bibliographic metadata content guidelines
DCMI RDA Task Group asked to develop an
RDF representation of RDA
Following the “London” meeting in April 2007
Stimulated creation of “FRBR namespaces
project”
To develop RDF representation of FRBR
RDF
Resource Description Framework
Basis of the Semantic Web
Metadata expressed as “atomic” statements
A simple, single, irreducible statement
The creator of this work is R. Shankar
Constructed in 3 parts
“Triple”
RDF triples
The creator of this work is R. Shankar
Subject of the statement = Subject: This work
Nature of the statement = Predicate: has creator
Value of the statement = Object: R. Shankar
This work – has creator – R. Shankar
subject – predicate – object
RDF requires the subject and predicate of a triple
to have machine-readable identifiers
Uniform Resource Identifier (URI)
Object of triple can be a URI, or a literal string like a
title
RDF properties and classes
A predicate is called a property
Properties link two things
A specific work with a specific creator
This work <-> R. Shankar
A generic type of thing is called a class
Works, Persons, Places, etc.
Properties can specify which class the subject
and object of a triple should belong to
Subject “domain”; object “range”
Methodology (1)
 Open Metadata Registry used for basic representation
in RDF
Same approach as RDA
Provides URI assignment; labels, definition, scope note,
property range and domain
 FR family are entity-relationship models
Entity => RDF Class
E.g. FRBR Work
Entity attribute => RDF property
E.g. FRBR has-intended-audience
Entity-entity relationship => RDF property
E.g. FRBR is-realized-through (Work-Expression)
Classes
Properties
FRBR Groups are not (super-)classes [FRBR Review Group]
Group 1 entities are not classes [some ontologists]
3.2.5 Person
The fifth entity defined in the
model is person: an individual.
The entity defined as person
encompasses individuals that are
deceased as well as those that are
living.
Entity => Class
4.4.12 Capture mode
Capture mode is the means used to
record notation, sound, or images
in the production of a
manifestation (e.g., analogue,
acoustic, electric, digital, optical
etc.).
Attribute => Property
Label
Definition
Scope note
Label
Definition
Scope note
Verbalised label
URI
URI for Manifestation
Spanish translation of label
- URI remains the same
Methodology (2)
 Terminology of labels, definitions and scope notes
based as closely as possible on source documentation
E.g. Property label = “has” + attribute name
 Classes and properties declared for prior model(s) reused in current model
E.g. FRAD re-uses URIs from FRBR namespace
E.g. FRBR Expression; FRBR has-key
 No re-use of RDF resources from external community
namespaces such as Dublin Core Terms
Equivalences likely to be established later
Methodology (3)
FR attribute properties have domain but no
range
Required by the model for extensibility
E.g. FRBR has-key (domain = Work)
Relationship properties have domain and
range
E.g. FRBR is-realized-through (domain = Work;
range = Expression)
General, non-technical issues
Consistency within documentation
Written for human consumption; evidence of variable
phrasing to make it more readable
E.g. “prior”, “preceding”, “first”
Consistency of labels of RDF resources
E.g. “has a reproduction” (regular) vs “has
reproduction” (irregular)
Documentation refers to sub-types of entity
E.g. “musical work”, “serial”
First pass: sub-type => sub-class
But wrong due to semantic overlap
Opaque URIs
http://iflastandards.info/ns/fr/frbr/frbrer/P1001
frbrer:P1001
Not http://iflastandards.info/ns/fr/frbr/frbrer/Work
IFLA operates in a multilingual environment
Anglophone bias avoided
Labels, etc. in English (@en), but no problem in
accommodating translations
Allows subsequent changes to alternative and
preferred labels without causing confusion
URI must not change!
Linking the FR models
Relationships between separate FR model
classes and properties are declared with RDF
equivalence and hierarchical properties
E.g. owl:sameAs, rdfs:subClassOf
E.g. FRAD class Corporate Body seems to be a
sub-class of FRBR Corporate Body
Will inform the consolidated model
Which may also require additional classes and
properties
RDA issues
DCMI RDA Task Group has declared parallel
FRBR classes within RDA namespace
Could not wait for FRBR RDF resources to be
approved
JSC to decide whether to substitute FRBR namespace
resources, or declare equivalence
Conflict with FRBR in RDA implementation
scenarios?
RDA Manifestation “embodies” Work and
Expression; FRBR allows only Expression
Legacy catalogue records
Domain and range of FR (and RDA) properties
inhibit use for creating triples from nonFRBRized catalogue records
Need separate URIs for each specific Work,
Expression, Manifestation, and Item
Solution: equivalent properties with no
domain or range
Need just a URI for the specific resource described
RDA has such unbounded properties
Thank you
gordon@gordondunsire.com
 FRBR Review Group
http://www.ifla.org/en/frbr-rg
 Open Metadata Registry
http://metadataregistry.org/
 Library Linked Data Incubator Group
http://www.w3c.org/2005/Incubator/lld/
Sponsors
ALA
Cataloging & Classification Quarterly
MARCIVE, Inc.
Download