Lifecycle Metadata for Digital Objects October 23, 2006 Creation Metadata

advertisement
Lifecycle Metadata for
Digital Objects
October 23, 2006
Creation Metadata
What happens at file creation?

Consciously



You place into the file everything you think it needs in order
to be useful
You save it with a file name that will help you find it again
(you hope)
Unconsciously (or unwillingly)


Metadata is added using environmental information
Metadata is added using information elicited from you
Metadata from creating app
(Word 2000 Statistics)
Metadata from creating app
(Word 2000 General)
Metadata controlled by the
user (Word 2000 Summary)
Metadata controlled by the
user (Word 2000 Custom)
Viewing Word Metadata in XML





Add relevant metadata to your Word document as
outlined above
If you have Word 2003, save the document as XML
Open the document in an XML editor
If not, save the document as HTML
View the document in Notepad or another ASCII
editor or view source from the HTML document
displayed in a browser
The future of XML in Word?



Word has already provided XML markup of its
Document Properties and Custom Document
Properties metadata for several versions
back
In Word 2003 a native (and patented) XML
schema was used
In Office 2007 XML will be systematically
included in the form of a configurable RM
template
Future of XML at Microsoft




XML and the whole WC3 model are becoming part of the
emerging Microsoft operating environment from servers to
desktop
Note move to a standard IE interface for system functions
Steps ongoing to build 5015.2-like records management
functions into office suite as part of the Enterprise Content
Management architecture based on the SharePoint Portal
Server, so as to get business from governments
Creation metadata thus vital to this whole scheme
Open Office metadata



Note Open Office metadata itself is just as
complex as the new Microsoft standard
OO object is packaged as a group of files
(see specification, pp. 541-545: the package
is a zip file with an XML manifest file
containing a list of files and their types)
This means also being concerned with zip file
format standard, compression, etc. etc.
[non-Microsoft] Uses of creation
metadata






Establishing prior art for an invention
Identifying who knew what and when
Showing how an object fits into the larger scheme of
things (preserving the “archival bond”)
Keeping track of versions of an object
Providing assurance of reliability: that the object is
what it purports to be
Anchoring the object in the place and time of its
origin
Placement of creation
metadata




Same options as for all metadata
Embedded within the object (Word metadata)
Wrapped around the object (object is embedded in
metadata document: Word document containing
metadata embedded in XML document extracting
reliability metadata)
Captured, communicated, or kept separately from
the object (non-text objects but not only them)
UBC Creation Metadata I

The notion of a complete record







Medium
Content
Form
Persons (author, writer, addressee, creator)
Acts
Archival bond
Transmission (intent, capability, success)
UBC Creation Metadata II

“Elements of intellectual form” inside the record:





Date (time of transmission and receipt; place of
transmission)
Superscription or attestation (author/originator)
Inscription (all addressees and receivers)
Title and/or subject
Disposition/purpose (the intention of the record)
UBC Creation Metadata III


The notion of reliable record; must in addition to
completeness have:
“Document profile” as container for the object






Date available (created or received)
Time available (created or received)
[Date and time of further transmission]
Author
Addressee
Subject [classification code, registry number]
DoD 5015.2 Recordkeeping
Standard: Assumptions


Note that 5015.2 assumes an entire detailed
recordkeeping system that fully accounts for
all records at the series, folder, and individual
level
The “file plan” defines the recordkeeping
system; the “schedule” is applied to entities
defined in the file plan
DoD 5015.2 metadata,
*=mandatory








Unique identifier*
Supplemental marking list
Subject/Title*
Media type*
Format*
Date filed*
Publication date*
Date received








Author/Originator*
Addressee*
Other addressees*
Originating organization*
Location
Vital record indicator
Vital record review/update
cycle*
User-defined fields
DoD 5015.2 email metadata
mapping






Sender (Author/Originator)
Primary addressees (Addressee)
Other addressees (Other addressee)
Date/time sent (Publication date)
Date/time received (Date received)
Subject (Subject/Title)
Date/time and persons: vital to
records’ reliability




Without date/time, not possible to manage records
by date: cutoffs, retention, destruction
Without persons (author, recipient, creator), nobody
would care
Without hierarchical set of data categories, no
structure
Note dependence on systems in which records
creation are embedded
What about non-text objects?

Creation metadata for non-text objects covers
much the same ground:



Information about occasion of creation, including
technical information (version of coftware,
settings)
Information about creator, intention, receiver
Information about the object itself
Connecting metadata to a nontext object



Object is kept in specifically-defined file structure
File name/ID is crucial to the connection
XLink is used to connect the two using a series of
XML attributes:
<graphic
xmlns:xlink=“http://www.w3.org/1999/xlink”
xlink:type=“simple”
xlink:href=“usr/local/coll1/object9.tif”
xlink:show=“new”
xlink:actuate=“onRequest”
/>
Download