2009 Backyard Botanicals Metadata Dictionary Digital Underground 11/1/2009 Digital Underground Data Dictionary Descriptive, Technical, and Administrative Metadata for Digital Images pertaining to the Backyard Botanicals Collection This data dictionary provides a complete reference to the metadata used for the Backyard Botanicals Collection at Dspace. The list was compiled using Dublin Core standards as well as NISO. For the complete metadata elements as well as clarification please refer to the Dublin Core website at http://dublincore.org/documents/dces/. For the complete list of NISO standards please refer to the ANSI/NISO Z39.87 - Data Dictionary - Technical Metadata for Digital Still Images, which can be found at http://www.niso.org/kst/reports/standards?step=2&gid=None&project_key=b897b0cf3e2ee526252d9f830 207b3cc9f3b6c2c. The Premis Data Dictionary for Preservation Metadata Standards was used as a reference for this dictionary, specifically 2.1 eventIdentifier and 3.1 agentIdentifier. These two elements were tailored to meet the specific needs of the Backyard Botanicals Collection. DC and NISO standards were used to fulfill the provenance requirements of the collection therefore negating the need to utilize Premis as more than a reference. Usage This dictionary should be utilized by anyone submitting to the collection; as well as anyone maintaining the collection for preservation. Permission to update this dictionary will be granted for the purpose of maintenance and preservation. Document Administration This dictionary is authored, administered and maintained by the Digital Underground. Karen Davis Jon-Erik Gilot Mystye Gorgan Forest Rushay Gregory Stall 3 Backyard Botanicals Metadata Dictionary Section 1: Dublin Core................................................................................................................................... 4 4.1 Title ..................................................................................................................................................... 4 4.8. Creator ............................................................................................................................................... 4 4.2 Subject................................................................................................................................................. 5 4.3 Description .......................................................................................................................................... 5 4.17 Provenance ....................................................................................................................................... 6 4.4 Type..................................................................................................................................................... 6 4.7 Coverage ............................................................................................................................................. 7 4.10 Contributor ....................................................................................................................................... 7 4.11 Rights................................................................................................................................................. 8 4.12 Date ................................................................................................................................................... 8 4.13 Format ............................................................................................................................................... 8 4.16 Audience ........................................................................................................................................... 9 4.18 Rights Holder..................................................................................................................................... 9 Section 2: NISO............................................................................................................................................ 10 6.2 fileSize ............................................................................................................................................... 10 7.1.1 imageWidth.................................................................................................................................... 10 7.1.2 imageHeight .................................................................................................................................. 11 7.1.3.1 colorSpace ................................................................................................................................... 11 8.2.1 dateTimeCreated ........................................................................................................................... 11 8.2.2 imageProducers ............................................................................................................................. 12 8.2.3 captureDevice ................................................................................................................................ 12 8.4.1 digitalCameraManufacturer........................................................................................................... 12 8.4.2.1 digitalCameraModelName .......................................................................................................... 13 8.4.2.2 digitalCameraModelNumber ...................................................................................................... 13 10.1.4 processingRationale ..................................................................................................................... 13 10.1.5 Processing Software ..................................................................................................................... 14 10.1.6 processingActions ........................................................................................................................ 14 10.2 Previous Image Metadata ............................................................................................................... 14 Section 3: DUMD (Digital Underground Metadata).................................................................................... 15 agentIdentifier ........................................................................................................................................ 15 eventIdentifier ........................................................................................................................................ 15 3 Section 1: Dublin Core 4.1 Title Label Element Description Guidelines Title The name given to the resource. Typically, a Title will be a name by which the resource is formally known. If in doubt about what constitutes the title, repeat the Title element and include the variants in second and subsequent Title iterations. If the item is in HTML, view the source document and make sure that the title identified in the title header (if any) is also included as a Title. Ex Notes Title="A Pilot's Guide to Aircraft Insurance" A second property with the same name as this property has been declared in the dcterms: namespace (http://purl.org/dc/terms/). See the Introduction to the document "DCMI Metadata Terms" (http://dublincore.org/documents/dcmiterms/) for an explanation. Mandatory/Optional M Repeatable No 4.8. Creator Label Element Description Guidelines Creator An entity primarily responsible for making the content of the resource. Examples of a Creator include a person, an organization, or a service. Typically the name of the Creator should be used to indicate the entity. Creators should be listed separately, preferably in the same order that they appear in the publication. Personal names should be listed surname or family name first, followed by forename or given name. When in doubt, give the name as it appears, and do not invert. In the case of organizations where there is clearly a hierarchy present, list the parts of the hierarchy from largest to smallest, separated by full stops and a space. If it is not clear whether there is a hierarchy present, or unclear which is the larger or smaller portion of the body, give the name as it appears in the item. Ex Notes Creator="Shakespeare, William" If the Creator and Publisher are the same, do not repeat the name in the Publisher area. If the nature of the responsibility is ambiguous, the recommended practice is to use Publisher for organizations, and Creator for individuals. In cases of lesser or ambiguous responsibility, other than creation, use Contributor. 5 Backyard Botanicals Metadata Dictionary Mandatory/Optional M Repeatable No 4.2 Subject Label Element Description Guidelines Subject and Keywords The topic of the resource. Typically, the subject will be represented using keywords, key phrases, or classification codes. Recommended best practice is to use a controlled vocabulary. To describe the spatial or temporal topic of the resource, use the Coverage element. In general, choose the most significant and unique words for keywords, avoiding those too general to describe a particular item. Subject might include classification data if it is available (for example, Library of Congress Classification Numbers or Dewey Decimal numbers) or controlled vocabularies (such as Medical Subject Headings or Art and Architecture Thesaurus descriptors) as well as keywords. When including terms from multiple vocabularies, use separate element iterations. If multiple vocabulary terms or keywords are used, either separate terms with semi-colons or use separate iterations of the Subject element. Ex Notes Subject="Aircraft leasing and renting" A second property with the same name as this property has been declared in the dcterms: namespace (http://purl.org/dc/terms/). See the Introduction to the document "DCMI Metadata Terms" (http://dublincore.org/documents/dcmiterms/) for an explanation. Mandatory/Optional M Repeatable Yes 4.3 Description Label Element Description Guidelines Description An account of the content of the resource. Description may include but is not limited to: an abstract, table of contents, reference to a graphical representation of content or a free-text account of the content. Since the Description field is a potentially rich source of indexable terms, care should be taken to provide this element when possible. Best practice recommendation for this element is to use full sentences, as description is often used to present information to users to assist in their selection of appropriate 5 resources from a set of search results. Descriptive information can be copied or automatically extracted from the item if there is no abstract or other structured description available. Although the source of the description may be a web page or other structured text with presentation tags, it is generally not good practice to include HTML or other structural tags within the Description element. Applications vary considerably in their ability to interpret such tags, and their inclusion may negatively affect the interoperability of the metadata. Ex Description="Illustrated guide to airport markings and lighting signals, with particular reference to SMGCS (Surface Movement Guidance and Control System) for airports with low visibility conditions." Notes A second property with the same name as this property has been declared in the dcterms: namespace (http://purl.org/dc/terms/). See the Introduction to the document "DCMI Metadata Terms" (http://dublincore.org/documents/dcmiterms/) for an explanation. Mandatory/Optional O Repeatable No 4.17 Provenance Label Element Description Guidelines Ex Provenance A statement of any changes in ownership and custody of the resource since its creation that are significant for its authenticity, integrity and interpretation. The statement may include a description of any changes successive custodians made to the resource. Provenance="This copy once owned by Benjamin Spock." Provenance="Estate of Hunter Thompson." Provenance="Stolen in 1999; recovered by the Museum in 2003." Notes Under the DC Element Description Mandatory/Optional O Repeatable Yes 4.4 Type Label Element Description Resource Type The nature or genre of the content of the resource. Type includes terms describing general categories, functions, genres, or aggregation levels for content. Recommended best practice is to select a value from a controlled vocabulary (for example, the DCMIType vocabulary ). To describe the physical or digital manifestation of the resource, use the FORMAT element. 7 Backyard Botanicals Metadata Dictionary Guidelines If the resource is composed of multiple mixed types then multiple or repeated Type elements should be used to describe the main components. Type="Image" Ex Notes Mandatory/Optional O Repeatable Yes 4.7 Coverage Label Element Description Guidelines Coverage The extent or scope of the content of the resource. Coverage will typically include spatial location (a place name or geographic co-ordinates), temporal period (a period label, date, or date range) or jurisdiction (such as a named administrative entity). Recommended best practice is to select a value from a controlled vocabulary (for example, the Thesaurus of Geographic Names [Getty Thesaurus of Geographic Names, http://www. getty.edu/research/tools/vocabulary/tgn/]). Where appropriate, named places or time periods should be used in preference to numeric identifiers such as sets of co-ordinates or date ranges. Whether this element is used for spatial or temporal information, care should be taken to provide consistent information that can be interpreted by human users, particularly in order to provide interoperability in situations where sophisticated geographic or time-specific searching is not supported. For most simple applications, place names or coverage dates might be most useful. Coverage="Boston, MA" Ex Notes Mandatory/Optional O Repeatable No 4.10 Contributor Label Element Description Guidelines Contributor An entity responsible for making contributions to the resource. Examples of a Contributor include a person, an organization, or a service. Typically, the name of a Contributor should be used to indicate the entity. Ex Notes The same general guidelines for using names of persons or organizations as Creators apply here. Contributor is the most general of the elements used for "agents" responsible for the resource, so should be used when primary responsibility is unknown or irrelevant. Mandatory/Optional O Repeatable Yes 7 4.11 Rights Label Element Description Guidelines Ex Rights Management Information about rights held in and over the resource. Typically a Rights element will contain a rights management statement for the resource, or reference a service providing such information. Rights information often encompasses Intellectual Property Rights (IPR), Copyright, and various Property Rights. If the rights element is absent, no assumptions can be made about the status of these and other rights with respect to the resource. The Rights element may be used for either a textual statement or a URL pointing to a rights statement, or a combination, when a brief statement and a more lengthy one are available. Rights="Access limited to members" Rights="http://cs-tr.cs.cornell.edu/Dienst/Repository/2.0/Terms& quot; Notes Mandatory/Optional O Repeatable No 4.12 Date Label Element Description Guidelines Ex Date A date associated with an event in the life cycle of the resource. Typically, Date will be associated with the creation or availability of the resource. Recommended best practice for encoding the date value is defined in a profile of ISO 8601 [Date and Time Formats, W3C Note, http://www.w3.org/TR/NOTE- datetime] and follows the YYYY-MM-DD format. If the full date is unknown, month and year (YYYY-MM) or just year (YYYY) may be used. Many other schemes are possible, but if used, they may not be easily interpreted by users or software. Date="1998-02-16" Date="1998-02" Date="1998" Notes Mandatory/Optional M Repeatable Yes 4.13 Format Label Element Description Guidelines Format The file format, physical medium, or dimensions of the resource. In addition to the specific physical or electronic media format, information concerning the size of a resource may be included in the content of the Format 9 Backyard Botanicals Metadata Dictionary element if available. In resource discovery size, extent or medium of the resource might be used as a criterion to select resources of interest, since a user may need to evaluate whether they can make use of the resource within the infrastructure available to them. When more than one category of format information is included in a single record, they should go in separate iterations of the element. Ex Title="Dublin Core icon" Identifier="http://purl.org/metadata/dublin_core/images/dc2.gif& quot; Type="Image" Format="image/gif" Format="4 kB" Notes Recommended best practice is to select a value from a controlled vocabulary (for example, the list of Internet Media Types [http://www.iana.org/ assignments/media-types/] defining computer media formats). Mandatory/Optional O Repeatable Yes 4.16 Audience Label Element Description Guidelines Ex Audience A class of entity for whom the resource is intended or useful. A class of entity may be determined by the creator or the publisher or by a third party. Audience terms are best utilized in the context of formal or informal controlled vocabularies. None are presently recommended or registered by DCMI, but several communities of interest are engaged in setting up audience vocabularies. In the absence of recommended controlled vocabularies, implementers are encouraged to develop local lists of values, and to use them consistently. Audience="elementary school students" Audience="ESL teachers" Audience="deaf adults" Notes Mandatory/Optional O Repeatable No 4.18 Rights Holder Label Element Description Guidelines Rights Holder A person or organization owning or managing rights over the resource. Recommended best practice is to use the URI or name of the Rights Holder to indicate the entity. Since, for the most part, people and organizations are not typically assigned URIs, 9 Ex a person or organization holding rights over a resource would be named using a text string. People and organizations sometimes have websites, but URLs for these are not generally appropriate for use in this context, since they are not clearly identifying the person or organization, but rather the location of a website about them. RightsHolder="Stuart Weibel" RightsHolder="University of Bath" Notes Uncer the DC Element Rights Mandatory/Optional O Repeatable No Section 2: NISO 6.2 fileSize Definition Type Mandatory/Optional Repeatable Values Ex: Notes Use A data element that designates the size in bytes of the image file Positive integer M N 618 72839 The file size must record the number of bytes as recorded by the system. Do not attempt to record file sizes in terms of KB, MB, or other notation System 7.1.1 imageWidth Definition Type Mandatory/Optional Repeatable Values Ex: Notes: Use A data element that specifies the width of the digital image, i.e. horizontal or x dimensions in pixels Positive integer M N 1330 The image width may be the shorter or longer dimension of the image, depending upon the orientation of the camera or scanner during image capture. For multiple resolution image file formats, value shall specify the highest resolution System Manager User 11 Backyard Botanicals Metadata Dictionary 7.1.2 imageHeight Definition Type Mandatory/Optional Repeatable Values Ex: Notes: Use A data element that specifies the height of the digital image, i.e. vertical or y dimensions in pixels Positive integer M N 2130 The image height may be the shorter or longer dimension of the image, depending upon the orientation of the camera or scanner during image capture. For multiple resolution image file formats, value shall specify the highest resolution System Manager User 7.1.3.1 colorSpace Definition Type Mandatory/Optional Repeatable Values Ex: A data element that designates the color model of the decompressed image data. String O N WhitelsZero BlacklsZero RGB PaletteColor TransparencyMask CMYK YcbCr CIELab ICClab DeviceGray DeviceRGB DeviceCMYK Other(key in appropriate text name) Notes: Use 8.2.1 dateTimeCreated Definition A data element that designates the Date or Date Time the image was created 11 Type Mandatory/Optinal Repeatable Values Ex: Notes: Use Date Time MA N YYYY-MM-DD (2004-07-15) YYYY-MM-DD HH:MM:SS (2004-07-15 19:20:00 Use ISO 8601 numeric representations of date and time See section 10.1.1 dateTimeProcessed for images created by processing image data (digital to digital) Manager 8.2.2 imageProducers Definition Type Mandatory/Optional Repeatable Values Ex: Notes: Use A data element that identifies the organization –level producers of the image string O Y Luna Imaging JJT, Inc University of Michigan The image width may be the shorter or longer dimension of the image, depending upon the orientation of the camera or scanner during image capture. For multiple resolution image file formats, value shall specify the highest resolution Manager 8.2.3 captureDevice Definition Type Mandatory/Optional Repeatable Values Ex: Notes: Use A data element that designates the classification of device used to create the image data Enumerated type (restricted to list) O N Transmission scanner Reflection print scanner Digital still camera Still from video When image processing is used to generate the image data from a digital source, see section 10.1 ImageProcessing Manager 8.4.1 digitalCameraManufacturer Definition A data element that identifies the manufacturer of the digital camera used to 13 Backyard Botanicals Metadata Dictionary Type Mandatory/Optional Repeatable Values Ex: Notes: Use create the image String O N Phase One Manager 8.4.2.1 digitalCameraModelName Definition Type Mandatory/Optional Repeatable Values Ex: Notes: Use A data element that identifies the camera model name of the digital camera used to create the image String O N H_20 When image processing is used to generate the image data from a digital source, see section 10.1 ImageProcessing Manager 8.4.2.2 digitalCameraModelNumber Definition Type Mandatory/Optional Repeatable Values Ex: Notes: Use A data element that identifies the model number of the digital camera used to create the image String R N C2520B Manager 10.1.4 processingRationale Definition Type Mandatory/Optional Repeatable Values Ex: Notes: A data element that provides rationale for image editing decisions or describes trigger event for image migration String O N Processing action in 10.1.6 undertaken to best match to source image TIFF to JPEG 2000 undertaken as TIFF format no longer supported 13 Use Manager 10.1.5 Processing Software Definition Type Mandatory/Optional Repeatable Values Ex: Notes: Use A container of information detailing the processing software used to edit or transform the image data ; comprised as four data elements: 10.1.5.1 processingSoftwareName, 10.1.5.2 processing SoftwareVersion, 10.1.5.3 processingOperatingSystemName, 10.1.5.4 processing OperatingSystemVersion Container O Y Adobe Photoshop (software Name) CS (Software Version) Windows or Win (Operating System Name) XP (operating System Version) Containers may be repeated but individual elements may not Manager System 10.1.6 processingActions Definition Type Mandatory/Optional Repeatable Values Ex: Notes: Use A data element that designates an ordinal listing of the image processing steps performed by way of 10.1.5 ProcessingSoftware String R Y Editing ex: Rotate 90 degrees cw; ICC profile added Transformation ex: Format migration from TIFF 6.0 to lossless JPEG 2000 Whenever possible, script or action files should be supplied for this element Manager 10.2 Previous Image Metadata Definition Type Mandatory/Optional Repeatable Values Ex: Notes: A data element of technical metadata from previous generations of the image file recorded to document provenance and change history and to provide essential metadata that could be used to simulate return to original image data (retains previous data types) M (each time a new generation of the image is created) Y The metadata for the source file is recorded as a block in this section. Each 15 Backyard Botanicals Metadata Dictionary Use subsequent generation’s metadata is appended Manager User Section 3: DUMD (Digital Underground Metadata) *please see note agentIdentifier Definition Type Mandatory/Optional Repeatable Values Ex: Notes: Use A data element that for this purpose identifies the name of the person who owns the flowers in the photo and photographer, name of person who took the photo String O Y String eventIdentifier Definition Type Mandatory/Optional Repeatable Values Ex: A designation used to uniquely identify the event within the preservation repository system. String Mandatory; *detail is optional Yes*individual elements are not For this dictionary three eventidentifiers exist that cover, separately, Dspace date time, Dspace Detail, and Dspace Name; values below dspacedatetime: Date/time uploaded to Dspace: YY-MM-DD HH:MM:SS format dspacedetail: more info of what occurred during dspace session dspacename: Name of person uploading to Dspace Notes: Use *The fields used here are crafted after 2.1 and 3.1from Premis and tailored to meet the specific needs of the group. 15