EMC Kazeon-eDiscovery IS1200 Metadata 4.8 Reference Guide

Title Page
EMC® Kazeon-eDiscovery
Version 4.8.0
IS1200 Metadata Reference Guide
EMC Corporation
Corporate Headquarters:
Hopkinton, MA 01748-9103
1-508-435-1000
www.EMC.com
Copyright © 2007 - 2015 EMC Corporation. All rights reserved.
Published September 2015
EMC believes the information in this publication is accurate as of its publication date. The information is subject to
change without notice.
THE INFORMATION IN THIS PUBLICATION IS PROVIDED "AS IS." EMC CORPORATION MAKES NO
REPRESENTATIONS OR WARRANTIES OF ANY KIND WITH RESPECT TO THE INFORMATION IN THIS
PUBLICATION, AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF MERCHANTABILITY OR
FITNESS FOR A PARTICULAR PURPOSE.
Use, copying, and distribution of any EMC software described in this publication requires an applicable software license.
For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com. Adobe and
Adobe PDF Library are trademarks or registered trademarks of Adobe Systems Inc. in the U.S. and other countries. All
other trademarks used herein are the property of their respective owners.
The IS1200 software is based in part on software licenses from the following:
Outside In® Content Access © 1991-2015, Chicago, Inc.
Open Source code from www.java2s.com called the itext.asian.jar available at:
http://www.java2s.com/Code/Jar/GHI/itext-asian.jar.htm
Copyright 2009 - 12 Demo Source and Support. All rights reserved
In part on the work of the Independent JPEG Group.
Code from Inxight Software, Inc. Copyright © 1996-2015. All rights reserved. www.inxight.com.
Certain icons used by the Kazeon Web applications come from the Silk Icon set
(http://www.famfamfam.com/lab/icons/silk/)
licensed under the Creative Commons Attribution 2.5 license
(http://creativecommons.org/licenses/by/2.5/).
ii
IS1200 Metadata Reference Guide— v4.8.0
Contents
Preface ............................................................... ......................................vii
Chapter 1
The Metadata Schema and Tag Management
What is Metadata? .............................................................................. 2
About the Metadata Schema ............................................................. 4
The fulltext Metadata Field ......................................................... 4
Default Metadata Schema Fields ................................................ 5
Viewing the Metadata Schema ................................................... 7
How Tags Are Defined in Metadata Schema.................................. 7
Metadata Tag Attributes.............................................................. 7
Metadata Tag Types ...................................................................10
Date Format Requirements .......................................................10
Fully Qualified Tag Names .......................................................11
Metadata Tag Namespaces .......................................................11
Using ‘set schema’ to Add Tags to the Metadata Schema........... 15
Deleting a Field from the Metadata Schema...........................16
Other CLI Commands for Tag Maintenance................................. 16
Creating New Namespaces .......................................................16
Editing Tags.................................................................................16
Un-hiding Tags ...........................................................................17
Listing Name Spaces ..................................................................17
Listing Tags in a Specific Name Space ....................................17
Listing the Indexed Tags in a Name Space .............................17
Listing the Details of a Specific Fully Qualified Tag .............18
Synchronizing Tag Management with Database ...................18
Best Practices ..................................................................................... 18
Customizing the Metadata Schema for Web-Search Preview .... 19
Customizing the Search Schema for DICOM Data ...................... 19
IS1200 Metadata Reference Guide— v4.8.0
iii
Contents
Chapter 2
The Default Schema and Metadata Tags
How Metadata Tags and Namespaces are Organized ................ 22
Namespace Tables Legend .............................................................. 23
System Default Namespaces ........................................................... 24
Calendar Namespace ................................................................. 24
Contact Namespace.................................................................... 25
CTA Namespace ......................................................................... 27
Detected Namespace.................................................................. 28
DICOM Namespace ................................................................... 28
Digest Namespace ...................................................................... 29
Directory Namespace................................................................. 29
Document Namespace............................................................... 30
EnvCommon Namespace.......................................................... 33
Hierarchy Namespace ............................................................... 35
InheritedMail Namespace ......................................................... 36
IPTC Namespace ........................................................................ 38
Kazcollection Namespace.......................................................... 39
Kazeon Namespace .................................................................... 40
Legalapp Namespace................................................................. 41
Legal Namespace........................................................................ 42
LegalExport Namespace............................................................ 44
MailDerived Namespace........................................................... 45
Mail Namespace ......................................................................... 46
MDB Namespace ........................................................................ 52
Objcommon Namespace............................................................ 52
OriginalSource Namespace....................................................... 53
Persistentnamespace Namespace............................................. 53
Processingservice Namespace .................................................. 54
Retention Namespace ................................................................ 54
System Namespace..................................................................... 56
Systemusertags Namespace...................................................... 58
Task Namespace ......................................................................... 58
Tiff Namespace ........................................................................... 60
Userdefined Namespace............................................................ 62
Userextracted Namespace......................................................... 63
Version Namespace.................................................................... 63
Namespaces Populated by Rulesets .............................................. 64
Amex Namespace....................................................................... 65
Centera Namespace.................................................................... 65
DinersClub Namespace ............................................................. 66
Discover Namespace.................................................................. 66
EIN Namespace .......................................................................... 67
Entity Namespace....................................................................... 67
iv
IS1200 Metadata Reference Guide— v4.8.0
Contents
GLBA Namespace.......................................................................70
ITIN Namespace .........................................................................72
JCB Namespace ...........................................................................72
Master Namespace......................................................................73
NIN Namespace ..........................................................................73
NPI Namespace...........................................................................74
PCI Namespace ...........................................................................75
SIN Namespace ...........................................................................77
SSNCC Namespace.....................................................................77
SSN Namespace ..........................................................................78
Visa Namespace ..........................................................................78
Namespaces Populated by Optional Module Connectors .......... 79
Atmoscloud Namespace ............................................................80
Centera Namespace ....................................................................80
CSVC Documentum Namespace..............................................81
CSVC SharePoint Namespace ...................................................85
Directory Namespace .................................................................87
Domino Namespace ...................................................................87
Dumpster Namespace ................................................................88
EnterpriseVault Namespace......................................................88
Exchange Namespace.................................................................90
NSF Namespace ..........................................................................90
Sourceonearchive Namespace ..................................................91
Alphabetic Listing of Indexed Default Metadata Tags: ............... 92
Chapter 3
Metadata Mapping
Background ........................................................................................ 96
Creating a Mapping File to Capture OMR Custom Attributes .. 97
Identifying the OMR Custom Attributes.................................97
Identifying and Creating the IS1200 Metadata Tags to Map
To...................................................................................................97
Obtaining the Mapping Template from the IS1200 ...............98
Editing and Saving the Mapping File ......................................98
Mapping File Naming and Syntax ...........................................99
Default Connector Metadata Mappings ...................................... 101
NFS Default Metadata Mappings...........................................102
CIFS Default Metadata Mappings ..........................................103
Centera Default Metadata Mappings.....................................104
Enterprise Vault (EV) Default Metadata Mappings ............106
SharePoint Default Metadata Mappings ...............................108
Documentum Default Metadata Mappings ..........................110
IS1200 Metadata Reference Guide— v4.8.0
v
Contents
Chapter 4
Legal Export Default Tags
Legal Export Tags Default profile................................................. 116
Post Legal Export Tagging for Rolling Exports .......................... 118
Rolling Export Metadata Tags ................................................ 118
Appendix A
NSF Metadata Tags
How NSF Metadata is Mapped to IS1200 Tags .......................... 120
Appendix B
Using DICOM Tags
Using DICOM File Attributes as Metadata................................. 124
Enabling DICOM File Recognition ........................................ 124
Selecting the DICOM Properties to Extract .......................... 125
Adding DICOM File Attributes to the Search Schema ....... 125
DICOM Properties That Can Be Extracted As Extended
Attributes ......................................................................................... 126
Glossary ........................................................... ......................................147
vi
IS1200 Metadata Reference Guide— v4.8.0
Preface
As part of an effort to improve its product lines, EMC periodically releases
revisions of its software and hardware. Therefore, some functions described
in this document may not be supported by all versions of the software or
hardware currently in use.The product release notes provide the most
up-to-date information on product features.
Contact your EMC technical support professional if a product does not
function properly or does not function as described in this document.
Note: This document was accurate at publication time. Go to EMC Online
Support (https://support.emc.com) to ensure that you are using the latest
version of this document.
Audience
Related
Documentation
This is a reference guide for advanced users, such as IS1200
administrators, that need detailed information about the metadata
fields in the IS1200 Metadata Schema, how to map files between the
IS1200 and repositories accessed through optional modules (such as
Microsoft Exchange servers), what metadata fields are exported
through the Legal Export function, and other information about
using IS1200 metadata.
IS1200 Installation and Quickstart Guide
- describes installing and configuring the IS1200 server software.
IS1200 Web-Admin User and Configuration Guide
- describes using Web-Admin to setup and manage Kazeon clusters.
IS1200 Web-Search User Guide
- describes using Web-Search to perform basic and advanced searches.
IS1200 Metadata Reference Guide— v4.8.0
vii
IS1200 Web-Reports User Guide
- describes using Web-Reports to create and use basic and advanced reports.
IS1200 eDiscovery Case Manager Administrators and Supervisors Guide
- for legal representatives, a primer of all the web-based Interfaces
above for performing eDiscovery.
IS1200 Command Line Interface Reference Guide
- describes the IS1200 Command Line Interface and all its commands.
Follow these steps to download IS1200 document from the web:
1. Go to https://support.emc.com and click the SUPPORT BY
PRODUCT option in the home page.
2. In the Find a Product field, enter Kazeon. From the product
selection list, choose one of the sub-headers (such as Kazeon
ECS) and click the Find button.
3. Kazeon ECS window is displayed. Click the link for
Documentation.
4. In the left-navigation menu, choose a version level to display the
available documents.
Conventions used in
this document
EMC uses the following conventions for special notices:
DANGER indicates a hazardous situation which, if not avoided, will
result in death or serious injury.
WARNING indicates a hazardous situation which, if not avoided,
could result in death or serious injury.
CAUTION, used with the safety alert symbol, indicates a hazardous
situation which, if not avoided, could result in minor or moderate injury.
NOTICE is used to address practices not related to personal injury.
viii
IS1200 Metadata Reference Guide— v4.8.0
Note: A note presents information that is important, but not hazard-related.
IMPORTANT
An important notice contains information essential to software or
hardware operation.
Typographical conventions
EMC uses the following type style conventions in this document.
Normal
Used in running (nonprocedural) text for:
• Names of interface elements (such as names of windows, dialog boxes, buttons, fields, and menus)
• Names of resources, attributes, pools, Boolean expressions, buttons, DQL statements, keywords,
clauses, environment variables, functions, utilities
• URLs, pathnames, filenames, directory names, computer names, filenames, links, groups, service
keys, file systems, notifications
Bold
Used in running (nonprocedural) text for:
• Names of commands, daemons, options, programs, processes, services, applications, utilities,
kernels, notifications, system calls, man pages
Used in procedures for:
• Names of interface elements (such as names of windows, dialog boxes, buttons, fields, and menus)
• What user specifically selects, clicks, presses, or types
Italic
Used in all text (including procedures) for:
• Full titles of publications referenced in text
• Emphasis (for example a new term)
• Variables
Courier
Used for:
• System output, such as an error message or script
• URLs, complete paths, filenames, prompts, and syntax when shown outside of running text
Courier bold
Used for:
• Specific user input (such as commands)
Courier italic
Used in procedures for:
• Variables on command line
• User input variables
<>
Angle brackets enclose parameter or variable values supplied by the user
[]
Square brackets enclose optional values
|
Vertical bar indicates alternate selections - the bar means “or”
{}
Braces indicate content that you must specify (that is, x or y or z)
...
Ellipses indicate nonessential information omitted from the example
IS1200 Metadata Reference Guide— v4.8.0
ix
Where to get help
EMC support, product, and licensing information can be obtained as
follows.
EMC product, and licensing information can be obtained as follows.
Product information — For documentation, release notes, software
updates, or for information about EMC products, licensing, and
service, go to the EMC Online Support at:
https://support.emc.com
Technical Support — Go to EMC Online Support and click Service
Center. You will see several options for contacting EMC Technical
Support. Note that to open a service request, you must have a valid
support agreement. Contact your EMC sales representative for details
about obtaining a valid support agreement or with questions about
your account.
Documentation
Feedback
Your suggestions help us continue to improve the accuracy,
organization, and overall quality of the user publications. Please send
your comments or opinions on this document to:
ECD.Documentation.Feedback@emc.com
Revision History
Table 1
x
Revision History Details
Revision Date
Description
September 2015
Updated the Deduplication section in “Glossary”
December 2013
Initial Publication
IS1200 Metadata Reference Guide— v4.8.0
1
The Metadata Schema and Tag Management
This chapter discusses metadata tag creation and management using
the Command Line Interface (CLI) to edit the metadata schema,
originally called the Kaz Schema. Most tag management tasks can
only be performed from the CLI. See the
IS1200 Command Line Interface Reference Guide for complete
descriptions of the CLI commands described here.
Topics include:
◆
◆
◆
◆
◆
◆
◆
◆
◆
◆
◆
◆
What is Metadata? ............................................................................... 2
About the Metadata Schema .............................................................. 4
The fulltext Metadata Field ................................................................ 4
Default Metadata Schema Fields ....................................................... 5
Viewing the Metadata Schema........................................................... 7
How Tags Are Defined in Metadata Schema ................................... 7
Metadata Tag Namespaces ............................................................... 11
Using ‘set schema’ to Add Tags to the Metadata Schema............ 15
Other CLI Commands for Tag Maintenance.................................. 16
Best Practices....................................................................................... 18
Customizing the Metadata Schema for Web-Search Preview ..... 19
Customizing the Search Schema for DICOM Data ....................... 19
The Metadata Schema and Tag Management
1
The Metadata Schema and Tag Management
What is Metadata?
Data is information recorded in files on data repositories (file servers).
Metadata is data that describes data. Generally, metadata refers to the
information that describes the file, or is contained in the file.
File type, file path, and creation date, as well as any textual content
found in a file, are all examples of metadata. IS1200 classifications
extract this metadata and store it in centrally located metadata
repositories to allow comprehensive searches of all your registered
repositories from one place.
You can also use metadata to group files and to assign them to policy
groups. See the About Policies and Policy Groups chapter of
IS1200 Web-Admin User and Configuration Guide for complete details
on using Policies.
The IS1200 works with many types of metadata including; file system
or standard metadata; application-specific metadata; and fullText, custom,
or user-defined metadata.
File system metadata or standard metadata consists of standard file
attributes taken from data repository directories. It includes attributes
such as file type, file size, creation and modification dates, file path, etc.
These attributes are extracted from the file’s data repository directory
listing and are attributes all files have.
2
IS1200 Metadata Reference Guide— v4.8.0
The Metadata Schema and Tag Management
Application-specific metadata is information that is specific to certain
types of files. For example in an Microsoft Word document, the Properties
dialog shows application-specific standard-document properties such as
Title, Subject, and Author, and application-specific custom-document
properties such as Checked By.
Figure 1
Standard or Application Specific Metadata Examples
FullText metadata is probably the most commonly searched metadata
field. fullText is extracted from files with textual content, see
“fullText” on page 156 for more details. All words found in file text
content (except stop words, see “stop words” on page 166 for details)
are individually indexed and may be searched for as “keywords” in
basic and advanced searches.
Custom metadata or user-defined metadata, is also extracted and indexed
from file content, but it is indexed according to guidelines found in
extraction rules, see “Extraction Rules” on page 155 for more details.
For instance, custom metadata can be defined to identify all files
containing the word “confidential”. Custom metadata is also used to
identify files that contain patterns such as social security numbers,
drivers license numbers, or part numbers. Custom metadata search
expressions may contain boolean operators to specify multiple
parameters.
The Metadata Schema and Tag Management
3
The Metadata Schema and Tag Management
About the Metadata Schema
The Metadata Schema (originally called the Kaz Schema) defines the
set of metadata fields used to build a Search Index for registered data
repositories (file systems). The Search Index is built whenever a
classification is performed, and the information extracted is stored in
the Search Index. This allows the IS1200 to parse the Search Index for
search query terms rather than directly accessing the files on all
registered data repositories.
Basic classifications store standard metadata like file size; file name;
and file creation, modification, and access dates in the Search Index.
Deep classifications add even more information based on extractions
rules which define custom metadata to locate and index. Custom
metadata is defined by extraction rules that locate information like
social security numbers, part numbers, or key words like
“confidential” or “private”.
The fulltext Metadata Field
The custom metadata field called “fulltext” deserves special
attention. This field is defined and contained in an extraction rule
called “the fulltext rule” and this rule constitutes the default rule set
applied to new deep classifications when they are created. The
fulltext extraction rule looks at the body of a file (for example, the text
in the body of a word processing or email document) and indexes
each word in the document body in the Search Index and includes the
full textual content of the object in a metadata field called “fulltext”.
Simple searches, for example looking for a company name like
“ACME Inc.”, actually check the fulltext metadata field in the Search
Index for the string “ACME Inc.”.
Note: Fulltext does not necessarily index all the “text” in a file, so simply
searching for a string like “the acme project” will not return a file, like an
email, if the search string is not in the email’s body, even though the string is
contained in the email’s subject. Likewise, searching for an “author” named
“john smith” will not return a Word file authored by John Smith (and duly
recorded in the file’s directory attributes) unless the name “john smith” is
actually a part of the text body of the file.
HOWEVER, information in metadata fields like “author” and “mailsubject”
can be made part of the fulltext search index, and thus searchable with simple
keyword searches, by manually configuring the Search Index to include the
information from those metadata fields. See “Using ‘set schema’ to Add Tags
4
IS1200 Metadata Reference Guide— v4.8.0
The Metadata Schema and Tag Management
to the Metadata Schema” on page 15 for more details.
Fulltext is extracted separately for file objects and sub-objects. For
example for container objects such as ZIP, TAR, JAR, and PST or NSF
files, individual fulltext is extracted and indexed for the container
object (the parent) and each of it’s sub-objects (children).
It is important to note that files with embedded objects, for example
Microsoft Word files containing embedded OLE objects (spreadsheets
or graphics), are not considered to be container objects and
sub-objects. Consequently, the fulltext of embedded objects (for
example the text of an embedded spreadsheet) is extracted and
included in the fulltext of it’s parent object.
Default Metadata Schema Fields
The default Search Schema contains the following types of metadata
fields:
File system fields: These metadata fields consist of file properties
such as FileType and Owner extracted during basic classification.
Do not delete these.
Kazeon-configured fields: The IS1200 provides an extraction rule
set named sampleruleset that contains several pre-defined
extraction rules. The Metadata Schema contains metadata fields
added for these sample extraction rules to populate. These
metadata fields can be deleted if you choose to not use these
rules.
For more information on the sampleruleset, see the section Sample
Extraction Rule Set from the Policies: Classification, Extraction, and
Assignment Rules chapter of IS1200 Web-Admin
User and Configuration Guide.
Internal fields: The IS1200 uses internal metadata fields to
classify files and perform other operations. You cannot delete
these fields.
With the appropriate optional modules installed, the following are
available:
Centera fields: Centera-specific metadata is extracted during
basic classification.
Snapshot fields: These Snapshot-specific metadata fields are
extracted during basic classification.
The Metadata Schema and Tag Management
5
The Metadata Schema and Tag Management
SnapLock fields: These SnapLock-specific metadata fields are
extracted during basic classification.
Not all fields are populated because field values depend on file
contents and document type. For example, the Kazeon-defined field
called “Company” is a property of Microsoft Word files. So, the
system populates this field with values extracted from Microsoft
Office files. But PDF files do not contain a “Company” field, so the
system leaves “Company” unpopulated in that metadata field for pdf
files. You can use extraction rules to populate empty fields. For
information on extraction rules, see the section Using Extraction Rules
from the Policies: Classification, Extraction, and Assignment Rules
chapter of IS1200 Web-Admin User and Configuration Guide.
The Metadata Schema can be customized for any kind of meta field
needed. For information, see “Using ‘set schema’ to Add Tags to the
Metadata Schema” on page 15.
Note: When classifying PST files, the IS1200 maps the file type to “MS
Outlook Personal File Folder” and extracts the following metadata fields:
MailCc, MailFrom, MailTo, MailSubject. Add these fields to the Search index
to use them in search. For more information, see “Using ‘set schema’ to Add
Tags to the Metadata Schema” on page 15.
Typically, the Search schema is set only once, when the IS1200 is
originally configured. If a new keyword is added to the Search
schema later, the Search index must be rebuilt by doing a deep
classification to update the metadata fields. This ensures the index
contains the appropriate metadata to retrieve information from the
corresponding file systems.
Note: See “System Default Namespaces” on page 24 for the fields included in
the default search schema.
6
IS1200 Metadata Reference Guide— v4.8.0
The Metadata Schema and Tag Management
Viewing the Metadata Schema
To view the current contents of the Metadata Schema use the CLI
command:
show schema
The system responds:
Figure 2
CLI Command: show schema
How Tags Are Defined in Metadata Schema
The ‘set schema’ command adds new tags to the Metadata Schema.
Tags have a variety of parameters and attributes that must (or can) be
set when using this command. These parameters and attributes must
be understood before using the command.
Metadata Tag Attributes
The following types of attributes may be specified when issuing the
set schema command.
◆
attributes (space separated list)
◆
search-attributes (comma separated list)
◆
display-attributes (comma separated list)
◆
report-attributes (comma separated list)
◆
delimit-attributes (comma separated list)
Each is set using a specific CLI command keyword.
The set schema attribute keywords, and their usage, follows:
The Metadata Schema and Tag Management
7
The Metadata Schema and Tag Management
Table 2
8
Types of Attributes for ‘set_schema’ CLI Command
set schema keyword
Attribute Description
attributes
A list of one or more attributes separated by spaces within single quotes. If set schema is in
edit mode, new attributes are added to existing ones. Attributes include:
• Keep: The tag must be populated during classification and persists during IS100
software upgrades.
• Indexed: The tag must be populated in the search index
• Usertag: The tag is cumulative across classifications
• Multivalued: The tag can have a set of values
• Property: Extracted from a predefined property of the document type
• Alias: The tag is an alias of another tag, this attribute may not be used with any other
attribute.
Example:
set schema NewTag type string namespace foo attributes 'keep
indexed'
defines the fully qualified tag fooNewTag that is a string and has the attributes keep and
indexed.
search- attributes
A list of one or more attributes separated by commas within single quotes. These are
search-specific system configuration parameters. When set schema is used in edit mode,
new search-attributes are added to existing ones. Search-attributes include the following:
• Text: Specifies that the field contains full form text such as sentences, paragraphs and
long bodies of text.
• Kaztext: Specifies that Kazeon specific aggressive tokenization be used on the value of
the tag while indexing and search.
• String: Tag contains a complete string and can only be searched for as a complete
string.
• Date: The tag contains date/time values that are parsed using all internationally
accepted date formatters (MM/dd/yyyy, dd/MM/yyyy, yyyy-MM-dd are defaults).
• Uri: The value for this tag is tokenized with <slash> and <space> as a universal resource
indicator
• Email: The tag value should be tokenized with as an e-mail.
• Saved: Specifies that the field value should be stored in the search index repository,
instead of fetching from metadata store. Generally this should not be used as it may
result in slow retrieval performance.
• Stemmed: Specifies that Porter stemming analyzer should be used for the field while
indexing and searching.
• Content: Specifies the tag is populated in both fullText and the search index.
Example:
set schema NewTag type string namespace foo
search-attributes ‘saved, stemmed'
defines the fully qualified tag fooNewTagName that is a string and has the search-attributes
saved and stemmed.
IS1200 Metadata Reference Guide— v4.8.0
The Metadata Schema and Tag Management
Table 2
Types of Attributes for ‘set_schema’ CLI Command
set schema keyword
Attribute Description
display- attributes
A list of one or more attributes separated by commas within single quotes. Provides a way to
control display screen clutter so only tags of interest are displayed. By setting
display-attributes to ‘hidden’, tags are not displayed in some displays. Only ‘hidden’ and
‘extractable’ are currently supported. When set schema is used in edit mode, new
display-attributes are added to existing ones.
Example:
set schema NewTag type string namespace foo
display-attributes ‘hidden'
defines the fully qualified tag fooNewTagName that can be classified (works in the search
index and the database) but is hidden on the screen. To un-hide, see “Un-hiding Tags” on
page 17.
report- attributes
A string containing report-specific system configuration parameters. Used for internal
purposes, not for general use.
delimit- attributes
A string containing multi-valued tokenization system configuration parameters.
These attributes can be used to increase classification performance when populating
multi-valued tags with long lists of values, e.g., when adding lengthy recipient lists to eMail
metadata tags for PST files.
To increase performance, use:
delimit-attributes 'semantics=fifoset'
Some properties like MailTo, MailCC, DocumentAuthor etc have this setting by default.
The Metadata Schema and Tag Management
9
The Metadata Schema and Tag Management
Metadata Tag Types
Each metadata tag has a type which can also be set using the set
schema command. Types include: string, url, date, integer,
decimal, boolean, and email.
When the set schema command is issued without specifying a type,
set schema looks for an existing tag with the name specified and
attempts to enter edit mode to modify the existing tag.
Date Format Requirements
Date information stored in metadata values (the value side of a
metadata tag-value pair) must match one of the following formats to
be indexed properly and be searchable:
◆
“yyyy-MM-dd'T'HH:mm:ssZZZZZ”
◆
"EEE MMM dd HH:mm:ss yyyy"
◆
"EEE, d MMM yy HH:mm:ss Z"
◆
"F, d MMM yy HH:mm:ss Z"
◆
"EEE, d MMM yy HH:mm:ss z"
◆
"yyyyMMddHHmmss"
◆
"yyyy-MM-dd'T'HH:mm:ss"
◆
"yyyy-MM-dd"
◆
"ddMMMyyyy"
◆
"MMMyyyy"
◆
"MM/dd/yy"
◆
"MM/dd/yy"
◆
"dd/MM/yyyy"
◆
"d MMM yy HH:mm:ss"
◆
"d MMM yy HH:mm:ss zzzzz"
New metadata tags (such as dates) can be added to the search schema
via Command Line Interface or Web-Admin. When adding classification
rules to via Web-Admin, the Set Search Schema dialog box appears
allowing the new tag to be added to the search schema. Metadata
dates added via this dialog must use the formats above.
10
IS1200 Metadata Reference Guide— v4.8.0
The Metadata Schema and Tag Management
Fully Qualified Tag Names
A fully qualified tag name contains both its namespace and the tag
name. For example the default namespace document contains a tag
called author.
This tag’s fully-qualified name is documentauthor
and will appear in GUI-generated search queries as
‘DocumentAuthor’
and in GUI metadata field drop-down menus as
‘Author[Document]’.
Metadata Tag Namespaces
A namespace must be declared whenever a new metadata field (tag) is
added to the Metadata Schema using the set schema command.
IS1200 software versions 4.0 and higher, organizes metadata tags into
a hierarchy defined by namespaces. Namespaces group similar sets of
tags, for example all the file level tags, like FileType, FileSize,
aTime, and cTime are grouped together under the System namespace.
The standard IS1200 software installation provides a standard set of
active namespaces. Additional namespaces may be created as
needed. Additionally, some standard namespaces are not normally
enabled at installation, but can be activated if needed. See “Using
DICOM Tags” on page 123 for more details.
Standard Installation Name Spaces
At the time this chapter was written, the standard IS1200 installation
provided the standard namespaces the following table. A complete,
updated, list of all namespaces can be obtained at anytime using the
Command Line Interface command:
show schema namespaces
Table 3
Standard or Default Name Spaces
Namespace
Description
Amex
Tags specific to American Express rule set
AtmosCloud
Tags specific to Atmos Cloud rule set
Calendar
The Metadata Schema and Tag Management
11
The Metadata Schema and Tag Management
Table 3
Namespace
Standard or Default Name Spaces
Description
Centera
Contact
CTA
Tags specific Cloud tiering appliance
Deduplicate
Tags specific to Deduplication
Detected
Digest
Dinersclub
Tags specific to Dinersclub rule set
Directory
Tags specific to Directory Reporting Rule set
Discover
Tags specific to Discover rule set
Document
Tags specific to Microsoft Office documents, PDF, HTML, etc
Documentum
Domino
Dumpster
Tags specific to objects in the dumpster folder
Ein
Tags specific to EIN rule set
Enterprisevault
Tags specific to data from enterprise vault files
Entity
Envcommon
Exchange
12
Glba
Tags specific to GLBA rule set
Hierarchy
Internal name space for Hierarchical objects like, zip, tar, pst files.
InheritedMail
Internal name space for managing inheritance of properties under a given hierarchy
IPTC
Tags specific to JPEG and GIF file properties
Itin
Tags specific to ITN rule set
Jcb
Tags specific to JCB rule set
Kazcollection
Internal name space managed by quick rule builder
IS1200 Metadata Reference Guide— v4.8.0
The Metadata Schema and Tag Management
Table 3
Standard or Default Name Spaces
Namespace
Description
Kazeon
Tags specific to some prepackaged default rule sets
Legal
Tags specific to Legal service
Legalapp
Tags specific to Legal application, like Interactive tagging.
LegalExport
Mail
Tags specific to the Email properties of exchange, pst, and other email objects
MailDerived
Master
Tags specific to Master Card rule set
MDB
Tags specific to Access database files
Nin
Tags specific to NIN rule set
NSF
Objcommon
OriginalSource
PersistentNameSpace
ProcessingService
Retention
SharePoint
Sin
Tags specific to SIN rule set
Ssn
Tags specific to social security number rule set
Ssncc
Tags specific to SSNCC rule set
System
Tags specific to system metadata captured by IS1200
SystemUserTags
Task
TIFF
UserDefined
Tags populated from CD_tagging services, see below for more details.
The Metadata Schema and Tag Management
13
The Metadata Schema and Tag Management
Table 3
Standard or Default Name Spaces
Namespace
Description
UserExtracted
Tags created by extraction rules, and tags carried over during upgrades from
pre-existing extraction rule tags that have no specific namespace defined., see below.
for more details.
Version
Visa
Tags specific to Visa rule set
Additionally, new tags may be created by Actionable Services and
Extraction rules and they will use the following standard
namespaces:
Table 4
User Defined Namespaces
Namespace
Description
Userdefined
Tags populated from CD_tagging services
Userextracted
Tags created by extraction rules, and tags carried over during upgrades from pre-existing
extraction rule tags that have no specific namespace defined.
The IS1200 uses the name spaces UserDefined and UserExtracted
for special purposes aimed at preserving custom tags. All tags
created in GUI screens (such as Actionable Services Tagging services)
are automatically added to the UserDefined name space. An alias is
defined for the tag entered for the field.
The UserExtracted name space is populated during upgrades from
releases earlier then 3.X or 4.X. All custom tags found during an
upgrade are automatically added to the UserExtracted name space to
preserve them beyond the upgrade process.
14
IS1200 Metadata Reference Guide— v4.8.0
The Metadata Schema and Tag Management
Using ‘set schema’ to Add Tags to the Metadata Schema
The CLI set schema command is used as follows:
set schema <fieldName> namespace <nameSpaceName> type
<typeCode>
attributes <spaceSeparatedListOfAttributes in quotes>
search-attributes <commaSeparatedListOfAttributes in
quotes>
display-attributes <commaSeparatedListOfAttributes in
quotes>
Where:
<fieldName> is the name of the new metadata tag
<nameSpaceName> is the name of the namespace to add the new
tag to
<typeCode> is either text, string, integer, decimal, boolean, or
date and the attributes, search-attributes, and display-attributes
lists are defined in “Types of Attributes for ‘set_schema’ CLI
Command” on page 8.
For example:
set schema capacity namespace geex type string
attributes 'indexed keep' search-attributes ‘saved,
stemmed' display-attributes ‘hidden'
Defines a new metadata field (tag) with the fully-qualified name
geexcapacity, for the tag capacity in the namespace geex, of type
string, with attributes indexed and keep, with search-attributes of
saved and stemmed, and the display-attributes of hidden.
All extraction rules populating the field geexcapacity will add
(cumulatively, not replace) the data of geexcapacity both in the search
indexer and the database.
Note: Date information stored in metadata values (the value side of a
metadata tag-value pair) must match certain formats to be indexed properly
and be searchable. See “Date Format Requirements” on page 10 for more
information.
The Metadata Schema and Tag Management
15
The Metadata Schema and Tag Management
Deleting a Field from the Metadata Schema
Enter the following command at a command line prompt and press
Enter:
remove schema fieldName
where fieldName is the name of the metadata field to remove.
Do not delete file system metadata from the Metadata Schema
because these fields are required to search file systems successfully.
The IS1200 does not allow deleting IS1200 internal metadata fields.
However, you can delete the IS1200-configured metadata or any
custom metadata added to the Metadata Schema.
Other CLI Commands for Tag Maintenance
Besides adding and deleting tags in the Metadata Schema, the CLI
can be used to perform other kinds of tag maintenance, and to inspect
the Metadata Schema. To show the basic Metadata Schema, see
“Viewing the Metadata Schema” on page 7.
Creating New Namespaces
When the set schema command specifies a namespace that does not
exist, it automatically creates a new namespace with the name
specified.
For example, if the following command is issued:
set schema newTagName namespace foo type string
attributes 'keep indexed'
And if the namespace foo does NOT exist, the IS1200 creates a new
foo namespace and adds the new tag newTagName to it.
Editing Tags
Generally, tags may be edited by using the set schema command
without a type designator. When type is omitted, and new attributes
specified are added to current attributes, rather than replacing the
current ones.
For example, to modify search-attributes previously set with:
16
IS1200 Metadata Reference Guide— v4.8.0
The Metadata Schema and Tag Management
set schema categories type string namespace geex
attributes 'keep indexed' search-attributes 'saved,
stemmed'
To change the tag geexcategories to be aggressively tokenized by
the search indexer, use the command:.
set schema categories namespace geex search-attributes
'saved, stemmed, kaztext'
Note the type parameter is not used.
Un-hiding Tags
Assuming a the ‘hidden’ display-attribute was previously applied, to
un-hide the tag geexemailaddress, use the command:
set schema emailaddress namespace geex
display-attributes ''
This makes the geexemailaddress tag visible in GUI screens, the tag
name will now appear in all drop-down menus where metadata field
names can be chosen.
Listing Name Spaces
The CLI command: show schema namespaces
Displays the name spaces currently under tag management.
Listing Tags in a Specific Name Space
The CLI command: show schema document namespace
Displays the fully qualified tags under the name space document.
Listing the Indexed Tags in a Name Space
The CLI command: show schema document namespace select
indexed
Displays the indexed tags in the name space document. Similarly, one
can select keep, multivalued, and other attributes and restrict the
display to those specific fields.
The Metadata Schema and Tag Management
17
The Metadata Schema and Tag Management
Listing the Details of a Specific Fully Qualified Tag
The CLI command: show schema geexcategories
Displays all details of the fully qualified tag geexcategories.
Synchronizing Tag Management with Database
The CLI command: synchronize schema
Synchronizes tag management with the database. Use this command
after editing a field to ensure subsequent classifications are affected.
Use this command only when other services are not running. Tag
management changes should only be done when no services are
running.
Best Practices
The following Best Practices should be followed.
Always synchronize the tag management with database after editing
any tag.
Always use fully qualified tags in extraction and assignment rules
Always define different name spaces for extraction rule set tags and
assignment rule set tags.
Tags designated for extraction rule sets should not be cumulative
(should not have an attribute of ‘usertag’), this ensures that each
successive classification puts new metadata in the tag, and that
previous values do not persist.
When required, edit a tag's properties to un-hide a tag or change
search attributes.
Tags that are not defined anywhere in the tag management are
temporary tags and are not persisted
18
IS1200 Metadata Reference Guide— v4.8.0
The Metadata Schema and Tag Management
Customizing the Metadata Schema for Web-Search Preview
To enable Preview mode in Web-Search, issue this
Command Line Interface command.
For v4.2 or greater:
set previewer status enable
For v4.1.2 or earlier:
set schema System.FullText search-attributes
kaztext,content,indexed,index=primary,saved
This allows Preview mode to display the fullText of search results
files in the Web-Search Previewer pane. Once set, a new deep
classification must be run to extract and record the Preview
information
Customizing the Search Schema for DICOM Data
If you intend to classify DICOM files, the IS1200 is designed to
recognize and classify the standard DICOM file attributes. However,
because there are hundreds of these, they are not a part of the default
search schema.
Be aware that the IS1200 only classifies DICOM file headers, the
images themselves are not opened or parsed.
To add any of the DICOM file attributes to the default search schema,
see “Using DICOM Tags” on page 123.
The Metadata Schema and Tag Management
19
The Metadata Schema and Tag Management
20
IS1200 Metadata Reference Guide— v4.8.0
2
The Default Schema and Metadata Tags
All IS1200 installations come with a standard default set of metadata
tags which are organized by namespaces. See “Metadata Tag
Namespaces” on page 11 for more details on namespaces.
Additionally, other standard metadata tags are automatically
available when specific rulesets or optional modules are employed.
This chapter lists all the standard IS1200 metadata Schema
namespaces and the standard metadata tags associated with those
namespaces.
Topics include:
◆
◆
◆
◆
◆
◆
How Metadata Tags and Namespaces are Organized..................
Namespace Tables Legend................................................................
System Default Namespaces ............................................................
Namespaces Populated by Rulesets................................................
Namespaces Populated by Optional Module Connectors...........
Alphabetic Listing of Indexed Default Metadata Tags:................
The Default Schema and Metadata Tags
22
23
24
64
79
92
21
The Default Schema and Metadata Tags
How Metadata Tags and Namespaces are Organized
All metadata tags are discussed as part of their namespace. However,
namespaces are grouped according to the following categories:
22
◆
“System Default Namespaces” on page 24:
The metadata tags in these namespaces are always populated
whenever a deep classification using the fullText ruleset is run.
◆
“Namespaces Populated by Rulesets” on page 64:
The metadata tags in these namespaces are only populated when
a deep classification is done and a specific ruleset (extracted or
assignment) is applied to the classification. See the Policies:
Extraction and Assignment Rules chapter of the IS1200
Web-Admin User and Configuration Guide for more details on
rulesets.
◆
“Namespaces Populated by Optional Module Connectors” on
page 79: The metadata tags in these namespaces are only
populated when a specific Optional Module Connector license is
installed and a deep classification is run on a data repository
accessed by that optional module. See the Optional Modules
section of the Introduction chapter of the IS1200 Web-Admin
User and Configuration Guide for more details on optional
modules.
◆
“Alphabetic Listing of Indexed Default Metadata Tags:” on
page 92: Not all the default tags are automatically indexed, only
those with the key word “indexed” in the Schema, see for details.
This table lists all default tags that have the “indexed” keyword.
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
Namespace Tables Legend
The Namespace tables are organized as follows:
◆
Description: Describes properties common to all the tags in this
namespace.
◆
Service populating namespace: List the types of jobs, services,
reports, etc. that can be used to populate the namespace’s
metadata fields.
◆
Extraction ruleset: The extraction ruleset that must be used by the
job or service above to populate the metadata fields.
◆
Assignment ruleset: The assignment ruleset that must be used by
the job or service above to populate the metadata fields.
The Namespace Tags tables below:
◆
Tags: List the metadata tags contained by that namespace.
◆
Saved in DB (reportable): Is checked if the metadata information
is saved by default to the database, and therefore usable in
reports.
◆
Indexed (searchable): Is checked if the metadata information is
saved by default to the search index, and therefore searchable by
that tag.
◆
Comments: Unique information about the metadata tag.
The Default Schema and Metadata Tags
23
The Default Schema and Metadata Tags
System Default Namespaces
The metadata tags in the following namespaces are always populated
whenever a deep classification using the fullText ruleset is run.
Calendar Namespace
Table 5
Calendar Namespace
Description
Outlook Calendar (meeting or appointment) information tags
Service populating namespace
Deep, reclassify, collection-with-source-fulltext-indexing, collection-target-indexing,
copy-with-target-indexing, in-place-processing
Extraction Ruleset
fulltext (rule)
Assignment Ruleset
n/a
Table 6
Calendar Namespace Tags
Tags
Saved in DB
(reportable)
CalendarEndTime
Y
Comments
Outlook Calendar Appointment/ Meeting end time
CalendarFrom
Alias for MailFrom
CalendarLocation
Alias for MailLocation
CalendarOptionalAttendees
Alias for MailCc
CalendarRecurrence
Y
Outlook Calendar Appointment/ Meeting Recurrence
CalendarRequiredAttendees
Alias for MailTo
CalendarSentDate
Alias for MailDate
CalendarStartTime
Y
CalendarSubject
24
Indexed
(Searchable)
IS1200 Metadata Reference Guide— v4.8.0
Outlook Calendar Appointment/ Meeting start time
Alias for MailSubject
The Default Schema and Metadata Tags
Contact Namespace
Table 7
Contact Namespace
Description
Outlook contact information tags
Service populating namespace
Deep, reclassify, collection-with-source-fulltext-indexing, collection-target-indexing,
copy-with-target-indexing, in-place-processing
Extraction Ruleset
fulltext (rule)
Assignment Ruleset
n/a
Table 8
Contact Namespace Tags
Tags
Saved in DB (reportable)
ContactAnniversaryDate
Y
ContactAssistantName
Y
ContactAssistantPhone
Y
ContactBirthDate
Y
ContactBusiness2Phone
Y
ContactBusinessAddress
Y
ContactBusinessFax
Y
ContactBusinessPhone
Y
ContactCallbackPhone
Y
ContactCarPhone
Y
ContactCompany
Y
ContactCompanyPhone
Y
ContactDepartment
Y
ContactDisplayAs
Y
ContactEMail2Address
Y
ContactEMail3Address
Y
ContactEMailAddress
Y
ContactFamily Name
Y
Indexed (Searchable)
Comments
The Default Schema and Metadata Tags
25
The Default Schema and Metadata Tags
Table 8
26
Contact Namespace Tags
Tags
Saved in DB (reportable)
ContactFileAs
Y
ContactFirstName
Y
ContactFullName
Y
ContactGender
Y
ContactHome2Phone
Y
ContactHomeAddress
Y
ContactHomeFax
Y
ContactHomePhone
Y
ContactIMAddress
Y
ContactISDNPhone
Y
ContactJobTitle
Y
ContactManagerName
Y
ContactMiddleName
Y
ContactMobilePhone
Y
ContactNameSuffix
Y
ContactNickname
Y
ContactOffice
Y
ContactOtherAddress
Y
ContactOtherFax
Y
ContactOtherPhone
Y
ContactPager
Y
ContactPrimaryPhone
Y
ContactProfession
Y
ContactRadioPhone
Y
ContactSpouseName
Y
IS1200 Metadata Reference Guide— v4.8.0
Indexed (Searchable)
Y
Y
Comments
The Default Schema and Metadata Tags
Contact Namespace Tags
Table 8
Tags
Saved in DB (reportable)
ContactTelexPhone
Y
ContactTitle
Y
ContactTTYPhone
Y
ContactWebPage
Y
Indexed (Searchable)
Comments
CTA Namespace
CTA Namespace Tags
Table 9
Tags
Saved in DB (reportable)
Indexed (searchable)
Comments
CTAFileSizeOnDisk
Y
Y
Additional attribute to report
actual disk space used, in
case of NFS/CIFS storages
connected to EMC CTA
appliance).
The Default Schema and Metadata Tags
27
The Default Schema and Metadata Tags
Detected Namespace
Table 10
Detected Namespace
Description
Service populating namespace
Deep, reclassify, collection-with-source-fulltext-indexing, collection-target-indexing,
copy-with-target-indexing, in-place-processing
Extraction Ruleset
fulltext (rule)
Assignment Ruleset
n/a
Table 11
Detected Namespace Tags
Tags
Saved in DB (reportable)
Indexed (Searchable)
detectedalternates
Y
Y
detectedlanguage
Y
Y
Comments
DICOM Namespace
Table 12
DICOM Namespace
Description
DICOM file information tags
Service populating namespace
Deep, reclassify, collection-with-source-fulltext-indexing, collection-target-indexing,
copy-with-target-indexing, in-place-processing
Extraction Ruleset
fulltext (rule)
Assignment Ruleset
n/a
See “Using DICOM Tags” on page 123 for the list of DICOM
metadata tags available and details about added them to the schema.
28
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
Digest Namespace
Table 13
Digest Namespace
Description
Tags to store object's digest information (used for De-duplication)
Service populating namespace
Any classification service
(basic/deep/collection-with-source/target-indexing/copy-with-target-indexing/in-plac
e-processing)
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 14
Digest Namespace Tags
Tags
Saved in DB (reportable)
Indexed (Searchable)
Comments
DigestObject
Y
Digest of the object
DigestProfileId
Y
ID of profile used for digest calculation
Directory Namespace
Table 15
Directory Namespace
Description
Namespace that populates up to first five directory names of a file path
Service populating namespace
Basic, Deep, Reclassify-Action
Extraction Ruleset
n/a
Assignment Ruleset
DirectoryReporting
Table 16
Directory Namespace Tags
Tags
Saved in DB (reportable) Indexed (Searchable)
Comments
directorydir1
Y
First directory in the file path (if it exists)
directorydir2
Y
Second directory in the file path (if it exists)
directorydir3
Y
Third directory in the file path (if it exists)
directorydir4
Y
Fourth directory in the file path (if it exists)
directorydir5
Y
Fifth directory in the file path (if it exists)
The Default Schema and Metadata Tags
29
The Default Schema and Metadata Tags
Document Namespace
Table 17
Document Namespace
Description
Information about metadata of objects like Microsoft office documents, PDF
documents, etc.
Service populating namespace
Deep, reclassify, collection-with-source-fulltext-indexing,
collection-target-indexing, copy-with-target-indexing, in-place-processing
Extraction Ruleset
fulltext (rule)
Assignment Ruleset
n/a
Table 18
Document Namespace Tags
Tags
Saved in DB (reportable)
Indexed (Searchable) Comments
Y
Y
Y
Y
DocumentAccount
DocumentAddress
DocumentAttachments
DocumentAuthor
DocumentAuthorization
DocumentBackupDate
DocumentBaseFileLocation
DocumentBillTo
DocumentBlindCopy
DocumentCarbonCopy
DocumentCategory
DocumentCharacterCount
DocumentCheckedBy
DocumentClient
DocumentComments
Y
DocumentCompany
Y
DocumentCompletedDate
30
IS1200 Metadata Reference Guide— v4.8.0
Y
The Default Schema and Metadata Tags
Table 18
Document Namespace Tags
Tags
Saved in DB (reportable)
Indexed (Searchable) Comments
Y
Y
Y
Y
Y
Y
DocumentContributor
DocumentCountBytes
DocumentCountCharsWithSpaces
DocumentCountLines
DocumentCountMMClips
DocumentCountNotes
DocumentCountParas
DocumentCountSlides
DocumentCountSlidesHidden
DocumentCreationDate
DocumentDepartment
DocumentDescription
DocumentDestination
DocumentDisposition
DocumentDivision
DocumentDocNumber
DocumentDocType
DocumentEditor
DocumentForwardTo
DocumentHeadingPairs
DocumentKeyWords
Y
DocumentLanguage
Y
Y
Y
Y
DocumentLastPrintDate
DocumentLastSaveDate
DocumentLastSavedBy
The Default Schema and Metadata Tags
31
The Default Schema and Metadata Tags
Table 18
Document Namespace Tags
Tags
Saved in DB (reportable)
Indexed (Searchable) Comments
Y
Y
Y
Y
DocumentLinksDirty
DocumentMailStop
DocumentManager
DocumentMatter
DocumentMinutesEdited
DocumentOffice
DocumentOperator
DocumentOwner
DocumentPageCount
Objects with
PageCount as an
attribute, are extracted
by the parser and
exported by the legal
export.For example,
Microsoft word files.
Note: PageCount is
only supported for
Microsoft word 2010
documents.
PageCount is always
displayed as 1 for the
Microsoft word
documents saved in
the versions prior to
Microsoft word 2010.
DocumentPresentationFormat
DocumentProject
Y
Y
DocumentPublisher
Y
Y
DocumentPurpose
DocumentReceivedFrom
DocumentRecordedBy
32
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
Table 18
Document Namespace Tags
Tags
Saved in DB (reportable)
Indexed (Searchable) Comments
Y
Y
DocumentSubject
Y
Y
DocumentTitle
Y
Y
DocumentRecordedDate
DocumentRelation
DocumentRevisionDate
DocumentRevisionNotes
DocumentRevisionNumber
DocumentScaleCrop
DocumentSection
DocumentSecurity
DocumentSource
DocumentStatus
DocumentTitleOfParts
DocumentTypist
DocumentVersionDate
DocumentVersionNotes
DocumentVersionNumber
DocumentWordCount
DocumentWorkGroup
EnvCommon Namespace
Table 19
Description
EnvCommon Namespace
Environment specific metadata
The Default Schema and Metadata Tags
33
The Default Schema and Metadata Tags
Table 19
Service populating namespace
Any classification service
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 20
34
EnvCommon Namespace
EnvCommon Namespace Tags
Tags
Saved in DB (reportable) Indexed (Searchable)
Comments
EnvcommonMtime
Y
Y
Environment specific
mtime;Mailreceive time for
mails, otherwise mtime of
objects.
EnvcommonRepositorytype
Y
Y
Repository type (e.g., domino,
documentum, etc.)
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
Hierarchy Namespace
Table 21
Hierarchy Namespace
Description
Information of objects and sub-objects in composite objects
Service populating namespace Any classification service (note that anot all properties/tags are populated in all
classification services)
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 22
Hierarchy Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
HierarchyActionCasid
Y
Casid of parent object on which action can be
taken.
HierarchyChildrenCompleted
Y
Internal use
HierarchyDepth
Y
Level of the sub-document in the composite file.
HierarchyDisplayObjectType
Y
Y
Type of subdocument like 'embedded email' ,
'embedded file' etc.
HierarchyErrorMessage
Y
Y
Any Errors while parsing the object.
HierarchyFileName
Y
Y
Filename if physical object. Sub-document name if
object is a subdocument. Mail Subject if object is a
mail.
HierarchyFolderName
Y
Y
Name of any folder where the subdocument
resides in the composite file.
HierarchyHasChild
Y
Set to 't' if object has any children.
HierarchyLineage
Y
Internal use
HierarchyOffsets
Y
Internal use
Comments
The Default Schema and Metadata Tags
35
The Default Schema and Metadata Tags
InheritedMail Namespace
Table 23
InheritedMail Namespace
Description
Information parent mail properties inherited by sub-documents (email attachments)
Service populating namespace Deep, reclassify-action, collection-with-source-fulltext-indexing,
collection-target-indexing, copy-with-target-indexing, in-place-processing
Extraction Ruleset
Table 24
36
fulltext (rule)
InheritedMail Namespace Tags
Assignment Ruleset
n/a
InheritedMailBccLevel1
Y
Y
If immediate parent is email, mailbcc list of that
parent
InheritedMailBccLevel2
Y
Y
If immediate grandparent is email, mailbcc list of
that grandparent
InheritedMailBccLevel3
Y
Y
If object's lineage above 2 levels has email(s),
union of mailbcc list of all those parents
InheritedMailCcLevel1
Y
Y
If immediate parent is email, mailcc list of that
parent
InheritedMailCcLevel2
Y
Y
If immediate grandparent is email, mailcc list of
that grandparent
InheritedMailCcLevel3
Y
Y
If object's lineage above 2 levels has email(s),
union of mailcc list of all those parents
InheritedMailDateLevel1
Y
Y
If immediate parent is email, maildate of that
parent
InheritedMailDateLevel2
Y
Y
If immediate parent is email, maildate of that
parent
InheritedMailDateLevel3
Y
Y
If object's lineage above 2 levels has email(s),
union of maildates of all those parents
InheritedMailFromLevel1
Y
Y
If immediate parent is email, mailfrom list of that
parent
InheritedMailFromLevel2
Y
Y
If immediate grandparent is email, mailfrom of that
grandparent
InheritedMailFromLevel3
Y
Y
If object's lineage above 2 levels has email(s),
union of mailfrom list of all those parents
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
Table 24
InheritedMail Namespace Tags
Assignment Ruleset
n/a
InheritedMailReceivedLevel1 Y
Y
If immediate parent is email, mailreceived date of
that parent
InheritedMailReceivedLevel2 Y
Y
If immediate grandparent is email, mailreceived
date of that grandparent
InheritedMailReceivedLevel3 Y
Y
If object's lineage above 2 levels has email(s),
union of mailreceived dates of all those parents
InheritedMailSubjectLevel1
Y
Y
If immediate parent is email, mailsubject of that
parent
InheritedMailSubjectLevel2
Y
Y
If immediate grandparent is email, mailsubject of
that grandparent
InheritedMailSubjectLevel3
Y
Y
If object's lineage above 2 levels has email(s),
union of mailsubject of all those parents
InheritedMailToLevel1
Y
Y
If immediate parent is email, mailto list of that
parent
InheritedMailToLevel2
Y
Y
If immediate grandparent is email, mailto list of that
grandparent
InheritedMailToLevel3
Y
Y
If object's lineage above 2 levels has email(s),
union of mailto list of all those parents
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Comments
The Default Schema and Metadata Tags
37
The Default Schema and Metadata Tags
IPTC Namespace
Table 25
Description
IPTC Namespace
Tags for IPTC properties of images
Service populating namespace Deep, reclassify-action, collection-with-source-fulltext-indexing,
collection-target-indexing, copy-with-target-indexing, in-place-processing
Extraction Ruleset
fulltext (rule)
Assignment Ruleset
n/a
Table 26
38
IPTC Namespace Tags
Tags
Saved in DB (reportable)
IPTCByline
Y
IPTCBylineTitle
Y
IPTCCaption
Y
IPTCCaptionWriter
Y
IPTCCategory
Y
IPTCCity
Y
IPTCCopyrightNotice
Y
IPTCCountry
Y
IPTCCredits
Y
IPTCDateCreated
Y
IPTCHeadline
Y
IPTCKeywords
Y
IPTCObjectName
Y
IPTCOriginalTransmissionReference
Y
IPTCSource
Y
IS1200 Metadata Reference Guide— v4.8.0
Indexed (Searchable)
Comments
The Default Schema and Metadata Tags
Table 26
IPTC Namespace Tags
Tags
Saved in DB (reportable)
IPTCSpecialInstructions
Y
IPTCState
Y
IPTCSupplementalCategories
Y
Indexed (Searchable)
Comments
Kazcollection Namespace
Table 27
Kazcollection Namespace
Description
Collection name namespace
Service populating namespace
Collection-with-source-fulltext-indexing, collection-target-indexing
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 28
Kazcollection Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
KazcollectionRuleName
Y
Y
Comments
List of collection name(s) associated with the object
(both source and targets)
The Default Schema and Metadata Tags
39
The Default Schema and Metadata Tags
Kazeon Namespace
Table 29
Kazeon Namespace
Description
Internal namespace (maintained for backward compatibility).
Service populating namespace
Extraction Ruleset
Assignment Ruleset
Table 30
40
Kazeon Namespace Tags
Tags
Saved in DB
(reportable)
KazeonAccountNo
Y
KazeonAutoMFG
Y
KazeonConfidential
Y
KazeonContracts
Y
KazeonExtDate
Y
KazeonHipaa
Y
KazeonHR
Y
Kazeoninternaluseonly
Y
KazeonIPRelated
Y
KazeonLegalDiscovery
Y
KazeonMFG
Y
KazeonName
Y
KazeonNpiType
Y
KazeonNpiValue
Y
KazeonPostalCode
Y
Y
KazeonPrice
Y
Y
KazeonRecordType
Y
KazeonRoutingNo
Y
IS1200 Metadata Reference Guide— v4.8.0
Indexed
(Searchable)
Y
Y
Y
Comments
The Default Schema and Metadata Tags
Table 30
Kazeon Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
KazeonSalesMarketing
Y
KazeonSECData
Y
KazeonSocialSecurityNumber
Y
Y
KazeonStockSymbol
Y
Y
KazeonTelephoneNumber
Y
Y
KazeonUKNpiType
Y
KazeonUKNpiValue
Y
KazeonUniversity
Y
KazeonUri
Y
Comments
Y
Legalapp Namespace
Note: The Legalapp namespace tags are only available with IS1200
installations such as ECS that include the eDiscovery Case Manager.
Table 31
Legalapp Namespace
Description
Legal application's per case object data.
Service populating namespace
Tagging during legal review
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 32
Legalapp Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Comments
legalappCase
Y
Y
The casenum
legalappLastReviewedBy
Y
Y
The last user the document was reviewed by (per
case).
legalappLastReviewedDate
Y
Y
Stores the last review date (per case).
The Default Schema and Metadata Tags
41
The Default Schema and Metadata Tags
Table 32
Legalapp Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Comments
legalappNotes
Y
Y
Notes the user entered while setting the legal hold.
legalappReviewedAs
Y
Y
The document's review status. Out of the box
ships with the following values
[None|Completed|pending|Deferred].
legalappReviewer
Y
Y
The reviewer assigned (per case).
legalappTag
Y
Y
The review tag assigned to the document (per
case). Out of the box ships with the following
values [Not
Relevant|Privileged|Escalate|Responsive].
Legal Namespace
Note: The Legal namespace tags are only available with IS1200 installations
such as ECS that include the eDiscovery Case Manager.
Table 33
Legal Namespace
Description
Legal hold related metadata attributes.
Service populating namespace
Extraction Ruleset
Assignment Ruleset
42
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
Table 34
Legal Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
legalhold
Y
Y
legalhold_orig_acls
Y
Legalholdenforcefs
Y
LegalholdEnforcementReleaseDate
Y
legalholdexecutor
Y
Comments
[yes|no]. This field is multi valued, hence the
format of values per case are
legalapp.[casnum].[yes|no]. When set to yes
the objects is set on legal hold for the case
specified in the value.
The original ACL byte stream of the object
prior to legal hold.
Y
Stores 'true' if legal hold was enforced on the
repository for the object, else 'false. Values
[true|false].
The time legalhold was set on the physical
file.
Y
Stores the user (per case) that set legal hold
on the object.
legalholdmonth
legalholdnotes
Y
Stores the notes user entered while setting
the legal hold.
legalholdowner
Y
Stores the original owner of the object set on
legal hold. This is important because legal
hold can change the owner of the object.
LegalholdReleaseDate
Y
Stores the (per case) date when legal hold
was released.
LegalholdSetDate
Y
Stores the (per case) date when legal hold
was set.
legalholdtime
Y
Y
The time legal hold was set on the object.
legalholdtype
Y
Y
Stores the type of hold on the physical
object. Values [legal|security].
legalholdyear
The Default Schema and Metadata Tags
43
The Default Schema and Metadata Tags
LegalExport Namespace
Note: The LegalExport namespace tags are only available with IS1200
installations such as ECS that include the eDiscovery Case Manager.
Table 35
Legal Namespace
Description
Tags created after a successive Legal Export for files that were exported using
the Rolling Exports options.
Service populating namespace
Legal Export
Extraction Ruleset
Assignment Ruleset
Table 36
44
Legal Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable) Comments
LegalExportCaseIds
Y
Y
[yes|no]. Field is multi valued, hence the format of values per
case are legalapp.[casnum].[yes|no]. When set to yes the
objects have been exported for the case specified in the
value.
LegalExportJobIds
Y
Y
Field is multi valued, hence the format of values per case are
legalapp.[casnum].[jobID].
LegalExportTimes
Y
Y
The time object was Legally Exported.
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
MailDerived Namespace
Table 37
MailDerived Namespace
Description
Tags derived from mail properties
Service populating namespace
Deep, Reclassify-Action, Collection-with-source-fulltext-indexing,
Collection-target-indexing, copy-with-target-indexing, in-place-processing
Extraction Ruleset
fulltext(rule)
Assignment Ruleset
n/a
Table 38
MailDerived Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
MailDerivedDirection
Y
Y
MailDerivedDirectionInheritedLevel1
Y
Y
MailDerivedDirectionInheritedLevel2
Y
Y
MailDerivedDirectionInheritedLevel3
Y
Y
MailDerivedFromDomain
Y
Y
Senders mail domain
MailDerivedFromDomainInheritedLevel1
Y
Y
If immediate parent is email,
senders mail domain of parent
MailDerivedFromDomainInheritedLevel2
Y
Y
If immediate grandparent is email,
senders mail domain of parent
MailDerivedFromDomainInheritedLevel3
Y
Y
If any object in lineage is email,
union of senders mail domain of
all those objects
MailDerivedRecipientDomain
Y
Y
Receiver's mail domain
MailDerivedRecipientDomainInheritedLevel1
Y
Y
If immediate parent is email,
receiver's mail domain of parent
MailDerivedRecipientDomainInheritedLevel2
Y
Y
If immediate grandparent is email,
receiver's mail domain of parent
MailDerivedRecipientDomainInheritedLevel3
Y
Y
If any object in lineage is email,
union of receiver's mail domain of
all those objects
Comments
Direction of mail like 'outgoing',
'incoming' , 'internal' etc.
The Default Schema and Metadata Tags
45
The Default Schema and Metadata Tags
Mail Namespace
Table 39
Mail Namespace
Description
Extracted properties of mails are stored in this namespace.
Services that populate it:
Deep, Reclassify-Action, collection-with-source-fulltext-indexing,
collection-target-indexing, copy-with-target-indexing, in-place-processing
Extraction Ruleset
fulltext (rule)
Assignment Ruleset
n/a
Table 40
Mail Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Y
Y
Comments
MailAlternateRecipientAllowed
MailAttachment
MailAttachmentsCount
File/files sent along with email
attachment.
Number of attachments with the email.
MailAttrHidden
MailAttrReadonly
MailAttrSystem
Y
MailAutoForwarded
Y
MailBcc
Y
MailCategories
Y
MailCc
Y
Y
Members marked in the Bcc list.
Y
Members who are in the copy list of an
email.
MailCcme
MailClientSubmitTime
Sender name in the copy list.
Y
Date and timestamp of the email sent by
the sender.
MailCompany
MailConversationIndex
MailConversationTopic
46
IS1200 Metadata Reference Guide— v4.8.0
Y
Y
Subject of the mail.
The Default Schema and Metadata Tags
Table 40
Mail Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
MailCreationTime
Comments
Time when the mail was composed.
MailCreatorEntryid
MailCreatorName
Y
MailDate
Y
MailDeferredDeliveryTime
Y
Y
Date and timestamp of the mail
received.
MailDeleteAfterSubmit
MailEmail
Y
Y
MailEntryId
Internal ID of the mail.
MailExpires
Y
MailExpiryTime
Y
MailFlagsts
Y
MailFrom
Y
MailFullname
Y
MailHomephone
MailImportance
Status as to whether the mail has been
expired or not.
Y
Sender of the mail.
Full name of the sender.
Phone number of the sender.
Y
Importance state of the email.
MailInetMailOverrideFormat
MailInReplyTo
Y
MailInternetArticleNumber
MailInternetCpid
MailInternetMessageId
Y
Unique internet message ID associated
with the email.
MailJobtitle
The Default Schema and Metadata Tags
47
The Default Schema and Metadata Tags
Table 40
Mail Namespace Tags
Tags
Saved in DB
(reportable)
MailLastModified
Y
Indexed
(Searchable)
Comments
Deep Crawl
Exchange Server: Time when the object
was crawled
Standalone .msg files: Time when the
object was saved
Collection
Exchange Server: Time when the object
was collected
Standalone .msg files: Time when the
object was saved
Modification Time
Deep Crawl
Exchange Server: Actual mail
received/sent
Standalone .msg files: Time when the
object was saved
Collection
Exchange Server: Actual mail
received/sent
Standalone .msg files: Time when the
object was saved
MailLastModifierEntryid
MailLastModifierName
Y
Person who modified the mail last.
MailLatestDeliveryTime
Y
MailLocation
Y
MailMessageClass
Y
Y
MailMessageId
Y
Y
MailMessageLocaleId
Y
MailMessageCodepage
MailMessageSubmissionId
48
IS1200 Metadata Reference Guide— v4.8.0
Type of message such as:
'IPM.Note' (for emails),
'IPM.Appointment' (for calender items)
‘IPM.Note.ExShortcut” for email stubs
The Default Schema and Metadata Tags
Table 40
Mail Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Comments
MailMsgEditorFormat
MailMsgflag
Y
Flag associated with the email.
MailNewsgroups
MailNormalizedSubject
MailNtSecurityDescriptor
Y
MailOriginatorDeliveryReportRequested
MailPriority
Y
Priority set for the email.
MailProfileConnectFlags
MailRcvdByFlags
MailRcvdRepresentingAddrtype
MailRcvdRepresentingEmailAddress
MailRcvdRepresentingEntryid
MailRcvdRepresentingFlags
MailRcvdRepresentingName
MailRcvdRepresentingSearchKey
MailReadReceiptRequested
MailReceived
Y
Y
Receiver of the email.
MailReceivedByAddrtype
MailReceivedByEmailAddress
MailReceivedByEntryid
MailReceivedByName
Y
MailReceivedBySearchKey
MailRecipientReassignmentProhibited
MailReferences
Y
MailReplyRequested
The Default Schema and Metadata Tags
49
The Default Schema and Metadata Tags
Table 40
Mail Namespace Tags
Tags
Saved in DB
(reportable)
MailReplyTime
Y
Indexed
(Searchable)
Comments
Size of the email.
MailReportTag
MailResponseRequested
MailRtfbody
MailRtfInSync
MailRtfSyncBodyCount
MailRtfSyncBodyCrc
MailRtfSyncBodyTag
MailRtfSyncPrefixCount
MailRtfSyncTrailingCount
MailSearchKey
MailSenderAddrtype
MailSenderEmailAddress
Y
MailSenderEntryid
MailSenderFlags
MailSenderName
Y
MailSenderSearchKey
MailSensitivity
Y
MailSentRepresentingAddrtype
MailSentRepresentingEMailAddress
MailSentRepresentingEntryid
MailSentRepresentingFlags
MailSentRepresentingSearchKey
50
MailSize
Y
Y
MailSubject
Y
Y
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
Table 40
Mail Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Comments
Y
Receiver of the email.
MailSubmittime
MailThreadIndex
Y
MailTo
Y
MailTransportMessageHeaders
MailTrustSender
MailUnknown
MailWebpage
MailWorkphone
The Default Schema and Metadata Tags
51
The Default Schema and Metadata Tags
MDB Namespace
Table 41
MDB Namespace
Description
Extracted properties of Access database ( saved in MDB files) are stored in this
namespace.
Services that populate it:
Deep, Reclassify-Action, collection-with-source-fulltext-indexing,
collection-target-indexing, copy-with-target-indexing, in-place-processing
Extraction Ruleset
fulltext (rule)
Assignment Ruleset
n/a
Table 42
MDB Namespace Tags
Tags
Saved in DB
(reportable)
MDBColumn
Y
MDBTable
Y
Indexed
(Searchable)
Comments
Objcommon Namespace
Table 43
Description
Metadata specific to the object.
Services that populate it:
Any classification service
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 44
52
Objcommon Namespace
Objcommon Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Comments
ObjcommonMtime
Y
Y
mailreceive time for mail type objects
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
OriginalSource Namespace
Table 45
OriginalSource Namespace
Description
Metadata from the original source of the object's instance.
Services that populate it:
Any classification service (not all tags are populated with all services)
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 46
OriginalSource Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
OriginalsourceAdHocTag
Y
Y
OriginalsourceContainerLocation
Y
OriginalsourceCustodian
Y
Y
OriginalsourceDocId
Y
Y
OriginalsourceFinalized
Y
OriginalsourceLocation
Y
Comments
Y
Persistentnamespace Namespace
Table 47
Processingservice Namespace
Description
Services that populate it:
Collection and/or processing legalapp
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 48
Tags
Processingservice Namespace Tags
Saved in DB
(reportable)
PersistentNameSpaceLegalCaseNum Y
Indexed
(Searchable)
Comments
Y
Case numbers this object belongs to.
The Default Schema and Metadata Tags
53
The Default Schema and Metadata Tags
Processingservice Namespace
Table 49
Processingservice Namespace
Description
Tags that don't get erased with re-crawls, and are not assigned or userdefined.
Services that populate it:
In-place-processing
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 50
Processingservice Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Processingservicename
Y
Y
Comments
Retention Namespace
Table 51
Retention Namespace
Description
Describes the object retention.
Services that populate it:
Deep, Reclassify-Action, collection-with-source-fulltext-indexing,
collection-target-indexing, copy-with-target-indexing, in-place-processing,
Retention.
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 52
Retention Namespace Tags
Tags
Saved in DB
(reportable)
Retentionclass
Y
Indexed
(Searchable) Comments
Retentionclassname
54
Retentiondate
Y
Set to the date for which the object will be set under
Retention.
Retentionlock
Y
Set to 'yes' if the object is set under Retention and is
locked.
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
Table 52
Tags
Retention Namespace Tags
Saved in DB
(reportable)
Indexed
(Searchable) Comments
Retentionreportdate
Retentionsetdate
Y
Retentionsetuser
Y
Set to the date when the object was set under Retention.
Y
Set to the name of the user who put the object under
Retention.
The Default Schema and Metadata Tags
55
The Default Schema and Metadata Tags
System Namespace
Table 53
System Namespace
Description
System specific information of files are stored in this namespace.
Services that populate it:
Any classification service (not all tags are populated at all times)
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 54
System Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Comments
SystemAtime
Y
Y
Atime of the file
SystemCasid
Y
Y
SystemCasidV
Y
Y
SystemCreationTime
Y
Y
SystemCTime
Y
Y
SystemEndVersionTime
Y
Y
SystemFileExtension
Y
Y
SystemFilepath
Y
Y
SystemFilerName
Y
Y
SystemFileSize
Y
Y
SystemFileType
Y
Y
SystemFullText
Y
Y
Y
Y
Y
Y
Actual file extension in the file system.
SystemFileName
SystemfulltextLimit
SystemGroupName
Systemlargeparsetimeout
SystemLastSeenTime
SystemmatchLimit
56
IS1200 Metadata Reference Guide— v4.8.0
Type of the file
The Default Schema and Metadata Tags
Table 54
System Namespace Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Comments
Y
Y
Mtime of the file
SystemObjectName
Y
Y
SystemObjectType
Y
Y
SystemOwnerName
Y
Y
SystemPolicyGroups
Y
Y
SystemRetentionTime
Y
Y
SystemSnaplocked
Y
Y
SystemSnaplockTime
Y
Y
SystemSsHasBeenDeleted
Y
Y
SystemStartVersionTime
Y
Y
Y
Y
Y
Y
Tags
SystemMimeType
SystemMtime
SystemnoFragmentCD
Systemparsetimeout
SystemtextLimit
SystemTypeExtension
Extenstion depending on type of the file.
SystemUncPath
SystemUniqueDocumentID
The Default Schema and Metadata Tags
57
The Default Schema and Metadata Tags
Systemusertags Namespace
Table 55
Systemusertags Namespace
Description
Services that populate it:
NISHESH
Extraction Ruleset
Assignment Ruleset
Table 56
Systemusertags Namespace Tags
Tags
Saved in DB
(reportable)
SystemUsertagsDigestComposite
Y
Indexed
(Searchable)
Comments
Task Namespace
Table 57
Task Namespace
Description
Services that populate it:
Deep, Reclassify-Action, collection-with-source-fulltext-indexing,
collection-target-indexing, copy-with-target-indexing, in-place-processing
Extraction Ruleset
fulltext (rule)
Assignment Ruleset
n/a
Table 58
58
Task Namespace Tags
Tags
Saved in DB
(reportable)
TaskActualWork
Y
TaskBillingInformation
Y
TaskCompanies
Y
TaskDateCompleted
Y
TaskDueDate
Y
TaskMileage
Y
IS1200 Metadata Reference Guide— v4.8.0
Indexed
(Searchable)
Comments
The Default Schema and Metadata Tags
Table 58
Task Namespace Tags
Tags
Saved in DB
(reportable)
TaskOwner
Y
TaskPercentComplete
Y
TaskPriority
Y
TaskStartDate
Y
TaskStatus
Y
TaskSubject
Y
TaskTotalWork
Y
TaskUpdateList
Y
Indexed
(Searchable)
Comments
Y
The Default Schema and Metadata Tags
59
The Default Schema and Metadata Tags
Tiff Namespace
Table 59
Description
Tiff file information is stored in this namespace.
Services that populate it:
Deep, Reclassify-Action, collection-with-source-fulltext-indexing,
collection-target-indexing, copy-with-target-indexing, in-place-processing
Extraction Ruleset
fulltext (rule)
Assignment Ruleset
n/a
Table 60
60
Tiff Namespace
Tiff Namespace Tags
Tags
Saved in DB
(reportable)
TIFF_254
Y
TIFF_258
Y
TIFF_259
Y
TIFF_262
Y
TIFF_266
Y
TIFF_269
Y
TIFF_270
Y
TIFF_271
Y
TIFF_272
Y
TIFF_273
Y
TIFF_274
Y
TIFF_277
Y
TIFF_278
Y
TIFF_279
Y
TIFF_282
Y
TIFF_283
Y
TIFF_284
Y
IS1200 Metadata Reference Guide— v4.8.0
Indexed
(Searchable)
Comments
The Default Schema and Metadata Tags
Table 60
Tiff Namespace Tags
Tags
Saved in DB
(reportable)
TIFF_296
Y
TIFF_301
Y
TIFF_305
Y
TIFF_306
Y
TIFF_315
Y
TIFF_317
Y
TIFF_318
Y
TIFF_319
Y
TIFF_33432
Y
TIFF_529
Y
TIFF_532
Y
TIFFArtist
Y
TIFFCopyright
Y
TIFFDateTime
Y
TIFFImageDescription
Y
TIFFImageLength
Y
TIFFImageWidth
Y
TIFFMake
Y
TIFFModel
Y
TIFFSoftware
Y
Indexed
(Searchable)
Comments
Y
Y
The Default Schema and Metadata Tags
61
The Default Schema and Metadata Tags
Userdefined Namespace
Note: The Userdefined namespace tags are only available with IS1200
installations such as ECS that include the eDiscovery Case Manager.
Table 61
Userdefined Namespace
Description
Tags populated by users with a tagging service. These are retained across crawls.
Services that populate it:
Tagging
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 62
Tags
Userdefined Namespace Tags
Saved in DB
(reportable)
userdefineddefault
62
IS1200 Metadata Reference Guide— v4.8.0
Indexed
(Searchable)
Comments
Internal use.
The Default Schema and Metadata Tags
Userextracted Namespace
Table 63
Userextracted Namespace
Description
Tags created using extraction rules that are created in GUI by the users.
Services that populate it:
Extraction Ruleset
Assignment Ruleset
Table 64
Userextracted Namespace Tags
Saved in DB
(reportable)
Tags
Indexed
(Searchable)
userextracteddefault
Comments
Internal use.
Version Namespace
Table 65
Version Namespace
Description
Version details of the object
Services that populate it:
Deep, reclassify-action, collection-with-source-fulltext-indexing,
collection-target-indexing, copy-with-target-indexing, in-place-processing
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 66
Version Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Comments
versionBaseObject
KazURL of base object
versionDate
Date when this version was created
versionId
Version Id of the object
versionName
Version Name
versionParentCasnum
Casnum of base object
The Default Schema and Metadata Tags
63
The Default Schema and Metadata Tags
Namespaces Populated by Rulesets
The metadata tags in the following namespaces are only
populatedwhen a deep classification is run and a specific ruleset
(either extracted or assignment) is specified with the classification.
See the Policies: Extraction and Assignment Rules chapter of the
IS1200 Web-Admin User and Configuration Guide for more details on
rulesets. The required ruleset(s) are noted in the Namespace tables.
64
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
Amex Namespace
Table 67
Amex Namespace
Description
Identifies American Express Cards and reports the risk level in their document.
Service populating namespace
Deep, Reclassify-Action
Extraction Ruleset
AmEx
Assignment Ruleset
AmEx
Table 68
Amex Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Amexrisk
Y
Y
Comments
Reported as High, Low, Medium, Unknown depending on number of
occurrences of American Express Card numbers and a combination
of literals 'Confidential' or 'Sensitive' or 'Restricted' in the document.
Centera Namespace
Table 69
Centera Namespace
Description
Metadata of objects in Centera repository.
Service populating namespace
Reclassify, collection-target-indexing, copy-with-target-indexing
Extraction Ruleset
fulltext (rule)
Assignment Ruleset
n/a
Table 70
Centera Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Comments
centeraClipCreationTime
Y
Y
Creation time of clip in Centera.
centeraClipId
Y
Y
Clip Id assigned by Centera
centeraName
Y
Y
Clip name.
The Default Schema and Metadata Tags
65
The Default Schema and Metadata Tags
DinersClub Namespace
DinersClub Namespace
Table 71
Description
Identifies Diners Club Cards and reports their risk levels in the documents.
Service populating namespace
Deep, Reclassify-Action
Extraction Ruleset
DinersClub
Assignment Ruleset
DinersClub
Table 72
DinersClub Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Dinersclubrisk
Y
Y
Comments
Reported as High, Low, Medium, Unknown depending on
number of occurrences Diners Club Card numbers and a
combination of literals 'Confidential' or 'Sensitive' or 'Restricted'
in the document.
Discover Namespace
Discover Namespace
Table 73
66
Description
Identifies Discover Cards and reports the risk levels in the documents.
Service populating namespace
Deep, Reclassify-Action
Extraction Ruleset
Discover
Assignment Ruleset
Discover
Table 74
Discover Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
discoverrisk
Y
Y
IS1200 Metadata Reference Guide— v4.8.0
Comments
Reported as High, Low, Medium, Unknown depending on number
of occurrences Discover Card numbers and a combination of
literals 'Confidential' or 'Sensitive' or 'Restricted' in the document.
The Default Schema and Metadata Tags
EIN Namespace
EIN Namespace
Table 75
Description
Identifies EINs ( USA Employer Identification Number) and reports the risk levels
in the documents.
Service populating namespace
Deep, Reclassify-Action
Extraction Ruleset
EIN
Assignment Ruleset
EIN
EIN Namespace Tags
Table 76
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
einrisk
Y
Y
Comments
Reported as High, Low, Medium, Unknown depending on number of
occurrences EIN numbers and a combination of literals 'Confidential'
or 'Sensitive' or 'Restricted' or 'Employer Identification Number/No.' or
'EIN#' in the document.
Entity Namespace
Entity Namespace
Table 77
Description
Extracted Concepts/Entities.
Service populating namespace
Deep, reclassify-action, collection-with-source-fulltext-indexing,
collection-target-indexing, copy-with-target-indexing, in-place-processing
Extraction Ruleset
fulltext (rule)
Assignment Ruleset
conceptfinder
Entity Namespace Tags
Table 78
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Entityaddress
Y
US street addresses roughly in street number, street, city,
state, zip format, or Internet addresses in abc@xyz.com form.
entitycity
Y
Name of a city, including abbreviations for major cities.
entitycompany
Y
Name of a company, organization, or corporation.
Comments
The Default Schema and Metadata Tags
67
The Default Schema and Metadata Tags
Entity Namespace Tags
Table 78
68
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
entityconcepts
Y
entitycountry
Y
Names of countries and abbreviations.
entitycurrency
Y
Quantities of world currency, and ranges of amounts of
currency.
entitydate
Y
Dates minimally composed of a number and month.
entityfacility
Y
Man-made structures, extracted as one of the following
sub-types:
BUILDGROUNDS- Names of architectural and civil
engineering structures, and outdoor spaces that are mainly
man-made or man-maintained. There is no distinction with
respect to their function, they could be civil or military facilities,
they could be used for work, entertainment, or monuments.
entityholiday
Y
Holidays and special days.
entitylanguage
Y
entitymeasure
Y
Any measurement, such as weight, volume, or length, in
English or metric units, including standard abbreviations of
measurement units.
entityorganization
Y
Government, legal, and service agencies, including non-profit
organizations, fine arts groups, and other associations and
institutions.
entitypeoples
Y
Names referring to identifiable groups of people based on
country, ethnicity, race, or religion.
entitypercent
Y
A percentage.
entityperson
Y
An individual specified by name.
entityphone
Y
Phone numbers in US ### ### #### format. International
phone numbers based on French, German, and Spanish
formats.
Y
Comments
Not populated.
entityplace_region Y
A geographical area larger than a city that captures a
significant land mass, such as a continent or a group of
countries
entityproduct
A product name, including software and service-oriented
products.
Y
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
Entity Namespace Tags
Table 78
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
entityprop_misc
Y
A proper name that does not fall into any of the entity types
specified by the other entities.
entityssn
Y
Social security number, including Canadian Social Insurance
Numbers and French INSEE Numbers.
entitystate
Y
One of the fifty states of the United States, including standard
abbreviations and two-letter postal codes.
entityticker
Y
Company stock ticker symbols used on the stock exchange.
entitytime
Y
Designations of hours, minutes, and seconds.
entityvehicle
Y
Methods of transportation, extracted as one of the following
sub-types such as AIR, LAND, WATER.
Comments
The Default Schema and Metadata Tags
69
The Default Schema and Metadata Tags
GLBA Namespace
Table 79
Description
Identification of Consumers' Personal Financial Information
Service populating namespace
Deep, Reclassify-Action
Extraction Ruleset
GLBA
Assignment Ruleset
GLBA
Table 80
70
GLBA Namespace
GLBA Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
glbacountamex
Y
glbacountdinersclub
Y
glbacountein
Y
glbacountgenericcc
Y
glbacountitin
Y
glbacountjcb
Y
glbacountmastercard
Y
glbacountnin
Y
glbacountpan
Y
glbacountsin
Y
glbacountssn
Y
glbacountvisa
Y
Glbarisk
Y
glbatypeamex
Y
Updated to 'yes' if a document contains American Express
Card Number.
glbatypeconfidential
Y
Updated to 'yes' if a document contains literal 'Confidential'.
glbatypedinersclub
Y
Updated to 'yes' if a document contains Diners Club Card
Number.
Comments
Y
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
Table 80
GLBA Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
glbatypeein
Y
Updated to 'number' or 'literal' or 'both' if a document
contains a literal 'EIN#' or 'Employer Identification Number'
or an EIN number or combination of both.
glbatypegenericcc
Y
Updated to 'yes' if a generic credit card number is found in
the document.
glbatypeinternaluse
Y
Updated to 'yes' if a document contains literal 'Internaluse'.
glbatypeitin
Y
Updated to 'number' or 'literal' or 'both' if a document
contains a literal 'Tax ID#' or 'Tax Identification Number/No.'
or an Tax ID number or combination of both.
glbatypejcb
Y
Updated to 'yes' if the document contains JCB Card
Number.
glbatypemastercard
Y
Updated to 'yes' if the document contains Master Card
Number.
glbatypenin
Y
Updated to 'yes' if the document contains UK National
Insurance Number.
glbatypepan
Y
Updated to 'yes' if the document contains India Permanent
Account Number.
glbatypeproprietary
Y
Updated to 'yes' if a document contains literal 'Properietary'.
glbatyperestricted
Y
Updated to 'yes' if a documented contains literal 'Restricted'.
glbatypesin
Y
Updated to 'yes' if the document contains Canada Social
Insurance Number.
glbatypessn
Y
Updated to 'yes' if the document contains USA Social
Security Number.
glbatypevisa
Y
glbaweight
Y
Comments
The Default Schema and Metadata Tags
71
The Default Schema and Metadata Tags
ITIN Namespace
Table 81
ITIN Namespace
Description
Identifies Tax Payer Identification Numbers and reports the risk levels in the
documents.
Service populating namespace
Deep, reclassify-action
Extraction Ruleset
ITIN
Assignment Ruleset
ITIN
Table 82
ITINNamespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
itinrisk
Y
Y
Comments
Reported as High, Low, Medium, Unknown depending on number of
occurrences ITIN numbers and a combination of literals 'Confidential' or
'Sensitive' or 'Restricted' or 'ITIN' or 'Tax Payer Identification Number/No'
in the document.
JCB Namespace
Table 83
72
JCB Namespace
Description
Identifies JCB Cards and reports the risk levels in the documents.
Service populating namespace
Deep, reclassify-action
Extraction Ruleset
JCB
Assignment Ruleset
JCB
Table 84
JCB Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable) Comments
jcbrisk
Y
Y
IS1200 Metadata Reference Guide— v4.8.0
Reported as High, Low, Medium, Unknown depending on number of
occurrences of JCB Card numbers and a combination of literals
'Confidential' or 'Sensitive' or 'Restricted' in the document.
The Default Schema and Metadata Tags
Master Namespace
Table 85
Master Namespace
Description
Identifies Master Cards and reports the risk levels in the documents.
Services that populate it:
Deep, Reclassify-Action
Extraction Ruleset
Master
Assignment Ruleset
Master
Table 86
Master Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
masterrisk
Y
Y
Comments
Reported as High, Low, Medium, Unknown depending on number of
occurrences of Master Card numbers and a combination of literals
'Confidential' or 'Sensitive' or 'Restricted' in the document.
NIN Namespace
Table 87
NIN Namespace
Description
Identifies UK National Insurance Numbers and reports the risk levels in the
documents.
Services that populate it:
Deep, Reclassify-Action
Extraction Ruleset
NIN
Assignment Ruleset
NIN
Table 88
NIN Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
ninrisk
Y
Y
Comments
Reported as High, Low, Medium, Unknown depending on number of
occurances of UK National Insurance numbers and a combination of
literals 'Confidential' or 'Sensitive' or 'Restricted' in the document.
The Default Schema and Metadata Tags
73
The Default Schema and Metadata Tags
NPI Namespace
Table 89
Description
Identifies Non Public Information in a given document and reports the value.
Note - This namespace is not populated. The namespaces.config does not contain this
namespace.
Services that populate it:
Deep, Reclassify-Action
Extraction Ruleset
NPI
Assignment Ruleset
NPI
Table 90
74
NPI Namespace
NPI Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
npitype
Y
The document to be identified as 'NonPublicInfo' or 'PublicInfo'
depending on the npivalue of the document. The npivalue is
described below.
npivalue
Y
The document has this value populated if it contains one of the
following:
American Express Card #
Discover Card #
Master Card #
Visa Card #
Diners Club Card #
Enroute Card #
JCB Card #
Social Security #
Literals 'Confidential' or 'Sensitive' or 'Proprietary'
Date of Birth
California Drivers License
Customer Account Number
IS1200 Metadata Reference Guide— v4.8.0
Comments
The Default Schema and Metadata Tags
PCI Namespace
Table 91
PCI Namespace
Description
This namespace is not populated. The namespaces.config and ldap do not contain this
namespace.
Services that populate it:
Deep, Reclassify-Action
Extraction Ruleset
PCI
Assignment Ruleset
PCI
Table 92
PCI Namespace Tags
Tags
Saved in DB
(reportable)
PciCountAmEx
Y
PciCountDinersClub
Y
PciCountEIN
Y
PciCountGenericCC
Y
PciCountITIN
Y
PciCountJCB
Y
PciCountMasterCard
Y
PciCountNIN
Y
PciCountPAN
Y
PciCountSIN
Y
PciCountSSN
Y
PciCountVisa
Y
PciRisk
Y
PciTypeAmEx
Y
PciTypeConfidential
Y
PciTypeDinersClub
Y
PciTypeEIN
Y
Indexed
(Searchable)
Comments
The Default Schema and Metadata Tags
75
The Default Schema and Metadata Tags
Table 92
76
PCI Namespace Tags
Tags
Saved in DB
(reportable)
PciTypeGenericCC
Y
PciTypeInternalUse
Y
PciTypeITIN
Y
PciTypeJCB
Y
PciTypeMasterCard
Y
PciTypeNIN
Y
PciTypePAN
Y
PciTypeProprietary
Y
PciTypeRestricted
Y
PciTypeSIN
Y
PciTypeSSN
Y
PciTypeVisa
Y
PciWeight
Y
IS1200 Metadata Reference Guide— v4.8.0
Indexed
(Searchable)
Comments
The Default Schema and Metadata Tags
SIN Namespace
Table 93
SIN Namespace
Description
Identifies Canada Social Insurance Numbers and reports the risk levels in the
documents.
Services that populate it:
Deep, Reclassify-Action
Extraction Ruleset
SIN
Assignment Ruleset
SIN
Table 94
SIN Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
sinrisk
Y
Y
Comments
Reported as High, Low, Medium, Unknown depending on number of
occurrences of Canada Social Insurance numbers and a combination of
literals 'Confidential' or 'Sensitive' or 'Restricted' in the document.
SSNCC Namespace
Table 95
SSNCC Namespace
Description
Identifies Social Security Numbers and Credit Card Numbers and reports the risk
levels in the documents.
Services that populate it:
Deep, Reclassify-Action
Extraction Ruleset
SSNCC
Assignment Ruleset
SSNCC
Table 96
SSNCC Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable) Comments
ssnccrisk
Y
Y
Reported as High, Low, Medium, Unknown depending on number of
occurrences of Social Security Number and Credit Card Numbers
and a combination of literals 'Confidential' or 'Sensitive' or 'Restricted'
or 'SSN#/SS#' or 'Social Security Number/No.' in the document.
The Default Schema and Metadata Tags
77
The Default Schema and Metadata Tags
SSN Namespace
Table 97
SSN Namespace
Description
Identies Social Security Numbers and reports the risk levels in the documents.
Services that populate it:
Deep, Reclassify-Action
Extraction Ruleset
SSN
Assignment Ruleset
SSN
Table 98
SSN Namespace Tags
Saved in DB
(reportable)
Tags
ssnrisk
Indexed
(Searchable)
Y
Comments
Reported as High, Low, Medium, Unknown depending on number of
occurrences of Social Security Number and a combination of literals
'Confidential' or 'Sensitive' or 'Restricted' or 'SSN#/SS#' or 'Social
Security Number/No.' in the document.
Visa Namespace
Table 99
Description
Identifies Visa Cards and reports the risk levels in the documents.
Services that populate it:
Deep, Reclassify-Action
Extraction Ruleset
Visa
Assignment Ruleset
Visa
Table 100
78
Visa Namespace
Visa Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable) Comments
visarisk
Y
Y
IS1200 Metadata Reference Guide— v4.8.0
Reported as High, Low, Medium, Unknown depending on number of
occurrences of Visa Card numbers and a combination of literals
'Confidential' or 'Sensitive' or 'Restricted' in the document.
The Default Schema and Metadata Tags
Namespaces Populated by Optional Module Connectors
The metadata tags in the following namespaces are only populated
when a specific Optional Module Connector license is installed and a
deep classification is run on a data repository accessed by that
optional module. See the Optional Modules section of the Introduction
chapter of the IS1200 Web-Admin User and Configuration Guide for
more details on optional modules. The name of the Namespace table
indicates the Optional Module Connector required to extract these
namespaces and tags.
The Default Schema and Metadata Tags
79
The Default Schema and Metadata Tags
Atmoscloud Namespace
Table 101
Atmoscloud Namespace
Description
Atmos Cloud information tags
Service populating namespace
Deep, reclassify, collection-with-source-fulltext-indexing,
collection-target-indexing, copy-with-target-indexing, in-place-processing
Extraction Ruleset
fulltext (rule)
Assignment Ruleset
n/a
Table 102
Atmoscloud Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Comments
AtmosCloudGroup
Y
Y
Alias for Group id (GID) of the object
AtmosCloudObjectId
Y
Y
Alias for the Object id in Atmos
AtmosCloudObjectName
Y
Y
Alias for the name of the object (filename or
directory) in
Atmos
AtmosCloudOwner
Y
N
Alias for the User id (UID) of the owner of the object
AtmosCloudPolicyName
Y
Y
Alias for the name of the policy under which the
system stores an object.
AtmosCloudType
Y
Y
Alias for the type of object defined as follows:
• Regular: applicable for files or objects
• Directory: applicable for directories
Centera Namespace
Table 103
80
Centera Namespace
Description
Metadata of objects in Centera repository.
Service populating namespace
Reclassify, collection-target-indexing, copy-with-target-indexing
Extraction Ruleset
fulltext (rule)
Assignment Ruleset
n/a
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
Table 104
Centera Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Comments
centeraClipCreationTime
Y
Y
Creation time of clip in Centera.
centeraClipId
Y
Y
Clip Id assigned by Centera
centeraName
Y
Y
Clip name.
CSVC Documentum Namespace
Table 105
CSVC Documentum Namespace
Description
Metadata captured for objects in Documentum repository.
Service populating namespace
Deep, reclassify, collection-with-source-fulltext-indexing,
collection-target-indexing, copy-with-target-indexing, in-place-processing
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 106
CSVC Documentum Namespace Tags
Tag
Saved in DB
(Reportable)
Indexed
(Searchable)
CSVC_EMC_DocumentumIsUnderRetention
Y
N
Check if a particular document is under
retention or no.
CSVC_EMC_DocumentumIsOnHold
Y
N
Check if the document is on hold or no.
CSVC_EMC_DocumentumHold
Y
N
The hold value.
CSVC_EMC_DocumentumTitle
Y
Y
Title of the object. This is user-defined.
CSVC_EMC_DocumentumSubject
Y
Y
Subject of the object. This is user-defined.
CSVC_EMC_DocumentumAuthors
Y
Y
List of the authors for the object
CSVC_EMC_DocumentumKeywords
Y
Y
List of user-defined keywords for the object
CSVC_EMC_DocumentumApplicationType
Y
N
CSVC_EMC_DocumentumStatus
Y
N
Comments
The Default Schema and Metadata Tags
81
The Default Schema and Metadata Tags
Table 106
82
CSVC Documentum Namespace Tags
Tag
Saved in DB
(Reportable)
Indexed
(Searchable)
Comments
CSVC_EMC_DocumentumModifier
Y
Y
Name of the user who made last modification
CSVC_EMC_DocumentumIsHidden
Y
N
Check whether an object is hidden or no.
CSVC_EMC_DocumentumIsDeleted
Y
N
If the object is the root version of a version
tree, deleting the object sets this attribute to
TRUE.
CSVC_EMC_DocumentumArchive
Y
N
CSVC_EMC_DocumentumCompoundArchitectu
re
Y
N
CSVC_EMC_DocumentumLinkResolved
Y
N
CSVC_EMC_DocumentumReferenceCount
Y
N
CSVC_EMC_DocumentumHasFolder
Y
N
CSVC_EMC_DocumentumFolderId
Y
N
CSVC_EMC_DocumentumLinkCount
Y
N
CSVC_EMC_DocumentumLinkHighCount
Y
N
CSVC_EMC_DocumentumAssembledFromId
Y
N
CSVC_EMC_DocumentumFrznAssemblyCount
Y
N
CSVC_EMC_DocumentumHasFrznAssembly
Y
N
CSVC_EMC_DocumentumResolutionLabel
Y
N
CSVC_EMC_DocumentumIsvirtualDoc
Y
N
CSVC_EMC_DocumentumContentsId
Y
N
CSVC_EMC_DocumentumContentType
Y
N
File format in which to display the contents of
the object attached to the task. (Note that
this may be different than the content type
specified for the object.)
CSVC_EMC_DocumentumPageCount
Y
N
Number of content files associated with the
object.
IS1200 Metadata Reference Guide— v4.8.0
Folder ID
SysObject is a virtual document. If the value
is 1, the object is a virtual document. If the
value is 0, the object is not a virtual
document unless the r_link_cnt is greater
than 0.
The Default Schema and Metadata Tags
Table 106
CSVC Documentum Namespace Tags
Tag
Saved in DB
(Reportable)
Indexed
(Searchable)
CSVC_EMC_DocumentumFullText
Y
N
CSVC_EMC_DocumentumStorageType
Y
N
CSVC_EMC_DocumentumCabinetId
Y
N
ID of the cabinet
CSVC_EMC_DocumentumOwner
Y
Y
Name of the object’s owner. This can be a
user or group name.
CSVC_EMC_DocumentumOwnerPermit
Y
N
Object-level permission assigned to the
owner for this object.
CSVC_EMC_DocumentumGroup
Y
N
Group to which this object belongs.
CSVC_EMC_DocumentumGroupPermit
Y
N
CSVC_EMC_DocumentumWorldPermit
Y
N
CSVC_EMC_DocumentumAntecedentId
Y
N
CSVC_EMC_DocumentumChronicleId
Y
N
CSVC_EMC_DocumentumLatestFlag
Y
N
CSVC_EMC_DocumentumLockOwner
Y
N
CSVC_EMC_DocumentumLockDate
Y
N
CSVC_EMC_DocumentumLockMachine
Y
N
CSVC_EMC_DocumentumLogEntry
Y
N
CSVC_EMC_DocumentumBranchCount
Y
N
CSVC_EMC_DocumentumDirectDsc
Y
N
CSVC_EMC_DocumentumImmutableFlag
Y
N
CSVC_EMC_DocumentumFrozenFlag
Y
N
CSVC_EMC_DocumentumHasEvents
Y
N
CSVC_EMC_DocumentumAclDomain
Y
N
Identifies the domain of the ACL associated
with the object.
CSVC_EMC_DocumentumAclName
Y
N
The object name of the associated ACL.
CSVC_EMC_DocumentumSpecialApp
Y
N
Comments
Comments specified by the user.
The Default Schema and Metadata Tags
83
The Default Schema and Metadata Tags
Table 106
84
CSVC Documentum Namespace Tags
Tag
Saved in DB
(Reportable)
Indexed
(Searchable)
CSVC_EMC_DocumentumIsreference
Y
N
CSVC_EMC_DocumentumCreatorName
Y
Y
CSVC_EMC_DocumentumIspublic
Y
N
CSVC_EMC_DocumentumPolicyId
Y
N
CSVC_EMC_DocumentumResumeState
Y
N
CSVC_EMC_DocumentumCurrentState
Y
N
CSVC_EMC_DocumentumAliasSetId
Y
N
CSVC_EMC_DocumentumEffectiveDate
Y
N
CSVC_EMC_DocumentumExpirationDate
Y
N
CSVC_EMC_DocumentumPublishFormats
Y
N
CSVC_EMC_DocumentumEffectiveLabel
Y
N
CSVC_EMC_DocumentumEffectiveFlag
Y
N
CSVC_EMC_DocumentumCategory
Y
N
CSVC_EMC_DocumentumLanguageCode
Y
N
CSVC_EMC_DocumentumIstemplate
Y
N
CSVC_EMC_DocumentumControllingApp
Y
N
CSVC_EMC_DocumentumFullContentSize
Y
N
CSVC_EMC_DocumentumExtendedProperties
Y
N
CSVC_EMC_DocumentumIsSigned
Y
N
CSVC_EMC_DocumentumLastReviewDate
Y
N
CSVC_EMC_DocumentumRetainUntil
Y
Y
CSVC_EMC_DocumentumAspectName
Y
N
IS1200 Metadata Reference Guide— v4.8.0
Comments
Attributes are either:
• The value of the relation_name attribute
in a dm_relation object or subtype of a
dm_relation.
• Room ID
Most restrictive retention date among all
content associated with this object.
The Default Schema and Metadata Tags
Table 106
CSVC Documentum Namespace Tags
Tag
Saved in DB
(Reportable)
Indexed
(Searchable)
CSVC_EMC_DocumentumPartition
Y
N
CSVC_EMC_DocumentumIsreplica
Y
N
CSVC_EMC_DocumentumVStamp
Y
N
CSVC_EMC_DocumentumVersion
Y
Y
Version number (first version of version
label).
CSVC_EMC_DocumentumObjectID
Y
Y
Document ID assigned by DCTM
Comments
CSVC SharePoint Namespace
Table 107
Table 108
SharePoint Namespace
Description
Metadata captured for objects in SharePoint repository
Services that populate it:
Deep, Reclassify-Action,
collection-with-source-fulltext-indexing,
collection-target-indexing, copy-with-target-indexing,
in-place-processing
Extraction Ruleset
n/a
Assignment Ruleset
n/a
CSVC SharePoint Namespace Tags
Tag
Saved in DB
(Reportable)
Indexed
(Searchable)
CSVC_EMC_SharePointIsLeafObject
Y
N
An object that is not a directory or a
folder.
CSVC_EMC_SharePointSize
Y
N
Content size
CSVC_EMC_SharePointIsUnderRetention
Y
N
Whether an object is under retention or
no.
CSVC_EMC_SharePointRetentionClass
Y
N
The name of the retention class.
Comments
The Default Schema and Metadata Tags
85
The Default Schema and Metadata Tags
Table 108
86
CSVC SharePoint Namespace Tags
Tag
Saved in DB
(Reportable)
Indexed
(Searchable)
Comments
CSVC_EMC_SharePointIsOnHold
Y
N
Whether an object is on hold or no.
CSVC_EMC_SharePointHold
Y
N
Provides the Hold value.
CSVC_EMC_SharePointAuthor
Y
Y
Name of the user who created the
document in SharePoint
CSVC_EMC_SharePointOwners
Y
N
List ofSharePoint owners.
CSVC_EMC_SharePointOwner
Y
Y
Name of the object’s owner. This can be
a user or group name.
CSVC_EMC_SharePointCreator
Y
Y
User ID of the form domain\userid
CSVC_EMC_SharePointModifier
Y
Y
Name of the user who made last
modification
CSVC_EMC_SharePointEditor
Y
Y
CSVC_EMC_SharePointAllowAnonymousAccess
Y
N
CSVC_EMC_SharePointDescription
Y
N
Description of the SharePoint object
CSVC_EMC_SharePointLocale
Y
N
Locale of the object.
CSVC_EMC_SharePointTitle
Y
N
Title of the object. This is user-defined.
CSVC_EMC_SharePointDisplayName
Y
N
Name displayed on the SharePoint
server.
CSVC_EMC_SharePointVersion
Y
Y
Version number of the document
CSVC_EMC_SharePointObjectID
Y
Y
Document ID assigned by SharePoint
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
Directory Namespace
Directory Namespace
Table 109
Description
Namespace that populates up to first five directory names of a file path
Service populating namespace
Basic, Deep, Reclassify-Action
Extraction Ruleset
n/a
Assignment Ruleset
DirectoryReporting
Directory Namespace Tags
Table 110
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
directorydir1
Y
First directory in the file path (if it exists)
directorydir2
Y
Second directory in the file path (if it exists)
directorydir3
Y
Third directory in the file path (if it exists)
directorydir4
Y
Fourth directory in the file path (if it exists)
directorydir5
Y
Fifth directory in the file path (if it exists)
Comments
Domino Namespace
Domino Namespace
Table 111
Description
Metadata attributes related to Domino.
Service populating namespace
Deep, reclassify, collection-with-source-fulltext-indexing, in-place-processing
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Domino Namespace Tags
Table 112
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Comments
DominoMailbox
Y
Y
The mail box of the domino user.
The Default Schema and Metadata Tags
87
The Default Schema and Metadata Tags
Dumpster Namespace
Table 113
Dumpster Namespace
Description
The dumpster information of an email object.
Service populating namespace
Deep, Reclassify-Action, collection-with-source-fulltext-indexing,
in-place-processing
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 114
Dumpster Namespace Tags
Saved in DB
(reportable)
Tags
Indexed
(Searchable)
dumpstermail
Comments
Values [yes|no]. The value is set to 'yes' when the object is
stored in the dumpster and vice versa.
EnterpriseVault Namespace
Table 115
Description
Metadata captured for objects in Enterprise Vault repository.
Service populating namespace
Deep, reclassify-action, collection-with-source-fulltext-indexing,
collection-target-indexing, copy-with-target-indexing, in-place-processing
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 116
88
EnterpriseVault Namespace
EnterpriseVault Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable) Comments
enterprisevaultAuthor
Y
Y
Author of item (Sender for e-mail
messages)
enterprisevaultLegalHolds
Y
Y
Legal Hold Id, if legal hold has been set for
the object
enterprisevaultOriginalLocation
Y
IS1200 Metadata Reference Guide— v4.8.0
Original object location or mailbox name
The Default Schema and Metadata Tags
Table 116
EnterpriseVault Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable) Comments
enterprisevaultRetentionCategoryId
Y
enterprisevaultTitle
Y
Y
Title in Enterprise Vault
enterprisevaultVaultname
Y
Y
Archive name in Enterprise Vault
Identifier of Retention Category in
Enterprise Vault
The Default Schema and Metadata Tags
89
The Default Schema and Metadata Tags
Exchange Namespace
Exchange Namespace
Table 117
Description
The metadata attributes related to Exchange.
Service populating namespace
Deep, reclassify-action, collection-with-source-fulltext-indexing,
in-place-processing
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 118
Exchange Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
Comments
ExchangeMailbox
Y
Y
The mail box of the exchange account.
NSF Namespace
Table 119
NSF Namespace
Description
Service populating namespace Deep, reclassify, collection-with-source-fulltext-indexing,
collection-target-indexing, copy-with-target-indexing, in-place-processing
Extraction Ruleset
fulltext (rule)
Assignment Ruleset
n/a
Table 120
90
NSF Namespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
NSFDocument
Y
Y
NSFISLooseDXL
Y
Y
IS1200 Metadata Reference Guide— v4.8.0
Comments
The Default Schema and Metadata Tags
Sourceonearchive Namespace
Table 121
SourceonearchiveNamespace
Description
Metadata captured for objects in SourceOne Archive repository
Services that populate it:
Deep, Reclassify-Action, collection-with-source-fulltext-indexing, in-place-processing
Extraction Ruleset
n/a
Assignment Ruleset
n/a
Table 122
SourceonearchiveNamespace Tags
Tags
Saved in DB
(reportable)
Indexed
(Searchable)
SourceOneArchiveFolderName
Y
Y
SourceOne Business Folder
Name
SourceOneArchiveMetaDataModificationTime
Y
N
Metadata Modification Time (for
Kazeon internal use)
SourceOneArchiveOwner
Y
Y
List of owners for archived
e-mails
SourceOneArchiveFileName
Y
Y
Filename of a SourceOne
archived object
SourceOneArchiveObjectType
Y
Y
Type of objects in SourceOne
archive
SourceOneArchiveOriginalFileLocation
Y
N
Location of file objects from
where it’s been archived in
SourceOne
SourceoneArchiveFileVersion
Y
N
File version of a Sharepoint
object
SourceoneArchiveOriginalOwner
Y
N
The intersection of owner filter
with all owners from SourecOne
archive.If no owner filter is
specified then the value will be
all owners belonging to an object
in SourceOne
Comments
The Default Schema and Metadata Tags
91
The Default Schema and Metadata Tags
Alphabetic Listing of Indexed Default Metadata Tags:
Not all default tags are automatically indexed, only those that contain
the key word “indexed”in the Schema, see “Types of Attributes for
‘set_schema’ CLI Command” on page 8 for details.
The following columns list all default metadata tags that contain the
“indexed” keyword.
Table 123
92
Alphabetic Listing of Indexed Default Metadata Tags
amexrisk
centeraClipCreationTime
centeraClipId
centeraName
ContactFullName
ContactNickname
detectedalternates
detectedlanguage
dinersclubrisk
discoverrisk
DocumentAuthor
DocumentCategory
DocumentCompany
DocumentCreationDate
DocumentDescription
DocumentEditor
DocumentLanguage
DocumentLastSavedBy
DocumentManager
DocumentOwner
DocumentProject
DocumentPublisher
DocumentSource
DocumentSubject
DocumentTitle
documentumAuthors
documentumKeywords
documentumOwnerName
documentumRetainUntil
documentumSubject
documentumTitle
DominoMailbox
dumpstermail
einrisk
enterprisevaultAuthor
enterprisevaultLegalHolds
enterprisevaultTitle
enterprisevaultVaultname
entitylanguage
EnvcommonMtime
EnvcommonRepositorytype
ExchangeMailbox
glbarisk
HierarchyDisplayObjectType
HierarchyErrorMessage
HierarchyFileName
HierarchyFolderName
InheritedMailBccLevel1
InheritedMailBccLevel2
InheritedMailBccLevel3
InheritedMailCcLevel1
InheritedMailCcLevel2
InheritedMailCcLevel3
InheritedMailFromLevel1
InheritedMailFromLevel2
InheritedMailFromLevel3
InheritedMailReceivedLevel
InheritedMailReceivedLevel2
InheritedMailReceivedLevel3
InheritedMailSubjectLevel1
InheritedMailSubjectLevel2
InheritedMailSubjectLevel3
InheritedMailToLevel1
IS1200 Metadata Reference Guide— v4.8.0
The Default Schema and Metadata Tags
Table 123
Alphabetic Listing of Indexed Default Metadata Tags
InheritedMailToLevel2
InheritedMailToLevel3
itinrisk
jcbrisk
KazcollectionRuleName
KazeonConfidential
KazeonExtDate
Kazeoninternaluseonly
KazeonPostalCode
KazeonPrice
KazeonSocialSecurityNumber
KazeonStockSymbol
KazeonTelephoneNumber
KazeonUri
legalappCase
legalappLastReviewedBy
legalappLastReviewedDate
legalappNotes
legalappReviewedAs
legalappReviewer
legalappTag
legalholdlegalholdenforcefs
legalholdexecutor
legalholdtime
legalholdtype
MailAttachment
MailBcc
MailCc
MailConversationTopic
MailDate
MailDerivedDirection
MailDerivedDirectionInheritedLevel1
MailDerivedDirectionInheritedLeve
l2
MailDerivedDirectionInheritedLevel3
MailDerivedFromDomain
MailDerivedFromDomainInheritedLevel2
MailDerivedFromDomainInheritedLevel3
MailDerivedRecipientDomain
MailDerivedRecipientDomainInheritedLevel1
MailDerivedRecipientDomainInheritedLevel2
MailDerivedRecipientDomainInher
itedLevel3
MailEmail
MailFrom
MailMessageClass
MailMessageId
MailReceived
MailSize
MailSubject
MailTo
masterrisk
ninrisk
ObjcommonMtime
OriginalsourceAdHocTag
OriginalsourceCustodian
OriginalsourceDocId
OriginalsourceLocation
PersistentNameSpaceLegalCaseNum
processingservicename
retentionsetusersharepointAuthor
sharepointCreatedBy
sharepointEditorsharepointModifiedBysinrisk
SourceOneArchiveFolderNamess
nccrisk
ssnrisk
SystemAtime
SystemCasid
SysemFullText
SystemGroupName
SystemLastSeenTime
SystemCasidV
SystemCreationTime
SystemCTime
SystemEndVersionTime
SystemFileExtension
SystemFilepath
The Default Schema and Metadata Tags
93
The Default Schema and Metadata Tags
Table 123
Alphabetic Listing of Indexed Default Metadata Tags
SystemFilerName
SystemFileSize
SystemFileType
SystemMtime
SystemObjectName
SystemObjectType
SystemOwnerName
SystemPolicyGroups
SystemRetentionTime
SystemSnaplocked
SystemSnaplockTime
SystemSsHasBeenDeleted
SystemStartVersionTime
SystemTypeExtension
SystemUniqueDocumentID
TaskSubject
TIFFArtist
TIFFImageDescription
visarisk
94
IS1200 Metadata Reference Guide— v4.8.0
3
Metadata Mapping
Data repositories that the IS1200 accesses using and Optional Module
Connector maintain unique server-specific metadata beyond the
standard file metadata maintained by NFS and CIFS servers.
Additionally, some of these servers also allow users and applications
to define and maintain custom metadata. This chapter explains how
to set up mapping files that allow the IS1200 to capture and retrieve
the standard server-specific, as well as the user or application defined
metadata in the standard IS1200 metadata repositories.
Topics include:
◆
◆
◆
◆
◆
◆
◆
Background .........................................................................................
Creating a Mapping File to Capture OMR Custom Attributes ...
Identifying the OMR Custom Attributes........................................
Identifying and Creating the IS1200 Metadata Tags to Map To ..
Obtaining the Mapping Template from the IS1200 .......................
Editing and Saving the Mapping File .............................................
Mapping File Naming and Syntax ..................................................
Metadata Mapping
96
97
97
97
98
98
99
95
Metadata Mapping
Background
All IS1200 installations can register, classify, as well as search and
report on NFS and CIFS data repositories. If the appropriate Optional
Module license is installed, and properly setup and configured, the
IS1200 may also work with a variety of other servers, or data
repositories. Examples include, Microsoft Exchange or SharePoint
servers, EMC Centera or Documentum servers, Lotus Domino, etc. In
this guide, these Optional Module Repositories (servers), are referred
to as OMR.
Some OMR the IS1200 can work with save a pre-defined, but
server-specific, set of metadata for each file they store. While this
metadata is often unique to the server type, it is standard for that server,
and the IS1200 automatically captures this standard server-specific
metadata by default once the appropriate Optional Module license is
installed, and the module setup and configuration is completed. For
these servers, the Optional Module or Connector user guide that
describes them, also contains a section that lists the standard
server-specific metadata automatically classified for that server.
Additionally, some of these OMR allow users or applications to
define additional custom metadata, or attributes, for the files they
store. For example, SharePoint users can use site columns or column
properties to create new SharePoint metadata or custom properties.
See your Microsoft SharePoint Server User Guide for more details.
This server-specific custom metadata may also be automatically
collected during Kazeon classifications by creating a metadata mapping
file. The mapping file allows the custom metadata to be indexed in
the IS1200 search schema, and allows users to search registered OMR
for this custom metadata using the Web-Search application.
A template for the metadata mapping file is provided on all standard
IS1200 installations. Once modified, and saved with the correct
mapping filename, the IS1200 automatically uses the mapping file to
extract the custom metadata during classifications of OMR.
The mapping file lists the names of all OMR custom-defined
metadata you want collected classified and associates each with a
Kazeon metadata field defined in the Metadata Schema. See “The
Metadata Schema and Tag Management” on page 1 for details on
adding custom metadata fields and namespaces to the IS1200
Metadata Schema. Additionally, the mapping file can define how to
96
IS1200 Metadata Reference Guide— v4.8.0
Metadata Mapping
reverse the process when copying or moving files to an OMR
repository (i.e. it can specify what IS1200 metadata to copy into what
OMR custom metadata fields.)
The following sections describe how to create a mapping file. Once a
mapping file is created and copied to the IS1200, OMR custom
metadata is automatically collected and indexed during subsequent
IS1200 classifications.
Creating a Mapping File to Capture OMR Custom Attributes
The following general steps are required to create a mapping file:
1. Identify the OMR metadata to be captured during IS1200
classifications.
2. Identify the IS1200 metadata tags each OMR metadata attribute
will be mapped to.
3. Obtain the Kazeon IS1200 mapping file template from your
IS1200 server.
4. Edit a copy of the template, to add the appropriate mappings, and
then save it on the IS1200.
When saved with an appropriate mapping file name, the IS1200
automatically begins using the mapping file for all subsequent
classifications.
Identifying the OMR Custom Attributes
The list of standard and custom OMR-specific metadata is usually
defined when the OMR Connector is installed, specifically, when the
recommended configuration lists for that OMR are filled out as part
of the OMR installation and configuration process. See the
appropriate OMR connector user guide for it’s configuration list.
If this list was not made, contact your OMR administrator for a list of
custom attributes—and their OMR names—for your OMR server.
Identifying and Creating the IS1200 Metadata Tags to Map To
Each OMR metadata attribute must have a specific IS1200 metadata
tag to map to. If the IS1200 metadata tags do not already exist, they
must be created.
Metadata Mapping
97
Metadata Mapping
Make a list of the OMR metadata attributes you want to extract, and
then identify the IS1200 metadata tag to map it to.
◆
If the IS1200 metadata tags do no already exist, create them using
the procedures described in “The Metadata Schema and Tag
Management” on page 1.
◆
Make sure the field type of each Kazeon metadata tag matches the
OMR metadata type that will be mapped to it.
◆
If necessary, create IS1200 namespaces to contain the new IS1200
metadata tags, see the procedures described in the “The
Metadata Schema and Tag Management” on page 1.
Obtaining the Mapping Template from the IS1200
A template for creating a new mapping file is automatically saved to
your IS1200 server when the IS1200 software is installed. To access
this file:
1. Logon to the IS1200
2. Move a copy of the file:
/opt/openkaz/config/repository_tag_map_template.xm
to someplace you can edit it.
Note: DO NOT edit the original template file. Edit a copy.
Editing and Saving the Mapping File
Open the template file copy obtained in the step above and edit it to
add the mappings. When the edits are finished, save it as:
/opt/openkaz/config/sharepoint_custom_map.xml.
For multi-node clusters, this file must be saved on all cluster nodes.
Edit your template file as shown in the example below, and keeping
in mind the syntax described in the following section.
Example:
To extract Sharepoint custom attributes ‘sharePointProp1’ and
‘sharePointProp2’ during a classification and map them to IS1200
metadata tags ‘tag1’in namespace ‘Kazeon’ and ‘tag2’ in
namespace ‘custom’.
98
IS1200 Metadata Reference Guide— v4.8.0
Metadata Mapping
And, during copies from an IS1200 registered repository to a
SharePoint repository, to copy IS1200 metadata values with the tag
‘KazeonTag1’ and ‘CustomTag4’ to the SharePoint metadata attributes
‘prop3’ and ‘prop4’,
Unless, ‘CustomTag4’ was not applied to a file, then set it to default
value of ‘this-is-default’
<MetadataMap>
<Maps RepositoryType = "sharepoint" Operation = "crawl">
<Map RepositoryTagName = "sharepointProp1"
RepositoryDataType = "Text"
KazeonTagName = "kazeontag1"
multivalued = "no" />
<Map RepositoryTagName = " sharepointProp2"
RepositoryDataType = "Text"
KazeonTagName = "customtag2"
multivalued = "no" />
</Maps>
<!-- Mapping used during copy to repository (operation = "copy") -->
<Maps RepositoryType = "sharepoint" Operation = "copy">
<Map RepositoryTagName = "prop3"
KazeonTagName = "KazeonTag3"
<Map RepositoryTagName = "prop4"
KazeonTagName = "CustomTag4"
RepositoryDataType = "Text"
<Map RepositoryTagName = "prop5"
/>
defaultValue = "this-is-default" />
KazeonTagName = "kazeontag5"
RepositoryDataType = "Integer"
multivalued = "no" />
</Maps>
</MetadataMap>
Mapping File Naming and Syntax
When creating a custom mapping file, the filename should be:
/opt/openkaz/config/<OMR_Name>_custom_map.xml
where <OMR_Name> is the name of the OMR server type.
For examples:
For SharePoint:
/opt/openkaz/config/sharepoint_custom_map.xml
For Documentum:
/opt/openkaz/config/documentum_custom_map.xml
The custom mapping file can be validated using:
Metadata Mapping
99
Metadata Mapping
/opt/openkaz/bin/repomdmap <map-file-name>
For example:
/opt/openkaz/bin/repomdmap/opt/openkaz/config/sharepoint
_custom_map.xml
Use the following syntax, when creating a mapping file.
Description of XML tags and attributes:
================================================
Attributes of 'Maps'
=================================
1. RepositoryType :
(required)
'sharepoint' - for SharePoint
'documentum' - for Documentum
2.
Operation:
(required)
Possible values:
a) crawl
b) copy
Attributes of 'Map'
=================================
1. RepositoryTagName :
(required)
This is the name of attribute in repository
(SharePoint column name, Documentum attribute)
2. KazeonTagName :
(required)
3. DefaultValue: (optional)
This value will be used in copy operation, if KazeonTag is
not available for the object
4. RepositoryDataType:
(optional)
Valid values:
a) Text
b) Integer
c) DateTime
100
IS1200 Metadata Reference Guide— v4.8.0
Metadata Mapping
d) Boolean
e) Decimal
If not specified, default is 'Text'
5. multivalued: (optional)
Indicates that repository tag is multi valued tag
Possible values:
a)
yes
: if this is multivalued attribute
b)
no
: if this is single valued attribute
Default: 'no'
Default Connector Metadata Mappings
The following tables list the default mappings used between the
IS1200 and various connector-accessed repositories (such as Centera,
SharePoint, etc.) when coping data from an IS1200 data repository to
a connector-accessed data repository. The tables show what metadata
fields from the source repository are copied to what metadata fields
on the target repository.
For example, in Table 126, “Centera Default Metadata Mappings,” on
page 104 below, when doing a copy to a Centera repository, if you
read across any row in the table (for example the first row, atime):
◆
The first column specifies the metadata field on the target (in this
case a Centera metadata repository) the source metadata will be
copied to, and
◆
Columns NFS (3rd) - EV (10th) tell you what metadata field from
a particular source (specified by the column heading) will be
copied into that target metadata field.
Metadata Mapping
101
Metadata Mapping
NFS Default Metadata Mappings
Table 124
NFS Default Metadata Mappings
IS1200 Metadata Creation on copy/collection into NFS repository
Source Repository IS1200 Tags or Sources
Target IS1200
Tags
102
NFS
CIFS
Centera
Documentu
m
Exchange
Sharepoint
Domino
EV
atime
atime
atime
atime
atime
atime
atime
atime
atime
mtime
mtime
mtime
mtime
mtime
mtime
mtime
mtime
amtime
ctime
Set by
system.
Captured
in
metadata
ctime
N/A
N/A
N/A
N/A
N/A
N/A
N/A
creationtime
Not
Applicable
creationtime
N/A
N/A
N/A
N/A
N/A
N/A
N/A
owner
owner
owner (uid)
owner
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
group
group
group (gid)
group
Not Preserved
Not Preserved
Not Preserved
b
Not Preserved
OriginalSource
Tags
OriginalSource OriginalSource
Tags
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
IS1200 Metadata Reference Guide— v4.8.0
Metadata Mapping
CIFS Default Metadata Mappings
Table 125
CIFS Default Metadata Mappings
IS1200 Metadata Creation on copy/collection into CIFS repository
Source Repository IS1200 Tags or Sources
Target IS1200
Tags
NFS
CIFS
Centera
Documentum
Exchange
Sharepoint
Domino
EV
atime
atime
atime
atime
atime
atime
atime
atime
atime
mtime
mtime
mtime
mtime
mtime
mtime
mtime
mtime
mtime
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
creationtime
Not Preserved
Not Preserved
Not Preserved
creation time
Not Preserved
archive time
ctime
Set by
system.
Captured in
metadata
creationtime
owner
Same as
object
owner
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
group
Same as
object
group
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
cifs owner
UID
cifs owner
Not Preserved
Not Preserved
Mailbox User
Sharepoint
owner
MailBox User
Not Preserved
cifs group
GUI
cifs owner
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
OriginalSource
Tags
OriginalSource OriginalSource
Tags
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
Metadata Mapping
103
Metadata Mapping
Centera Default Metadata Mappings
Centera Default Metadata Mappings
Table 126
IS1200 Metadata Creation on copy/collection into Centera repository
Source Repository IS1200 Tags or Sources
Target IS1200
Tags
CIFS
Centera
Documentum
Exchange
Sharepoint
Domino
EV
atime
Copied
from source
object
atime
atime
atime
atime
atime
atime
atime
atime
mtime
Copied
from source
object
mtime
mtime
mtime
mtime
mtime
mtime
mtime
mtime
ctime
Copied
from source
object
ctime
ctime
ctime
ctime
ctime
ctime
ctime
ctime
creationtime
Copied
from source
object
creationtime
creationtime
creationtime
creationtime
creationtime
creationtime
creationtime
creationtime
owner
root (don't
use)
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
group
root (don't
use)
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
cifs owner
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
cifs group
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
OriginalSource OriginalSource
Tags
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
Assigned by
Centera
Assigned by
Centera
Assigned by
Centera
Assigned by
Centera
Assigned by
Centera
Assigned by
Centera
Assigned by
Centera
OriginalSource
Tags
centeraClipId
104
NFS
Preserved
Assigned by
Centera
IS1200 Metadata Reference Guide— v4.8.0
Metadata Mapping
Table 126
Centera Default Metadata Mappings
IS1200 Metadata Creation on copy/collection into Centera repository
atime
Copied
from source
object
atime
atime
atime
atime
atime
atime
atime
atime
mtime
Copied
from source
object
mtime
mtime
mtime
mtime
mtime
mtime
mtime
mtime
ctime
Copied
from source
object
ctime
ctime
ctime
ctime
ctime
ctime
ctime
ctime
Metadata Mapping
105
Metadata Mapping
Enterprise Vault (EV) Default Metadata Mappings
Enterprise Vault (EV) Default Metadata Mappings
Table 127
IS1200 Metadata Creation on copy/collection into EV repository
Source Repository IS1200 Tags or Sources
Target IS1200
Tags
106
NFS
CIFS
Centera
Documentum
Exchange
Sharepoint
Domino
EV
atime
Set to
Not Preserved
archive time
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
mtime
Source
Object's
mtime is
preserved
mtime
mtime
mtime
mtime
mtime
mtime
mtime
mtime
ctime
Archive
Time
assigned by
EV
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
creationtime
Archive
Time
assigned by
EV
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
owner
root (don't
use)
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
group
root (don't
use)
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
OriginalSource
Tags
Preserved
OriginalSource OriginalSource
Tags
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
enterprisevault
RetentionCateg
oryId
Selected by
user
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
IS1200 Metadata Reference Guide— v4.8.0
Metadata Mapping
Table 127
Enterprise Vault (EV) Default Metadata Mappings
IS1200 Metadata Creation on copy/collection into EV repository
enterprisevaultL
egalHolds
Holde Id (if
applicable)
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
enterprisevaultA Owner of
uthor
Source
Object
owner
owner
owner
owner
owner
owner
owner
owner
enterprisevaultT
itle
File name
File name
File name
File name
File name
File name
File name
File name
enterprisevault
Directory of
OriginalLocation Source File
Directory
name
Directory
name
Directory
name
Directory name
Directory name Directory
name
Directory name Directory
name
enterprisevaultV Selected by
aultname
user
N/A
N/A
N/A
N/A
N/A
N/A
Source File
Name
N/A
N/A
Metadata Mapping
107
Metadata Mapping
SharePoint Default Metadata Mappings
Table 128
SharePoint Default Metadata Mappings
IS1200 Metadata Creation on copy/collection into Sharepoint repository
Source Repository IS1200 Tags or Sources
Documentu
m
Exchange
Sharepoint
Domino
EV
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Object
creation
time in
SharePoint
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
owner
User-id
used for
repository
registration
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
group
id used for
repository
registration
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
OriginalSource Tags
Preserved
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
Target IS1200 Tags
108
Action
NFS
CIFS
atime
Object
creation
time in
SharePoint
Not Preserved
Not Preserved
mtime
Object
modifictaion
time in
SharePoint
Not Preserved
ctime
Object
creation
time in
SharePoint
creationtime
IS1200 Metadata Reference Guide— v4.8.0
Centera
Metadata Mapping
Table 128
SharePoint Default Metadata Mappings
IS1200 Metadata Creation on copy/collection into Sharepoint repository
sharepointURL
Assigned by
SharePoint
N/A
N/A
N/A
N/A
N/A
Not Preserved
N/A
N/A
sharepointVersion
Assigned by
SharePoint
N/A
N/A
N/A
N/A
N/A
Not Preserved
N/A
N/A
sharepointCheckoutBy
Empty
N/A
N/A
N/A
N/A
N/A
Not Preserved
N/A
N/A
sharepointModifiedBy
id used for
repository
registration
N/A
N/A
N/A
N/A
N/A
Not Preserved
N/A
N/A
sharepointCreatedBy
id used for
repository
registration
N/A
N/A
N/A
N/A
N/A
Not Preserved
N/A
N/A
sharepointAuthor
id used for
repository
registration
N/A
N/A
N/A
N/A
N/A
Not Preserved
N/A
N/A
sharepointEditor
id used for
repository
registration
N/A
N/A
N/A
N/A
N/A
Not Preserved
N/A
N/A
sharepointMimeType
Assigned by
SharePoint
N/A
N/A
N/A
N/A
N/A
Not Preserved
N/A
N/A
Metadata Mapping
109
Metadata Mapping
Documentum Default Metadata Mappings
Table 129
Documentum Default Metadata Mappings
IS1200 Metadata Creation on copy/collection into Documentum repository
Source Repository IS1200 Tags or Sources
Target IS1200 Tags
110
Action
NFS
CIFS
Exchange
Sharepoint
Domino
EV
atime (Not preserved)
Object
creation time
in
Documentum
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
mtime (Not preserved)
Object
creation time
in
Documentum
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
ctime (Not preserved)
Object
creation time
in
Documentum
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
creationtime(Not
preserved)
Object
creation time
in
Documentum
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
owner (Not preserved)
User-id used
for repository
registration
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
group (Not preserved)
groupid of
suer used for
repository
registration or
Unknown
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
Not Preserved
cifs owner
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
N/A
N/A
cifs group
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
N/A
N/A
IS1200 Metadata Reference Guide— v4.8.0
Centera Documentum
Metadata Mapping
Table 129
Documentum Default Metadata Mappings
IS1200 Metadata Creation on copy/collection into Documentum repository
OriginalSource Tags
Preserved
from source
repository
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
OriginalSource
Tags
documentumAclDomain
Assigned by
Documentum
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumAclName
Assigned by
Documentum
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumAuthors
User-id used
for repository
registration
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumContentType
Assigned by
Documentum
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumCreatorName
User-id used
for repository
registration
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumDeleted
Assigned by
Documentum
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumGroupName
Assigned by
Documentum
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumKeywords
Empty
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumLogEntry
Empty
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumMimeType
Assigned by
Documentum
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumModifiedBy
User-id used
for repository
registration
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
Metadata Mapping
111
Metadata Mapping
Table 129
Documentum Default Metadata Mappings
IS1200 Metadata Creation on copy/collection into Documentum repository
112
documentumObjectId
Assigned by
Documentum
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumObjectType
Assigned by
Documentum
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumOwnerName
Assigned by
Documentum
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumOwnerPermit
Assigned by
Documentum
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumRetainerID
Assigned by
Documentum
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumRetainUntil
Assigned by
Documentum
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumSubject
Empty
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumTitle
Empty
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumVersionLabel
Assigned by
Documentum
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
documentumVirtualDoc
Assigned by
Documentum
N/A
N/A
N/A
Not Preserved
N/A
N/A
N/A
N/A
IS1200 Metadata Reference Guide— v4.8.0
Metadata Mapping
Metadata Mapping
113
Metadata Mapping
114
IS1200 Metadata Reference Guide— v4.8.0
4
Legal Export Default Tags
This chapter describes the IS1200 metadata fields that are exported,
by default, during an Actionable Services Legal Export operation.
Topics include:
◆
◆
◆
Legal Export Tags Default profile.................................................. 116
Post Legal Export Tagging for Rolling Exports ........................... 118
Rolling Export Metadata Tags........................................................ 118
Legal Export Default Tags
115
Legal Export Default Tags
Legal Export Tags Default profile
The following table lists the IS1200 Schema fields exported using the
default export template for Concordance and Lextranet export. The
table applies only to versions IS1200 software v4.4.0 and greater.
Note: Modification of these output fields can only be performed by EMC
Professional Services.
Table 130
116
Default Kazeon Metadata Fields Exported During Concordance Legal
Export
Concordance
Tag Name
IS1200 Source Metadata Tag
Tag Type Name
Description
ObjMTime
Date*
ObjcommonMtime
Object common Mtime
ResposType
Text
EnvcommonRepositorytype
Repository Type
EnvMTime
Date*
EnvcommonMtime
Environment common Mtime
Fldrname
Text
HierarchyFolderName
Hierarchy Folder Name
Filename
Text
HierarchyFileName
Hierarchy File Name
FileType
Text
FileType
File Type
Filepath
Text
Filepath
File Path
FileSize
Text
FileSize
File Size
Group
Text
GroupName
Group Name
Owner
Text
OwnerName
Owner Name
ObjName
Text
ObjectName
Object Name
ObjType
Text
ObjectType
Object Type
FileExt
Text
FileExtension
File extension, the actual system object file extension
TypeExt
Text
TypeExtension
File true type extension, the programmatically derived file
extension, may differ from the FileExt field
OrigsrcCust
Text
OriginalsourceCustodian
Original source custodian, reflects the original file/object
custodian. May be overridden during processing using
custodian mapping
IS1200 Metadata Reference Guide— v4.8.0
Legal Export Default Tags
Table 130
Default Kazeon Metadata Fields Exported During Concordance Legal
Export
Concordance
Tag Name
IS1200 Source Metadata Tag
Tag Type Name
OrigsrcLoc
Text
OriginalsourceLocation
Original Source location, reflects the first full filepath the object
was found in
OrigsrcCLoc
Text
OriginalsourceContainerlocation
Origianal Source Container Location. For objects found in
containers during processing, reflects the container path
OrigsrcDoId
Text
OriginalsourceDocId
Original source document ID
ATime
Date*
Atime
Access time, reflects the last file access time
MTime
Date*
Mtime
Modified Time, reflects the last file modification time
CTime
Date*
Ctime
Metadata change time
CrtTime
Date*
Creation Time
File creation time
legalRvwer
Text
legalappReviewer
Legal reviewer assigned to this file
legalTag
Text
legalappTag
Legal case review tag
RvwdAs
Text
legalappReviewedAs
legalNotes
Text
legalappNotes
Standard legal review notes of this file
LastRvwdBy
Text
legalappLastReviewedBy
File last reviewed by
LastRvwdDate
Text
legalappLastReviewedDate
File last reviewed date
Description
*Default date format is "yyyy/mm/dd".
Legal Export Default Tags
117
Legal Export Default Tags
Post Legal Export Tagging for Rolling Exports
After any successful Legal Export operation, where an active-case
was selected and the Rolling Export options were used, a tagging job is
run after the legal export completes to update the exported file’s
source metadata with the export jobID, legal case number, and export
time.
See the Legal Exports: Rolling Exports heading of the Actionable Services
sections of the IS1200 Web-Search User Guide or
IS1200 eDiscovery Case Manager Administrators and Supervisors Guide
for more background on Rolling Exports. See “LegalExport
Namespace” on page 44 for descriptions of the metadata tags that are
updated.
Rolling Export Metadata Tags
The Rolling Export option Mark Exported Objects adds special case tags
to the exported file’s metadata in their source index. The tags record
what case and job the files were exported under. Once applied, these
tags appear in review filters.
The two new metadata tag names are:
◆
LegalExportCaseIds - which identifies whether the object has
been exported
◆
LegalExportJobIds - which identifies the export job ids
For example, the following metadata tag/values might be searched
for:
LegalExportCaseIds=case.1.yes, case.2.yes
LegalExportJobIds=case.1.job.20, case.2.job.21
118
IS1200 Metadata Reference Guide— v4.8.0
A
NSF Metadata Tags
This appendix describes how Lotus NSF Metadata is Mapped to
IS1200 Tags.
Topics include:
◆
How NSF Metadata is Mapped to IS1200 Tags ........................... 120
NSF Metadata Tags
119
NSF Metadata Tags
How NSF Metadata is Mapped to IS1200 Tags
The following table shows how Lotus metadata is mapped to IS1200
metadata.
Table 131
Lotus Email Metadata Mappings
Lotus notes fields
EML Fields per RFC822
IS1200 eMail Search
Template Fields
Created: 02/25/2010 03:30:56 PM PST
ctime
Modified: 02/25/2010 03:31:37 PM PST
mtime
Accessed: 02/25/2010 03:31:37 PM PST
atime
DeliveredDate:
PostedDate: 02/25/2010 03:31:37 PM PST
Date (Inbox other folder items
)
Email received
Date ( For Sent Items )
$Revisions:
RouteTimes:
Principal: CN=akiko4/O=kazeon
$langPrincipal:
$altPrincipal:
InetSendTo: .
InetCopyTo: .
InetBlindCopyTo: .
$StorageTo: 1
$StorageCc: 1
$StorageBcc: 1
$Mailer: Lotus Notes Release 6.5 September 26, 2003
$MessageID: <OF6A9B442F.1FF44B1FON882576D5.00812CF4-882576D5.00813D1D@LocalDomain>
Message-ID
Message-ID
From
From
Recipients: CN=akiko4/O=kazeon@kazeon
MailOptions: 0
SaveOptions: 1
Form: Memo
From: CN=Administrator/O=kazeon
120
IS1200 Metadata Reference Guide— v4.8.0
NSF Metadata Tags
Table 131
Lotus Email Metadata Mappings
Lotus notes fields
AltFrom: CN=Administrator/O=Kazeon
EML Fields per RFC822
IS1200 eMail Search
Template Fields
Sender
Logo: stdNotesLtr0
useApplet: True
DefaultMailSaveOptions: 1
Query_String:
ExpandPersonalGroups: 1
tmpImp:
Sign:
Encrypt:
Encrypted
SendTo: CN=akiko4/O=kazeon@kazeon
To
To
CopyTo: CN=akiko5/O=kazeon@kazeon
CC
Cc
BlindCopyTo: CN=akssach5/O=kazeon@kazeon
BCC
Bcc
Subject
Subject
Subject: test cc, bcc
Body:
Everything after 1st null is
body
wBody:
EnterSendTo: akiko4
EnterCopyTo: akiko5
EnterBlindCopyTo: akiko6
$UpdatedBy: CN=Administrator/O=kazeon
NSF Metadata Tags
121
NSF Metadata Tags
122
IS1200 Metadata Reference Guide— v4.8.0
B
Using DICOM Tags
This appendix provides reference information useful when extracting
DICOM properties as extended attributes.
Topics include:
◆
◆
◆
◆
◆
Using DICOM File Attributes as Metadata.................................. 124
Enabling DICOM File Recognition................................................ 124
Selecting the DICOM Properties to Extract.................................. 125
Adding DICOM File Attributes to the Search Schema............... 125
DICOM Properties That Can Be Extracted As Extended Attributes
126
123
Using DICOM File Attributes as Metadata
DICOM files contain hundreds of standard file attributes that are
useful in searches.
Starting with version 3.1 Hotfix 3, recognition of DICOM files is OFF
by default and must be specifically enabled when desired. Even when
recognition is enabled, only the file-type DICOM is extracted unless
specific DICOM properties are selected for extraction using extraction
rules and certain configuration file settings.
The IS1200 only searches DICOM file headers for their attributes, it
does not open or parse the actual image data.
Enabling DICOM File Recognition
DICOM file processing (recognition) is optional, and OFF by default.
As such, DICOM files are considered unknown-type files.
To enable recognition and handling of DICOM files,
remove the # from the beginning of the "#handleDicomFiles=1" line
in
/opt/openkaz/config/parser.config:
# If set (to 1 or true), then DICOM file
handling is enabled. If
# not set, files will not be checked for DICOM
filetype, nor will
# DICOM metadata be extracted (they will be
treated as UNKNOWN).
#handleDicomFiles=1
Do this on each node of the cluster. Normally, selecting what
attributes are processed is done using extraction rules, but since
initial file recognition is done during basic classification (when
extraction rules do not apply), DICOM file recognition must be
controlled through configuration files.
124
IS1200 Metadata Reference Guide— v4.8.0
Selecting the DICOM Properties to Extract
Once recognition is enabled, do the following to select the DICOM
properities to extract.
Then, do the following to setup DICOM file attribute classification:
Edit the file /opt/openkaz/config/parser.config to set
skipProperities=FALSE.
Edit the file /opt/openkaz/config/parser.config on
each node to uncomment and set:
DicomProperties=<list_of_DicomProperties_to_cl
assify>
The list should be separated by commas, for example:
DicomProperties=DICOMPatientsName,DICOMPatient
ID,DICOMStudyDate
to collect only the three Dicom properties in the list above.
Edit the Search Index to add the desired Dicom attributes, see the
following section for details.
Adding DICOM File Attributes to the Search Schema
The IS1200 can extract most of the standard DICOM file attributes as
metadata. To make these attributes searchable, each attribute you
want to search by must be added to the Search Index, and then a deep
classification must be run. For example, to be able to search by the
DICOM tag DICOMTableTopEccentricAxisDistance, enter
the following in the CLI:
set schema DICOMTable.TopEccentricAxisDistance
attributes string,indexed
and then run a deep classification.
After the classification finishes, the tag
DICOMTableTopEccentricAxisDistance will be available
in the standard search index.
125
DICOM Properties That Can Be Extracted As Extended Attributes
The following is a list of all DICOM tags that can be added to the search index:
126
DICOMAccessionNumber
DICOMAffectedSOPClassUID
DICOMAcquisitionContextDescription
DICOMAffectedSOPInstanceUID
DICOMAcquisitionContextSequence
DICOMAirKermaRateReferenceDate
DICOMAcquisitionDate
DICOMAirKermaRateReferenceTime
DICOMAcquisitionDeviceProcessingCode
DICOMAnatomicRegionModifierSequence
DICOMAcquisitionDeviceProcessingDescription
DICOMAnatomicRegionSequence
DICOMAcquisitionGroupLength
DICOMAnatomicStructureSpaceorRegionSequence
DICOMAcquisitionMatrix
DICOMAngioFlag
DICOMAcquisitionNumber
DICOMAngularPosition
DICOMAcquisitionsinStudy
DICOMAngularStep
DICOMAcquisitionStartCondition
DICOMAngularViewVector
DICOMAcquisitionStartConditionData
DICOMAnnotationContentSequence
DICOMAcquisitionTerminationCondition
DICOMAnnotationDisplayFormatID
DICOMAcquisitionTerminationConditionData
DICOMAnnotationFlag
DICOMAcquisitionTime
DICOMAnnotationPosition
DICOMActionTypeID
DICOMAnodeTargetMaterial
DICOMActiveSourceDiameter
DICOMApplicableFrameRange
DICOMActiveSourceLength
DICOMApplicationSetupManufacturer
DICOMActualFrameDuration
DICOMApplicationSetupName
DICOMActualHumanPerformersSequence
DICOMApplicationSetupNumber
DICOMAdditionalDrugSequence
DICOMApplicationSetupSequence
DICOMAdditionalPatientHistory
DICOMApplicationSetupType
DICOMAdministrationRouteCodeSequence
DICOMApplicatorDescription
DICOMAdmissionID
DICOMApplicatorID
DICOMAdmittingDate
DICOMApplicatorSequence
DICOMAdmittingDiagnosesDescription
DICOMApplicatorType
DICOMAdmittingDiagnosisCodeSequence
DICOMApprovalStatus
DICOMAdmittingTime
DICOMAttenuationCorrectionMethod
IS1200 Metadata Reference Guide— v4.8.0
DICOMAttributeIdentifierList
DICOMBitsStored
DICOMAudioComments
DICOMBlockData
DICOMAudioSampleData
DICOMBlockDivergence
DICOMAudioSampleFormat
DICOMBlockName
DICOMAudioType
DICOMBlockNumber
DICOMAveragePulseWidth
DICOMBlockNumberofPoints
DICOMAxialAcceptance
DICOMBlockSequence
DICOMAxialMash
DICOMBlockThickness
DICOMAxisLabels
DICOMBlockTransmission
DICOMAxisUnits
DICOMBlockTrayID
DICOMBasicColorImageSequence
DICOMBlockType
DICOMBasicGrayscaleImageSequence
DICOMBluePaletteColorLookupTableData
DICOMBeamDescription
DICOMBluePaletteColorLookupTableDescriptor
DICOMBeamDose
DICOMBodyPartExamined
DICOMBeamDoseSpecificationPoint
DICOMBodyPartThickness
DICOMBeamLimitingDeviceAngle
DICOMBorderDensity
DICOMBeamLimitingDeviceAngleTolerance
DICOMBrachyAccessoryDeviceID
DICOMBeamLimitingDevicePositionSequence
DICOMBrachyAccessoryDeviceName
DICOMBeamLimitingDevicePositionTolerance
DICOMBrachyAccessoryDeviceNominalThickness
DICOMBeamLimitingDeviceRotationDirection
DICOMBrachyAccessoryDeviceNominalTransmissi
on
DICOMBeamLimitingDeviceSequence
DICOMBeamLimitingDeviceToleranceSequence
DICOMBeamMeterset
DICOMBeamName
DICOMBeamNumber
DICOMBeamSequence
DICOMBeamType
DICOMBeatRejectionFlag
DICOMBillingItemSequence
DICOMBillingProcedureStepSequence
DICOMBillingSuppliesandDevicesSequence
DICOMBiPlaneAcquisitionSequence
DICOMBrachyAccessoryDeviceNumber
DICOMBrachyAccessoryDeviceSequence
DICOMBrachyAccessoryDeviceType
DICOMBrachyApplicationSetupDose
DICOMBrachyApplicationSetupDoseSpecificationP
oint
DICOMBrachyControlPointSequence
DICOMBrachyReferencedDoseReferenceSequence
DICOMBrachyTreatmentTechnique
DICOMBrachyTreatmentType
DICOMBranchofService
DICOMBurnedInAnnotation
DICOMBitsAllocated
127
DICOMCalibrationDataSequence
DICOMColumns
DICOMCalibrationImage
DICOMCommandField
DICOMCardiacNumberofImages
DICOMCommentsonRadiationDose
DICOMCassetteOrientation
DICOMCommentsonthePerformedProcedureSteps
DICOMCassetteSize
DICOMCommentsontheScheduledProcedureStep
DICOMCenterofCircularCollimator
DICOMCompensatorColumns
DICOMCenterofCircularShutter
DICOMCompensatorID
DICOMCenterofRotationOffset
DICOMCompensatorNumber
DICOMChannelLength
DICOMCompensatorPixelSpacing
DICOMChannelNumber
DICOMCompensatorPosition
DICOMChannelSequence
DICOMCompensatorRows
DICOMChannelShieldID
DICOMCompensatorSequence
DICOMChannelShieldName
DICOMCompensatorThicknessData
DICOMChannelShieldNominalThickness
DICOMCompensatorTransmissionData
DICOMChannelShieldNominalTransmission
DICOMCompletionFlag
DICOMChannelShieldNumber
DICOMCompletionFlagDescription
DICOMChannelShieldSequence
DICOMCompressionForce
DICOMChannelTotalTime
DICOMConceptCodeSequence
DICOMCineRate
DICOMConceptNameCodeSequence
DICOMCodeMeaning
DICOMConfidentialityCode
DICOMCodeValue
DICOMConfidentialityConstraintonPatientDataDes
cription
DICOMCodingSchemeDesignator
DICOMCoincidenceWindowWidth
DICOMCollationFlag
DICOMCollimatorgridName
DICOMCollimatorLeftVerticalEdge
DICOMCollimatorLowerHorizontalEdge
DICOMCollimatorRightVerticalEdge
DICOMCollimatorShape
DICOMCollimatorType
DICOMCollimatorUpperHorizontalEdge
DICOMColorImagePrintingFlag
DICOMColumnAngulation
128
IS1200 Metadata Reference Guide— v4.8.0
DICOMConfigurationInformation
DICOMConstraintWeight
DICOMContentSequence
DICOMContentTemplateSequence
DICOMContinuityOfContent
DICOMContourData
DICOMContourGeometricType
DICOMContourImageSequence
DICOMContourOffsetVector
DICOMContourSequence
DICOMContourSlabThickness
DICOMContrastAllergies
DICOMCumulativeMetersetWeight
DICOMContrastBolusAdministrationRouteSequenc
e
DICOMCumulativeTimeWeight
DICOMContrastBolusAgent
DICOMCurrentPatientLocation
DICOMContrastBolusAgentSequence
DICOMCurrentRequestedProcedureEvidenceSeque
nce
DICOMContrastBolusIngredient
DICOMCurveDataDescriptor
DICOMContrastBolusIngredientConcentration
DICOMCurveDataOW
DICOMContrastBolusRoute
DICOMCurveDate
DICOMContrastBolusStartTime
DICOMCurveDescription
DICOMContrastBolusStopTime
DICOMCurveDimensions
DICOMContrastBolusTotalDose
DICOMCurveLabel
DICOMContrastBolusVolume
DICOMCurveNumber
DICOMContrastFlowDurations
DICOMCurveRange
DICOMContrastFlowRates
DICOMCurveTime
DICOMContrastFrameAveraging
DICOMDataCollectionDiameter
DICOMControlPoint3DPosition
DICOMDataInformationSequence
DICOMControlPointIndex
DICOMDataSetTrailingPadding
DICOMControlPointRelativePosition
DICOMDataSetType
DICOMControlPointSequence
DICOMDataValueRepresentation
DICOMConversionType
DICOMDate
DICOMConvolutionKernel
DICOMDateofLastCalibration
DICOMCoordinateStartValue
DICOMDateOfLastDetectorCalibration
DICOMCoordinateStepValue
DICOMDateofSecondaryCapture
DICOMCorrectedImage
DICOMDateTime
DICOMCountRate
DICOMdBdt
DICOMCountryofResidence
DICOMDeadTimeCorrectionFlag
DICOMCountsAccumulated
DICOMDeadTimeFactor
DICOMCountsIncluded
DICOMDecayCorrection
DICOMCountsSource
DICOMDecayFactor
DICOMCranialThermalIndex
DICOMDeliveryMaximumDose
DICOMCreationDate
DICOMDeliveryWarningDose
DICOMCreationTime
DICOMDepthofScanField
DICOMCumulativeDoseReferenceCoefficient
DICOMDerivationDescription
129
130
DICOMDestinationAE
DICOMDischargeDate
DICOMDetectorActivationOffsetFromExposure
DICOMDischargeDiagnosisCodeSequence
DICOMDetectorActiveDimensions
DICOMDischargeDiagnosisDescription
DICOMDetectorActiveOringin
DICOMDischargeTime
DICOMDetectorActiveShape
DICOMDistanceSourcetoDetector
DICOMDetectorActiveTime
DICOMDistanceSourcetoEntrance
DICOMDetectorBinning
DICOMDistanceSourcetoPatient
DICOMDetectorConditionsNominalFlag
DICOMDistanceSourcetoSupport
DICOMDetectorConfiguration
DICOMDistributionAddress
DICOMDetectorDescription
DICOMDistributionName
DICOMDetectorElementPhysicalSize
DICOMDopplerCorrectionAngle
DICOMDetectorElementSize
DICOMDopplerSampleVolumeXPosition
DICOMDetectorElementSpacing
DICOMDopplerSampleVolumeYPosition
DICOMDetectorID
DICOMDoseCalibrationFactor
DICOMDetectorInformationSequence
DICOMDoseComment
DICOMDetectorLinesofResponseUsed
DICOMDoseGridScaling
DICOMDetectorMode
DICOMDoseRateSet
DICOMDetectorPrimaryAngle
DICOMDoseReferenceDescription
DICOMDetectorSecondaryAngle
DICOMDoseReferenceNumber
DICOMDetectorTemperature
DICOMDoseReferencePointCoordinates
DICOMDetectorTimeSinceLastExposure
DICOMDoseReferenceSequence
DICOMDetectorType
DICOMDoseReferenceStructureType
DICOMDetectorVector
DICOMDoseReferenceType
DICOMDeviceDescription
DICOMDoseSummationType
DICOMDeviceDiameter
DICOMDoseType
DICOMDeviceDiameterUnits
DICOMDoseUnits
DICOMDeviceLength
DICOMDoseValue
DICOMDeviceSequence
DICOMDVHData
DICOMDeviceSerialNumber
DICOMDVHDoseScaling
DICOMDeviceVolume
DICOMDVHMaximumDose
DICOMDigitalImageFormatAcquired
DICOMDVHMeanDose
DICOMDirectoryRecordSequence
DICOMDVHMinimumDose
DICOMDirectoryRecordType
DICOMDVHNormalizationDoseValue
IS1200 Metadata Reference Guide— v4.8.0
DICOMDVHNormalizationPoint
DICOMExposureControlModeDescription
DICOMDVHNumberofBins
DICOMExposureinuAs
DICOMDVHReferencedROISequence
DICOMExposureSequence
DICOMDVHROIContributionType
DICOMExposuresOnDetectorSinceLastCalibration
DICOMDVHSequence
DICOMExposuresOnDetectorSinceManufactured
DICOMDVHType
DICOMExposuresonPlate
DICOMDVHVolumeUnits
DICOMExposureStatus
DICOMEchoNumbers
DICOMExposureTime
DICOMEchoTime
DICOMFailedSOPInstanceUIDList
DICOMEchoTrainLength
DICOMFailedSOPSequence
DICOMEffectiveSeriesDuration
DICOMFailureReason
DICOMEmptyImageDensity
DICOMFieldofViewDimensions
DICOMEndCumulativeMetersetWeight
DICOMFieldOfViewHorizontalFlip
DICOMEnergyWindowInformationSequence
DICOMFieldOfViewOrigin
DICOMEnergyWindowLowerLimit
DICOMFieldOfViewRotation
DICOMEnergyWindowName
DICOMFieldofViewShape
DICOMEnergyWindowNumber
DICOMFileInformationGroupLength
DICOMEnergyWindowRangeSequence
DICOMFileMetaInformationVersion
DICOMEnergyWindowUpperLimit
DICOMFilesetConsistencyFlag
DICOMEnergyWindowVector
DICOMFilesetDescriptorFileID
DICOMEntranceDose
DICOMFilesetID
DICOMErrorComment
DICOMFillerOrderNumberORImagingServiceRequ
est
DICOMErrorID
DICOMEstimatedRadiographicMagnificationFactor
DICOMEthnicGroup
DICOMEventElapsedTimes
DICOMEventTimerNames
DICOMEventTypeID
DICOMExecutionStatus
DICOMExecutionStatusInfo
DICOMExposedArea
DICOMExposure
DICOMFillerOrderNumberORImagingServiceRequ
est
DICOMFillerOrderNumberProcedure
DICOMFilmBoxContentSequence
DICOMFilmConsumptionSequence
DICOMFilmDestination
DICOMFilmOrientation
DICOMFilmSessionLabel
DICOMFilmSizeID
DICOMFilterMaterial
DICOMExposureControlMode
131
132
DICOMFilterThicknessMaximum
DICOMGantryDetectorSlew
DICOMFilterThicknessMinimum
DICOMGantryDetectorTilt
DICOMFilterType
DICOMGantryRotationDirection
DICOMFinalCumulativeMetersetWeight
DICOMGatedInformationSequence
DICOMFinalCumulativeTimeWeight
DICOMGeneratorPower
DICOMFixationDeviceDescription
DICOMGraphicData
DICOMFixationDeviceLabel
DICOMGraphicType
DICOMFixationDevicePosition
DICOMGreenPaletteColorLookupTableData
DICOMFixationDeviceSequence
DICOMGreenPaletteColorLookupTableDescriptor
DICOMFixationDeviceType
DICOMGrid
DICOMFlipAngle
DICOMGridAbsorbingMaterial
DICOMFocalDistance
DICOMGridAspectRatio
DICOMFocalSpots
DICOMGridFocalDistance
DICOMFocusDepth
DICOMGridFrameOffsetVector
DICOMFractionGroupNumber
DICOMGridPeriod
DICOMFractionGroupSequence
DICOMGridPitch
DICOMFractionNumber
DICOMGridSpacingMaterial
DICOMFractionPattern
DICOMGridThickness
DICOMFrameDelay
DICOMHalfValueLayer
DICOMFrameIncrementPointer
DICOMHardcopyCreationDeviceID
DICOMFrameNumbersofInterest
DICOMHardcopyDeviceManufacturer
DICOMFrameofReferenceRelationshipSequence
DICOMHardcopyDeviceSoftwareVersion
DICOMFrameofReferenceTransformationComment
DICOMHardcopyDevManufactModelName
DICOMFrameofReferenceTransformationMatrix
DICOMHeartRate
DICOMFrameofReferenceTransformationType
DICOMHighBit
DICOMFrameofReferenceUID
DICOMHighRRValue
DICOMFrameReferenceTime
DICOMHistogramBinWidth
DICOMFramesofInterestDescription
DICOMHistogramData
DICOMFrameTime
DICOMHistogramExplanation
DICOMFrameTimeVector
DICOMHistogramFirstBinValue
DICOMFramingType
DICOMHistogramLastBinValue
DICOMGantryAngle
DICOMHistogramNumberOfBins
DICOMGantryAngleTolerance
DICOMHistogramSequence
IS1200 Metadata Reference Guide— v4.8.0
DICOMHumanPerformerCodeSequence
DICOMImagingFrequency
DICOMHumanPerformersName
DICOMImagingServiceRequestComments
DICOMHumanPerformersOrganization
DICOMImplantPresent
DICOMIconImageSequence
DICOMImplementationClassUID
DICOMIdenticalDocumentsSequence
DICOMImplementationVersionName
DICOMIdentifyingGroupLength
DICOMImpressions
DICOMIllumination
DICOMInstanceCreationDate
DICOMImageAreaDoseProduct
DICOMInstanceCreationTime
DICOMImageBoxContentSequence
DICOMInstanceCreatorUID
DICOMImageBoxPresentationLUTFlag
DICOMInstanceNumber
DICOMImageComments
DICOMInstitutionAddress
DICOMImageContentDate
DICOMInstitutionalDepartmentName
DICOMImageContentTime
DICOMInstitutionCodeSequence
DICOMImageDisplayFormat
DICOMInstitutionName
DICOMImagedNucleus
DICOMIntensifierSize
DICOMImageFrameOrigin
DICOMIntermarkerDistance
DICOMImageID
DICOMInterpretationApprovalDate
DICOMImageIndex
DICOMInterpretationApprovalTime
DICOMImageLaterality
DICOMInterpretationApproverSequence
DICOMImageOrientationPatient
DICOMInterpretationAuthor
DICOMImageOverlayBoxContentSequence
DICOMInterpretationDiagnosisCodeSequence
DICOMImageOverlayFlag
DICOMInterpretationDiagnosisDescription
DICOMImagePlanePixelSpacing
DICOMInterpretationID
DICOMImagePosition
DICOMInterpretationIDIssuer
DICOMImagePositionPatient
DICOMInterpretationRecordedDate
DICOMImagePresentationGroupLength
DICOMInterpretationRecordedTime
DICOMImagerPixelSpacing
DICOMInterpretationRecorder
DICOMImagesinAcquisition
DICOMInterpretationStatusID
DICOMImageTransformationMatrix
DICOMInterpretationText
DICOMImageTranslationVector
DICOMInterpretationTranscriber
DICOMImageType
DICOMInterpretationTranscriptionDate
DICOMImagingDeviceSpecificAcquisitionParamete
rs
DICOMInterpretationTranscriptionTime
DICOMInterpretationTypeID
133
134
DICOMIntervalsAcquired
DICOMManufacturer
DICOMIntervalsRejected
DICOMManufacturersModelName
DICOMInterventionalStatus
DICOMMaskFrameNumbers
DICOMInterventionalTherapySequence
DICOMMaskOperation
DICOMInterventionDrugCodeSequence
DICOMMaskOperationExplanation
DICOMInterventionDrugDose
DICOMMaskPointers
DICOMInterventionDrugInformationSequence
DICOMMaskSubpixelShift
DICOMInterventionDrugName
DICOMMaskSubtractionSequence
DICOMInterventionDrugStartTime
DICOMMaterialID
DICOMInterventionDrugStopTime
DICOMMaxDensity
DICOMInversionTime
DICOMMaximumCoordinateValue
DICOMIsocenterPosition
DICOMMeasuredValueSequence
DICOMIssueDateofImagingServiceRequest
DICOMMeasurementUnitsCodeSequence
DICOMIssuerofAdmissionID
DICOMMeasuringUnitsSequence
DICOMIssuerofPatientID
DICOMMechanicalIndex
DICOMIssueTimeofImagingServiceRequest
DICOMMediaStorageSOPClassUID
DICOMKVP
DICOMMediaStorageSOPInstanceUID
DICOMLargestImagePixelValue
DICOMMedicalAlerts
DICOMLargestImagePixelValueinPlane
DICOMMedicalRecordLocator
DICOMLargestPixelValueinSeries
DICOMMediumType
DICOMLastMenstrualDate
DICOMMemoryAllocation
DICOMLaterality
DICOMMessageID
DICOMLeafJawPositions
DICOMMessageIDBeingRespondedTo
DICOMLeafPositionBoundaries
DICOMMetersetExposure
DICOMLookupTableNumber
DICOMMilitaryRank
DICOMLossyImageCompression
DICOMMinDensity
DICOMLossyImageCompressionRatio
DICOMMinimumCoordinateValue
DICOMLowRRValue
DICOMModalitiesinStudy
DICOMLUTDataUSor
DICOMModality
DICOMLUTDescriptor
DICOMModalityLUTSequence
DICOMLUTExplanation
DICOMModalityLUTType
DICOMMagneticFieldStrength
DICOMMoveDestination
DICOMMagnificationType
DICOMMoveOriginatorApplicationEntityTitle
IS1200 Metadata Reference Guide— v4.8.0
DICOMMoveOriginatorMessageID
DICOMNumberofPatientRelatedSeries
DICOMMRAcquisitionType
DICOMNumberofPatientRelatedStudies
DICOMMRDRDirectoryRecordOffset
DICOMNumberofPhaseEncodingSteps
DICOMNameofPhysiciansReadingStudy
DICOMNumberofPhases
DICOMNamesofIntendedRecipientsofResults
DICOMNumberofPoints
DICOMNominalBeamEnergy
DICOMNumberofPulses
DICOMNominalInterval
DICOMNumberofReferences
DICOMNominalPriorDose
DICOMNumberofRemainingSuboperations
DICOMNormalizationPoint
DICOMNumberofRotations
DICOMNumberofAverages
DICOMNumberofRRIntervals
DICOMNumberofBeams
DICOMNumberofSamples
DICOMNumberofBlocks
DICOMNumberofSeriesRelatedImages
DICOMNumberofBoli
DICOMNumberofSlices
DICOMNumberofBrachyApplicationSetups
DICOMNumberofStages
DICOMNumberofChannels
DICOMNumberofStudyRelatedImages
DICOMNumberofCompensators
DICOMNumberofStudyRelatedSeries
DICOMNumberofCompletedSuboperations
DICOMNumberofTableBreakPoints
DICOMNumberofContourPoints
DICOMNumberofTableEntries
DICOMNumberofControlPoints
DICOMNumberofTemporalPositions
DICOMNumberofCopies
DICOMNumberofTimeSlices
DICOMNumberofDetectors
DICOMNumberofTimeSlots
DICOMNumberofEnergyWindows
DICOMNumberOfTomoSynthesisSourceImages
DICOMNumberofEventTimers
DICOMNumberofTriggersinPhase
DICOMNumberofFailedSuboperations
DICOMNumberofViewsinStage
DICOMNumberofFilms
DICOMNumberofWarningSuboperations
DICOMNumberofFractionsPerDay
DICOMNumberofWedges
DICOMNumberofFractionsPlanned
DICOMNumericValue
DICOMNumberofFrames
DICOMObservationDateTime
DICOMNumberofFramesinOverlay
DICOMObservationNumber
DICOMNumberofFramesinPhase
DICOMOccupation
DICOMNumberofFramesinRotation
DICOMOffendingElement
DICOMNumberofLeafJawPairs
DICOMOffsetofReferencedLowerLevelDirectoryEnt
ity
DICOMNumberofPatientRelatedImages
135
DICOMOffsetoftheFirstDirectoryRecordoftheRootD
irectoryEntity
DICOMOffsetoftheLastDirectoryRecordoftheRootD
irectoryEntity
DICOMOffsetoftheNextDirectoryRecord
DICOMOperatorsName
DICOMOrderCallbackPhoneNumber
DICOMOrderEnteredBy
DICOMOrderEnterersLocation
DICOMOrganatRiskFullvolumeDose
DICOMOrganatRiskLimitDose
DICOMOrganatRiskMaximumDose
DICOMOrganatRiskOverdoseVolumeFraction
DICOMOrganDose
DICOMOrganExposed
DICOMOriginalImageSequence
DICOMOriginator
DICOMOtherPatientIDs
DICOMOtherPatientNames
DICOMOtherStudyNumbers
DICOMOutputPower
DICOMOverlayBitPosition
DICOMOverlayBitsAllocated
DICOMOverlayColumns
DICOMOverlayData
DICOMOverlayDate
DICOMOverlayDescription
DICOMOverlayForegroundDensity
DICOMOverlayLabel
DICOMOverlayMagnificationType
DICOMOverlayMode
DICOMOverlayNumber
DICOMOverlayOrigin
136
IS1200 Metadata Reference Guide— v4.8.0
DICOMOverlayPlaneOrigin
DICOMOverlayPlanes
DICOMOverlayRows
DICOMOverlaySmoothingType
DICOMOverlaySubtype
DICOMOverlayTime
DICOMOverlayType
DICOMOwnerID
DICOMPaletteColorLookupTableUID
DICOMPatientAdditionalPosition
DICOMPatientComments
DICOMPatientGantryRelationshipCodeSequence
DICOMPatientGroupLength
DICOMPatientID
DICOMPatientOrientation
DICOMPatientOrientationCodeSequence
DICOMPatientOrientationModifierCodeSequence
DICOMPatientOtherEvidenceSequence
DICOMPatientPosition
DICOMPatientsAddress
DICOMPatientsAge
DICOMPatientsBirthDate
DICOMPatientsBirthName
DICOMPatientsBirthTime
DICOMPatientSetupNumber
DICOMPatientSetupSequence
DICOMPatientsInstitutionResidence
DICOMPatientsInsurancePlanCodeSequence
DICOMPatientsMothersBirthName
DICOMPatientsName
DICOMPatientsPrimaryLanguageCodeSequence
DICOMPatientsPrimaryLanguageModifierCodeSeq
uence
DICOMPatientsReligiousPreference
DICOMPersonName
DICOMPatientsSex
DICOMPhaseDelay
DICOMPatientsSize
DICOMPhaseEncodingDirection
DICOMPatientState
DICOMPhaseInformationSequence
DICOMPatientsTelephoneNumbers
DICOMPhaseVector
DICOMPatientSupportAngle
DICOMPhosphorType
DICOMPatientSupportAngleTolerance
DICOMPhotometricInterpretation
DICOMPatientSupportRotationDirection
DICOMPhototimerSetting
DICOMPatientsWeight
DICOMPhysicalDeltaX
DICOMPatientTransportArrangements
DICOMPhysicalDeltaY
DICOMPauseBetweenFrames
DICOMPhysicalUnitsXDirection
DICOMPercentPhaseFieldofView
DICOMPhysicalUnitsYDirection
DICOMPercentSampling
DICOMPhysicianApprovingInterpretation
DICOMPerformedActionItemSequence
DICOMPhysiciansofRecord
DICOMPerformedLocation
DICOMPixelAspectRatio
DICOMPerformedProcedureStepDescription
DICOMPixelBandwidth
DICOMPerformedProcedureStepEndDate
DICOMPixelComponentDataType
DICOMPerformedProcedureStepEndTime
DICOMPixelComponentMask
DICOMPerformedProcedureStepID
DICOMPixelComponentOrganization
DICOMPerformedProcedureStepStartDate
DICOMPixelComponentPhysicalUnits
DICOMPerformedProcedureStepStartTime
DICOMPixelComponentRangeStart
DICOMPerformedProcedureStepStatus
DICOMPixelComponentRangeStop
DICOMPerformedProcedureTypeDescription
DICOMPixelData
DICOMPerformedProcessingApplicationsCodeSeq
uence
DICOMPixelDataGroupLength
DICOMPerformedSeriesSequence
DICOMPerformedStationAETitle
DICOMPerformedstationGeographicLocationCode
sequence
DICOMPerformedStationName
DICOMPerformedStationNameCodeSequence
DICOMPerformingPhysiciansName
DICOMPerformProcedureCodeSequence
DICOMPixelIntensityRelationship
DICOMPixelIntensityRelationshipSign
DICOMPixelPaddingValue
DICOMPixelRepresentation
DICOMPixelSpacing
DICOMPlacerOrderNumberORImagingServiceReq
uest
DICOMPlacerOrderNumberORImagingServiceReq
uest
137
DICOMPlacerOrderNumberProcedure
DICOMPrinterStatusInfo
DICOMPlanarConfiguration
DICOMPrintJobDescriptionSequence
DICOMPlanes
DICOMPrintJobID
DICOMPlannedVerificationImageSequence
DICOMPrintManagementCapabilitiesSequence
DICOMPlateID
DICOMPrintPriority
DICOMPlateType
DICOMPrintQueueID
DICOMPolarity
DICOMPriority
DICOMPositionerMotion
DICOMPrivateInformation
DICOMPositionerPrimaryAngle
DICOMPrivateInformationCreatorUID
DICOMPositionerPrimaryAngleIncrement
DICOMPrivateRecordUID
DICOMPositionerSecondaryAngle
DICOMPrivateSchemeCreatorUID
DICOMPositionerSecondaryAngleIncrement
DICOMProcedureCodeSequence
DICOMPositionerType
DICOMProcessingFunction
DICOMPositionReferenceIndicator
DICOMProjectionEponymousNameCodeSequence
DICOMPostprocessingFunction
DICOMProposedStudySequence
DICOMPredecessorDocumentsSequence
DICOMProtocolName
DICOMPreferredPlaybackSequencing
DICOMPulseRepetitionFrequency
DICOMPregnancyStatus
DICOMPulseRepetitionInterval
DICOMPreMedication
DICOMPVCRejection
DICOMPrescriptionDescription
DICOMQualityControlImage
DICOMPresentationIntentType
DICOMQuantity
DICOMPresentationLUTContentSequence
DICOMQuantitySequence
DICOMPresentationLUTFlag
DICOMQueryRetrieveLevel
DICOMPresentationLUTSequence
DICOMQueueStatus
DICOMPresentationLUTShape
DICOMRadialPosition
DICOMPrimaryAnatomicStructureModifierSequen
ce
DICOMRadiationMachineName
DICOMPrimaryAnatomicStructureSequence
DICOMPrimaryDosimeterUnit
DICOMPrimaryPromptsCountsAccumulated
DICOMPrinterCharacteristicsSequence
DICOMPrinterName
DICOMPrinterStatus
138
IS1200 Metadata Reference Guide— v4.8.0
DICOMRadiationMachineSAD
DICOMRadiationMachineSSD
DICOMRadiationMode
DICOMRadiationSetting
DICOMRadiationType
DICOMRadionuclideCodeSequence
DICOMRadionuclideHalfLife
DICOMRadionuclidePositronFraction
DICOMRadionuclideTotalDose
DICOMRadiopharmaceutical
DICOMRadiopharmaceuticalCodeSequence
DICOMRadiopharmaceuticalInformationSequence
DICOMRadiopharmaceuticalRoute
DICOMRadiopharmaceuticalSpecificActivity
DICOMRadiopharmaceuticalStartTime
DICOMRadiopharmaceuticalStopTime
DICOMRadiopharmaceuticalVolume
DICOMRadiusofCircularCollimator
DICOMRadiusofCircularShutter
DICOMRandomsCorrectionMethod
DICOMReasonforStudy
DICOMReasonfortheImagingServiceRequest
DICOMReasonfortheRequestedProcedure
DICOMReceivingCoil
DICOMRecognitionCode
DICOMRecommendedDisplayFrameRate
DICOMRecommendedViewingMode
DICOMReconstructionDiameter
DICOMReconstructionMethod
DICOMRecordInuseFlag
DICOMRectificationType
DICOMRedPaletteColorLookupTableData
DICOMRedPaletteColorLookupTableDescriptor
DICOMReferenceAirKermaRate
DICOMReferencedBasicAnnotationBoxSequence
DICOMReferencedBeamNumber
DICOMReferencedBeamSequence
DICOMReferencedBlockNumber
DICOMReferencedBolusSequence
DICOMReferencedBrachyApplicationSetupNumbe
r
DICOMReferencedBrachyApplicationSetupSequenc
e
DICOMReferencedCompensatorNumber
DICOMReferencedContentItemIdentifier
DICOMReferencedControlPointIndex
DICOMReferencedCurveSequence
DICOMReferencedDateTime
DICOMReferencedDoseReferenceNumber
DICOMReferencedDoseReferenceSequence
DICOMReferencedDoseSequence
DICOMReferencedFileID
DICOMReferencedFilmBoxSequence
DICOMReferencedFilmSessionSequence
DICOMReferencedFractionGroupNumber
DICOMReferencedFractionGroupSequence
DICOMReferencedFrameNumber
DICOMReferencedFrameNumbers
DICOMReferencedFrameofReferenceSequence
DICOMReferencedFrameofReferenceUID
DICOMReferencedImageBoxSequence
DICOMReferencedImageOverlayBoxSequence
DICOMReferencedImageSequence
DICOMReferencedInterpretationSequence
DICOMReferencedOverlayGroup
DICOMReferencedOverlayPlaneGroups
DICOMReferencedOverlayPlaneSequence
DICOMReferencedOverlaySequence
DICOMReferencedOverlaySequence
DICOMReferencedPatientAliasSequence
DICOMReferencedPatientSequence
DICOMReferencedPatientSetupNumber
139
DICOMReferencedPresentationLUTSequence
DICOMReferencePixelPhysicalValueY
DICOMReferencedPrintJobSequence
DICOMReferencePixelX0
DICOMReferencedProcedureStepSequence
DICOMReferencePixelY0
DICOMReferencedReferenceImageNumber
DICOMReferencetoRecordedSound
DICOMReferencedReferenceImageSequence
DICOMReferringPhysiciansAddress
DICOMReferencedRequestSequence
DICOMReferringPhysiciansName
DICOMReferencedResultsSequence
DICOMReferringPhysiciansTelephoneNumbers
DICOMReferencedROINumber
DICOMReflectedAmbientLight
DICOMReferencedRTPlanSequence
DICOMRegionDataType
DICOMReferencedSamplePositions
DICOMRegionFlags
DICOMReferencedSeriesSequence
DICOMRegionLocationMaxX1
DICOMReferencedSOPClassUID
DICOMRegionLocationMaxY1
DICOMReferencedSOPClassUIDinFile
DICOMRegionLocationMinX0
DICOMReferencedSOPInstanceUID
DICOMRegionLocationMinY0
DICOMReferencedSOPInstanceUIDinFile
DICOMRegionofResidence
DICOMReferencedSOPSequence
DICOMRegionSpatialFormat
DICOMReferencedSourceNumber
DICOMRelatedFrameofReferenceUID
DICOMReferencedStandaloneSOPInstanceSequenc
e
DICOMRelatedRTROIObservationsSequence
DICOMReferencedStoredPrintSequence
DICOMReferencedStructureSetSequence
DICOMReferencedStudyComponentSequence
DICOMReferencedStudySequence
DICOMReferencedTimeOffsets
DICOMReferencedToleranceTableNumber
DICOMReferencedTransferSyntaxUIDinFile
DICOMReferencedVerificationImageSequence
DICOMReferencedVisitSequence
DICOMReferencedVOILUTBoxSequence
DICOMReferencedWaveformChannels
DICOMReferencedWedgeNumber
DICOMReferenceImageNumber
DICOMReferencePixelPhysicalValueX
140
IS1200 Metadata Reference Guide— v4.8.0
DICOMRelationshipGroupLength
DICOMRelationshipType
DICOMRelativeXrayExposure
DICOMRepeatFractionCycleLength
DICOMRepetitionTime
DICOMReportedValuesOrigin
DICOMReportingPriority
DICOMRepresentativeFrameNumber
DICOMReprojectionMethod
DICOMRequestAttributesSequence
DICOMRequestedContrastAgent
DICOMRequestedImageSize
DICOMRequestedProcedureCodeSequence
DICOMRequestedProcedureCodeSequence
DICOMRequestedProcedureComments
DICOMRequestedProcedureDescription
DICOMROIPhysicalPropertiesSequence
DICOMRequestedProcedureDescription
DICOMROIPhysicalProperty
DICOMRequestedProcedureID
DICOMROIPhysicalPropertyValue
DICOMRequestedProcedureLocation
DICOMROIStandardDeviation
DICOMRequestedProcedurePriority
DICOMROIVolume
DICOMRequestedSOPClassUID
DICOMRotationDirection
DICOMRequestedSOPInstanceUID
DICOMRotationInformationSequence
DICOMRequestingPhysician
DICOMRotationVector
DICOMRequestingService
DICOMRouteofAdmissions
DICOMRescaleIntercept
DICOMRows
DICOMRescaleSlope
DICOMRRIntervalVector
DICOMRescaleType
DICOMRTBeamLimitingDeviceType
DICOMResidualSyringeCounts
DICOMRTDoseROISequence
DICOMResultsComments
DICOMRTImageDescription
DICOMResultsDistributionListSequence
DICOMRTImageLabel
DICOMResultsID
DICOMRTImageName
DICOMResultsIDIssuer
DICOMRTImageOrientation
DICOMRetrieveAETitle
DICOMRTImagePlane
DICOMReviewDate
DICOMRTImagePosition
DICOMReviewerName
DICOMRTImageSID
DICOMReviewTime
DICOMRTPlanDate
DICOMROIArea
DICOMRTPlanDescription
DICOMROIContourSequence
DICOMRTPlanGeometry
DICOMROIDescription
DICOMRTPlanLabel
DICOMROIDisplayColor
DICOMRTPlanName
DICOMROIGenerationAlgorithm
DICOMRTPlanRelationship
DICOMROIGenerationDescription
DICOMRTPlanTime
DICOMROIInterpreter
DICOMRTReferencedSeriesSequence
DICOMROIMean
DICOMRTReferencedStudySequence
DICOMROIName
DICOMRTRelatedROISequence
DICOMROINumber
DICOMRTROIIdentificationCodeSequence
DICOMROIObservationDescription
DICOMRTROIInterpretedType
DICOMROIObservationLabel
DICOMRTROIObservationsSequence
141
DICOMRTROIRelationship
DICOMScheduledStudyStartDate
DICOMRWavePointer
DICOMScheduledStudyStartTime
DICOMSampleRate
DICOMScheduledStudyStopDate
DICOMSamplesperPixel
DICOMScheduledStudyStopTime
DICOMSAR
DICOMSecondaryCaptureDeviceID
DICOMScanArc
DICOMSecondaryCaptureDeviceManufacturer
DICOMScanLength
DICOMSecondaryCaptureDeviceManufacturersMo
delName
DICOMScanningSequence
DICOMScanOptions
DICOMScanVelocity
DICOMScatterCorrectionMethod
DICOMScatterFractionFactor
DICOMScheduledActionItemCodeSequence
DICOMScheduledAdmissionDate
DICOMScheduledAdmissionTime
142
DICOMSecondaryCaptureDeviceSoftwareVersions
DICOMSecondaryCountsAccumulated
DICOMSecondaryCountsType
DICOMSegmentedBluePaletteColorLookupTableDa
ta
DICOMSegmentedGreenPaletteColorLookupTable
Data
DICOMScheduledDischargeDate
DICOMSegmentedRedPaletteColorLookupTableDa
ta
DICOMScheduledDischargeTime
DICOMSensitivity
DICOMScheduledPatientInstitutionResidence
DICOMSequenceName
DICOMScheduledPerformingPhysiciansName
DICOMSequenceofUltrasoundRegions
DICOMScheduledProcedureStepDescription
DICOMSequenceVariant
DICOMScheduledProcedureStepEndDate
DICOMSeriesDate
DICOMScheduledProcedureStepEndTime
DICOMSeriesDescription
DICOMScheduledProcedureStepID
DICOMSeriesInstanceUID
DICOMScheduledProcedureStepLocation
DICOMSeriesinStudy
DICOMScheduledProcedureStepSequence
DICOMSeriesNumber
DICOMScheduledProcedureStepStartDate
DICOMSeriesTime
DICOMScheduledProcedureStepStartTime
DICOMSeriesType
DICOMScheduledProcedureStepStatus
DICOMSetupDeviceDescription
DICOMScheduledStationAETitle
DICOMSetupDeviceLabel
DICOMScheduledStationName
DICOMSetupDeviceParameter
DICOMScheduledStepAttributesSequence
DICOMSetupDeviceSequence
DICOMScheduledStudyLocation
DICOMSetupDeviceType
DICOMScheduledStudyLocationAETitles
DICOMSetupReferenceDescription
IS1200 Metadata Reference Guide— v4.8.0
DICOMSetupTechnique
DICOMSourceApplicatorName
DICOMSetupTechniqueDescription
DICOMSourceApplicatorNumber
DICOMShieldingDeviceDescription
DICOMSourceApplicatorStepSize
DICOMShieldingDeviceLabel
DICOMSourceApplicatorType
DICOMShieldingDevicePosition
DICOMSourceApplicatorWallNominalThickness
DICOMShieldingDeviceSequence
DICOMSourceApplicatorWallNominalTransmissio
n
DICOMShieldingDeviceType
DICOMShutterLeftVerticalEdge
DICOMShutterLowerHorizontalEdge
DICOMShutterRightVerticalEdge
DICOMShutterShape
DICOMShutterUpperHorizontalEdge
DICOMSkipBeats
DICOMSlantAngle
DICOMSliceLocation
DICOMSliceSensitivityFactor
DICOMSliceThickness
DICOMSliceVector
DICOMSmallestImagePixelValue
DICOMSmallestImagePixelValueinPlane
DICOMSmallestPixelValueinSeries
DICOMSmokingStatus
DICOMSmoothingType
DICOMSoftTissuefocusThermalIndex
DICOMSoftTissuesurfaceThermalIndex
DICOMSoftTissueThermalIndex
DICOMSoftwareVersion
DICOMSOPClassUID
DICOMSOPInstanceUID
DICOMSourceApplicationEntityTitle
DICOMSourceApplicatorID
DICOMSourceApplicatorLength
DICOMSourceApplicatorManufacturer
DICOMSourceAxisDistance
DICOMSourceEncapsulationNominalThickness
DICOMSourceEncapsulationNominalTransmission
DICOMSourceImageSequence
DICOMSourceIsotopeHalfLife
DICOMSourceIsotopeName
DICOMSourceManufacturer
DICOMSourceMovementType
DICOMSourceNumber
DICOMSourceSequence
DICOMSourcetoBeamLimitingDeviceDistance
DICOMSourcetoBlockTrayDistance
DICOMSourcetoCompensatorTrayDistance
DICOMSourcetoReferenceObjectDistance
DICOMSourcetoSurfaceDistance
DICOMSourcetoWedgeTrayDistance
DICOMSourceType
DICOMSpacingBetweenSlices
DICOMSpatialResolution
DICOMSpecialNeeds
DICOMSpecificCharacterSet
DICOMSpecificCharacterSetofFilesetDescriptorFile
DICOMStageName
DICOMStageNumber
DICOMStartAngle
DICOMStartCumulativeMetersetWeight
143
144
DICOMStartTrim
DICOMTableAngle
DICOMStationName
DICOMTableHeight
DICOMStatus
DICOMTableLateralIncrement
DICOMSteeringAngle
DICOMTableLongitudinalIncrement
DICOMStopTrim
DICOMTableMotion
DICOMStorageMediaFilesetID
DICOMTableofParameterValues
DICOMStorageMediaFilesetUID
DICOMTableofPixelValues
DICOMStructureSetDate
DICOMTableofXBreakPoints
DICOMStructureSetDescription
DICOMTableofYBreakPoints
DICOMStructureSetLabel
DICOMTableTopEccentricAngle
DICOMStructureSetName
DICOMTableTopEccentricAngleTolerance
DICOMStructureSetROISequence
DICOMTableTopEccentricAxisDistance
DICOMStructureSetTime
DICOMTableTopEccentricRotationDirection
DICOMStudyArrivalDate
DICOMTableTopLateralPosition
DICOMStudyArrivalTime
DICOMTableTopLateralPositionTolerance
DICOMStudyComments
DICOMTableTopLateralSetupDisplacement
DICOMStudyCompletionDate
DICOMTableTopLongitudinalPosition
DICOMStudyCompletionTime
DICOMTableTopLongitudinalPositionTolerance
DICOMStudyComponentStatusID
DICOMTableTopLongitudinalSetupDisplacement
DICOMStudyDate
DICOMTableTopVerticalPosition
DICOMStudyDescription
DICOMTableTopVerticalPositionTolerance
DICOMStudyID
DICOMTableTopVerticalSetupDisplacement
DICOMStudyIDIssuer
DICOMTableTraverse
DICOMStudyInstanceUID
DICOMTableType
DICOMStudyPriorityID
DICOMTableVerticalIncrement
DICOMStudyReadDate
DICOMTargetMaximumDose
DICOMStudyReadTime
DICOMTargetMinimumDose
DICOMStudyStatusID
DICOMTargetPrescriptionDose
DICOMStudyTime
DICOMTargetUnderdoseVolumeFraction
DICOMStudyVerifiedDate
DICOMTemplateExtensionCreatorUID
DICOMStudyVerifiedTime
DICOMTemplateExtensionFlag
DICOMSurfaceEntryPoint
DICOMTemplateExtensionOrganizationUID
DICOMSyringeCounts
DICOMTemplateIdentifier
IS1200 Metadata Reference Guide— v4.8.0
DICOMTemplateLocalVersion
DICOMTomoTime
DICOMTemplateName
DICOMTomoType
DICOMTemplateNumber
DICOMTopicAuthor
DICOMTemplateType
DICOMTopicKeyWords
DICOMTemplateVersion
DICOMTopicSubject
DICOMTemporalPositionIdentifier
DICOMTopicTitle
DICOMTemporalRangeType
DICOMTotalBlockTrayFactor
DICOMTemporalResolution
DICOMTotalCompensatorTrayFactor
DICOMTextString
DICOMTotalNumberofExposures
DICOMTextValue
DICOMTotalReferenceAirKerma
DICOMTherapyDescription
DICOMTotalTime
DICOMTherapyType
DICOMTotalTimeofFluoroscopy
DICOMThermalIndex
DICOMTransactionUID
DICOMThresholdDensity
DICOMTransducerData
DICOMTIDOffset
DICOMTransducerFrequency
DICOMTime
DICOMTransducerOrientationModifierSequence
DICOMTimeofLastCalibration
DICOMTransducerOrientationSequence
DICOMTimeOfLastDetectorCalibration
DICOMTransducerPositionModifierSequence
DICOMTimeofSecondaryCapture
DICOMTransducerPositionSequence
DICOMTimeSliceVector
DICOMTransducerType
DICOMTimeSlotInformationSequence
DICOMTransferSyntaxUID
DICOMTimeSlotTime
DICOMTransferTubeLength
DICOMTimeSlotVector
DICOMTransferTubeNumber
DICOMTMLinePositionX0
DICOMTransmittingCoil
DICOMTMLinePositionX1
DICOMTransverseMash
DICOMTMLinePositionY0
DICOMTreatmentDeliveryType
DICOMTMLinePositionY1
DICOMTreatmentIntent
DICOMToleranceTableLabel
DICOMTreatmentMachineName
DICOMToleranceTableNumber
DICOMTreatmentMachineSequence
DICOMToleranceTableSequence
DICOMTreatmentProtocols
DICOMTomoAngle
DICOMTreatmentSites
DICOMTomoClass
DICOMTriggerSourceorType
DICOMTomoLayerHeight
DICOMTriggerTime
145
DICOMTriggerVector
DICOMWedgePosition
DICOMTriggerWindow
DICOMWedgePositionSequence
DICOMTrim
DICOMWedgeSequence
DICOMTypeofData
DICOMWedgeType
DICOMTypeofDetectorMotion
DICOMWholeBodyTechnique
DICOMTypeofFilters
DICOMWindowCenter
DICOMUID
DICOMWindowCenterWidthExplanation
DICOMUltrasoundColorDataPresent
DICOMWindowWidth
DICOMUnits
DICOMXFocusCenter
DICOMValueType
DICOMXRayImageReceptorAngle
DICOMVariableFlipAngleFlag
DICOMXRayOutput
DICOMVerificationDateTime
DICOMXrayTubeCurrent
DICOMVerificationFlag
DICOMYFocusCenter
DICOMVerifyingObserverIdentificationCodeSeque
nce
DICOMZoomCenter
DICOMVerifyingObserverName
DICOMVerifyingObserverSequence
DICOMVerifyingOrganization
DICOMVerticesofthePolygonalCollimator
DICOMVerticesofthePolygonalShutter
DICOMVideoImageFormatAcquired
DICOMViewCodeSequence
DICOMViewModifierCodeSequence
DICOMViewNumber
DICOMViewPosition
DICOMVisitComments
DICOMVisitStatusID
DICOMVOILUTSequence
DICOMWedgeAngle
DICOMWedgeFactor
DICOMWedgeID
DICOMWedgeNumber
DICOMWedgeOrientation
146
IS1200 Metadata Reference Guide— v4.8.0
DICOMZoomFactor
Glossary
This glossary contains terms related to disk storage subsystems,
networks, file management, and eDiscovery. Many of these terms are
used in this manual.
A
active case
In eDiscovery situations, a company may have more than one legal
issue (case) in progress at a time. Often it is advantageous to limit job
or search scope to just one case. When the user interface scope is
limited to a particular single case, that case is the active case.
Active Directory (AD)
A technology created by Microsoft that provides a variety of network
services, including: LDAP-like directory services, Kerberos-based
authentication, and DNS-based naming and other network
information.
Actions,
Actionable Services
Access Control List
(ACL)
Services such as copy, move, delete, tagging, and so on, that can be
applied to search and report results and allow the IS1200 to be an
effective file management tool for registered repositories.
A file system level data file that specifies how users or groups may
access resources on a computer or network, like an application, file or
printer, and the rights they have to it, for example read access, write
access, and so forth. For more information on how the IS1200 may use
ACLs, see the Controlling ACL Checking section of the Configuration
Files and Utilities appendix of any IS1200 User Guide for details.
147
Glossary
Advanced Search
Agents
Assignment Rules
A search made from the IS1200 Advanced Search link. Allows
searching for extracted metadata by tag-value pairs, and allows
multiple variable and boolean searches.
See “connectors” on page 151.
An assignment rule is a type of classification rule. It tags files with
metadata and assigns files to policy groups. Assignment rules are
contained in Assignment Rule Sets (ASRs). See the Policies:
Classification, Extraction and Assignment Rules chapter of the any
IS1200 User Guide for more details.
Auditing
A service that allows the IS1200 to record all system events according
to who did what, when, and the event result. This data is especially
useful to Legal Service Providers when providing an audit trail for
responsive data produced during eDiscovery. Complete details are
available in the Auditing and Data Verification chapter of any IS1200
User Guide for details.
Authorization Rule
A policy rule that filters search results to ensure that the assigned files
can only be viewed by authorized users. IS1200 authorization policies
may be used to add additional levels of security to the Access Control
Lists (ACLs) for file objects found in registered data repositories. See
the Policy Groups: Authorization Policies chapter of any IS1200 User
Guide for more details.
Authentication
The process of identifying users based on user name and password to
ensure that only authorized users can access the IS1200.
B
Basic Search
A search made from the Search page using only the Search field.
Searches only the content found in the fullText field populated
during classifications.
C
CAS Device
CASID
148
EMC’s Content Addressed Storage (CAS) devices are cluster-able
archival devices that host archival business file content such as email,
office productivity files (like word processing and spreadsheet files),
images, and other file documents.
A unique IS1200 ID for each classified file that the system generates
during basic classification.
Glossary
Centera Server
The EMC Centera server is a networked storage system specifically
designed to store and provide fast, easy access to fixed content
(information in its final form). It is a CAS device providing long-term
retention and assured integrity designed to store and manage data
that require or have legally mandated retention periods, for example
medical records and files relevant to legal matters.
Celerra Server
An EMC server designed to store and manage archival data. The
Celerra File Level Retention (FLR) server also allows enforcing
enterprise or governmental retention policies.
checkpoints,
checkpointing
Checkpoints and checkpointing allow IS1200 jobs and services to
resume more efficiently if the job or service is paused or stopped
before it completes. Basically, the IS1200 records “bookmarks” about
what file or object was last processed. This allows the IS1200 to skip
to the bookmark—the checkpoint—when the job or service is
resumed, and avoid reprocessing all the files and objects already
processed.
However, checkpoints are not set for every file accessed, instead most
jobs divide file processing into “batches” and the checkpoints
indicate where batches started. Consequently, when a job restarts at a
checkpoint, some objects may be reprocessed again and—in cases
such as a 'Copy' service with 'enable-versioning' option
selected—duplicate versioned files will be created on the target
repository when those objects are reprocessed.
Classification Rule
Classification Service
Rules that the system implements during data classification to extract
metadata, tag files, and assign files to policy groups. The two types of
classification rules are extraction rules and assignment rules.
Sometimes called a “crawl”. An IS1200 service that accesses
job-specified registered repositories and extracts and records their
metadata to later facilitate comprehensive and cross-repository
searches. Classifications extract metadata according to extraction
rules, compute digests for all objects, and assigns files to policy
groups according to assignment rules. See “Assignment Rules” on
page 148, “Extraction Rules” on page 155, “Hash Values” on
page 156, and “Policy Groups” on page 160 for more details.
Classifications may be “full”, every object in the specified repositories
is parsed and its metadata repopulated in the indexes and databases,
or they may be “differential”, see “Differential Classifications” on
page 153 of more details.
149
Glossary
Cluster
CAS
Content Addressable
Storage
CSV
Comma Separated
Values
A set of IS1200 appliance nodes working as a unit. A cluster can
contain a maximum of four nodes. A cluster can be used to control
other clusters, see “Information Center Server” on page 157 for
details.
Rather than address data objects by a file name, at a physical location,
a CAS device uses a content address (hash-code identifiers) based on
file contents to store file objects in a flat file system that maximizes
storage efficiency. This returns a unique identifier (Content Address)
used to store and retrieve data objects.
A file type used to transfer data between applications such as
databases and spreadsheets.
CLI
Command Line
Interface
The CLI is a traditional command line interface that allows direct
communications with the IS1200 “backend” using a the set of
commands defined in the IS1200 Command Line Interface Reference
Guide.
Concepts Search
The standard IS1200 software supports keyword exploration.
However, in the initial stages of the legal discovery process (often
called eDiscovery), keyword search alone may not be as concise or as
time-efficient as required by standard legal timetables.
Concepts augments standard keyword searching by automatically
suggesting filters based on the results of a current search. By default it
looks for concepts based on persons, countries, noun groups,
organizations, company names, and products.
Concepts Search is an optional module that requires an additional
license key for each IS1200 cluster node. See the IS1200 Concepts
Search User and Configuration Guide for complete details.
conceptfinder Ruleset
The conceptfinder ruleset is an assignment ruleset that extracts the
concepts listed in the Review/Analysis Results Grouping Concepts pane,
which is only available when a valid Concepts license is installed on
the IS1200. The conceptfinder ruleset must be used in deep
classifications to get the best results in Review/Analysis from the
Concepts heading of the Results Grouping pane.
The ConceptFinder_DWF assignment ruleset combines both the
conceptfinder ruleset and the DocsWithoutFullText ruleset. See
“DocsWithoutFullText Assignment Ruleset” on page 154 for more
details.
150
Glossary
connectors
Connectors are IS1200 optional modules that allow an IS1200 to work
with repository types beyond the standard CIFS and NFS
repositories. See “optional modules” on page 160 for more details.
Optional module connectors require separate licenses to be purchased
and installed on all nodes of an IS1200 cluster. For a complete list of
optional modules available, see the Introduction chapter of any IS1200
User Guide.
Some connectors, such as the Microsoft Exchange Server Connector,
require agents. Agents are additional server platforms, usually
Windows servers, that provide the additional CPU cycles and
network staging the IS1200 needs to work with the repository types
they connect to.
All connectors have their own user guides which can be accessed from
the Kazeon Documentation link on the IS1200 Manager page
(https://<yourIS1200Name>/manager).
Container file/object
A file (object) that contains other files (sub-objects), such as a ZIP,
TAR, JAR, and PST or NSF files. The container file is often called the
“parent” and the contained objects are called “children”. Container
objects should not be confused with files that have embedded objects,
such as Microsoft Word files that have embedded charts or graphics
(OLE).
Custodian
A legal term used by Legal Service Providers (LSP) and other legal
personnel to describe the owners or responsible parties for electronic
documents pertinent (responsive) to a legal matter.
D
Data
Datamap
A file of any type and size such as a short email, a word processor
document, or a large spreadsheet.
A report that lists the electronic storage locations of all possible
sources of relevant ESI. This can include standard file servers,
groupware servers, email servers—and their backup and archive
systems—as well as custodian’s desktop and laptop computers.
Data-Mount
The NFS file system that is accessed by the IS1200 to parse data and
extract metadata.
Data Server
The file server that exports an NFS or CIFS file system so that the
IS1200 can classify data on the file system to create metadata.
151
Glossary
Data-Share
The CIFS file system to be accessed by the IS1200 to extract metadata.
Data Repository
A networked file system registered with the IS1200 so it can be
classified, searched, and reported on. Data repositories created on the
IS1200 itself (sometimes called localdatafs) are strongly
discouraged!
Data Verification
Builds on Auditing and is only available when system auditing is
enabled. For job services like Actionable Services Copy or Move,
Legal Hold Copy, and Single Step Collections, Data Verification
generates an audit trail proving that files were not altered during
these actions. This is especially valuable in eDiscovery situations.
Complete details are available in the Auditing and Data Verification
chapter of any IS1200 User Guide
Deduplication
A process that identifies file or email object and sub-object duplicates
based on their digest values (See “Digest Values” on page 153 for
details).
In the 4.7.0 and prior versions of the IS1200 software, deduplication
was only available for export actions (Actionable Services such as
Download, Legal Export, and Copy). This allowed exporting only the
unique files and email objects from a set of search results. With IS1200
version 4.8.0, deduplication's functionality is expanded and is
automatically applied during case collections and processing to allow
displaying deduplicated search results. Note that when deduplication is
applied to display of search results, duplicates are only suppressed
from display, however duplicates are physically removed from
exported file sets.
Deduplication is available only in the ECS version of IS1200 and is
applicable only in case context.
DeDuplication view is configurable as deduplication and
non-deduplication view. This allows to view whether any object has
got duplicates in search results and the duplicate of the Original (in
the search results).
Besides the automatic deduplication of collections and processing,
deduplication may also be started manually from the IS1200's case
dashboard.
152
Glossary
Deduplication reports describing how a particular job or service
applied deduplication are available. The reports can be accessed from
the IS1200 case dashboard as well as from web search. Reports can list
all results, only unique (deduplicated) results, or percentages of
unique and duplicates.
Reduplication is a process that allows the duplicates of unique files to
be identified so tagging processes can apply metadata tags to the
unique files as well as all its copies. Legal Tags reduplication can be
done after documents are added to the case.
Differential
Classifications
Differential classifications do not re-classify all file objects in the
selected repositories. Instead, they examine the metadata from
previous crawls, and if there is no previous metadata (indicating the
object is new since the last classification) or the metadata has changed
(based on atime, or mtime changes), then the object is parsed and its
metadata re-populated in the database.
Note: System classification configuration settings default to using mtime to
determine if files have changed for differential classifications. If atime is
desired instead, see the Using atimes for Differential Crawls section of the
Configuration Files and Utilities appendix of any IS1200 User Guide for details
on resetting the default to atime.
Additionally, atime may be applied only to selected classifications by
initiating them from the Command Line Interface, see the add service
deep-classification command and the crawl-atime-check-enabled
option in the IS1200 Command Line Interface Reference Guide for details.
Digest Values
Digests are numerical values calculated based on file and email
content and are unique for all unique objects. Digest values allow file
objects to be compared very quickly. Digests are calculated during
basic and deep classifications or during collections or processing
when indexing is enabled.
Digests are calculated differently for standard files, emails, and
container objects. For standard files, a physical digest is computed for
the entire file much like a hash value.
For email objects, just the subject, the message content (including
attachments), and certain specific addresses are combined and an
email digest value is calculated from the combination. Container
objects, like ZIP or PST files, and their sub-objects have digests
calculated both as complete objects and as individual sub-objects.
153
Glossary
Note: Calculating email digests requires access to the email object's fullText
and only classifications that include the fullText rule can produce email
digests. Emails classified without the fullText rule receive the same physical
digest that other files do. Consequently, identical emails on different
repositories, one classified with and one without the fullText rule, will not be
identified as duplicates.
Domino Sever (Lotus)
Domino XML
Language (DXL)
DocsWithoutFullText
Assignment Ruleset
A Lotus server providing groupware solutions and storage.
A Lotus version of eXtensible Markup Language (XML) used to
import and export Lotus email files.
Some file objects, such as graphics files (examples are.jpeg, .gif, or
.bmp files) contain no text, and hence will have no fullText
extracted by the FullTextRuleset, see “fullText” on page 156 for more
details. In legal cases, these files may still contain responsive
information, but not textual information that can be located by text
searches. The DocsWithoutFulltext assignment rules identifies these
files and adds the metadata tag and value
“DocWithoutFulltext=true” to all files that contain no searchable
text. This allows these files to be easily searched for later, and
inspected for legal responsiveness by non-search methods.
The ConceptFinder_DWF assignment ruleset combines both the
DocsWithoutFullText ruleset with the conceptfinder ruleset. See
“conceptfinder Ruleset” on page 150 for more details.
Note: Parent file objects that don’t contain text (such as .zip, .tar, and .pst files)
are not tagged with the DocWithoutFulltext tag.
Documentum Sever
(EMC)
The EMC Documentum server manages business content including
documents, photos, video, medical images, e-mail, Web pages, fixed
content, XML-tagged documents, and so on. The Documentum core
is a repository that stores content securely under compliance rules
and appears as a unified environment, even though content may
reside on multiple servers and physical storage devices within a
distributed environment.
E
eDiscovery
154
The process of reviewing electronic files to determine their relevances
and responsiveness to a legal matter or case.
Glossary
eDiscovery Case
Manager
An IS1200 tab that facilitates eDiscovery for Legal Service Providers.
Electronic Discovery
Reference Model
(EDRM)
The EDRM was a Project created to provide standards and guidelines
for the electronic discovery market. The model defines a common,
flexible and extensible framework for the development, selection,
evaluation and use of electronic discovery products and services.
Enterprise Vault
eth1, eth2
Extended Attributes
Extraction Rules
Exchange Server
(Microsoft)
A Symantec networked repository for archived email.
Most IS1200 platforms require two ethernet connections for proper
deployment. These connections are called eth1 and eth2, must each
have unique IP addresses, and must be GigaBit, or 1GB/sec or faster,
connections. Additionally, all network segments between eth1 and all
registered metadata and data repositories must be gigabit
eth1 is used to communicate between the IS1200 and its registered
repositories. The IS1200 hostname should be DNS mapped to the eth1
IP address.
eth2 must be connected to a private network between the IS1200
nodes and is used to coordinate and balance system wide operations.
eth2 IP address should not be DNS mapped.
User-defined keywords that are extracted during data classification.
Extraction rules are a type of classification rule. They extract
user-defined keywords (custom metadata) to add to the metadata file.
Extraction rules are grouped into Extraction Rule Sets (ERSs). See the
Policies: Classification, Extraction and Assignment Rules chapter of any
IS1200 User Guide for more details.
A Microsoft server designed to store and manage email.
F
Federation
Federation Server
Filer
A defined group of member-clusters on a Federation server that can
be managed, searched, and reported on as a group. Member-clusters
are referred to as Federated clusters.
A single-node IS1200 server, with a Federation license, that allows
consolidated searching and reporting of up to eight Federated
member-clusters of its defined Federation.
A file server that exports its file systems using NFS or CIFS protocol.
155
Glossary
fullText
fullText is the “content” portion of a file, for example this is the textual
content of word processing files and the message body of emails.
fulltext is an extraction rule that is used to save file textual content as
metadata to the Search Index during classifications. It saves up to 10
megabytes of content by default. This default may be changed, but it
is not recommended. Fulltext extraction is required by
Review/Analysis for the Previewer pane to work and to generate
Concepts in the Results Grouping pane.
fulltext, is extracted differently for container objects and sub-objects, and
for files with embedded objects.
Container objects (such as ZIP or PST files) and their sub-objects
are classified individually and the fulltext of the parent container
file, and for each child sub-object, is extracted and added to the
relevant metadata repository separately.
Files with embedded objects (such as a Microsoft Word file with and
embedded spreadsheet), are classified together. The fulltext of the
embedded object is included in the fulltext of its parent object and
not collected separately.
For more details on fullText, see Chapter 1 of the IS1200 Metadata
Reference Guide.
G
Groupware
Collaborative software designed to help people involved in common
tasks achieve their goals. Incorporates services such as email,
calendaring, text chat, wiki, web-sharing, document control, and
advanced search.
H
Hash Values
Hash values are used to compare one file with another for duplicates.
An extremely simplified description of hashing is that the numeric
values of all bytes in a file are added into a grand total. The chances of
two different files yielding the same result (hash value) are remotely
small, so hash values can be used to identify duplicate files, or
compare files with the same name to decide if they have been
modified.
156
Glossary
Computing hash on an entire file is called a full-hash, and computing
hash on a portion of the file is called a partial-hash. A “partial hash”
may also be used to increase classification speed and “hashing” can
be turned on, or off to increase classification speed.
I
identity
A single entry in the Identity Vault database. The identity contains a
single username and password that the IS1200 can retrieve when it
needs to access a registered data or metadata repository or other
server like and authentication service.
Identity Vault
An encrypted database of usernames and passwords the IS1200 uses
to store the credentials used to access registered data repositories,
send email notifications, and work with authentication services.
Information Center
Server
The standard IS1200 server offers clustering as a scalable solution for
classifying, searching, and reporting on registered network
repositories. While clustering is ideal for scaling to large numbers of
files on a LAN, it is not a viable solution for WANs. Enterprises with
multiple IS1200 clusters deployed, or IS1200 clusters deployed in
remote offices need the ability to setup and manage unified reports
and searches across all their clusters. The IS1200 Information Center
server provides this solution.
Each Federation server supports one federation. A Federation may
have up to eight clusters (with four nodes each) included in it. Once a
federation is established, it becomes a central management point
allowing classifications, search, and reports to be setup or managed
on all the federations members from the Information Center server.
See the IS1200 Information Center User and Configuration Guide for
complete details.
Intelligent Platform
Management
Interface (IPMI)
IS1200 clusters may contain more than one node. Normally each node
communicates with the others to share information and workload.
The IS1200 appliance includes an Intelligent Platform Management
Interface (IPMI) to shut down nodes when individual nodes or
software errors would degrade the overall cluster performance. The
IPMI is an autonomous micro-controller—installed in all cluster
nodes—used by the cluster’s “leader” node to power down nodes
with errors or performance problems. The IPMI requires its own
unique IP address, but communicates over the eth1 port, see “eth1,
eth2” on page 155 for more details.
157
Glossary
K
Kazeon EVAgent
An IS1200 service, installed on the Enterprise Vault server, that allows
the IS1200 to directly open and access Enterprise Vault email for
classification services.
Kaz-mount
The NFS file system that is the IS1200 metadata repository. on which
the IS1200 stores metadata.
Kazeon Query
Language (KQL)
A programming language used in classification and assignment rules
to identify files that should receive specified metadata tags.
KQL Reserved Words
The KQL language reserves the following words. Consequently, they
are not allowed to be searched for, or used as tags or aliases.
"ADD", "ALL", "ALTER", "AND", "ANY", "AS", "ASC", "AVG",
"BETWEEN", "BY", "CASCADE", "CHECK", "COLUMN", "COUNT",
"DESC", "DISTINCT", "ESCAPE", "EXISTS", "FROM", "FULL",
"GRANT", "GROUP", "HAVING", "IN", "INTO", "IS", "JOIN", "KEY",
"LEFT", "LIKE", "MAX", "MIN", "NOT", "NULL", "ON", "OR",
"ORDER", "OUTER", "REVOKE", "RIGHT", "SELECT", "SET", "SUM",
"UNION", "UNIQUE", "UPDATE", "VALUES", "VIEW", "WHERE"
Kaz-server
The file server where the metadata repository is located.
Kaz-share
The CIFS file system on which the IS1200 stores metadata.
Kaz Schema
Defines the set of metadata fields used to build a Search Index for
registered data repositories (file systems).
L
Legal Hold
Files placed on legal hold are either copied to a secure secondary
location where they can preserved for later use, or are locked in their
original locations against further change until a legal matter is
resolved.
Legal Service Provider
(LSP)
A lawyer or trained legal professional that provides legal services for
a fee.
Local
localdatafs
158
Refers to the local resources (usually the metadata repository) of the
Federation server.
A data repository created on the IS1200 itself. This practice is not
recommended.
Glossary
localkazfs
Logging rule
A metadata repository created on the IS1200 itself. This practice is not
recommended.
Logging rules audit user actions on files such as file access, creation,
modification, and deletion.
M
Manifest Reports
Manifests are reports that summarize the results of an IS1200 job or
service. Manifests are produced for Collections (from either
Administration or the Case Mgmt) and for some Actionable Services.
Collection Manifests summarize what files were, or were not
collected during a collection. Actionable Service Manifests reconcile
Actionable Services object-counts with the search result object-counts
they are performed on because processes such as deduplication can
result in the two counts not matching. The reports details the count of
differences and the reasons for the differences. For more information,
see Manifests in the IS1200 Web-Search User Guide.
Note: Collection manifests are available ONLY for collections done from
v4.6.0 or later, earlier versions did not generate collection manifests.
Member-cluster
Metadata
Metadata Repository
Any of the clusters registered to a particular Federation.
Data about data. Metadata is used to search for information and to
create reports. Metadata can be file system or custom metadata that
the IS1200 extracts from files during classification. File system
metadata includes file type, and file path extracted during basic
classification. Custom metadata is generated during deep
classification.
A registered repository the IS1200 uses exclusively to record the
metadata extracted during classification services on the registered
data repository the metadata repository is mapped to.
The primary metadata repository is the host of the repository
registration database, the report results database, Environment
Discovery job results, Auditing and Data Verification databases, and
miscellaneous databases the cluster requires for routine operation.
Collectively these are called the Cluster Data Base.
Metadata repositories created on the IS1200 itself (sometimes called
localkazfs) are strongly discouraged!
159
Glossary
N
Namespaces
IS1200 software, versions 4.0 and higher, organize metadata fields
into hierarchy defined by namespaces. Namespaces group similar sets
of tags, for example all the file level tags such as FileType, FileSize,
aTime, and cTime are grouped together in the System namespace. See
the IS1200 Metadata Reference Guide for complete details.
Network File System
(NFS)
A protocol used primarily by Unix based computers for accessing
computer systems and filers over the internet.
Network Information
System (NIS)
A network naming, administration, and authentication system for
smaller networks that was developed by Sun Microsystems and is
used primarily by Unix systems.
Node
Notes Storage File
(NSF)
A single IS1200 appliance.
A standardized storage file format used by Lotus to store email,
attachments, notes, calendars, and so on.
O
optional modules
The standard IS1200 license provides a default set of features that
allows the IS1200 to register, classify, and search and report on CIFS
and NFS data repositories. Optional modules are additional software
licenses that can add further capabilities, such as being able to work
with repository types other than CIFS and NFS, or providing
Concepts Search capabilities, or applying legal hold. Some optional
modules require connectors, see “connectors” on page 151 for more
details. For a complete list of available optional modules, see the
Introduction chapter of any IS1200 User Guide.
P
160
PEA Files
A Pool Entry Authorization (PEA) file is generated by the Centera
server administrator. A PEA file defines what applications and users
can perform read, write, delete, query, copy, or hold operations for
Centera objects.
Policy Groups
Associates one or more authorization rule and logging rule with one
or more files to protect information and audit user actions on files.
Glossary
PST Files
Personal STorage files are generally used by email programs like
Microsoft Outlook to store user email locally. PST files are also called
“composite” files, because they are packages meant to efficiently
store a number of smaller related files. Another example of a
composite file is a ZIP storage file
R
Retention
Roles
The process of enforcing corporate or legal standards for how long
certain kinds of files must be preserved for access. Examples of
retained files include files responsive to legal matters and medical
records.
All IS1200 users have a role, either admin, auditor, or end-user. If a
legal license is installed, there may also be legaladmin,
legalsupervisor, legalreviewer, or a custodian. Roles
determines what parts of the IS1200 interface may be seen, and how
much of search and report results are displayed.
S
Search Analytics
Pre-Processing
Search Analytics Pre-processing was introduced in release 4.5.0 to
minimize search results display time and improve the overall
efficiency of eDiscovery culling. Analytics Pre-processing is an
integral, automatic, post-processing job performed after any job that
modifies the Search Index. Analytics Pre-processing trades an
increased post-job indexing period for significantly reduced search
results display times after the affected jobs complete.
A variety of jobs requires Search Index changes and therefore require
Analytics Pre-processing. These include Collections, Classifications,
Delete, and Tagging jobs. The time required by Analytics
Pre-processing is determined primarily by the number of objects in
the affected data repository, the number of distinct analytic (result
filter grouping) attributes (such as custodians, mail senders, mail
recipients, sender domains, recipient domains and so on.), and the
read/write performance of the metadata repository associated with
the data repository.
Additionally, once any Analytics Pre-processing job is launched, all
subsequent Analytics Pre-processing jobs (that might be required by
other concurrent jobs-in-progress) wait for the current Analytic
Pre-processing job to finish. However, before beginning any Analytics
Pre-processing job for a particular data repository, the IS1200 checks
161
Glossary
all other jobs-in-progress for that repository to see if they might also
require Analytics Pre-processing. If other jobs are found, the IS1200
waits for all these jobs to finish in order to launch a single Analytics
Pre-processing job for all the jobs that affected the Search Index for
that data repository.
Therefore, there are two best practices suggested for scheduling jobs
that affect the Search Index:
•
Schedule large classifications or collections such that both they,
and the Analytics Pre-Processing they require, can both fully
complete before starting any other job. This allows the IS1200 to
most efficiently schedule the required processing resources. Large
jobs are those that affect data repositories with tens of thousands
of objects or terabytes of data.
•
Schedule small jobs (such as incremental collections, or
post-search tagging operations) to run concurrently so the IS1200
can identify their common Analytics Pre-processing requirements
and group them into a single job.
Note: IS1200’s that are upgraded to v4.5.0 may need some additional
configuration to make the most efficient use of Analytics Pre-Processing. See
the Configuring the IS1200 To Use Proactive Indexing section of the Configuration
Files and Utilities appendix of any IS1200 User Guide for complete details.
Search Index
SharePoint Sever
(Microsoft)
snippets
An IS1200 database that stores and indexes the file content metadata
(including extended attributes, and fullText) for standard and custom
user-defined metadata produced by extraction rules during
classifications.
A Microsoft server in the groupware category.
A snippet is a sub-set of a document’s actual content. Snippets are
only displayed if they are enabled in Review/Analysis Preferences, and
only in Paragraph View immediately under the first line of the result
listing.
After a keyword search completes, result snippets are created as
small standard size chunks of data taken from the text surrounding a
search query hit. For example, if a search is made for “medicine”, the
snippet will contain about 300 bytes of the text surrounding the
paragraph where the word “medicine” was found. If multiple search
162
Glossary
hits are found, the most relevant hit is used to create the snippet. For
searches made without keywords, snippets are simply the first 300
bytes of file text.
Snippet size is configurable, see the Configuration Files and Utilities
appendix of any IS1200 User Guide for details on setting snippet size.
In all cases, snippets are taken from the result file’s fullText.
SourceOne Archive
Server (EMC)
The EMC SourceOne server is a comprehensive, policy-based system
that automatically collects, organizes, indexes and retains messages
and associated attachments and stores them in designated archives
connected to shared storage. EMC SourceOne provides indexed
searching that works with both EMC storage and other brands such
as IBM or NetApp.
Special Characters
The IS1200 supports alphanumeric ASCII and UTF-8 characters.
Non-alphanumeric ASCII characters are defined as Special Characters
and include the following:
‘“-_\/!@#$%^&*+={}[]()<>|:;,.?~`
Special characters are not universally supported in the IS1200
interfaces. The following limitations must be noted:
Search Queries and Special Character, Special characters pose a
searching challenge. Because the IS1200 tokenization removes special
characters from indexed text as it is classified, special characters are
never entered into the IS1200 metadata indexes. Consequently,
special characters may not be directly searched for. For more details
see Tokenisation and Stemming in the IS1200 Web-Search User Guide.
While special characters may not be directly searched for, the text
they are included in can be searched. For example, the string
"-ACME-" is tokenized on the hyphens and recorded in the metadata
only as "ACME". Consequently, searching for the string with the
hyphens (-) will NOT work. However, you can search for “?ACME?”
(using the question mark wildcard) which gives the result as
“!ACME!”, “@ACME.”, and so on. See the IS1200 Web-Search User
Guide for more details on wildcards.
Note: The question mark character ( ? ) may not be searched for in filepaths,
even when escaped. This exception is limited to filepath searches only.
163
Glossary
AD login names and NIS login names support only alphanumeric
ASCII and UTF-8 characters, they do NOT support following special
characters:
'"-_\/!@#$%^&*+={}[]()<>|:;,.?~`
However, in Active Directory (AD), registered users may have both an
AD login name and a display name. For example, John Smith may have
the AD login name “jsmith” and the display name “John Smith”.
When new legal supervisors or reviewers are created in the Case
Mgmt using the AD lookup button, they take the display name, not the
login name, and the display name may contain special characters as
described below.
Legal Supervisor Names and Legal Reviewer Names only,
support:
'-_!@#$%^&*+={}[]()|:;,.?~`
do NOT support: " \ / < >
Custodian Names only,
support:
'"-_!@# %^&*+={}[]()|:;,.?~`
do NOT support:
\/<>$
Case Names, Legal Export Profile Names, Repository Names,
Rule Names, and Policy Names only,
support:
_ (underscore)
do NOT support: " - \ / ! @ # $ % ^ & * + = { } [ ] ( ) < > | : ; , . ? ~ `
Email IDs which are used in Legal Hold notification and
Acknowledgements, Search filters, Collection filters and so on,
support:
'-_!#$%^&{}:;,.?~`
do NOT support: " \ / @ * + = [ ] ( ) | < >
Mail Domain Names DO NOT support any special characters.
File names/ Directory names in source and destination file names
only,
support:
'-_!@^+={}[]()<>;,.~
do NOT support: " \ / # $ % & * | : ? `
Tag Names only,
support:
_ (underscore)
do NOT support: ' " - \ / ! @ # $ % ^ & * + = { } [ ] ( ) < > | : ; , . ? ~ `
164
Glossary
Tag Values only,
support:
'-_\/!@#$%^&*+={}[]()<>|:;,.?~`
do NOT support: " (double quote)
Rules definitions, special characters must be “escaped” before they
may be used in rule definitions. To escape a character use a \ before
the character.
Search technology uses reserved words, stop words, special
characters, tokenizers, and so on. These are common to almost ALL
search technologies and not just a Kazeon search engine
manifestation. One major reason for such implementation is that if all
the characters and the words are indexed regardless of any semantics,
your search index has a potential to increase beyond any manageable
size since the occurrence of said categories of tokens is so common.
Besides, there is not much value in indexing stop words (as, the, or,
and so on.) and tokenizers (@ , . - and so on.). The omission of such
characters from your search query is just a part of the optimization.
For example, when you “search” for (1+1):2, the characters “(”, “+”,
“)”, and “:” have special meaning in search - the parentheses are used
to specify grouping, the plus is used to specify inclusive terms in a
query, and the colon is using to separate tag from value as in
"filepath:*". In order to use these in your query, you need to escape
them with a backslash, as follows:
\(1\+1\)\:2
However, the escaping does not mean the characters are now a part of
your query. It only means that those characters are not interpreted by
search with special semantics. This query is preprocessed to drop
those characters from the final query which appears as follows when
it is actually executed by the search engine:
fulltext:"1 1 2"
This means that we are searching for a 1 followed by a 1 which is
again followed by a 2 such that there are no other valid indexable
search tokens between the three numbers. The results may match
1-1+2
1:1:2
1-1-2 and so on.
However, they will not match
165
Glossary
1:3:1:2
1-43+1:2 and so on.
Hence to search for (1+1):2, use the following query:
\(1\+1\)\:2
stop words
Stop words consist of the most commonly used words in sentences,
such as “a”, “an”, “the”, and ”and”. If indexed individually, they
would consume excessive amount of metadata storage space, and
consequently are not individually indexed.
If stop words are used in a search query, they are ignored unless they
are parts of quoted phrases. The table below lists all stop words:
Table 132Stop Words
166
a
an
and
are
as
at
be
but
by
for
if
in
into
is
it
not
of
on
or
such
that
the
their
then
there
these
they
this
to
was
will
with
Stemming
Stemming is a search technique designed to increase search efficiency
and broaden relevant search hits. When stemming is used, fullText
indexing first attempts to identify each word’s “stem”, and then
indexes words by their stems. For example, the words “connected”,
“connecting”, and “connectable” all share the same stem and are
indexed under “connect”. Search query criteria are automatically
stemmed, and so querying “connected” returns all instances where
“connect”, “connected”, and “connecting” are used. Nouns like
“connector” are not stemmed. Stemming is ON by default but may be
disabled.
stubs
Stubs are created by many file archiving applications, most notably
email archiving systems. When stubbing is used, and a file object is
moved to archival storage, a “stub” is left behind on the original file
system that points to the archived file’s new location. Thereafter, if a
user attempts to open the archived file from the original filer, the stub
allows that filer to retrieve the archived file and return it to the user
transparently (as if it were still on the original filer).
Glossary
Stubs may be searched for using the metadata field
“mailMessageClass”. For example, use the search query
“mailMessageClass:IPM.Note.ExShortcut” to find email message
stubs.
sub-objects
A file found inside a “container object”, see “Container file/object”
on page 151 for more details. A container file is often called the
“parent” and the contained sub-objects are called “children”.
Sub-objects should not be confused with embedded files such as OLE
objects, for example spreadsheets or graphics embedded in a
Microsoft Word file. Note however, an email message may be a
container object, if it “contains” attachments, or a simple
non-container object but still have a graphic embedded in its body.
T
Tags
The names of metadata fields. Tags are always associated with a
value. For example, the metadata tag “filename” for any given file is
always followed by a value (a text string) containing the actual
filename.
Tokenization
Tokenization is an IS1200 classification procedure that breaks word
strings into “tokens” for better search results. During classifications,
Numbers, AlphaNums, HostNames and EmailAddresses (in
fullText) are tokenized similar to alpha only strings. With
tokenization, the stings “www.kazeon.com”, “fred@kazeon.com”,
and “11,22,333,44” are tokenized into separate words yielding;
“www”,”kazeon”, and “com”; “fred”, “kazeon”, and “com”; and
“11”, “22”, “333”, and “444”. This allows searching for “kazeon” and
getting all email addresses that contain the domain name, while.
U
UTF-8
Unicode Transformation Format - 8, is an 8-bit coding scheme for
digitally representing both the standard western alphabet (Aa-Zz)
and its punctuation characters, and non-western word characters
such as the glyhps found in the Chinese, Japanese, and Korean
languages. UTF-8 encodes all its characters as 8-bit bytes (or octets).
The first 128 UTF-8 characters are identical to the first 128 ASCII
characters and require only one byte each. Non-western languages
are coded using one to four octets each. UTF-8 can encode all of the
1,112,064 code points in the Unicode character set that covers the
majority of languages in use around the world.
167
Glossary
W
Web-Admin
An IS1200 web application used by IT personnel to administer the
server itself, and when the IS1200 is used to help administer other IT
resources. Administration is the preferred interface for administering
the server.
Web-Reports
An IS1200 web application that provides advanced reporting
capabilities based on IS1200 metadata.
Web-Search
An IS1200 web application that provides basic, advanced, and
specialized email searches against IS1200 metadata.
X
XML
eXtensible Markup
Language
168
A file type that uses the XML language to define and describe data
that can be transferred between applications like databases and
spreadsheets.