Subject Analysis: Indexing with the INIS Thesaurus

advertisement
International Atomic Energy Agency
INIS Training Seminar
Subject Analysis:
Indexing with the INIS Thesaurus
Bekele Negeri
INIS Unit
Nuclear Information Specialist
07 – 11 October 2013
Vienna, Austria
Subject Analysis
• Steps of Subject Analysis
• subject classification
• abstracting
• subject indexing
•
Subject indexing means analysing the information
content of a piece of literature and expressing the
meaningfull information content in the language of the
database using the controlled vocabulary of the
Thesaurus
INIS Training Seminar
October 2013
2
International Atomic Energy Agency
Subject Indexing
• Subject Analysis should be carried out whenever possible
by subject specialists with a good knowledge of the
subject matter and a familiarity with the subject analysis
tools of the respective database (subject categories,
thesaurus, subject analysis rules)
•
•
Understanding of the subject content  subject specialist
•
Familiarity with Thesaurus and indexing rules
Select a set of descriptors that describes the
subject content of the piece of literature
INIS Training Seminar
October 2013
3
International Atomic Energy Agency
Thesaurus
The basic tools for subject indexing are the controlled vocabulary
maintained in the Thesaurus and the rules for its application
„A thesaurus is a terminological control device used in translating
from the natural language of documents, indexers or users into
a more constrained system language. It is a controlled and
dynamic vocabulary of semantically and generically related
terms which covers a specific domain of knowledge“
This definition has been adopted by UNESCO
„Guidelines for the establishment and development of monolingual thesauri“, UNESCO, SC/W/255, Paris, September 1973
• INIS/ETDE Thesaurus: contains the controlled
terminology for indexing all information within the subject scopes
of the International Nuclear Information System (INIS) and the
Energy Technology Data Exchange (ETDE). The terminology is
intended for use in subject descriptions for input or retrieval of
information in these systems.
INIS Training Seminar
October 2013
4
International Atomic Energy Agency
Where do we find the INIS Thesaurus?
• English version integrated with FIBRE and CAI
•
for indexing
Multilingual Thesaurus (Arabic, Chinese,
English , French,
German, Japanese, Russian, Spanish)
• Integrated with INIS Collection Search
http://inis.iaea.org/search/
• Multilingual Thesaurus (browsable) as a general
reference tool http://nkp.iaea.org/INISMLThesaurus/
• English version integrated with the IRPS
Production (INIS Records Processing System)
INIS Training Seminar
October 2013
5
International Atomic Energy Agency
The INIS/ETDE Thesaurus
• The descriptor is placed in its correct semantic context by its
wordblock - forbidden, broader, narrower and related terms
• For a few descriptors where there could still exist a
possibility of ambiguity - scope note
• Descriptors 30 726 descriptors with scope notes
• 22 051 valid terms and
• 8 675 forbidden terms
• Hidden terms
INIS Training Seminar
October 2013
6
International Atomic Energy Agency
The Thesaurus and its Structure
Relationship
Sy
Cross reference
hierarchical
hierarchical
BT
NT
broader term (level 1, 2,...)
narrower term (level 1, 2,...)
affinitive
RT
related term
preferential
preferential
UF
UF+
preferential
SF
used for (reciprocally USE ...)
used for multiple
(reciprocally USE ... AND ...)
seen for
(reciprocally SEE ... OR ...)
INIS Training Seminar
October 2013
7
International Atomic Energy Agency
Examples
• BT /NT / RT Relationships
COMBINED THERAPY
INIS: 1993-08-04; ETDE: 1986-01-16
The use of both radiotherapy and chemotherapy to achieve a synergistic effect.
*BT1 therapy
RT antineoplastic drugs
RT chemotherapy
RT neoplasms
RT radiotherapy
RT side effects
• UF/USE Relationships
INIS Training Seminar
October 2013
8
International Atomic Energy Agency
Examples
• UF+ / USE ..AND
• SF/ SEE ..OR
INIS Training Seminar
October 2013
9
International Atomic Energy Agency
Examples
TRANSPORT
Limited to the movement of goods and persons. For other types of
transport, see descriptors such as ENVIRONMENTAL
TRANSPORT, RADIATION TRANSPORT, RADIONUCLIDE
MIGRATION, and RADIONUCLIDE KINETICS
UF shipment
UF space transport
SF public transport
SF travel
NT1 air transport
NT1 hydraulic transport
NT1 land transport
NT1 maritime transport
NT1 pneumatic transport
RT arctic gas pipelines
INIS Training Seminar
October 2013
10
International Atomic Energy Agency
Examples
PRODUCTION
Limited to industrial production; see also PARTICLE PRODUCTION
UF output
RT availability
RT capacity
RT computer-aided manufacturing
RT fabrication
RT gross domestic product
RT gross national product
RT isotope production
RT manufacturing
RT planning
RT productivity
INIS Training Seminar
October 2013
11
International Atomic Energy Agency
INDEXING RULES…
• Specificity Rule
Rule: Always use the most specific appropriate descriptor.
Check the worldblock to find the most specific descriptor
Example:
KINETICS
NT1 radionuclide kinetics
NT1 reaction kinetics
NT2 biochemical reaction kinetics
NT3 cpb (competitive protein binding)
NT2 chemical reaction kinetics
NT3 combustion kinetics
NT2 nuclear reaction kinetics
NT1 reactor kinetics
RT collisions…
INIS Training Seminar
October 2013
12
International Atomic Energy Agency
INDEXING RULES…
Rule: Do not assign a descriptor and one of its broader terms to the
same item.
Example:
SLOWPOKE TYPE REACTORS
INIS: 1979-12-20; ETDE: 1980-01-24
UF safe low power critical experiment
*BT1 enriched uranium reactors
*BT1 isotope production reactors
*BT1 pool type reactors
*BT1 research reactors
NT1 slowpoke-ottawa reactor
NT1 slowpoke-toronto reactor
NT1 slowpoke-alberta reactor
NT1 slowpoke-dalhousie reactor
NT1 slowpoke-montreal reactor
NT1 slowpoke-wnre reactor
INIS Training Seminar
October 2013
13
International Atomic Energy Agency
Data Flagging
DATA
(For data flagging always use a more
specific term.)
BT1 information
NT1 numerical data
NT2 compiled data
NT2 evaluated data
NT2 experimental data
NT2 financial data
NT2 statistical data
NT2 theoretical data
INIS Training Seminar
October 2013
14
International Atomic Energy Agency
INDEXING RULES…
•
Find implicit information
"The resonance capture was not examined.“
If it is not examined – do not index
"no resonance capture was observed"
In case it was examined, but not found – do indexing
"The reaction is enhanced by the presence of platinum" CATALYSIS
"The long-half-life carbon isotope" - CARBON 14
"A computer code was developed" – PROGRAMMING
Find implicit information
INIS Training Seminar
October 2013
15
International Atomic Energy Agency
Proposed Terms (Technical Note 175)
If no suitable descriptor exists in the Thesaurus for the
retrieval of a usefull concept, make a proposal for a
new one, containing the following:
• Proposed term
• Proposed word block of the term (in particular
proposed BTs)
• Potential forbidden terms pointing to this proposed
descriptor
• Scope note when appropriate
• Explanation and justification for the proposal
• One or more sample records
INIS Training Seminar
October 2013
16
International Atomic Energy Agency
Cautions required when Proposing terms
Examples of mistaken proposals
•
•
•
•
Abbreviations
•
TLD dosimeters -> THERMOLUMINESCENT DOSEMETERS
Existing forbidden terms proposed
•
Energy deposition -> use ENERGY ABSORPTION
Concept could be represented by existing term
•
Tissue engineering -> GENETIC ENGINEERING
English language related: Spelling/ plural singular
•
•
Developing country -. DEVELOPING COUTRIES
Fungus -> FUNGI
INIS Training Seminar
October 2013
17
International Atomic Energy Agency
INDEXING PROCEDURES Summary
•
•
•
•
•
•
Carefully read the title and abstract and scan the full text
Identify the concept(s) about which the document contains useful
information
"Translate" the concepts into descriptors.
Check each descriptor to make sure that:
- the descriptors represent as precisely as possible the major concept(s);
- the definition matches the use;
- the selected descriptor is the most specific appropriate choice
If part of a document is out of INIS scope, index the latter portion generally
Avoid over indexing.
INIS Training Seminar
October 2013
18
International Atomic Energy Agency
Abstracting and Title Augumentation
• Guidelines for Abstract Preparation
•
•
•
•
Submit an informative abstract whenever possible
Emphasize what is novel about the information in the original
document
Do not repeat the title of the original document in the body of the
abstract
Do not exceed 6000 characters (900-1200 words) in length, including
spaces and symbols
• Title augmentation
should indicate in a concise and abbreviated form the essential topic
discussed in the piece of literature to which no reference is made in the
title
INIS Training Seminar
October 2013
19
International Atomic Energy Agency
• The purpose of subject indexing is
to enable useful retrieval
• Choose such information items for indexing as
you would yourself expect to find in the piece of
literature if you were the user searching for
that information.
INIS Training Seminar
October 2013
20
International Atomic Energy Agency
• The purpose of subject indexing is
to enable useful retrieval
• Choose such information items for indexing as
you would yourself expect to find in the piece of
literature if you were the user searching for
that information.
INIS Training Seminar
October 2013
21
International Atomic Energy Agency
Download