International Atomic Energy Agency INIS Training Seminar Subject Analysis: Indexing with the INIS Thesaurus Bekele Negeri INIS Unit Nuclear Information Specialist 07 – 11 October 2013 Vienna, Austria Subject Analysis • Steps of Subject Analysis • subject classification • abstracting • subject indexing • Subject indexing means analysing the information content of a piece of literature and expressing the meaningfull information content in the language of the database using the controlled vocabulary of the Thesaurus INIS Training Seminar October 2013 2 International Atomic Energy Agency Subject Indexing • Subject Analysis should be carried out whenever possible by subject specialists with a good knowledge of the subject matter and a familiarity with the subject analysis tools of the respective database (subject categories, thesaurus, subject analysis rules) • • Understanding of the subject content subject specialist • Familiarity with Thesaurus and indexing rules Select a set of descriptors that describes the subject content of the piece of literature INIS Training Seminar October 2013 3 International Atomic Energy Agency Thesaurus The basic tools for subject indexing are the controlled vocabulary maintained in the Thesaurus and the rules for its application „A thesaurus is a terminological control device used in translating from the natural language of documents, indexers or users into a more constrained system language. It is a controlled and dynamic vocabulary of semantically and generically related terms which covers a specific domain of knowledge“ This definition has been adopted by UNESCO „Guidelines for the establishment and development of monolingual thesauri“, UNESCO, SC/W/255, Paris, September 1973 • INIS/ETDE Thesaurus: contains the controlled terminology for indexing all information within the subject scopes of the International Nuclear Information System (INIS) and the Energy Technology Data Exchange (ETDE). The terminology is intended for use in subject descriptions for input or retrieval of information in these systems. INIS Training Seminar October 2013 4 International Atomic Energy Agency Where do we find the INIS Thesaurus? • English version integrated with FIBRE and CAI • for indexing Multilingual Thesaurus (Arabic, Chinese, English , French, German, Japanese, Russian, Spanish) • Integrated with INIS Collection Search http://inis.iaea.org/search/ • Multilingual Thesaurus (browsable) as a general reference tool http://nkp.iaea.org/INISMLThesaurus/ • English version integrated with the IRPS Production (INIS Records Processing System) INIS Training Seminar October 2013 5 International Atomic Energy Agency The INIS/ETDE Thesaurus • The descriptor is placed in its correct semantic context by its wordblock - forbidden, broader, narrower and related terms • For a few descriptors where there could still exist a possibility of ambiguity - scope note • Descriptors 30 726 descriptors with scope notes • 22 051 valid terms and • 8 675 forbidden terms • Hidden terms INIS Training Seminar October 2013 6 International Atomic Energy Agency The Thesaurus and its Structure Relationship Sy Cross reference hierarchical hierarchical BT NT broader term (level 1, 2,...) narrower term (level 1, 2,...) affinitive RT related term preferential preferential UF UF+ preferential SF used for (reciprocally USE ...) used for multiple (reciprocally USE ... AND ...) seen for (reciprocally SEE ... OR ...) INIS Training Seminar October 2013 7 International Atomic Energy Agency Examples • BT /NT / RT Relationships COMBINED THERAPY INIS: 1993-08-04; ETDE: 1986-01-16 The use of both radiotherapy and chemotherapy to achieve a synergistic effect. *BT1 therapy RT antineoplastic drugs RT chemotherapy RT neoplasms RT radiotherapy RT side effects • UF/USE Relationships INIS Training Seminar October 2013 8 International Atomic Energy Agency Examples • UF+ / USE ..AND • SF/ SEE ..OR INIS Training Seminar October 2013 9 International Atomic Energy Agency Examples TRANSPORT Limited to the movement of goods and persons. For other types of transport, see descriptors such as ENVIRONMENTAL TRANSPORT, RADIATION TRANSPORT, RADIONUCLIDE MIGRATION, and RADIONUCLIDE KINETICS UF shipment UF space transport SF public transport SF travel NT1 air transport NT1 hydraulic transport NT1 land transport NT1 maritime transport NT1 pneumatic transport RT arctic gas pipelines INIS Training Seminar October 2013 10 International Atomic Energy Agency Examples PRODUCTION Limited to industrial production; see also PARTICLE PRODUCTION UF output RT availability RT capacity RT computer-aided manufacturing RT fabrication RT gross domestic product RT gross national product RT isotope production RT manufacturing RT planning RT productivity INIS Training Seminar October 2013 11 International Atomic Energy Agency INDEXING RULES… • Specificity Rule Rule: Always use the most specific appropriate descriptor. Check the worldblock to find the most specific descriptor Example: KINETICS NT1 radionuclide kinetics NT1 reaction kinetics NT2 biochemical reaction kinetics NT3 cpb (competitive protein binding) NT2 chemical reaction kinetics NT3 combustion kinetics NT2 nuclear reaction kinetics NT1 reactor kinetics RT collisions… INIS Training Seminar October 2013 12 International Atomic Energy Agency INDEXING RULES… Rule: Do not assign a descriptor and one of its broader terms to the same item. Example: SLOWPOKE TYPE REACTORS INIS: 1979-12-20; ETDE: 1980-01-24 UF safe low power critical experiment *BT1 enriched uranium reactors *BT1 isotope production reactors *BT1 pool type reactors *BT1 research reactors NT1 slowpoke-ottawa reactor NT1 slowpoke-toronto reactor NT1 slowpoke-alberta reactor NT1 slowpoke-dalhousie reactor NT1 slowpoke-montreal reactor NT1 slowpoke-wnre reactor INIS Training Seminar October 2013 13 International Atomic Energy Agency Data Flagging DATA (For data flagging always use a more specific term.) BT1 information NT1 numerical data NT2 compiled data NT2 evaluated data NT2 experimental data NT2 financial data NT2 statistical data NT2 theoretical data INIS Training Seminar October 2013 14 International Atomic Energy Agency INDEXING RULES… • Find implicit information "The resonance capture was not examined.“ If it is not examined – do not index "no resonance capture was observed" In case it was examined, but not found – do indexing "The reaction is enhanced by the presence of platinum" CATALYSIS "The long-half-life carbon isotope" - CARBON 14 "A computer code was developed" – PROGRAMMING Find implicit information INIS Training Seminar October 2013 15 International Atomic Energy Agency Proposed Terms (Technical Note 175) If no suitable descriptor exists in the Thesaurus for the retrieval of a usefull concept, make a proposal for a new one, containing the following: • Proposed term • Proposed word block of the term (in particular proposed BTs) • Potential forbidden terms pointing to this proposed descriptor • Scope note when appropriate • Explanation and justification for the proposal • One or more sample records INIS Training Seminar October 2013 16 International Atomic Energy Agency Cautions required when Proposing terms Examples of mistaken proposals • • • • Abbreviations • TLD dosimeters -> THERMOLUMINESCENT DOSEMETERS Existing forbidden terms proposed • Energy deposition -> use ENERGY ABSORPTION Concept could be represented by existing term • Tissue engineering -> GENETIC ENGINEERING English language related: Spelling/ plural singular • • Developing country -. DEVELOPING COUTRIES Fungus -> FUNGI INIS Training Seminar October 2013 17 International Atomic Energy Agency INDEXING PROCEDURES Summary • • • • • • Carefully read the title and abstract and scan the full text Identify the concept(s) about which the document contains useful information "Translate" the concepts into descriptors. Check each descriptor to make sure that: - the descriptors represent as precisely as possible the major concept(s); - the definition matches the use; - the selected descriptor is the most specific appropriate choice If part of a document is out of INIS scope, index the latter portion generally Avoid over indexing. INIS Training Seminar October 2013 18 International Atomic Energy Agency Abstracting and Title Augumentation • Guidelines for Abstract Preparation • • • • Submit an informative abstract whenever possible Emphasize what is novel about the information in the original document Do not repeat the title of the original document in the body of the abstract Do not exceed 6000 characters (900-1200 words) in length, including spaces and symbols • Title augmentation should indicate in a concise and abbreviated form the essential topic discussed in the piece of literature to which no reference is made in the title INIS Training Seminar October 2013 19 International Atomic Energy Agency • The purpose of subject indexing is to enable useful retrieval • Choose such information items for indexing as you would yourself expect to find in the piece of literature if you were the user searching for that information. INIS Training Seminar October 2013 20 International Atomic Energy Agency • The purpose of subject indexing is to enable useful retrieval • Choose such information items for indexing as you would yourself expect to find in the piece of literature if you were the user searching for that information. INIS Training Seminar October 2013 21 International Atomic Energy Agency