Download presentation
Presentation is loading. Please wait.
Published byBrenda Hicks Modified over 9 years ago
1
International Atomic Energy Agency October 2013INIS Training Seminar1 Subject Analysis: Computer Assisted Indexing 07 – 11 October 2013 Vienna, Austria Bekele Negeri INIS Unit Nuclear Information Specialist (Adapted from A. Nevyjel’s presentation)
2
International Atomic Energy Agency Subject Indexing Tools There are two main INIS products used for indexing: WinFibre and CAI WinFibre – for input preparation both bibliographic and subject indexing CAI (Computer Assisted Indexing) – for subject classification and indexing INIS/ETDE Thesaurus and INIS Subject Category Codes are incorporated in both. October 2013INIS Training Seminar2
3
International Atomic Energy Agency Indexing with FIBRE October 2013INIS Training Seminar3
4
International Atomic Energy Agency October 2013INIS Training Seminar4 Computer-assisted Indexing - CAI Kick-off MeetingJan 2004 Implementation and Customisation Jun 2004 Production Indexing from Jun 2004 ongoing CAI version 1.0 final acceptance Aug 2004 Tuning of the system from Aug 2004 ongoing CAI batch processing for Member StatesDec 2004 CAI online from remote for MSNov 2007
5
International Atomic Energy Agency October 2013INIS Training Seminar5
6
International Atomic Energy Agency October 2013INIS Training Seminar6 CAI Thesaurus Extension Thesaurus Valid Descriptors22,051 Forbidden Terms 8,675 Total30,726 CAI Hidden Terms~35.000 Terminological Knowledge Base
7
International Atomic Energy Agency October 2013INIS Training Seminar7 CAI Thesaurus extension “ Hidden terms” are character patterns representing the different appearances of a concept in the free text, which is indexed by one or more descriptors. handled similar to “forbidden terms” with one or more USE relations CAI internal only not exported to INIS production system not exported to FIBRE not printed in any appearance of the thesaurus support identification of descriptors in the free text
8
International Atomic Energy Agency October 2013INIS Training Seminar8 Hidden Terms: Compounds and Isotopes Descriptorhidden termfree text MAGNESIUM BORIDESMgB_2MgB 2 ACETIC ACIDC_2H_4O_2C 2 H 4 O 2 CESIUM 137Cesium 137, Cesium-137 "1"3"7cs 137 Cs 137 caesium137 Caesium, 137-Caesium caesium 137Caesium 137, Caesium-137 137 cesium137 Cesium, 137-Cesium 137 cs137 Cs, 137-Cs s 137Cs 137, Cs-137 cs"1"3"7Cs 137 cs137Cs137
9
International Atomic Energy Agency October 2013INIS Training Seminar9 Hidden Terms: Elementary Particles and countries Descriptorhidden termfree text ELECTRON NEUTRINOS#nu#_e ν e MUON NEUTRINOS#nu#_#mu# ν μ TAU NEUTRINOS#nu#_#tau# ν τ RHO-770 MESONS#rho#-770 ρ-770 OMEGA-782 MESONS#omega#-782 ω-782 Country Names: CAMBODIAkampuchea COTE D'IVOIREivory coast GREECEhellas MYANMARburma THAILANDsiam
10
International Atomic Energy Agency October 2013INIS Training Seminar10 Hidden Terms: UK/US Spellings Descriptorhidden term A CENTERSa centres ACTIVITY METERSactivity metres ANALOG COMPUTERSanalogue computers ANESTHESIAanaesthesia ARCHAEOLOGYarcheology AUSTRIAN ORGANIZATIONSaustrian organisations BALLISTIC MISSILE DEFENSEballistic missile defence BAYARD-ALPERT GAGESbayard-alpert gauges BEAM ANALYZERSbeam analysers BEHAVIORbehaviour CATALOGScatalogues
11
International Atomic Energy Agency October 2013INIS Training Seminar11 Hidden Terms: Other Spellings Descriptorhidden term Singular/Plural FUNGIfungus FUNGIfunguses G MATRIXg matrices G MATRIXg matrixes Reverse Sequence ATOM-MOLECULE COLLISIONSatom-molecule scattering ATOM-MOLECULE COLLISIONSmolecule-atom scattering ATOM-MOLECULE COLLISIONSatom-molecule reactions ATOM-MOLECULE COLLISIONSmolecule-atom reactions ATOM-MOLECULE COLLISIONSatom-molecule interactions ATOM-MOLECULE COLLISIONSmolecule-atom interactions
12
International Atomic Energy Agency October 2013INIS Training Seminar12 Further Improvements necessary “+” and “-“ signs K + KAONS PLUS, KAONS MINUS, POTASSIUM IONS Case sensitivity TiN TIN (instead of TITANIUM NITRIDES) gas GALLIUM SULFIDES “…who is the …” WHO (World Health Organization) Verbs versus Nouns “… this leads us to …” LEAD “… this leaves it ….” LEAVES Homographic terms Solutions SOLUTIONS or MATHEMATICAL SOLUTIONS Nuclear Reactions, e.g. 14 N(γ,α) 10 B Targets Beams Reactions
13
International Atomic Energy Agency INIS Training Seminar INDEXING PROBLEMS General terms (energy, physics, materials, uses etc. Misleading CAI suggestions: Thesaurus terms: PRODUCTIONPARTICLE PRODUCTION PRODUCTION and PARTICLE PRODUCTION SOLUTIONMATHEMATICAL SOLUTION SOLUTION and MATHEMATICAL SOLUTION IGNITIONTHERMONUCLEAR IGNITION IGNITION and THERMONUCLEAR IGNITION WALLS THERMONUCLEAR REACTOR WALLS WALLS and THERMONUCLEAR REACTOR WALLS PLANTSNUCLEAR POWER PLANTS PLANTS and NUCLEAR POWER PLANTS MEMBRANESmembrane MEMBRANES (classic) and membrane (in brane theory) COLORCOLOR MODEL COLOR and COLOR MODEL (elementary particle characteristics) TRANSPORT, etc. October 201313
14
International Atomic Energy Agency INIS Training Seminar INDEXING PROBLEMS chemical compounds/ case sensitivity/homonyms: INDIUM IONS for “in ions” ASTATINE 200 for at 200 o C VISIBLE RADIATION for light (weight) HELIUM 6 for “consisting of 6 He 3 tubes” VISIBLE RADIATION for “light weight” temperature, pressure, etc. range abbreviations: TNA for Thermal Neutron Analysis and TRINONYLAMINE MPA forMaximum Permissible Activity MPa (Mega Pascal) October 201314
15
International Atomic Energy Agency October 2013INIS Training Seminar15 CAI online for Member States introduced in July 2007 CAI Batch used by China Czech Republic (seldom) Georgia (only in 2012) Germany Iran Uzbekistan Vietnam CAI Online in use by Austria Bulgaria Cuba Israel (registering) Japan Mexico Netherlands (seldom) Uruguay CAI online and CAI batch are now regular services for Member States
16
International Atomic Energy Agency October 2013INIS Training Seminar16 CAI Batch and Online Processing Input:MemSt-CC-yymmdd-xxxxxxxxxxx MemSt is a standard prefix (meaning “member state”) CC is the country code yymmdd is the date when the file was generated xxxxxxxxxxx is any additional identification Examples MemSt-AR-041203-thisismytestfile MemSt-FR-041212-fileidentification
17
International Atomic Energy Agency October 2013INIS Training Seminar17 CAI Batch Processing Output:_MemSt-CC-yymmdd-xxxxxxxxxxx These files will carry the CAI suggested descriptors in tag 800, preceded by the string ##CAI suggestions##; Example: 800^##CAI suggestions##; DESCRIPTOR1; DESCRIPTOR2; DESCRIPTOR3; ……. sent back to the member state for reviewing
18
International Atomic Energy Agency October 2013INIS Training Seminar18
19
International Atomic Energy Agency October 2013INIS Training Seminar19 CAI Batch and Online Processing Reviewing Process Delete all suggested descriptors which are too general Add relevant descriptors which were not found numerical values, e.g. pressure ranges, temperature ranges,... nuclear reactions chemical compounds, alloys, etc. CAI is cleaning up BT/NTs clean up BT/NTs from manual additions Clean up suggestions from homographic terms
20
International Atomic Energy Agency October 2013INIS Training Seminar20 CAI Batch and Online Processing Finalisation Process CAI batch When reviewing of the record completed: Delete “##CAI suggestions## “ When reviewing of all records completed: Submit file to “INIS Input Box” CAI online When reaching the last record: press “export and exit” button File goes directly to INIS production system, or if required, sent back to Member State for reviewing
21
International Atomic Energy Agency Thank you! October 2013INIS Training Seminar21
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.