1 Languages for aboutness n Indexing languages: –Terminological tools Thesauri (CV – controlled vocabulary) Subject headings lists (CV) Authority files.

Slides:



Advertisements
Similar presentations
Taxonomy as Content Outline, Site Map and Search Aid SLA NWR Vancouver October 6, 2006 Marjorie M.K. Hlava President
Advertisements

Database Searching: How to Find Journal Articles? START.
Subject Analysis: An Introduction Based on BASIC SUBJECT CATALOGING USING LCSH edited by Lori Robare.
Advanced Information Systems Laboratory Department of Computer Science and Systems Engineering GI-DAYS MÜNSTER A software tool.
Controlling values The equivalence relationship. The vocabulary problem What is this?
IAEA International Atomic Energy Agency International Nuclear Information System (INIS) INIS/ETDE THESAURUS MAINTENANCE & USE OF COMPUTER-ASSISTED INDEXING.
6. Applying metadata standards: Controlled vocabularies and quality issues Metadata Standards and Applications Workshop.
Applying ISO25964 to thesaurus mapping and other forms of linkage Stella Dextre Clarke Convenor, ISO TC46/SC9 WG8 1.
Text Operations: Preprocessing. Introduction Document preprocessing –to improve the precision of documents retrieved –lexical analysis, stopwords elimination,
SLIDE 1IS 257 – Fall 2007 Thesaurus Construction and Use University of California, Berkeley School of Information IS 245: Organization of.
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Module 6a: Intro to Controlled Vocabularies, Taxonomies and Classification IMT530: Organization of Information Resources Winter 2007 Michael Crandall.
Thesaurus Design and Development
1 Vocabulary & languages in indexing & searching Connection: indexing searching
Federal Controlled Vocabularies Data Architecture Sub-Committee (DAS) April 8, 2010 Brand K. Niemann.
A Registry for controlled vocabularies at the Library of Congress
Searching and Researching the World Wide: Emphasis on Christian Websites Developed from the book: Searching and Researching on the Internet and World Wide.
Sunday May 4 – 5 PM Bradford, Hlava, McNaughton
Vocabulary & languages in searching
EuroVoc, Eurlex, EU Bookshop Danica Maleková, Publications Office STS Bratislava, 22 October 2010.
INLS 520 Erik Mitchell INLS 520 Information Organization.
Information Organization
Taxonomies: Hidden but Critical Tools Marjorie M.K. Hlava President Access Innovations, Inc.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Languages are bridges … not barriers Chiara Carlucci – CEDEFOP Library ReferNet Technical Meeting September 2009.
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Terminology services and the DDC: the High-Level Thesaurus and beyond Presented to the symposium Dewey goes Europe: on the use and development of the Dewey.
Improving Access to Audio- Visual Materials by Using Genre/Form Terms OLAC Conference 1-3 October 2004 Montreal, Quebec.
1 Catalog Displays, Retrieval, and FAST May 31, 2005.
Vocabularies in the VO Alasdair J G Gray Norman Gray Iadh Ounis.
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical.
AAT Art & Architecture Thesaurus. Diffuse list of museum standards
Conceptual Maps and Thesauri : A Comparison of Two Models of Representation Arising from Different Disciplinary Traditions Lalthoum Saàdani and Suzanne.
Current Events and Issues Using Index Databases for Finding Answers.
Overview of ISO NP Stella G Dextre Clarke Convenor, IDT/2/2 Working Group of BSI and Project Leader for ISO NP
The UNESCO Thesaurus Meeting for Managers of UNESCO Documentation Networks Meron Ewketu UNESCO Library June
ISO 25964: a standard in support of interoperability Stella G Dextre Clarke Project Leader, ISO NP
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
1 Controlled Vocabularies Paul Miller Interoperability Focus UKOLN U KOLN is funded by Resource: the Council.
INFO Week 8 Subject Indexing & Knowledge Representation Dr. Xia Lin Assistant Professor College of Information Science and Technology Drexel University.
Thesauri usage in information retrieval systems: example of LISTA and ERIC database thesaurus Kristina Feldvari Departmant of Information Sciences, Faculty.
The World Wide Web: Information Resource. Hock, Randolph. The Extreme Searcher’s Internet Handbook. 2 nd ed. CyberAge Books: Medford. (2007). Internet.
New Tools for astronomy librarians D Donna Thompson SLA PAM Roundtable June 9, 2014.
Controlled Vocabulary & Thesaurus Design Hierarchies.
Indexes and Abstracts: Dissecting the Resource By M. Leedy.
Subject Analysis and Vocabulary Control Spring 2006, 6 March Bharat Mehra IS 520 (Organization and Representation of Information) School of Information.
Information Retrieval
Subject Headings for Reference Everything You Need to Know About Subject Headings in One Easy Lesson By Dr. Nancy J. Becker Presented by Dr. Kevin Rioux.
June 2003INIS Training Seminar1 INIS Training Seminar 2-6 June 2003 Subject Analysis Thesaurus and Indexing Alexander Nevyjel Subject Control Unit INIS.
Order Out of Chaos: Creating and Valuing Taxonomies Information Highways Conference e-Content Institute April 6, 2005
Revising ANSI/NISO Z39.19 Updates for the 21 st Century.
Controlled Vocabulary & Thesaurus Design Associative Relationships & Thesauri.
ORGANIZATION OF ELEMENTS OF INFORMATION The Thesaurus.
Subject Access to Your Information Sandy Tucker Texas A&M University Libraries August 1, 2006 Second International Symposium on Transportation Technology.
Charlyn P. Salcedo Instructor Types of Indexing Languages.
분류론 대학원 CLASSIFICATION AS A SEARCH TOOL Do you have anything on ‘stegosaurus’? stegosaurus dinosaurs prehistoric animals, prehistoric.
Slide 6 HMD1SPI376 - Slide 6. What is the Relationship Between BT and NT?  Normally, BT and NT are "inverse" links. In other words, if X is a broader.
Controlling values for information organization 384C – Organizing Information Spring 2016 Karen Wickett School of Information University of Texas at Austin.
1 How do we describe something? n What something is about? –What the content of an object is “about”? n Different methods (Wilson, 1968) –counting terms.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Information Organization
Subject Access: Indexing and Abstracting
Information Organization
Introduction to Semantic Metadata & Semantic Web
THESAURUS CONSTRUCTION: GROUND WATER
Presentation transcript:

1 Languages for aboutness n Indexing languages: –Terminological tools Thesauri (CV – controlled vocabulary) Subject headings lists (CV) Authority files for named entities (people, places, structures, organizations) –Classification –Keyword lists –Natural language systems (broad interpretation)

2 Subject Analysis n What something is about? –What the content of an object is “about”? n Different methods (Wilson, 1968) –Counting (objective method) –Purposive method –Method appealing to unity –What stands out n Challenges –Non-text

3 Aboutness: How to do it! n Read the document [Intellectual reading] –look for key features –many indexers mark up the items –rarely have time to read the whole document n Determine aboutness [Conceptual analysis] n Translate aboutness into the vocabulary or scheme you are using –In general: Subject headings: 1-3 headings –Descriptors, 5-8 descriptors –Classification: 1 notation (should it only be one!?).

4 Features of indexing languages: n Involve rules and require maintenance n Can be generated via automatic, human, or auto-human processes n Different processes generally display different strengths and weaknesses.

5 Features of indexing languages: n With the exception of a few general domain tools, they are generally domain specific. –MeSH –NASA Thesaurus –Astronomy Thesaurus –ERIC thesaurus n Concepts (or concept representations) are arranged in a discernable order 

6 Language schema designs n Classified--grouping –Hierarchies and facets MeSH Browser Art and Architecture (Getty AAT) n Alphabetical -- horizontal –Verbal/Alphabetical (ordering/filing challenges)

7 Controlled Vocabulary n A list or a database of subject terms in which each concept has a preferred terms or phrase that will be used to represent it in the retrieval tool; the terms not used have references (syndetic structure), and often scope notes.

8 Thesaurus (structured thesaurus) n Lexical semantic relationships n Composed of indexing terms/descriptors n Descriptors = representations of concepts n Concepts = Units of meaning (Svenonius)

9 Thesaurus n Preferred terms n Non-preferred terms n Semantic relations between terms n How to apply terms (guidelines, rules) n Scope notes n Adding terms (How to produce terms that are not listed explicitly in the thesaurus)

10 Preferred Terms n Control form of the term Spelling, grammatical form Theatre / Theater MLA / Modern language association n Choose preferred term between synonyms Brain cancer or Brain Neoplasms?

11 Common thesaural identifiers n SN Scope Note –Instruction, e.g. don’t invert phrases n USE Use (another term in preference to this one) n UF Used For n BT Broader Term n NT Narrower Term n RT Related Term

12 Semantic Relationships n Hierarchy n Equivalence n Association

13 Hierarchies of Meaning ‘Glass’ ‘Beer Glass’ ‘Wine Glass’ ‘Red wine glass’ ‘White wine glass’ From: Controlled Vocabularies/ Paul Miller Interoperability Focus UKOLN

14 Hierarchy n Level of generality – both preferred terms n BT (broader term) –Robins BT Birds n NT (narrower term) –Birds NT Robins –Inheritance, very specific rules

15 Equivalence n When two or more terms represent the same concept n One is the preferred term (descriptor), where all the information is collected n The other is the non-preferred and helps the user to find the appropriate term

16 Equivalence n Non-preferred term USE Preferred term –Nuclear Power USE Nuclear Energy –Periodicals USE Serials n Preferred term UF (used for) Non-preferred term –Nuclear Energy UF Nuclear Power –Serials UF Periodicals

17 Association n One preferred term is related to another preferred term n Non-hierarchical n “See also” function n In any large thesaurus, a significant umber of terms will mean similar things or cover related areas, without necessarily being synonyms or fitting into a defined hierarchy

18 Association n Related Terms (RT) can be used to show these links within the thesaurus –Bed RT Bedding –Paint Brushes RT Painting –Vandalism RT Hostility –Programming RT Software

19 Thesauri Guides n National Information Standards Organization. (1993). Guidelines for the construction, format, and management of monolingual thesauri. ANSI/NISO Z Bethesda, MD: NISO Press.[SILS reference Z695.N or 19.pdf] 19.pdf n Aitchison, Jean & Gilchirist, Alan. Thesaurus Construction: A Practical Guide. 3 rd ed. London: Aslib, n Willpower Information Management Consultants

20 Thesauri Directory n Indexing Resources on the WWW – se1.htmhttp:// se1.htm –-- explore ASIST Thesaurus n Controlled vocabularies – n Web Compendium –

21 n Thesauri/Keywords n Created according to standards n Z39.19 (Ansi) n Single term concepts/postcoordination  “Wireless network” & “home computer”  “Terrorism” “Attacks” & “United States” n More popular in the online environment n Lend to recall n Lend to multilingual environment n Subject Heading Lists n Rules and guidelines n “Thesaurification” n multi-word concepts/pre- coordination  “Wireless home computer network”  $y Terrorism attacks $z United States n STRINGS n Lend to precision