Order Out of Chaos: Creating and Valuing Taxonomies Information Highways Conference e-Content Institute April 6, 2005

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

1 Metadata Registry Standards: A Key to Information Integration Jim Carpenter Bureau of Labor Statistics MIT Seminar June 3, 1999 Previously presented.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Taxonomies of Knowledge: Building a Corporate Taxonomy Wendi Pohs, Iris Associates
Taxonomies, Lexicons and Organizing Knowledge Wendi Pohs, IBM Software Group.
United Nations Statistics Division Principles and concepts of classifications.
6. Applying metadata standards: Controlled vocabularies and quality issues Metadata Standards and Applications Workshop.
Leveraging Your Taxonomy to Increase User Productivity MAIQuery and TM Navtree.
Taxonomies in Electronic Records Management Systems May 21, 2002.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
SchemaLogic Workshop Part 2 Tools for Enterprise Metadata Management and Synchronization Prepared for the University of Washington Information School Applied.
Kristin Eberle Monica Hampton Carmen Velasquez Kristin Eberle Monica Hampton Carmen Velasquez Knowledge Management.
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
Federal Controlled Vocabularies Data Architecture Sub-Committee (DAS) April 8, 2010 Brand K. Niemann.
1 Languages for aboutness n Indexing languages: –Terminological tools Thesauri (CV – controlled vocabulary) Subject headings lists (CV) Authority files.
Sunday May 4 – 5 PM Bradford, Hlava, McNaughton
Vocabulary & languages in searching
The NICE taxonomy: a case study of developing a corporate taxonomy Sadia Mughal Health Libraries Conference 19 th July 2010.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
The Relational Database Model
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Indexing Knowledge Daniel Vasicek 2014 March 27 Introduction Basic topic is : All Human Knowledge Who Cares? Simple Examples.
1 Catalog Displays, Retrieval, and FAST May 31, 2005.
H. Lundbeck A/S3-Oct-151 Assessing the effectiveness of your current search and retrieval function Anna G. Eslau, Information Specialist, H. Lundbeck A/S.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Basics of Information Retrieval Lillian N. Cassel Some of these slides are taken or adapted from Source:
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
CountryData Technologies for Data Exchange SDMX Information Model: An Introduction.
DACS Describing Archives: A Content Standard. The Background  Archives, Personal Papers & Manuscripts, 1980s –New Technologies with Web, XML, EAD –Revision.
Tommie Curtis SAIC January 17, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2023.
Indexing Jyothi Jandhyala. Disclaimer! Indexing cannot be reduced to a set of steps that can be followed! It is not a mechanical process. Indexing books.
Confidential 111 Financial Industry Business Ontology (FIBO) [FIBO– Business Entities] Understanding the Business Conceptual Ontology For FIBO-Business.
Electronic Scriptorium, Ltd. AIIM Minnesota Chapter Metadata and Taxonomy Presentation Copyright Electronic Scriptorium, Ltd. All rights reserved, 1991.
Terminology and documentation*  Object of the study of terminology:  analysis and description of the units representing specialized knowledge in specialized.
Semantic Data & Ontologies CMPT 455/826 - Week 5, Day 2 Sept-Dec 2009 – w5d21.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Libraries and Museums Jenn Riley Metadata Librarian Indiana University Digital Library.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
Controlled Vocabulary & Thesaurus Design Course Introduction and Background.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Information Retrieval
June 2003INIS Training Seminar1 INIS Training Seminar 2-6 June 2003 Subject Analysis Thesaurus and Indexing Alexander Nevyjel Subject Control Unit INIS.
Revising ANSI/NISO Z39.19 Updates for the 21 st Century.
Software Reuse Course: # The Johns-Hopkins University Montgomery County Campus Fall 2000 Session 4 Lecture # 3 - September 28, 2004.
Controlled Vocabulary & Thesaurus Design Associative Relationships & Thesauri.
LIS 204: Introduction to Library and Information Science Week Nine Kevin Rioux, PhD.
Subject Description LIS 571 The Organization and Control of Recorded Information.
ORGANIZATION OF ELEMENTS OF INFORMATION The Thesaurus.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Charlyn P. Salcedo Instructor Types of Indexing Languages.
FIND IT! USING LIBRARY CATALOGING CONCEPTS TO ORGANIZE AND MAKE RECORDS FINDABLE DIONNE L. MACK, INTERIM DIRECTOR OF QUALITY OF LIFE DEPARTMENTS.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
1 How do we describe something? n What something is about? –What the content of an object is “about”? n Different methods (Wilson, 1968) –counting terms.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Theoretical Perspectives: Information, Language and Cognition Week 14 Lecture notes INF 380E: Perspectives on Information Spring
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Information Organization
Information Organization: Overview
Introduction to Metadata
Taxonomies & Classification for Organizing Content
Taxonomies, Lexicons and Organizing Knowledge
Transportation Research Thesaurus:
2. An overview of SDMX (What is SDMX? Part I)
COURSE DEVELOPMENT PROCESS OVERVIEW AND GUIDELINES
SDMX Information Model: An Introduction
Attributes and Values Describing Entities.
Outlook and Shared Drives
Information Organization: Overview
Presentation transcript:

Order Out of Chaos: Creating and Valuing Taxonomies Information Highways Conference e-Content Institute April 6, 2005

Information Highways, April 6, 2005© Denise Bruno2 Agenda Fun! Controlled vocabularies Value of taxonomies Types of taxonomies Taxonomy development

Information Highways, April 6, 2005© Denise Bruno3 “The value of knowledge is largely tied to the way in which that knowledge is organized. If you can’t find it, it’s not likely to be of much use to you.” Marc Rapport Unfolding Knowledge Knowledge Management E-zine

Exercise Put the slips in some sort of order so that they are of use to you.

Information Highways, April 6, 2005© Denise Bruno5 Taxonomies, Metadata and Classification 8-week course Professional Learning Centre Faculty of Information Studies University of Toronto Bonus: Intranet Taxonomy Resource Centre

Controlled Vocabularies

Information Highways, April 6, 2005© Denise Bruno7 Definitions Controlled Vocabulary An indexing language, i.e., a standardized set of terms and phrases authorized for use in an indexing system to describe a subject area or information domain. A collection of preferred and non-preferred terms that are used to assist in more precise retrieval of content.

Information Highways, April 6, 2005© Denise Bruno8 Purpose Translation From natural language of authors and users into a vocabulary used for indexing and retrieval Consistency In the assignment of index terms Indication of Relationships Semantic relationships among terms Retrieval Searching aid in retrieval of documents (source: ANSI/NISO Z , Guidelines for the Construction, Format, and Management of Monolingual Thesauri)

Information Highways, April 6, 2005© Denise Bruno9 Types of Controlled Vocabularies dd (source: U of T, Professional Learning Centre, Intranet Taxonomy Resource Centre)

Information Highways, April 6, 2005© Denise Bruno10 Pick List A list of words Most basic of controlled vocabularies No synonyms identified No guidance provided

Information Highways, April 6, 2005© Denise Bruno11 Synonym Ring A list of words to be treated as equivalent in meaning for the purposes of searching Every term in the ring in synonymous to the others

Information Highways, April 6, 2005© Denise Bruno12 Synonym Ring (source: U of T, Professional Learning Centre, Intranet Taxonomy Resource Centre)

Information Highways, April 6, 2005© Denise Bruno13 Authority File Provides higher level of control than a synonym ring Designates one term as being preferred Includes references from synonyms, abbreviations, and acronyms to the preferred term AKA a subject list

Information Highways, April 6, 2005© Denise Bruno14 Authority File (source: U of T, Professional Learning Centre, Intranet Taxonomy Resource Centre)

Information Highways, April 6, 2005© Denise Bruno15 Taxonomy Defines hierarchical relationships between the terms Goes from the general to the specific Strict taxonomy is a Genus/species relationship, i.e. “is a” relationship e.g. russet “is a” type of potato

Information Highways, April 6, 2005© Denise Bruno16 Taxonomy (source: U of T, Professional Learning Centre, Intranet Taxonomy Resource Centre)

Information Highways, April 6, 2005© Denise Bruno17 Taxonomy “Taxis” – arrange, put in order “Onoma” – name Is the end result of the science, laws, or principles of classification

Information Highways, April 6, 2005© Denise Bruno18 Taxonomy (F rom Greek “taxis” meaning arrangement or division and “nomos” meaning law) is the science of classification according to a pre-determined system, with the resulting catalog used to provide a conceptual framework for discussion, analysis, or information retrieval. In theory, the development of a good taxonomy takes into account the importance of separating elements of a group (“taxon”) into subgroups (“taxa”) that are mutually exclusive, unambiguous, and taken together, include all possibilities. In practice, a good taxonomy should be simple, easy to remember, and easy to use. (source:

Information Highways, April 6, 2005© Denise Bruno19 Taxonomy “Structures that provide a way of classifying things – living organisms, products, books – into a series of hierarchical groups to make them easier to identify, study, or locate. Taxonomies consist of two parts – structures and applications. Structures consist of the categories (or terms) themselves and the relationships that link them together. Applications are the navigation tools available to help users find information.” (source: Jean Graef, Montague Institute)

Information Highways, April 6, 2005© Denise Bruno20 Thesaurus A type of controlled vocabulary that shows the following relationships among terms: hierarchical (e.g. parent-child BT, NT) associative (e.g. related RT) equivalent (e.g. synonymous U, UF) Also includes scope notes (definitions)

Information Highways, April 6, 2005© Denise Bruno21 Thesaurus (source: U of T, Professional Learning Centre, Intranet Taxonomy Resource Centre)

Information Highways, April 6, 2005© Denise Bruno22 User Warrant Justification for the representation of a concept in an indexing language or for the selection of a preferred term because of frequent requests for information on the concept or free-text searches on the term by users of an information storage and retrieval system (source: ANSI/NISO Z , Guidelines for the Construction, Format, and Management of Monolingual Thesauri)

Information Highways, April 6, 2005© Denise Bruno23 Classification Classification refers to the systematic grouping of like things or objects into classes or categories according to some shared quality or characteristic. Implies the separation of things according to their degree of unlikeness. The term “classification” can refer either to the process of defining the categories and structure of a classification scheme or to the process of assigning documents to their appropriate categories. (source: U of T, Professional Learning Centre, Intranet Taxonomy Resource Centre)

Information Highways, April 6, 2005© Denise Bruno24 Classification Scheme A scheme for arranging a collection of information in a hierarchical order using a controlled vocabulary to express the categories. Frequently referred to as a “taxonomy”. Also known as a file plan. (source: U of T, Professional Learning Centre, Intranet Taxonomy Resource Centre)

Information Highways, April 6, 2005© Denise Bruno25 Metadata Data about data “Metadata is structured information that describes, explains, locates, or otherwise makes is easier to retrieve, use, or manage an information resource.” (source: ITRC)

Information Highways, April 6, 2005© Denise Bruno26 Important A taxonomy describes the domain (e.g. subject) being used for classification, but is not itself metadata However, it can be used in metadata Does not address naming conventions for individual files (records) Separate policy/procedure

Information Highways, April 6, 2005© Denise Bruno27

Value of Taxonomies

Information Highways, April 6, 2005© Denise Bruno29 “…the primary motives for developing an internal taxonomy were to improve information access and to save time by streamlining the search process.” Taxonomies for Business:Access and Connectivity in a Wired World, TFPL Ltd.

Information Highways, April 6, 2005© Denise Bruno30 Information Environment Paper Facsimiles Electronic docs Chat boards White boards Legacy databases Instant messaging Intranet materials Internet materials Workflow Video Audio Microforms

Information Highways, April 6, 2005© Denise Bruno31 Information Environment No standards for info design or else too vague or incapable of being enforced Separate offices/divisions, many with own IT shops, build separate info systems Cultures of competitiveness or mistrust Legacy systems difficult to change Managers still looking for silver bullet

Information Highways, April 6, 2005© Denise Bruno32 Value of Taxonomies Identification – Controls the glut of information by filtering, categorizing and labeling information Navigation – Reduces the likelihood of becoming lost by moving along logical paths; facilitates browsing Discovery – Aids the serendipitous find, new associations via inference Searching – Provides context, reduces search time, improves search engine performance Delivery – Improves retrieval, for both browsing and free text searches

Types of Taxonomies

Information Highways, April 6, 2005© Denise Bruno34 Structural Model - Hierarchies Generic (Genus/Species) “is – a” kind of relationship Mutual exclusivity Strictest of hierarchies (source: Barbara Kwasnik, The Role of Classification in Knowledge Representation and Discovery, Library Trends, Summer 1999, pp.22-47) Eye Diseases Conjunctival Diseases Conjunctival Neoplasm Conjunctivitis Keratoconjunctivitis Corneal Diseases (from MeSH)

Information Highways, April 6, 2005© Denise Bruno35 Structural Model - Hierarchies Whole-Part Does not assume genus/species One-way flow of information Websites/directories Automobile Body Engine Block Pistons Valves Interior Upholstery

Information Highways, April 6, 2005© Denise Bruno36 Structural Model - Hierarchies Musical Instruments Stringed Percussion Instruments Pianos Polyhierarchical Concepts belong to more than one category

Information Highways, April 6, 2005© Denise Bruno37 Emphasis of Taxonomy Department Subject/Topic For a discrete body of knowledge Familiar to most users Product/Services Internal or external focus Audience User-centric Geography/Location

Information Highways, April 6, 2005© Denise Bruno38

Information Highways, April 6, 2005© Denise Bruno39

Information Highways, April 6, 2005© Denise Bruno40

Information Highways, April 6, 2005© Denise Bruno41

Information Highways, April 6, 2005© Denise Bruno42

Information Highways, April 6, 2005© Denise Bruno43

Information Highways, April 6, 2005© Denise Bruno44

Information Highways, April 6, 2005© Denise Bruno45 Emphasis of Taxonomy Function Functions represent the major responsibilities that are managed by the organization to fulfill its goals Source of information Government of Canada, Information Management Services, BASCS (Business Activity Structure Classification System) (

Information Highways, April 6, 2005© Denise Bruno46 Function Taxonomy Example Collection Part Section Primary Secondary Collection 2: ABC Company Management Part 3: Financial Management Section 05: Financial Reporting and Auditing Primary 03: Audit Working Papers ( ) Secondary 01: Audit Confirmations ( ) Whole-Part Example: Function-based

Information Highways, April 6, 2005© Denise Bruno47 “Though figuring out where to start can be frustrating, a good taxonomy is recognized as a central part of a knowledge management system.” Thomas Trimmer President, GrapeVine Technologies

Taxonomy Development

Information Highways, April 6, 2005© Denise Bruno49 High-level Overview Domain and Purpose Rules Data Gathering Develop Draft Taxonomy Consult & Test Refine & Finalize Document Train & Educate Users Ensure Continued Development

Information Highways, April 6, 2005© Denise Bruno50 IMPORTANT! Project  Process There is no “end”. A taxonomy is never “finished”.

Information Highways, April 6, 2005© Denise Bruno51 Denise Bruno Associate CONDAR Consulting Inc