Module 9a: Classification Schemes

Slides:



Advertisements
Similar presentations
Subject Analysis: An Introduction Based on BASIC SUBJECT CATALOGING USING LCSH edited by Lori Robare.
Advertisements

Dewey Decimal Classification (DDC)
Class Meeting 5: Categorization and Classification
Collection Management Training Program
Module 5a: Authority Control and Encoding Schemes IMT530: Organization of Information Resources Winter 2007 Michael Crandall.
Locating Items in the CCSU Library (and most college libraries)  We need a system to find items. To help the process, librarians catalog information.
Taxonomies and Classification for Organizing Content Prentiss Riddle INF 385E 9/21/2006.
Application of Subdivisions June 22, 2003 ALA Annual Conference, Toronto.
Module 8a: Faceted Classification
Organising Information in your Website Steps and Schemes.
Module 6a: Intro to Controlled Vocabularies, Taxonomies and Classification IMT530: Organization of Information Resources Winter 2007 Michael Crandall.
Module 10b: Wrapup IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Module 10a: Display and Arrangement IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Thesaurus Design and Development
Module 2a: Information Systems IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
The Library Cataloging Tradition
IMT530- Organization of Information Resources1 Feedback Like exercises –But want more instructions and feedback on them –Wondering about grading on these.
1 Measurement Measurement Rules. 2 Measurement Components CONCEPTUALIZATION CONCEPTUALIZATION NOMINAL DEFINITION NOMINAL DEFINITION OPERATIONAL DEFINITION.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
What do you hate most about the web?
Why classification matters The foundations of bibliographic classification.
Developing facets in UDC for online retrieval Claudio Gnoli (University of Pavia) Aida Slavic (UDC Consortium) 8th NKOS Workshop, Corfu, 1 Oct 2009.
1 MeSH & Principles of Classification April 13, 2005.
  Online public access catalog is an online database of materials held by a library or a group of libraries.  Users search a library catalog principally.
Computing for Bioinformatics Introduction to databases What is a database? Database system components Data types DBMS architectures DBMS systems available.
Golder and Huberman, 2006 Journal of Information Science Usage Patterns of Collaborative Tagging System.
ODINCINDIO Marine Information Management Training Course February 2006 Organizing the collection Murari P Tapaswi National Institute of Oceanography,
LIS510 lecture 9 Thomas Krichel Organization of information Libraries organize information. Otherwise nothing that is an library could ever.
1 4. Content Organization In this chapter you will learn about: Organizational schemes: classification systems for organizing content into groups Organizational.
Computer Science 1000 Information Searching I Permission to redistribute these slides is strictly prohibited without permission.
Lecture Four: Steps 3 and 4 INST 250/4.  Does one look for facts, or opinions, or both when conducting a literature search?  What is the difference.
 Libraries store and manage thousands materials.  These materials need to be organized in a manner that allows the easiest possible access for the end.
History of Bibliographic Control. Library and Information-type Work –Undertaken through much of human history. –Information packages of various types,
The Library Cataloging Tradition Marty Kurth CS 431 February 9, 2005 [slides stolen from Diane Hillmann]
Information Sources and Classification. Where does Information Come From?                  
Welcome to the RSC –YH Information Skills Project.
Collecting Things Together - Lists 1. We’ve seen that Python can store things in memory and retrieve, using names. Sometime we want to store a bunch of.
بسم الله الرحمن الرحيم. Organizing holdings & providing library services To provide high quality information services, librarians and information specialists.
Current Events and Issues Using Index Databases for Finding Answers.
Introduction to Searching Databases and Records. What is a database? A database is a large, organized collection of information. Addresses Recipes Citations.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
Controlled Vocabulary & Thesaurus Design Term Selection/Format & Synonyms.
Subjects Indexing, or assigning subject terms to documents.
1 Automatic indexing Salton: When the assignment of content identifiers is carried out with the aid of modern computing equipment the operation becomes.
IMT530- Organization of Information Resources1 Recap Descriptive metadata elements can be used for access or selection For access, it is important to have.
Module 10a: Display and Arrangement IMT530: Organization of Information Resources Winter, 2008 Michael Crandall.
Reference & Organization Instructor: Eric Riley. What we’re going to cover What makes a reference book Using LCC to locate books in the Library Using.
IMT530- Organization of Information Resources1 Feedback Lectures –More practical examples –Like guest lecturers –Generally helpful in understanding concepts.
Problem ! uncontrolled proliferation of information causes chaos 1.Even if we have software that help us searching and storing information items, we still.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
LIS 204: Introduction to Library and Information Science Week Nine Kevin Rioux, PhD.
Collection Management LC Call Number Training Program.
MSG Reuse Catalog T.W. van den Berg 7 April 2010.
FIND IT! USING LIBRARY CATALOGING CONCEPTS TO ORGANIZE AND MAKE RECORDS FINDABLE DIONNE L. MACK, INTERIM DIRECTOR OF QUALITY OF LIFE DEPARTMENTS.
Dewey Decimal Classification (DDC) (22 nd ed.) LIS 532. Week 4, Jan. 28/09.
Part 3A-2: Document & Subject Analysis Documents Subjects Facets.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
1 SUBJECT ACCESS INF 389F: Organization of Records Information Professor Fran Miksa October 29, 2003.
Module 1a: Course Overview and Logistics IMT530: Organization of Information Resources Winter 2007 Michael Crandall.
Subject Indexing 384C – Organizing Information Week 6 Spring 2016
Subject Analysis: An Introduction
Researching for your Literature Review
Indexing, or assigning subject terms to documents
Indexing, or assigning subject terms to documents
Taxonomies & Classification for Organizing Content
Taxonomies, Lexicons and Organizing Knowledge
Library / Media Center.
Introduction to Semantic Metadata & Semantic Web
Taxonomies and Classification for Organizing Content
Library of congress classification
Presentation transcript:

Module 9a: Classification Schemes IMT530: Organization of Information Resources Winter, 2007 Michael Crandall

Recap Hierarchical and faceted approaches are not mutually exclusive You can use hierarchies under facets to help with entry vocabulary and cross references You may not always be able to apply mutual exclusion and exhaustivity to facets, but you should use these principles to help clarify Spiteri’s Idea Plane is where you do this work Try to apply terms from all facets to each object (webpage) you’re tagging to see what happens If it doesn’t make sense, you probably need to rethink your facets IMT530- Organization of Information Resources

Module 9a Outline What is a classification system? Types of classifications Characteristics of classifications Taxonomies Classification vs. categorization Purposes of classification IMT530- Organization of Information Resources

What Makes a Classification? A verbal description of concepts represented in the scheme An arrangement of these descriptions in a classed or logical order for users (schedules) A notation that shows the logical order References to guide users to related aspects in the scheme An alphabetical index that leads to the notations Instructions for use A maintenance organization Schedules (hierarchies): Are the sum of the hierarchies devoted to a particular discipline; may be that a discipline consists of more than one hierarchy. Schedules represent a set of hierarchical relationships Notation is given to indicate the meaning of a particular class (e.g., 677 means “Textiles” in DDC; also included in the meaning is its context, which is Technology / Manufacturing). Notation is frequently numbers, or it may be a combination of numbers, letters, and punctuation. The third part of a traditional classification scheme is the alphabetical index to the schedules. An index that indexes an entire classification scheme is called a “relative index.” DDC and UDC have relative indexes; the LCC does not, but has separate indexes for each schedule. IMT530- Organization of Information Resources

Two Types of Classification Systems Classed – e.g., classification schemes such as Dewey Decimal Classification (DDC) Alphabetico-classed – e.g., taxonomies such as Yahoo’s categories Both share common characteristics: Classes- the smallest component of a subject hierarchy (each node is a class, composed of subclasses) Arrays- comprised of all “sister” classes that have the same “mother”; e.g., “roller chairs” and “straight back chairs” comprise one array, etc. Chains- a string of classes that move up and down a hierarchy (breadcrumb trail on the web) IMT530- Organization of Information Resources

IMT530- Organization of Information Resources

IMT530- Organization of Information Resources

Classed Systems: Definition Classed systems use notation as terms Notation frequently includes combinations of numbers, letters, and punctuation Notations always have a meaning, which is frequently not understandable by just looking at the notation itself (i.e., the DDC number 517 has a meaning) Notation may be expressive or hospitable The Library of Congress Classification (LCC) is an example of a classed system Expressiveness refers to how much you can tell by looking at the notation about the placement and relationships of a class-- as in Dewey, which uses decimals to show depth in the classification structure. LCC’s notation is very unexpressive- you can’t tell much about the class by the letters and numbers used to identify it. Hospitality refers to the ability to expand the notation– either in chains or in arrays (DDC: very hospitable in chain (because it is decimal based), not so in array (you can only ever have 10 sisters in an array)) IMT530- Organization of Information Resources

Classification Example: DDC Meaning of number is created by the context of the hierarchy. For example “inherent features” means “inherent features of photographs” – this is implied in the hierarchical structure. IMT530- Organization of Information Resources

Alphabetico-Classed Systems: Definition Alphabetico-classed systems are classifications that use words as terms instead of notation Like classed systems, hierarchical structure and meaning are inherent in each term Because words are used in terms, the terms are usually understandable without interpretation by an expert IMT530- Organization of Information Resources

Alphabetico-classed Systems An example of current alphabetico-classed systems are web-based taxonomies However, alphabetico-classed systems have a long history of use in knowledge organization in encyclopedias and library catalogs They did not work well in most manual environments because subjects often got buried very quickly and were difficult to find if you didn’t know the hierarchy in which they were embedded IMT530- Organization of Information Resources

Contrasting Alphabetico-classed and Classed Terms Sample alphabetico-classed term for papier mâché: Arts and Humanities > Visual Arts > Sculpture > Papier Maché Sample DDC classification number (term) for paper mache: 745.542 (its meaning, or hierarchy, in Dewey, is: Arts > Drawing and Decorative Arts > Decorative Arts > Handicrafts > In papers > Papier-mâché) IMT530- Organization of Information Resources

Classifications are Pre-Coordinated Notice that in both of the papier mache examples, both of the terms carry the meaning of an entire hierarchy Because of this, all classifications are pre-coordinated – almost all terms contain multiple concepts in that they carry with them the concepts inherent in an entire subject hierarchy IMT530- Organization of Information Resources

Classification in Libraries Driven by need to arrange materials and provide systematic access Traugott Koch’s types of classification schemes reflect this Universal schemes- DDC, UDC, LCC National schemes Subject specific schemes- NLM, EI Issues discussed in Taylor relate to this function of classification IMT530- Organization of Information Resources

Snoopy Bertolucci points out the need to move beyond the general classification schemes used in libraries for most local applications Points out different drivers and different user populations Usually constrained by narrower needs Often are built for temporary use These smaller classification structures are often lumped under the term “taxonomies” Most likely to be the model you will encounter IMT530- Organization of Information Resources

Yahoo Taxonomy Yahoo Directory is a large taxonomy- http://dir.yahoo.com Although each term in the taxonomy stands for a particular subject or class (e.g., “Disabled pets”), the term itself carries the meaning or context of the entire domain in which it is situated (“Science > Biology > Zoology > Animals, Insects, and Pets > Pets > Disabled Pets”) Cross-references to related classes out of the current sequence are indicated by the @ symbol (similar to a “see also” reference in a CV) In web taxonomies, you can either browse to an term embedded in a hierarchy (e.g., disabled pets), or search on it to retrieve it immediately Try these explorations on your own to see how a large taxonomy works Go to Yahoo Directory (http://dir.yahoo.com) Click on “Science” Click on “biology” Click on “zoology” Click on “animals, insects, and pets” What kind of category is “Complete list of animals by name”? Under additional categories, click on “Pets” Again, what kind of category is “by Animal”? Click on “disabled pets” and notice the taxonomic string at the top beginning with “Directory>…” Go back to the main directory page Click on “Science” again Click this time on “Animals, Insects, and Pets@” (note the @!) Look at the taxonomic string (term) at the top of your screen – it is: Science > Biology > Zoology > “Animals, Insects, and Pets” – so the @ indicates a class out of it’s hierarchical position, used as a reference (check out Booksellers under Science – what is its hierarchical position?) IMT530- Organization of Information Resources

Classification vs. Categorization Classification is different from categorization Categorization is grouping based on common characteristics (ordering and establishing hierarchical relationships is not necessarily involved) Categorization is used often in the information professions – pamphlet files, vertical files, and websites If there are very few files and all are visible, there is usually not a particular need for ordering or hierarchical structuring IMT530- Organization of Information Resources

Sample categorization – the “Idea Index” IMT530- Organization of Information Resources

Ways of Building Classifications Bottom-up Classes are developed empirically, through observation of cases (sometimes called taxonomy; in LIS, bottom-up systems are often a result of literary warrant policies) Top-down Classes are developed conceptually; observation of cases is not involved (sometimes called typology) IMT530- Organization of Information Resources

Purposes of Classification Systems To group topics or physical items on the same subject together (collocation) to promote browsing Classification by discipline or domain makes possible browsing in the context of a discipline or domain Browsing helps us to become aware of items that we did not previously know about To fix a topic or item at a particular location and context in the universe of knowledge All items are placed in context with all other items – each item has a location which may be contrasted or compared to the location of every other item IMT530- Organization of Information Resources

Purposes of Classification Systems Placing of an item in a classification gives us rich information about that item and its context James Welch, Fool’s Crow (assigned the LCC class number PS 3573.E44) PS3573.E44 = American Literature, 1960-, “W” authors, James Welch In a taxonomy, Welch’s novel might be placed in this kind of context: American literature > 20th century > Northwest fiction In a physical arrangement, a person can learn something about Fool’s Crow by what other items it resides next to IMT530- Organization of Information Resources

More Purposes of Classification To provide a location device based on subject content for individual items that will help user locate a particular item In libraries, classification numbers are used for storing physical documents, and for returning them to their place when they’ve been removed In taxonomies, you can guide users to similar content through breadcrumb trails Note that these purposes are suspiciously familiar- they are essentially Cutter’s Objects of the Catalog IMT530- Organization of Information Resources

Questions? IMT530- Organization of Information Resources

Exercise 9a Spend the next 45 minutes exploring the examples in Exercise 9a Ask questions and talk!!! Be sure to hand in completed work at the end of class for credit!!! IMT530- Organization of Information Resources