Automatic vs manual indexing Focus on subject indexing Not a relevant question? –Wherever full text is available, automatic methods predominate Simple.

Slides:



Advertisements
Similar presentations
The HILT Pilot Terminologies Server Dennis Nicholson: Centre for Digital Library Research, Strathclyde University.
Advertisements

Database Searching: How to Find Journal Articles? START.
Metadata in Carrot II Current metadata –TF.IDF for both documents and collections –Full-text index –Metadata are transferred between different nodes Potential.
Multimedia Database Systems
R2 Library Features and Functionality Overview. The R2 Library  The R2 Library is an electronic database that enables access to digital book content.
Taxonomies of Knowledge: Building a Corporate Taxonomy Wendi Pohs, Iris Associates
Lecture 11 Search, Corpora Characteristics, & Lucene Introduction.
Information Retrieval in Practice
A New Learning Tools. Topic Maps is a standard for the representation and interchange of knowledge, with an emphasis on the findability of information.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Search Strategies Online Search Techniques. Universal Search Techniques Precision- getting results that are relevant, “on topic.” Recall- getting all.
1 CS 430 / INFO 430 Information Retrieval Lecture 8 Query Refinement: Relevance Feedback Information Filtering.
Parametric search and zone weighting Lecture 6. Recap of lecture 4 Query expansion Index construction.
DYNAMIC ELEMENT RETRIEVAL IN A STRUCTURED ENVIRONMENT MAYURI UMRANIKAR.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Coolheads Consulting Copyright © 2003 Coolheads Consulting The Internal Revenue Service Tax Map Michel Biezunski Coolheads Consulting New York City, USA.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
River Campus Libraries Find Articles A Web Redesign for ENCompass David Lindahl Web Initiatives Manager River Campus Libraries University of Rochester.
Overview of Search Engines
What’s The Difference??  Subject Directory  Search Engine  Deep Web Search.
Indexes/Abstracts Ready Reference Dr. Dania Bilal IS 530 Spring 2002.
©2008 Srikanth Kallurkar, Quantum Leap Innovations, Inc. All rights reserved. Apollo – Automated Content Management System Srikanth Kallurkar Quantum Leap.
Modern Information Retrieval Computer engineering department Fall 2005.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Library HITS Library HITS: Helpful Information for Trinity Students/Staff Library eResources for SUBJECT Michaelmas Term 2013 Trinity College Library Dublin,
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
Electronic Scriptorium, Ltd. AIIM Minnesota Chapter Metadata and Taxonomy Presentation Copyright Electronic Scriptorium, Ltd. All rights reserved, 1991.
Information retrieval 1 Boolean retrieval. Information retrieval (IR) is finding material (usually documents) of an unstructured nature (usually text)
How Do We Find Information?. Key Questions  What are we looking for?  How do we find it?  Why is it difficult? “A prudent question is one-half of wisdom”
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
Conceptual structures in modern information retrieval Claudio Carpineto Fondazione Ugo Bordoni
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Recuperação de Informação Cap. 01: Introdução 21 de Fevereiro de 1999 Berthier Ribeiro-Neto.
Information Retrieval
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
1 One Table Stores All: Enabling Painless Free-and-Easy Data Publishing and Sharing Bei Yu 1, Guoliang Li 2, Beng Chin Ooi 1, Li-zhu Zhou 2 1 National.
Toward Semantic Search: RDFa based facet browser Jin Guang Zheng Tetherless World Constellation.
Acceso a la información mediante exploración de sintagmas Anselmo Peñas, Julio Gonzalo y Felisa Verdejo Dpto. Lenguajes y Sistemas Informáticos UNED III.
G. Marchionini, Univ. of Maryland Electronic Environments Cost Trends: Hardware cost < Software cost < Information cost < People time Virtuality (transcend.
Major Issues n Information is mostly online n Information is increasing available in full-text (full-content) n There is an explosion in the amount of.
Characteristics of Information on the Web Dania Bilal IS 530 Spring 2005.
Information Retrieval in Practice
Human Computer Interaction Lecture 21 User Support
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Information Organization: Overview
Lecture 1: Introduction and the Boolean Model Information Retrieval
Modern Information Retrieval
Human Computer Interaction Lecture 21,22 User Support
Information Retrieval
Universal Design for Learning
Data Mining Chapter 6 Search Engines
CSE 635 Multimedia Information Retrieval
NUR2300 – Guide to Searching ClinicalKey for Nursing
PubMed.
Website production.
Search Engine Architecture
Inverted Indexing for Text Retrieval
Information Retrieval
Calabasas Library Research Resources and Methods
Information Retrieval B
The New LexisNexis® Statistical
Information Organization: Overview
Information Retrieval and Web Design
Recuperação de Informação
Introduction to Search Engines
Presentation transcript:

Automatic vs manual indexing Focus on subject indexing Not a relevant question? –Wherever full text is available, automatic methods predominate Simple inverted index of text words More sophisticated vector-based models with weighting/ranking facilities Assignment systems which map to controlled terms

Automatic vs manual indexing Manual indexing –Gather doc’s across languages, vocabularies etc –Adapt to retrieval needs of particular user groups –Offer vocabulary assistance to users in search process –Adapt to needs for varying degree of specificity –Allow consistent retrieval over time –Allow for navigation, hierarchically or to related topics

Automatic vs manual indexing User problems –Express need (problem) in terminology appropriate to Problem System terminology Changing user habits –Searching vs. browsing

Automatic vs manual indexing Problems with manual indexing –Cost –capacity –Consistency & quality –Mapping between systems –Constructing & maintaining vocabularies

Automatic vs manual indexing Integration automatic / manual –Automatic categorization –Term mapping (”topics”) –Ranking mechanisms