Scanned Documents LBSC 796/INFM 718R Douglas W. Oard Week 8, October 29, 2007.

Slides:



Advertisements
Similar presentations
Relevance Feedback Limitations –Must yield result within at most 3-4 iterations –Users will likely terminate the process sooner –User may get irritated.
Advertisements

Review of AI from Chapter 3. Journal May 13  What advantages and disadvantages do you see with using Expert Systems in real world applications like business,
Sociological Abstracts Author search with composite name University Library next = click.
Internet Research Techniques Graham Seibert Copyright 2006 This is a segment of the draft version of a large syllabus. I need your feedback to improve.
THE DEFINITION OF “READING” IN GRADUATE SCHOOL Dr. Nicole Benedek.
How does a web search engine work?. search  google (started 1998 … now worth $365 billion)  bing  amazon  web, images, news, maps, books, shopping,
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
Information Retrieval Review
ISP 433/533 Week 2 IR Models.
April 23, 2001LBSC 878 Text Data Mining Douglas W. Oard.
INFORMATION ORGANIZATION LAB NOVEMBER 10, 2009 LAST WEEK ON IO LAB We thought you should see these one more time.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Searching with Lucene Chapter 2. For discussion Information retrieval What is Lucene? Code for indexer using Lucene Pagerank algorithm.
Evidence from Behavior LBSC 796/INFM 719R Douglas W. Oard Session 7, October 22, 2007.
INFO 624 Week 3 Retrieval System Evaluation
Information Access Douglas W. Oard College of Information Studies and Institute for Advanced Computer Studies Design Understanding.
Search Engines Session 11 LBSC 690 Information Technology.
Scanned Documents LBSC 796/INFM 718R Douglas W. Oard Week 8, March 30, 2011.
Search is not only about the Web An Overview on Printed Documents Search and Patent Search Walid Magdy Centre for Next Generation Localisation School of.
4 OFFICE WEEKLY MEETING Why 4 Office DMS?. Challenge Companies today are overwhelmed with information that comes to them on many formats: , electronic.
CC 2007, 2011 attrbution - R.B. Allen Text and Text Processing.
1 The BT Digital Library A case study in intelligent content management Paul Warren
CIG Conference Norwich September 2006 AUTINDEX 1 AUTINDEX: Automatic Indexing and Classification of Texts Catherine Pease & Paul Schmidt IAI, Saarbrücken.
LIS510 lecture 3 Thomas Krichel information storage & retrieval this area is now more know as information retrieval when I dealt with it I.
ArchiveGrid home page ArchiveGrid search results.
Boring People Can Make Spectacular Discoveries
Similar Document Retrieval and Analysis in Information Retrieval System based on correlation method for full text indexing.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Text Based Information Retrieval Text Based Information Retrieval H02C8A H02C8B Marie-Francine Moens Karl Gyllstrom Katholieke Universiteit Leuven.
Course grading Project: 75% Broken into several incremental deliverables Paper appraisal/evaluation/project tool evaluation in earlier May: 25%
Relevance Feedback Hongning Wang What we have learned so far Information Retrieval User results Query Rep Doc Rep (Index) Ranker.
The Structure of Information Retrieval Systems LBSC 708A/CMSC 838L Douglas W. Oard and Philip Resnik Session 1: September 4, 2001.
Voting with Their Fingers: What Research Libraries Can Learn from User Behavior Anne R. Kenney Columbia Reference Symposium March 2004.
9 July 2015 Currie Colket Slides at: Genealogical Searching for Information on.
Modern Information Retrieval Presented by Miss Prattana Chanpolto Faculty of Information Technology.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
Recuperação de Informação Cap. 01: Introdução 21 de Fevereiro de 1999 Berthier Ribeiro-Neto.
Information Retrieval
Web Search and Text Mining Lecture 5. Outline Review of VSM More on LSI through SVD Term relatedness Probabilistic LSI.
Problem Query image by content in an image database.
ONLINE SEARCH AND REDACTION SYSTEM Many concepts of digitalization which aim is to present datas on internet are faced with two main subjects and problems:
1 Pioneer Investments Legal and Compliance System Assessment Weekly Status Update June 23, 2005.
Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.
Archiving.Net® Document Management System rchiving.Net® is a bi-lingual (Arabic/English) document management system that lets you capture, index, organize,
Lucene Jianguo Lu.
Relevance Feedback Hongning Wang
November 8, 2005NSF Expedition Workshop Supporting E-Discovery with Search Technology Douglas W. Oard College of Information Studies and Institute for.
INPUT DEVICES. Keyboard & Mouse  Keyboard: Enter text and commands  Mouse: Point, Select & enter Commands.
Capture This! PO105 James Green. Table of Contents Capture Overview Laserfiche Tools Case Scenarios Questions and Answers.
Improving E-Book Access via a Library Developed Full-Text Search Tool Jill E. Foust, MLS Phillip Bergen, MA, MS Gretchen L. Maxeiner, MA, MS Health Sciences.
Scanned Documents INST 734 Module 10 Doug Oard. Agenda Document image retrieval Representation  Retrieval Thanks for David Doermann for most of these.
Topicality “That sounds good. That’s a good skill to have.” –Julia Marshall “Naw. Advantages don’t matter when it comes to Topicality.” –Humza Tahir.
How to make a brochure Go to Microsoft word 2007 and select any online template on brochure.
Xiaoying Sharon Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Scanning to Google Drive and Docs™ with ccScan®
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Digital Video Library - Jacky Ma.
Rapidly Retargetable Translingual Detection
Relevance Feedback Hongning Wang
WIRED Week 2 Syllabus Update Readings Overview.
What Shapes Can You See in this Picture?
Automatic Speaker Identification Using Sentinel Word Discrimination
Authors: Peiling Wang and Dagobert Soergel Reviewer: Douglas W. Oard
موضوع پروژه : بازیابی اطلاعات Information Retrieval
CSE 635 Multimedia Information Retrieval
CS246: Leveraging User Feedback
Retrieval Utilities Relevance feedback Clustering
Recuperação de Informação
Presentation transcript:

Scanned Documents LBSC 796/INFM 718R Douglas W. Oard Week 8, October 29, 2007

Expanding the Search Space Scanned Docs Identity: Harriet “… Later, I learned that John had not heard …”

High Payoff Investments Searchable Fraction Transducer Capabilities OCR MT Handwriting Speech

The Big Picture Find the words Index the words Do ranked retrieval Use that system to find what you want

Some Issues Language-based search without language! –Shape codes Accuracy-selection effect of ranked retrieval –Poor recognition scatters in the query-term space Blind relevance feedback – Based on clean text Image-domain summaries

Some Applications Case management for litigation Duplicate detection for declassification productivity and anti-tiling Knowledge management from everything I have ever xeroxed or faxed

Some Applications Legacy Tobacco Documents Library – Google Books – George Washington’s Papers –