Download presentation
Presentation is loading. Please wait.
Published byAya Abdo Modified over 7 years ago
1
Information Storage and Retrieval Fall 2017-2018 Lecture 1: Introduction and History
2
Lecture Overview Introduction to the Course Introduction to Information Retrieval The Information Seeking Process Information Retrieval History and Developments Discussion
3
Lecture Overview Introduction to the Course (re)Introduction to Information Retrieval The Information Seeking Process Information Retrieval History and Developments Discussion
4
Introduction to Course Grading: Final Paper 60% Class Work 30% Midterm Exam 10% Participation in class discussions and presentations is also expected.
5
Course Outline Week 1 :introduction to information retrieval Week 2 : Basic IR system components Week 3 : IR capabilities Week4 : Boolean model Week 5 :Inverted index: dictionary and posting lists Week 6 : Index construction Week 7 : Term weighting Week 8 : Vector space model Week 9 : Scoring and Ranking Week 10 : IR evaluation Week 11 : Relevant feedback Week 12 : Web Search Basics and conclusion
6
Readings INFORMATION STORAGE AND. RETRIEVAL SYSTEMS. Theory and Implementation. Second Edition. Gerald J. Kowalski. Information Retrieval Architecture and Algorithms. Kowalski, Gerald, 2011, springer.
7
Lecture Overview Introduction to the Course Introduction to Information Retrieval The Information Seeking Process Information Retrieval History and Developments Discussion
8
WHAT DO YOU THINK IS INFORMATION RETRIEVAL? 8 ?? INFORMATION RETRIEVAL
9
Information Retrieval Information Retrieval (IR) is finding material of an unstructured nature that satisfies an information need from within large collections.
10
Introduction Goal of IR is to retrieve all and only the “relevant” documents in a collection for a particular user with a particular need for information.
11
Information Retrieval The goal is to search large document collections to retrieve small subsets relevant to the user’s information need Examples are: Internet search engines (Google, Yahoo! web search, etc.) Digital library catalogues (MELVYL, GLADYS)
12
Other IR tasks Clustering: Given a set of docs, group them into clusters based on their contents. Classification: Given a set of topics, plus a new doc D, decide which topic(s) D belongs to (eg spam-nospam). Information Extraction: Find all snippets dealing with a given topic (e.g. company merges) Question Answering: deal with a wide range of question types including: list, definition, How, and Why questions. Opinion Mining: Analyze/summarize sentiment in a text (e.g. TripAdvisor) All the above, applied to images, video, audio 12
14
Lecture Overview Introduction to the Course (re)Introduction to Information Retrieval The Information Seeking Process Information Retrieval History and Developments Discussion
15
The Standard Retrieval Interaction Model
16
IR is an Iterative Process Repositories Workspace Goals
17
IR is a Dialog The exchange doesn’t end with first answer. Users can recognize elements of a useful answer, even when incomplete. Questions and understanding changes as the process continues.
18
Lecture Overview Introduction to the Course (re)Introduction to Information Retrieval The Information Seeking Process Information Retrieval History and Developments Discussion
19
IR History Overview Non-Computer IR (mid 1950’s) Interest in computer-based IR from mid 1950’s Modern IR – Large-scale evaluations, Web-based search and Search Engines -- 1990’s
20
Computer-Based Systems Bagley’s 1951 MS thesis from MIT suggested that searching 50 million item records, each containing 30 index terms would take approximately 41,700 hours
21
Lecture Overview Introduction to the Course (re)Introduction to Information Retrieval The Information Seeking Process Information Retrieval History and Developments Discussion
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.