Multimedia Information Retrieval

Slides:



Advertisements
Similar presentations
Relevance Feedback User tells system whether returned/disseminated documents are relevant to query/information need or not Feedback: usually positive sometimes.
Advertisements

Image Retrieval With Relevant Feedback Hayati Cam & Ozge Cavus IMAGE RETRIEVAL WITH RELEVANCE FEEDBACK Hayati CAM Ozge CAVUS.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Chapter 5: Introduction to Information Retrieval
Text Databases Text Types
Multimedia Database Systems
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
Search Strategies Online Search Techniques. Universal Search Techniques Precision- getting results that are relevant, “on topic.” Recall- getting all.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) IR Queries.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Computer comunication B Information retrieval. Information retrieval: introduction 1 This topic addresses the question on how it is possible to find relevant.
Evaluating the Performance of IR Sytems
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
Vector Space Model CS 652 Information Extraction and Integration.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
CH 11 Multimedia IR: Models and Languages
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Visual Information Systems visual information retrieval.
1 CS 502: Computing Methods for Digital Libraries Lecture 11 Information Retrieval I.
Chapter 5: Information Retrieval and Web Search
Multimedia Information Retrieval
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Multimedia Information Retrieval
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Modern Information Retrieval: A Brief Overview By Amit Singhal Ranjan Dash.
1 CS 430: Information Discovery Lecture 3 Inverted Files.
Chapter 6: Information Retrieval and Web Search
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Information Retrieval Model Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
Survey of Approaches to Information Retrieval of Speech Message Kenney Ng Spoken Language Systems Group Laboratory for Computer Science Massachusetts Institute.
Chapter 23: Probabilistic Language Models April 13, 2004.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Information Retrieval
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
Knowledge and Information Retrieval Dr Nicholas Gibbins 32/4037.
Information Retrieval in Practice
Information Retrieval in Practice
An Efficient Algorithm for Incremental Update of Concept space
Visual Information Retrieval
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Modern Information Retrieval
Introduction Multimedia initial focus
Text Based Information Retrieval
Information Retrieval and Web Search
CS 430: Information Discovery
Naming and Directories
Information Retrieval
Naming and Directories
Naming and Directories
Information Retrieval
Representation of documents and queries
Introduction into Knowledge and information
موضوع پروژه : بازیابی اطلاعات Information Retrieval
Evaluation of IR Performance
Multimedia Information Retrieval
CSE 635 Multimedia Information Retrieval
Chapter 5: Information Retrieval and Web Search
Ying Dai Faculty of software and information science,
Ying Dai Faculty of software and information science,
Spreadsheets, Modelling & Databases
Ying Dai Faculty of software and information science,
Retrieval Utilities Relevance feedback Clustering
Naming and Directories
Information Retrieval and Web Design
Information Retrieval and Web Design
Presentation transcript:

Multimedia Information Retrieval Unlike alphanumeric data, multimedia data do not have any semantic structure Achieving symmetry between annotation and query is difficult Retrieval is based on similarity between query and stored information instead of exact match Stored information is represented using indexing

IR Model Information is preprocessed to extract features and semantic contents Indexed based on these features and semantics User’s query is processed and main features are extracted Query’s features are then compared with features or index of each information item in the database Information item whose features are similar to those of the query are retrieved and presented to the user

Design Issues Indexing a mechanism that reduces the search space of an operator without losing any relevant information Similarity Computation easy to compute and should conform to human judgement

Performance Measures Retrieval speed, recall, precision Recall measures the ability of retrieving relevant information items from the database defined as the ratio between the number of retrieved relevant items and the total number of relevant items in the database Precision measures retrieval accuracy defined as the ratio between the number of retrieved relevant items and the number of total retrieved items Recall and precision are usually considered together high recall and low precision high precision and low recall

Text Retrieval Text may be used to annotate other media such as audio, images and video and conventional IR techniques used to retrieve multimedia information Boolean IR systems or text-pattern search systems Substantial effort is spent in analyzing the contents of the documents and in generating keywords and indices Boolean queries are keywords connected with logical operators (AND, OR, NOT)

File Structures Flat files Inverted files for each term a separate index is constructed that stores the document identifiers for all documents containing the term each term and the document IDs containing the term are organized into one row searching and retrieval is fast because only rows containing the query terms need to be retrieved and there is no need to search the whole database

Extensions Nearness parameters used in query specification help define the topic more precisely and therefore increase probable relevance of the retrieved item Within Sentence and Adjacency specification in queries Term location information is included in the inverted file Term i : document id, paragraph no., sentence no., word no. For example, if an inverted file has the following entries: information: R99, 10, 8, 3; R155, 15, 3, 6; R166, 2, 3,1 retrieval: R77, 9, 7, 2; R99, 10, 8, 4; R166, 10, 2, 5

Indexing Stop words -- grammatical functional words, such as “of,” “the,” and “a.” Stemming -- reducing words to a common root form Thesaurus -- list of synonyms Weighting -- term significance derived from occurrence frequency within a document and among different documents

Relevance Feedback Query modification Document modification terms occurring in documents previously identified as relevant are added to the original query or the weight of such terms is increased terms occurring in documents previously identified as irrelevant are deleted from the query or the weight of such terms is reduced Document modification terms in the query, but not in the user-judged relevant documents, are added to the document index list with an initial weight weights of index terms in the query and also in relevant documents are increased by a certain amount weights of index terms not in the query but in the relevant documents are decreased by a certain amount

Audio Search and Retrieval Keywords can be highly subjective because of a different perspective or even a different taxonomy Hard to browse directly since it must be heard in real-time (unlike video which can be keyframed) Two categories : Speech and Non-speech with speech, indexing and retrieval is based on obtaining spoken words either manually or by speech recognition technique with non-speech, indexing and retrieval may be based on text annotation (but will it help a query like “find the first occurrence of the note G-sharp.”)