HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
Search Engines and Information Retrieval
Distributed Search over the Hidden Web Hierarchical Database Sampling and Selection Panagiotis G. Ipeirotis Luis Gravano Computer Science Department Columbia.

LOCATING RESOURCES  Get to know the library  Know the library’s technology.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Information Retrieval in Practice
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Multimedia Data Mining Arvind Balasubramanian Multimedia Lab (ECSS 4.416) The University of Texas at Dallas.
How do I know the differences and uses of keyword versus subject searching in a database?
Library Instruction in North America Library Orientation (before 1980) –Tour of library, instruction in using card catalog, print indexes, reference works.
Quality-Aware Collaborative Question Answering: Methods and Evaluation Maggy Anastasia Suryanto, Ee-Peng Lim, Aixin Sun, and Roger H. L. Chiang. In Proceedings.
What difference a good tool? using Endeca for a faceted catalog Emily Lynema NCSU Libraries ACRL Delaware Valley Chapter Fall Program November 3, 2006.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
The Research Process Mr. Burt—Southwest HS—El Centro, CA.
Search Engines and Information Retrieval Chapter 1.
Multimedia Databases (MMDB)
LIS510 lecture 3 Thomas Krichel information storage & retrieval this area is now more know as information retrieval when I dealt with it I.
Defining Text Mining Preprocessing Transforming unstructured data stored in document collections into a more explicitly structured intermediate format.
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
Clustering User Queries of a Search Engine Ji-Rong Wen, Jian-YunNie & Hon-Jian Zhang.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
TRECVID 2004 Search Task by NUS PRIS Tat-Seng Chua, et al. National University of Singapore.
Topical Crawlers for Building Digital Library Collections Presenter: Qiaozhu Mei.
Producción de Sistemas de Información Agosto-Diciembre 2007 Sesión # 8.
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
NCSU Libraries Andrew Pace & Emily Lynema NCSU Libraries May 24, 2006.
IL Step 3: Using Bibliographic Databases Information Literacy 1.
1 Relevance Ranking in the Scholarly Domain Dr. Tamar Sadeh LIBER Conference Tartu, Estonia, June 2012 Dr. Tamar Sadeh LIBER Conference Tartu, Estonia,
Math Information Retrieval Zhao Jin. Zhao Jin. Math Information Retrieval Examples: –Looking for formulas –Collect teaching resources –Keeping updated.
Course grading Project: 75% Broken into several incremental deliverables Paper appraisal/evaluation/project tool evaluation in earlier May: 25%
Search Engine Architecture
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
WIRED Week 3 Syllabus Update (next week) Readings Overview - Quick Review of Last Week’s IR Models (if time) - Evaluating IR Systems - Understanding Queries.
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
21/11/20151Gianluca Demartini Ranking Clusters for Web Search Gianluca Demartini Paul–Alexandru Chirita Ingo Brunkhorst Wolfgang Nejdl L3S Info Lunch Hannover,
Mining real world data Web data. World Wide Web Hypertext documents –Text –Links Web –billions of documents –authored by millions of diverse people –edited.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Mr. P’s Class Term Paper All the Steps on the Path to an “A” Term Paper in World History.
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
Implementation of a faceted catalog search solution Kristin Antelman & Emily Lynema NCSU Libraries Feb. 7, 2006.
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Survey on Long Queries in Keyword Search : Phrase-based IR Sungchan Park
Relevance Feedback in Image Retrieval System: A Survey Tao Huang Lin Luo Chengcui Zhang.
Graphic Organizers EDUC 307. Graphic Organizers  Graphic organizers are mental maps that represent key skills like sequencing, comparing and contrasting,
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
A Faceted Interface to the Library Catalog Tito Sierra NCSU Libraries ALA Midwinter Meeting January 20, 2007.
Selecting Relevant Documents Assume: –we already have a corpus of documents defined. –goal is to return a subset of those documents. –Individual documents.
MULTIMEDIA SYSTEMS CBIR & CBVR. Schedule Image Annotation (CBIR) Image Annotation (CBIR) Video Annotation (CBVR) Video Annotation (CBVR) Few Project Ideas.
Searching the Web for academic information Ruth Stubbings.
Theory, Tools, History: A Brief Introduction August 17, 2016.
Information Retrieval in Practice
Information Organization: Overview
Visual Information Retrieval
Search Engine Architecture
Multimedia Information Retrieval
Introduction into Knowledge and information
Text Categorization Document classification categorizes documents into one or more classes which is useful in Information Retrieval (IR). IR is the task.
CSE 635 Multimedia Information Retrieval
Introduction to Information Retrieval
Panagiotis G. Ipeirotis Luis Gravano
Search Engine Architecture
Information Retrieval and Web Design
Information Organization: Overview
Information Retrieval and Web Design
Introduction to Search Engines
Presentation transcript:

HYP Progress Update By Zhao Jin

Outline Background Progress Update

Background Query (Text-based) –The set of keywords to be entered into the system to retrieve the desired information or resources –Main category Traditional IR Web (ex. Google) OPAC (ex. LINC) Video (ex. TRECVID)

Background Query Analysis –To analyze the pattern and hidden information in the queries –To efficiently classify and support such queries.

Progress update Mid-May to Early June –Background reading –Around 30 to 40 papers on various topic –Summarizing of key points in the paper

Progress update Mid-June to late-June –Log analysis BBC Video Query NUS OPAC Query –Background reading on OPAC and TRECVID

Progress update July to now –Follow up on two main topics Query classification and division on content-based and feature-based keywords (OPAC)Query classification and division on content-based and feature-based keywords Identifying ASR-oriented keywords in a video query (TRECVID)Identifying ASR-oriented keywords in a video query –Background reading on MARC, wordnet and LOC subject heading

Progress update Plan for the near future –Refine and experiment with the current ideas –Log analysis –Background reading (Textbook & Related paper) –Preparation for implementation

Q&A?

End of progress update Thank you for your attention!

Two types of keywords Content-Based Keyword (CBK) –The keywords that concern what the item is about –Ex. title, subject heading, etc Feature-Based Keyword (FBK) –The keywords that concern the features of the item. –Ex. author, publisher, genre, medium

Benefits Benefits: –Faster retrieval –More precise retrieval –Help in relevance ranking

Possible implementation Possible implementation: –term co-occurrence for concept division –list of special words and machine learning for FBK and CBK division –wordnet for classification among CBKs

Possible implementation Possible implementation: –CL and IL search algorithms for actual searching with CBKs. –list of special words and machine learning for classification among FBKs. –Marc record search algorithms for actual searching with FBKs. Back

Means to retrieve shots Example: –To find shots of “Bill Clinton” Face recognition Closed-caption Automatic Speech Recognition (ASR)

Metrics Common VS Special (In reality) –How common in reality is the concept represented by the keyword. Generic VS Specific –How generic is the concept represented by the keyword.

Metrics Concrete VS Abstract –Whether the keyword represented is concrete or abstract Topic frequency (Low VS High) –How often the keyword becomes (closely related to) a topic.

Metrics Formal VS Informal –Whether the keyword is in formal or informal language Written VS spoken –Whether the keyword is in spoken or written language

Metrics Feature-level VS Content-level –Whether the keyword is about the feature of the video (ex. camera motion) or the content of the video Back