1 Evaluation of Multi-Media Data QA Systems AQUAINT Breakout Session – June 2002 Howard Wactlar, Carnegie Mellon Yiming Yang, Carnegie Mellon Herb Gish,

Slides:



Advertisements
Similar presentations
National Technical University of Athens Department of Electrical and Computer Engineering Image, Video and Multimedia Systems Laboratory
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
User Interface Design Yonsei University 2 nd Semester, 2013 Sanghyun Park.
DL:Lesson 11 Multimedia Search Luca Dini
Richard Yu.  Present view of the world that is: Enhanced by computers Mix real and virtual sensory input  Most common AR is visual Mixed reality virtual.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
1 CS 502: Computing Methods for Digital Libraries Lecture 20 Multimedia digital libraries.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
1 CS 430 / INFO 430 Information Retrieval Lecture 22 Metadata 4.
1 Discussion Class 10 Informedia. 2 Discussion Classes Format: Question Ask a member of the class to answer. Provide opportunity for others to comment.
Reference Collections: Task Characteristics. TREC Collection Text REtrieval Conference (TREC) –sponsored by NIST and DARPA (1992-?) Comparing approaches.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Access to News Audio User Interaction in Speech Retrieval Systems by Jinmook Kim and Douglas W. Oard May 31, th Annual Symposium and Open House.
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
Presented by Zeehasham Rasheed
Common Core State Standards Professional Learning Module Series
ISP 433/633 Week 6 IR Evaluation. Why Evaluate? Determine if the system is desirable Make comparative assessments.
Digital Video Library Experience in Large Scale Content Management VIEW Technologies Symposium – CUHK – August 2002 Howard Wactlar Carnegie Mellon University,
1 Incremental Detection of Text on Road Signs from Video Wen Wu Joint work with Xilin Chen and Jie Yang.
Information Retrieval in Practice
Result presentation. Search Interface Input and output functionality – helping the user to formulate complex queries – presenting the results in an intelligent.
Copyright © 2004 Informedia Contexture: Analyzing and Synthesizing Video and Verbal Context for Intelligence Analysis Dialogues Alex Hauptmann, Howard.
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
1 Lessons Learned From Building a Terabyte Digital Video Library Presented by Jia Yao Multimedia Communications and Visualization Laboratory Department.
JAVELIN Project Briefing 1 AQUAINT Year I Mid-Year Review Language Technologies Institute Carnegie Mellon University Status Update for Mid-Year Program.
Evaluation Experiments and Experience from the Perspective of Interactive Information Retrieval Ross Wilkinson Mingfang Wu ICT Centre CSIRO, Australia.
Visualization, analysis and mining of geo- spatial information in educational data sets using web-based tools Aniruddha Desai |Winter 2013 Presentation.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
1 CS 430 / INFO 430 Information Retrieval Lecture 23 Non-Textual Materials 2.
Finding Better Answers in Video Using Pseudo Relevance Feedback Informedia Project Carnegie Mellon University Carnegie Mellon Question Answering from Errorful.
MULTIMEDIA DEFINITION OF MULTIMEDIA
1 CS430: Information Discovery Lecture 18 Usability 3.
Grade 8 – Writing Standards Text Types and Purposes (1b) Write arguments to support claims with clear reasons and relevant evidence. Support claim(s) with.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
Prof. Thomas Sikora Technische Universität Berlin Communication Systems Group Thursday, 2 April 2009 Integration Activities in “Tools for Tag Generation“
WIRED Week 3 Syllabus Update (next week) Readings Overview - Quick Review of Last Week’s IR Models (if time) - Evaluating IR Systems - Understanding Queries.
Analysis and definition of classes Practice 1. ATD. Duties  Abstract type of data defines a set of objects, which fully describes a set of operations.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
Measuring How Good Your Search Engine Is. *. Information System Evaluation l Before 1993 evaluations were done using a few small, well-known corpora of.
Image and Video Retrieval INST 734 Doug Oard Module 13.
Accessing News Video Libraries through Dynamic Information Extraction, Summarization, and Visualization Mike Christel Carnegie Mellon University, USA June.
Semantic Extraction and Semantics-Based Annotation and Retrieval for Video Databases Authors: Yan Liu & Fei Li Department of Computer Science Columbia.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
1 CS 430 / INFO 430 Information Retrieval Lecture 17 Metadata 4.
Colby Smart, E-Learning Specialist Humboldt County Office of Education
Preparing for the 2008 Beijing Olympics : The LingTour and KNOWLISTICS projects. MAO Yuhang, DING Xiao-Qing, NI Yang, LIN Shiuan-Sung, Laurence LIKFORMAN,
MULTIMEDIA DATA MODELS AND AUTHORING
Justification/ExplanationEvaluation Breakout Session 6/13/02 Stefano BertoloRichard Fikes AQUAINT PI Meeting Monterey, California June 11-13, 2002.
Describe Relationship between illustrations and the text in which they appear With prompting K Explain How specific images contribute to and clarify a.
Task-Based Video Quality Update Or, what has Carolyn been up to this past year? VQEG, June 2009.
AQUAINT Mid-Year PI Meeting – June 2002 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Boxes. boxes- learning targets o I will be able to display buttons (boxes) o I will be able to organize boxes o I will be able to create an animation.
TRECVID IES Lab. Intelligent E-commerce Systems Lab. 1 Presented by: Thay Setha 05-Jul-2012.
Evaluation Issues: June 2002 Donna Harman Ellen Voorhees.
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
Digital Video Library - Jacky Ma.
Visual Information Retrieval
CS 430: Information Discovery
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
CSE 635 Multimedia Information Retrieval
SIRS Issues Researcher School Subscribed Database
Discussion Class 9 Informedia.
Week 7 Presentation Ngoc Ta Aidean Sharghi
Presentation transcript:

1 Evaluation of Multi-Media Data QA Systems AQUAINT Breakout Session – June 2002 Howard Wactlar, Carnegie Mellon Yiming Yang, Carnegie Mellon Herb Gish, BBN

2 Extracting Answers From … Web-based charts and graphs – Yiming Yang Continuous speech – Herb Gish Video – Howard Wactlar

3 Video Extraction Long term goal: Ability to respond to same challenge and metrics as current and forthcoming TREC-QA –But with different source data –Multiple channels of information (with temporal and spatial dimensions) simultaneously Many of the speech goals and measures are applicable Errors inherent in extraction and interpretation of what is seen and heard –…but parallel sources of data may corroborate or refute

4 Multimedia Data Extraction Long term goal: Ability to respond to same challenge and metrics as current and forthcoming TREC-QA –But with different source data –Multiple channels of information (with temporal and spatial dimensions) simultaneously Problem: Errors inherent in extraction and interpretation –…but parallel sources of data may corroborate or refute

5 Examples of Parallel Data Mention of person’s names with people in imagery Appearance of face in image with name and affiliation overlaid in text (interpreted visually) Description of scene (or activity or event) and corresponding imagery Spoken location name and appearance of corresponding map Spoken location name and appearance of street or store sign Spoken location, appearance of street sign, and embedded GPS (forthcoming in all cameras)

6 Answering Questions from Video Factoid answers may be contained only within an image (while audio track gets you to the region of discovery) Generally, collateral video information provides context and background Finding a relevant image in addition to the textual answer Answering a query that specifically seeks an image –who was accompanying Muhammed Atef when he was visiting in Ankara and was this the same person who was in Rihyad with him

7 Some measures Did we find a relevant image related to the textual answer (precision measures) Did we correctly answer a question that specifically seeks an image –Video TREC precision and recall* measures (*on fully truthed data) How long does it take to interactively extract a useful answer –Number of steps, total elapsed time –Qualitatively different from textual search Extraction performance –e.g., How does named entity and factoid extraction degrade with speech recognition accuracy

8 Evaluation Task Proposal: Multi-modal Biography Generation Task Description: –Given a name, generate a multi-modal summary of key characteristics and events of person’s life in a specified time frame. Response: –Natural language passages/sentences of key events –Recent/historical pictures of individual –video/radio segments (e.g., speech,interview) augmenting key events –Org chart, map of movements Data: –Web images and text –Video and radio from news sources Eval: –Subjective (analyst / other user) assessment –Objective: precision, recall, recency, coherence Goal for next meeting: Define application-based task and metrics

9 Attendees who indicated further interest Robert Anita Jay Mark John Yiming Donna BerryNGIC Robert Hecht-Nielsen Herbert John Carol Van Ess-Dykema Howard Alex 5-6 others attended