Digital Video Library Experience in Large Scale Content Management VIEW Technologies Symposium – CUHK – August 2002 Howard Wactlar Carnegie Mellon University,

Slides:



Advertisements
Similar presentations
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Advertisements

A Human-Centered Computing Framework to Enable Personalized News Video Recommendation (Oh Jun-hyuk)
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Introduction to Multimedia Adeyemi Adeniyi Bsc, MCP MCTS
Distributed search for complex heterogeneous media Werner Bailer, José-Manuel López-Cobo, Guillermo Álvaro, Georg Thallinger Search Computing Workshop.
Audio, Visual, and Digital Technologies in Teaching
From Digital Libraries and Multimedia Archives Towards Virtual Information and Knowledge Environments supporting Collective Memories Technology Platforms.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
1 CS 502: Computing Methods for Digital Libraries Lecture 20 Multimedia digital libraries.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
1 CS 430 / INFO 430 Information Retrieval Lecture 22 Metadata 4.
T.Sharon 1 Internet Resources Discovery (IRD) Video IR.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
ISP 433/633 Week 5 Multimedia IR. Goals –Increase access to media content –Decrease effort in media handling and reuse –Improve usefulness of media content.
1 Discussion Class 10 Informedia. 2 Discussion Classes Format: Question Ask a member of the class to answer. Provide opportunity for others to comment.
1998/4/1by Chang I-Ning1 Video Database Systems Applications Introduction Education and Training Entertainment Commercial Industry and Manufacturing Digital.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Definition and Aspects
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
Carnegie Mellon © Copyright 2000 Michael G. Christel and Alexander G. Hauptmann 1 Informedia 03/12/97.
Outline of Presentation Introduction of digital video libraries Introduction of the CMU Informedia Project Informedia: user perspective Informedia:
Metadata Presentation by Rick Pitchford Chief Engineer, School of Communication COM 633, Content Analysis Methods Fall 2009.
DIVINES – Speech Rec. and Intrinsic Variation W.S.May 20, 2006 Richard Rose DIVINES SRIV Workshop The Influence of Word Detection Variability on IR Performance.
ISIC Rev.4 draft, Section K “Information and communication” United Nations Statistics Division WS-ECE 09/04.
G52IIP, School of Computer Science, University of Nottingham What we will learn … Topics relate to the use of computer to Acquire/generate Process/manipulate/store.
1 Samson Cheung EE 639, Fall 2004 Lecture 1: Applications & Trends Multimedia Information Systems advent: open communicator browser, screen cam, hari’s.
1. 2 Internet TV -Why bother? l Existing broadcast, satellite and cable TV do a better job of implementing the standard TV model than TCP/IP-based TV.
Multimedia Databases (MMDB)
Visual-Spatial Thinking in Digital Libraries —Top Ten Problems Chaomei Chen Brunel University June 28th 2001, Hotel Roanoke and Conference Center, Roanoke,
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Designing the User Interface: Strategies for Effective Human-Computer.
CIS750 – Seminar in Advanced Topics in Computer Science Advanced topics in databases – Multimedia Databases V. Megalooikonomou Introduction.
VIDEO ARCHIVING Models and opportunities Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library Executive Director,
Course Title: M.M.T Chapter No: 01 “Introduction to Multimedia”
I.T MEDIA MAISRUL www.roelsite.yolasite.com
Multimedia Chapter 1 Introduction to Multimedia Dhekra BEN SASSI.
1 CS 430 / INFO 430 Information Retrieval Lecture 23 Non-Textual Materials 2.
Cognitive Theory of Multi-Media Learning : Guiding Principles for Designing Media Presentations Based upon Research-Based Principles of Multimedia Learning.
Multimedia is a combination of text, art, sound, animation, and video.
MULTIMEDIA DEFINITION OF MULTIMEDIA
Grade 8 – Writing Standards Text Types and Purposes (1b) Write arguments to support claims with clear reasons and relevant evidence. Support claim(s) with.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.
Lecture 1 – Introduction
Local content in a Europeana cloud Kate Fernie, 2Culture Associates, Project Manager LoCloud is funded by the European Commission's ICT Policy Support.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
March 31, 1998NSF IDM 98, Group F1 Group F Multi-modal Issues, Systems and Applications.
1 Applications of video-content analysis and retrieval IEEE Multimedia Magazine 2002 JUL-SEP Reporter: 林浩棟.
MMDB-9 J. Teuhola Standardization: MPEG-7 “Multimedia Content Description Interface” Standard for describing multimedia content (metadata).
Ask a Librarian: The Role of Librarians in the Music Information Retrieval Community Jenn Riley, Indiana University Constance A. Mayer, University of Maryland.
Accessing News Video Libraries through Dynamic Information Extraction, Summarization, and Visualization Mike Christel Carnegie Mellon University, USA June.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
MPEG-7 Audio Overview Ichiro Fujinaga MUMT 611 McGill University.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
1 CS 430 / INFO 430 Information Retrieval Lecture 17 Metadata 4.
Video Databases What are it uses? –Sports –Surveillance How do we query it? –Mosaic-based Query Language.
1 Evaluation of Multi-Media Data QA Systems AQUAINT Breakout Session – June 2002 Howard Wactlar, Carnegie Mellon Yiming Yang, Carnegie Mellon Herb Gish,
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
The whole world in the palm of your hand… Daniel A. Smith Alisdair Owens Alistair Russell Max Wilson Daniel A. Smith Alisdair Owens Alistair Russell Max.
MPEG 7 &MPEG 21.
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
Digital Video Library - Jacky Ma.
Technologies: for Enhancing Broadcast Programmes with Bridgets
Visual Information Retrieval
CS 430: Information Discovery
Introduction Multimedia initial focus
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Overview What is Multimedia? Characteristics of multimedia
Multimedia Systems & Interfaces
Discussion Class 9 Informedia.
Presentation transcript:

Digital Video Library Experience in Large Scale Content Management VIEW Technologies Symposium – CUHK – August 2002 Howard Wactlar Carnegie Mellon University, USA

Acquisition Surveillance Radio Broadcast TV Training Film Satellite Video Life Cycle Analysis and Organization Speech Recognition Image Analysis Natural Language Interpretation Database ………………………. Digital Compression ………………………. …… Segmentation Distribution Cable PDA Cell Phone Internet

REQUIREMENTS: Automated process for information extraction from video Full-content search and retrieval from all spoken language and visual documents Establishment of large video libraries as a network searchable information resource Mission: Enable Search and Discovery in the Video Medium APPROACH: Integration of machine speech, image and natural language understanding for library creation and exploration Informedia Overview

CNN News Broadcasts (2050 hours) 68,000 segments/stories 1.7 Million “shots” China Historical and Cultural Documentaries (100 hours) English language Western perspective Sample Corpora

Some Examples

Why is Multimedia Difficult?

Challenges of Data Extraction

Scene Text Detection Recognizing Scene Text and Faces

Interpreting Images Containing Similar Content

Style Variations careful, clear, articulated, formal, casual spontaneous, normal, read, dictated, intimate Voice Quality breathy, creaky, whispery, tense, lax, modal Context sport, professional, interview, free conversation, man-machine dialogue Speaking Rate normal, slow, fast, very fast Stress in noise, with increased vocal effort (Lombard reflex), emotional factors (e.g. angry), under cognitive load Understanding Speech in Natural Settings

Gathering Information with Faulty Technology Retrieval performance in the presence of inaccuracy and ambiguity in the underlying cognitive processing Approximate match in meaning and visualization Presentation and reuse of library content New data type with space and time dimensions Restricted use intellectual property Interoperability in the absence of standards

Challenge of Continuous Production

Commercial 4500 motion pictures -> 9,000 hours/year (4.5 TB) 33,000 TV stations x 4 hrs/day -> 48,000,000 hrs/yr (24,000 TB) 44,000 radio stations x 4 hrs/day -> 65,500,000 hrs/yr (3,275 TB) Personal Photographs: 80 billion images -> 410,000 TB/yr Home videos: 1.4 billion tapes -> 300,000 TB/yr X-rays: 2 billion -> 17,000 TB/yr Surveillance Airports: 14,000 terminals x 140 cameras x 24 hrs/day -> 48 M hrs/day Annual Video and Audio Production

Commercial 22,600 newspapers x 30 pgs/day -> 124 TB/year 80,000 periodicals x 5,000 pgs/yr -> 52 TB/yr 40,000 scholarly journals x 1,700 pgs/yr -> 9 TB/yr Annual Print Production

Video Visualization ____ Summarizing and Visualizing the Result Set

Map collage summarizing “El Niño effects” showing distribution by nation with overlaid thumbnails North Pacific Ocean South Pacific Ocean Summarizing Thousands of Videos Example: Map Collage Drought Fire Floods

The Need for Visualization Strategies As digital video assets grow, so do possible result sets We transmit with limited bandwidths to limited screen “real estate” As automated processing improves, more metadata enables more dimensions and interfaces into the video content Users want to apply multiple perspectives interchangeably Direct manipulation interfaces are required to place the user in control

Some Examples

Video Digests Overview first, zoom and filter, then details-on-demand Concatenate scene elements into a single panoramic view Visualize word-based relationships Establish timelines showing trends against time Present maps (or diagrams) showing geographic (or spatial) correlations Combine digests into a single view or animated into a temporal presentation (the auto-documentary)

Content-based Metadata Extraction Enables Video Visualization and Summarization Personalized Presentation Summarizer Metadata Extractor User Perspective Templates People Event Affiliation Location Topics Time

Information Goals Generate information perspectives on-demand: e.g., by time, location, personalities, events Eliminate redundancy Link all the way back to source content to interactively and dynamically provide any level of detail and summarization Communicate results

Knowledge Goals Detect trends Reveal relationships Infer causality Discover anomalies ….

Acquisition Surveillance Radio Broadcast TV Training Film Satellite Video Life Cycle Analysis and Organization Speech Recognition Image Analysis Natural Language Interpretation Database ………………………. Digital Compression ………………………. …… Segmentation Distribution Cable PDA Cell Phone Internet $$ $$

Consumer and Business Evolving and archived news and information Education and training Sports and entertainment Interactive television Personal memory aids Professional and Enterprise Conventions and tradeshows Meetings/corporate memory Application Space

Digital Video Library Thank you