2003.09.23 - SLIDE 1IS 202 – FALL 2003 Lecture 10: Multimedia Information Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30.

Slides:



Advertisements
Similar presentations
Chapter 11 Designing the User Interface
Advertisements

1 CS 501 Spring 2002 CS 501: Software Engineering Lecture 11 Designing for Usability I.
Integrating Educational Technology into the Curriculum
Ying Wang EDN 303 Fall Objectives Define curriculum-specific learning Explain the difference between computer, information, and integration literacy.
Your Interactive Guide to the Digital World Discovering Computers 2012.
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
Building Enterprise Applications Using Visual Studio ®.NET Enterprise Architect.
Information Retrieval in Practice
SLIDE 1IS246 - SPRING 2003 Lecture 18: Final Project Overview IS246 Multimedia Information (FILM 240, Section 4) Prof. Marc Davis UC Berkeley.
Learning in the Wild Satanjeev “Bano” Banerjee Dialogs on Dialog March 18 th, 2005 In the Meeting Room Scenario.
ADVISE: Advanced Digital Video Information Segmentation Engine
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
SLIDE 1IS 202 – FALL 2002 Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00 am Fall 2002
SLIDE 1IS 202 – FALL 2002 Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00 am Fall 2002
SLIDE 1IS246 - SPRING 2003 Lecture 13: Computational Media Theory IS246 Multimedia Information (FILM 240, Section 4) Prof. Marc Davis UC Berkeley.
Living in a Digital World Discovering Computers 2010.
SLIDE 1IS 202 – FALL 2003 Lecture 10: Metadata for Media Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am.
SLIDE 1IS 202 – FALL 2004 Lecture 23: Multimedia Information Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30.
Overview What is Content Creation Software (CCS) Types of Software CCS Applied in the Classroom Summary References.
Knowledge Portals and Knowledge Management Tools
Discovering Computers Fundamentals, 2012 Edition Your Interactive Guide to the Digital World.
Libraries and Institutional Content Management Systems
Living in a Digital World Discovering Computers 2011.
Overview of Search Engines
Discovering Computers Fundamentals, 2011 Edition Living in a Digital World.
The Urge to Merge Kathleen A. Hansen, Professor University of Minnesota School of Journalism and Mass Communication SLA, Toronto, June 8, 2005 Kathleen.
 A set of objectives or student learning outcomes for a course or a set of courses.  Specifies the set of concepts and skills that the student must.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Multimedia Databases (MMDB)
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
An Overview of MPEG-21 Cory McKay. Introduction Built on top of MPEG-4 and MPEG-7 standards Much more than just an audiovisual standard Meant to be a.
The Yellow Group Design Informatics (Regli, Stone, Kusiak, Leifer, Gupta, Chung, Fenves, Law, Kopena)
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Sharad Oberoi and Susan Finger Carnegie Mellon University DesignWebs: Towards the Creation of an Interactive Navigational Tool to assist and support Engineering.
Business Software What is database software? p. 145 Allows you to create, access, and manage data Add, change, delete, sort, and retrieve data Next.
Master Thesis Defense Jan Fiedler 04/17/98
1 Application Software What is application software?  Programs that perform specific tasks for users.
Video Media Center (VMC 1000 ™ ) Turn communications into content.
SLIDE 1IS 202 – FALL 2002 Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00 am Fall 2002
New Ideas for IA Readings review - How to manage the process Content Management Process Management - New ideas in design Information Objects Content Genres.
Future Learning Landscapes Yvan Peter – Université Lille 1 Serge Garlatti – Telecom Bretagne.
1 Week 5 Application Software. Objectives Overview Identify the four categories of application software Describe characteristics of a user interface Identify.
1 Applications of video-content analysis and retrieval IEEE Multimedia Magazine 2002 JUL-SEP Reporter: 林浩棟.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
IBM Software Group ® Managing Reusable Assets Using Rational Suite Shimon Nir.
Automatic Discovery and Processing of EEG Cohorts from Clinical Records Mission: Enable comparative research by automatically uncovering clinical knowledge.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Software Reuse Course: # The Johns-Hopkins University Montgomery County Campus Fall 2000 Session 4 Lecture # 3 - September 28, 2004.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
SAPIR Search in Audio-Visual Content using P2P Information Retrival For more information visit: Support.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
1 CS 501 Spring 2003 CS 501: Software Engineering Lecture 13 Usability 1.
SLIDE 1IS 202 – FALL 2004 Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00 pm Fall 2004
MMM2005The Chinese University of Hong Kong MMM2005 The Chinese University of Hong Kong 1 Video Summarization Using Mutual Reinforcement Principle and Shot.
Discovering Computers Fundamentals, 2010 Edition Living in a Digital World.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
MPEG 7 &MPEG 21.
© 2007 IBM Corporation IBM Software Strategy Group IBM Google Announcement on Internet-Scale Computing (“Cloud Computing Model”) Oct 8, 2007 IBM Confidential.
Discovering Computers 2011: Living in a Digital World Chapter 3
Objectives Overview Identify the four categories of application software Describe characteristics of a user interface Identify the key features of widely.
Digital Video Library - Jacky Ma.
Visual Information Retrieval
Application Software Chapter 6.
SPECIALIZED APPLICATION SOFTWARE
An Overview of MPEG-21 Cory McKay.
Presentation transcript:

SLIDE 1IS 202 – FALL 2003 Lecture 10: Multimedia Information Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00 pm Fall SIMS 202: Information Organization and Retrieval

SLIDE 2IS 202 – FALL 2003 Today’s Agenda Problem Setting New Solutions –Media Streams –Active Capture –Adaptive Media –Mobile Media Metadata Project Discussion Questions Action Items for Next Time

SLIDE 3IS 202 – FALL 2003 Today’s Agenda Problem Setting New Solutions –Media Streams –Active Capture –Adaptive Media –Mobile Media Metadata Project Discussion Questions Action Items for Next Time

SLIDE 4IS 202 – FALL 2003 Global Media Network Digital media produced anywhere by anyone accessible to anyone anywhere Today’s media users become tomorrow’s media producers Not 500 Channels — 500,000,000 multimedia Web Sites

SLIDE 5IS 202 – FALL 2003 Media Asset Management and Reuse Media Asset Management –Corporate Media companies, media archives, training, sales, catalogs, etc. –Government Military, surveillance, law enforcement, etc. –Academia Libraries, research, instruction, etc. –Consumer Home video and photos, fan reuse of popular content, etc.

SLIDE 6IS 202 – FALL 2003 Applications of Analysis and Retrieval Professional and educational applications –Automated authoring of Web content –Searching and browsing large video archives –Easy access to educational materials –Indexing and archiving multimedia presentations –Indexing and archiving multimedia collaborative sessions Consumer domain applications –Video overview and access –Video content filtering –Enhanced access to broadcast video

SLIDE 7IS 202 – FALL 2003 The Media Opportunity Vastly more media will be produced Without ways to manage it (metadata creation and use) we lose the advantages of digital media Most current approaches are insufficient and perhaps misguided Great opportunity for innovation and invention Need interdisciplinary approaches to the problem

SLIDE 8IS 202 – FALL 2003 What is the Problem? Today people cannot easily find, edit, share, and reuse media Computers don’t understand media content –Media is opaque and data rich –We lack structured representations Without content representation (metadata), manipulating digital media will remain like word- processing with bitmaps

SLIDE 9IS 202 – FALL 2003 The Semantic Gap “[…] the semantic gap between the rich meaning that users want when they query and browse media and the shallowness of the content descriptions that we can actually compute is weakening today’s automatic content-annotation systems.”

SLIDE 10IS 202 – FALL 2003 M E T A D A T A Traditional Media Production Chain PRE-PRODUCTIONPOST-PRODUCTIONPRODUCTIONDISTRIBUTION Metadata-Centric Production Chain

SLIDE 11IS 202 – FALL 2003 Chang: Content-Based Media Analysis “Traditional views of content-based technologies focus on search and retrieval—which is important but relatively narrow.” “[…] emphasizing the end-to-end content chain and the many issues evolving around it. What’s the best way to integrate manual and automatic solutions in different parts of the chain?”

SLIDE 12IS 202 – FALL 2003 Computational Media Aesthetics “ […] the algorithmic study of a variety of image and aural elements in media (based on their use in film grammar). It is also the computational analysis of the principles that have emerged underlying their manipulation in the creative art of clarifying, intensifying, and interpreting an event for an audience.” “Our research systematically uses film grammar to inspire and underpin an automated process of analyzing, characterizing, and structuring professionally produced videos.”

SLIDE 13IS 202 – FALL 2003 Chang: Content-Based Media Technology Practical impact criteria for evaluating multimedia research directions –Generating metadata not available from production –Providing metadata that humans aren’t good at generating –Focusing on content with large volume and low individual value –Adopting well-defined tasks and performance metrics

SLIDE 14IS 202 – FALL 2003 Chang: Content-Based Media Technology Areas of research –Reverse engineering of the media capturing and editing processes –Extracting and matching objects –Meaning decoding and automatic annotation –Analysis and retrieval with user feedback –Generating time-compressed skims –Efficient indexing for large databases –Content adaptation for accessing, multimedia over heterogeneous devices –Standards for specifying content description language and scheme like MPEG-7

SLIDE 15IS 202 – FALL 2003 CMA Challenges Can we dynamically detect successful aesthetic principles with accuracy and consistency using computational analysis? Can we build new postproduction tools based on this analysis for rapid, cost-efficient, and effective moviemaking and consistent evaluation? How can we use these successful audio–visual strategies for improved training and education in mass communication? How do we raise the quality of media annotation and improve the usability of content-based video search and retrieval systems?

SLIDE 16IS 202 – FALL 2003 Asset Retrieval and Reuse Automated Media Production Process Web Integration and Streaming Media Services Flash Generator WAP HTML Print/Physical Media Active Capture 1 Automatic Editing 3 Personalized/ Customized Delivery 4 Adaptive Media Engine 2 Annotation and Retrieval Reusable Online Asset Database Annotation of Media Assets

SLIDE 17IS 202 – FALL 2003 Technology Summary Media Streams provides a framework for creating metadata throughout the media production cycle to make media assets searchable and reusable Active Capture automates direction and cinematography using real-time audio-video analysis in an interactive control loop to create reusable media assets Adaptive Media uses adaptive media templates and automatic editing functions to mass customize and personalize media and thereby eliminate the need for editing on the part of end users Together, these technologies will automate, personalize, and speed up media production, distribution, and reuse

SLIDE 18IS 202 – FALL 2003 Today’s Agenda Problem Setting New Solutions –Media Streams –Active Capture –Adaptive Media –Mobile Media Metadata Project Discussion Questions Action Items for Next Time

SLIDE 19IS 202 – FALL 2003 New Solutions for Creating Metadata After CaptureDuring Capture

SLIDE 20IS 202 – FALL 2003 New Solutions for Creating Metadata After CaptureDuring Capture

SLIDE 21IS 202 – FALL 2003 After Capture: Media Streams

SLIDE 22IS 202 – FALL 2003 Media Streams Features Key features –Stream-based representation (better segmentation) –Semantic indexing (what things are similar to) –Relational indexing (who is doing what to whom) –Temporal indexing (when things happen) –Iconic interface (designed visual language) –Universal annotation (standardized markup schema) Key benefits –More accurate annotation and retrieval –Global usability and standardization –Reuse of rich media according to content and structure

SLIDE 23IS 202 – FALL 2003 Today’s Agenda Problem Setting New Solutions –Media Streams –Active Capture –Adaptive Media –Mobile Media Metadata Project Discussion Questions Action Items for Next Time

SLIDE 24IS 202 – FALL 2003 New Solutions for Creating Metadata After CaptureDuring Capture

SLIDE 25IS 202 – FALL 2003 Creating Metadata During Capture New Capture Paradigm 1 Good Capture Drives Multiple Uses Current Capture Paradigm Multiple Captures To Get 1 Good Capture

SLIDE 26IS 202 – FALL 2003 Active Capture Processing CaptureInteraction Active Capture Computer Vision/ Audition Human Computer Interaction Direction/ Cinematography

SLIDE 27IS 202 – FALL 2003 Active Capture Active engagement and communication among the capture device, agent(s), and the environment Re-envision capture as a control system with feedback Use multiple data sources and communication to simplify the capture scenario Use HCI to support “human- in-the-loop” algorithms for computer vision and audition

SLIDE 28IS 202 – FALL 2003 Human-In-The-Loop Algorithms Leverage what humans and computers are respectively good at –Example: Object recognition and tracking Leverage interaction with the situated human agent –Examples: Activity recognition (Jump detector with “Simon Says” interaction) Object recognition (Car finder with “Treasure Hunt” interaction)

SLIDE 29IS 202 – FALL 2003 Active Capture

SLIDE 30IS 202 – FALL 2003 Active Capture: Reusable Shots

SLIDE 31IS 202 – FALL 2003 Marc Davis in Godzilla Scene

SLIDE 32IS 202 – FALL 2003 Today’s Agenda Problem Setting New Solutions –Media Streams –Active Capture –Adaptive Media –Mobile Media Metadata Project Discussion Questions Action Items for Next Time

SLIDE 33IS 202 – FALL 2003 Evolution of Media Production Customized production –Skilled creation of one media product Mass production –Automatic replication of one media product Mass customization –Skilled creation of adaptive media templates –Automatic production of customized media

SLIDE 34IS 202 – FALL 2003 Editing Paradigm Has Not Changed

SLIDE 35IS 202 – FALL 2003 Computational Media More intimately integrate two great 20 th century inventions

SLIDE 36IS 202 – FALL 2003 Movies change from being static data to programs Shots are inputs to a program that computes new media based on content representation and functional dependency (US Patents 6,243,087 & 5,969,716) Central Idea: Movies as Programs Parser Producer Media Content Representation Content Representation

SLIDE 37IS 202 – FALL 2003 Content Not Author- Generated Author- Generated Author- Generated Structure Compilation Movie Making Traditional Movie Making Historical Documentary Movie Making Adaptive Media Design Space

SLIDE 38IS 202 – FALL 2003 Video Lego (structure is constrained) Video MadLibs (structure is determined) Content Not Author- Generated Author- Generated Author- Generated Structure Compilation Movie Making Traditional Movie Making Historical Documentary Movie Making Adaptive Media Design Space

SLIDE 39IS 202 – FALL 2003 The Blank Page Approach

SLIDE 40IS 202 – FALL 2003 Captain Zoom IV MadLib™

SLIDE 41IS 202 – FALL 2003 Constructing With Lego™ Blocks

SLIDE 42IS 202 – FALL 2003 Video MadLibs and Video Lego Video MadLibs –Adaptive media template with open slots –Structure is fixed –Content can be varied Video Lego –Reusable media components that know how to fit together –Structure is constrained –Content can be varied

SLIDE 43IS 202 – FALL 2003 Today’s Agenda Problem Setting New Solutions –Media Streams –Active Capture –Adaptive Media –Mobile Media Metadata Project Discussion Questions Action Items for Next Time

SLIDE 44IS 202 – FALL 2003 Moore’s Law for Cameras 2000 Kodak DC40 Nintendo GameBoy Camera $400 $ Kodak DX4900 SiPix StyleCam Blink

SLIDE 45IS 202 – FALL 2003 Capture+Processing+Interaction+Network

SLIDE 46IS 202 – FALL 2003 Mobile Media Metadata Project Leverage the context and community of media capture in mobile devices –Gather all automatically available information at the point of capture (time, spatial location, phone user, etc.) –Use metadata similarity and media analysis algorithms to find similar media that has been annotated before –Take advantage of this previously annotated media to make educated guesses about the content of the newly captured media –Interact in a simple and intuitive way with the phone user to confirm and augment system-supplied metadata for captured media

SLIDE 47IS 202 – FALL 2003 Campanile Scenario

SLIDE 48IS 202 – FALL 2003 MMM Architecture Phone Browser Application Server Annotation Process Handling xhtml Facade Metadata Service DB Desktop Browser Phone Gateway html Web Server Metadata Processing Object Factory Flamenco Dumper Flamenco Dialog Logic

SLIDE 49IS 202 – FALL 2003 Today’s Agenda Problem Setting New Solutions –Media Streams –Active Capture –Adaptive Media –Mobile Media Metadata Project Discussion Questions Action Items for Next Time

SLIDE 50IS 202 – FALL 2003 Discussion Questions (Davis) Ryan Shaw on Davis on “Editing Out Video Editing” –There are obviously a lot of legal and political barriers to be overcome in order to realize the type of media production and re-use that Marc envisions. Current metadata schemes, if they have elements related to intellectual property at all, seem to be focused on ownership and restrictions, i.e. they spell out what we can't do. How might we design metadata that tell us and our media production machines what we *can* do legally? Are things like Creative Commons licenses a step in the right direction?

SLIDE 51IS 202 – FALL 2003 Discussion Questions (Davis) Ryan Shaw on Davis on “Editing Out Video Editing” –To what extent can computer games like Grand Theft Auto: Vice City be considered machines for mass customization of narrative media? –A couple of weeks ago we discussed the issue of authors categorizing and annotating their works themselves vs. having professionals such as librarians categorize and annotate them. If we look at what librarians do as being similar to annotation of media assets in postproduction, how might Marc's ideas about using software to capture metadata throughout the production process apply to textual information assets like books, articles, and web pages?

SLIDE 52IS 202 – FALL 2003 Discussion Questions (CMA) Margaret Spring on “Computational Media Aesthetics” –The article discusses a means of referencing film grammar (shot, lighting, recording distance, etc.) to visual, aural and thematic patterns in order to develop media analysis. Does this seem like it would be exclusively a production & annotation tool? If a catalog of film techniques where created, could a legitimate and comprehensible film be made purely utilizing technique references? While it might not be useful for dramatic films, could such technology be effective for training films, annotating events with rules (such as sports) or user guide type vignettes?

SLIDE 53IS 202 – FALL 2003 Discussion Questions (CMA) Margaret Spring on “Computational Media Aesthetics” –There is a reference to “reverse-engineering intent and meaning from available content.” What would this require? Do people believe that such technology could at one point be applied to summarizing a television program? If such a tool were created, would it be able to keep up with pop culture/connotative references in order to be effective? Can you teach an annotation system to understand sarcasm?

SLIDE 54IS 202 – FALL 2003 Discussion Questions (Chang) Andrea La Pietra on “The Holy Grail of Content-Based Media Analysis” –Chang discusses that one of the criteria that is important in order for content-based analysis to have a “high impact” is “providing metadata that humans aren’t good at generating.” But, what is the right balance of human perception/annotation and automatic annotation to be used in each domain? And can it be dangerous to use more automatic elements in some domains as opposed to others?

SLIDE 55IS 202 – FALL 2003 Discussion Questions (Chang) Andrea La Pietra on “The Holy Grail of Content- Based Media Analysis” –In the medical video indexing example he claims each piece of content “does not have significant value.” But, isn’t this subjective? Medical knowledge is dynamic. Is it possible that they may stumble upon a medical case similar to a prior medical case but do not make the connection because they chose to throw out some element common to both cases which seemed unimportant at one time? Chang even states himself, “A significant skill threshold exists for personnel qualified for the annotation process.”

SLIDE 56IS 202 – FALL 2003 Discussion Questions (Dimitrova) Dan Perkel on “Applications of Video- Content Analysis and Retrieval” –“We perceive a video program as a document” (p. 42) How does this assertion facilitate the development of useful video- content analysis and retrieval applications? How does this limit those applications? Are there other ways of treating a video (or multimedia) program that would lead to different types of applications than the ones described in the article?

SLIDE 57IS 202 – FALL 2003 Today’s Agenda Problem Setting New Solutions –Media Streams –Active Capture –Adaptive Media –Mobile Media Metadata Project Discussion Questions Action Items for Next Time

SLIDE 58IS 202 – FALL 2003 Next Time Metadata for Motion Pictures: Media Streams Readings for next time –“Media Streams: An Iconic Visual Language for Video Representation” (M. Davis)