Image and Video Retrieval INST 734 Doug Oard Module 13.

Slides:



Advertisements
Similar presentations
Creating the Television Ad
Advertisements

Motion Capture The process of recording movement and translating that movement onto a digital model Games Fast Animation Movies Bio Medical Analysis VR.
Games, Movies and Virtual Worlds – An Introduction to Computer Graphics Ayellet Tal Department of Electrical Engineering Technion.
RGB-D object recognition and localization with clutter and occlusions Federico Tombari, Samuele Salti, Luigi Di Stefano Computer Vision Lab – University.
Video Definitions Necessary Multimedia Vocabulary #5 Journalism 108.
Learning Techniques for Video Shot Detection Under the guidance of Prof. Sharat Chandran by M. Nithya.
CIS 581 Course Project Heshan Lin
Automatic Video Shot Detection from MPEG Bit Stream Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC.
Automated Shot Boundary Detection in VIRS DJ Park Computer Science Department The University of Iowa.
SmartPlayer: User-Centric Video Fast-Forwarding K.-Y. Cheng, S.-J. Luo, B.-Y. Chen, and H.-H. Chu ACM CHI 2009 (international conference on Human factors.
Clustering & image segmentation Goal::Identify groups of pixels that go together Segmentation.
Detecting Cartoons a Case Study in Automatic Video-Genre Classification Tzvetanka Ianeva Arjen de Vries Hein Röhrig.
Segmentation Course web page: vision.cis.udel.edu/~cv May 2, 2003  Lecture 29.
Human-Computer Interaction Human-Computer Interaction Segmentation Hanyang University Jong-Il Park.
1 Content-Based Retrieval (CBR) -in multimedia systems Presented by: Chao Cai Date: March 28, 2006 C SC 561.
Broadcast News Parsing Using Visual Cues: A Robust Face Detection Approach Yannis Avrithis, Nicolas Tsapatsoulis and Stefanos Kollias Image, Video & Multimedia.
ICME 2008 Huiying Liu, Shuqiang Jiang, Qingming Huang, Changsheng Xu.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
ADVISE: Advanced Digital Video Information Segmentation Engine
SWE 423: Multimedia Systems
Multimedia IR LBSC 796/INFM 718R Session 11 November 9, 2011.
T.Sharon 1 Internet Resources Discovery (IRD) Video IR.
Segmentation by Clustering Reading: Chapter 14 (skip 14.5) Data reduction - obtain a compact representation for interesting image data in terms of a set.
Image and Video Retrieval LBSC 796/INFM 718R Session 13, May 4, 2011 Douglas W. Oard.
Video summarization by graph optimization Lu Shi Oct. 7, 2003.
Stockman MSU Fall Computing Motion from Images Chapter 9 of S&S plus otherwork.
1 Motion in 2D image sequences Definitely used in human vision Object detection and tracking Navigation and obstacle avoidance Analysis of actions or.
Video Trails: Representing and Visualizing Structure in Video Sequences Vikrant Kobla David Doermann Christos Faloutsos.
Computer Vision - A Modern Approach Set: Segmentation Slides by D.A. Forsyth Segmentation and Grouping Motivation: not information is evidence Obtain a.
Motion Detection in UAV Videos by Cooperative Optical Flow and Parametric Analysis Masaharu Kobashi.
Beyond Text INFM 718X/LBSC 708X Session 10 Douglas W. Oard.
© Anselm SpoerriDigital Media Production Digital Media Production Digital Media Production Anselm Spoerri PhD (MIT) Rutgers University
Video Classification By: Maryam S. Mirian
Information Extraction from Cricket Videos Syed Ahsan Ishtiaque Kumar Srijan.
CA0932a Multimedia Development Lecture 11 Language of film and the visual narrative.
Image and Video Retrieval INST 734 Doug Oard Module 13.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
Università degli Studi di Modena and Reggio Emilia Dipartimento di Ingegneria dell’Informazione Prototypes selection with.
Introduction EE 520: Image Analysis & Computer Vision.
Levi Smith.  Reading papers  Getting data set together  Clipping videos to form the training and testing data for our classifier  Project separation.
MULTIMEDIA INPUT / OUTPUT TECHNOLOGIES
Image and Video Retrieval INST 734 Doug Oard Module 13.
Content-Based Image Retrieval QBIC Homepage The State Hermitage Museum db2www/qbicSearch.mac/qbic?selLang=English.
Final Review Course web page: vision.cis.udel.edu/~cv May 21, 2003  Lecture 37.
Detection of Illicit Content in Video Streams Niall Rea & Rozenn Dahyot
Using Cross-Media Correlation for Scene Detection in Travel Videos.
Pascal Kelm Technische Universität Berlin Communication Systems Group Thursday, 2 April 2009 Video Key Frame Extraction for image-based Applications.
MACHINE VISION GROUP MOBILE FEATURE-CLOUD PANORAMA CONSTRUCTION FOR IMAGE RECOGNITION APPLICATIONS Miguel Bordallo, Jari Hannuksela, Olli silvén Machine.
Image features and properties. Image content representation The simplest representation of an image pattern is to list image pixels, one after the other.
Representing Moving Images with Layers J. Y. Wang and E. H. Adelson MIT Media Lab.
SUMMERY 1. VOLUMETRIC FEATURES FOR EVENT DETECTION IN VIDEO correlate spatio-temporal shapes to video clips that have been automatically segmented we.
Representation in Vision Derek Hoiem CS 598, Spring 2009 Jan 22, 2009.
1 Review and Summary We have covered a LOT of material, spending more time and more detail on 2D image segmentation and analysis, but hopefully giving.
What is Windows Movie Maker? Windows Movie Maker is an easy to use video editing software that allows you to make home movies, automated photo albums,
Learning color and locality cues for moving object detection and segmentation Yuan-Hao Lai Feng Liu and Michael Gleicher University of Wisconsin-Madison.
Visual Information Retrieval
Automatic Video Shot Detection from MPEG Bit Stream
Multimedia Content Based Retrieval
Presenter: Ibrahim A. Zedan
Announcements Final is Thursday, March 20, 10:30-12:20pm
Motion Picture Language
Editing and Camera Terms
Representing Moving Images with Layers
Motion Picture Language
Image Segmentation Techniques
Representing Moving Images with Layers
Filtering Things to take away from this lecture An image as a function
Announcements Final is Thursday, March 16, 10:30-12:20
Filtering An image as a function Digital vs. continuous images
Presentation transcript:

Image and Video Retrieval INST 734 Doug Oard Module 13

Agenda Image retrieval  Video retrieval Multimedia retrieval

Video Structures Image structure –Absolute positioning, relative positioning Object motion –Translation, rotation Camera motion –Pan, zoom, perspective change Shot transitions –Cut, fade, dissolve, …

Shot Boundary Detection Create a color histogram for each image Segment at discontinuities (cuts) –Cuts are easy, other transitions are also detectable Cluster representative histograms for each shot –Identifies cuts back to a prior shot Build a time-labeled transition graph

Shot Classification Shot-to-shot structure correlates with genre –Reflects accepted editorial conventions Some substructures are informative –Frequent cuts to and from announcers –Periodic cuts between talk show participants –Wide-narrow cuts in sports programming Simple image features can reinforce this –Head-and-shoulders, object size, …

Object Motion Detection Hypothesize objects as in image retrieval –Segment based on color and texture Examine frame-to-frame pixel changes Classify motion –Translation Linear transforms model unaccelerated motion –Rotation Creation & destruction, elongation & compression –Merge or split

Camera Motion Detection Do global frame-to-frame pixel analysis Classify the resulting patterns –Central tendency -> zoom out –Balanced exterior destruction -> zoom in –Selective exterior destruction -> pan –Coupled rotation and translation -> perspective

Some Concept Detectors Outdoor –Road->Cul-de-Sac Vehicle –Airplane->Airplane Flying->Airplane Landing Person –Politicians->Tony Blair –Person_Using_ATM Election_Campaign –Election_Campaign_Greeting Animal –Arthropod->Crustacean->Lobster

Key Frame Extraction First frame of a shot is easy to select –But it may not be the best choice Genre-specific cues may be helpful –Minimum optical flow for director’s emphasis –Face detection for interviews –Presence of on-screen captions This may produce too many frames –Color histogram clusters can reveal duplicates

Full Motion Extracts Extracted shots, joined by cuts –The technique used in movie advertisements Conveys more information using motion –Optionally aligned with extracted sound as well Hard to build a coherent extract –Movie ads are constructed by hand

“Salient Stills” Abstractive Summary

Agenda Image retrieval Video retrieval  Multimedia retrieval