Creating Music Videos using Automated Media Analysis Authored by Jonathan Foote, Matthew Cooper, and Andreas Girgensohn Presented by Sukhyung Shin, Ninad.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

The fundamental matrix F
The evaluation and optimisation of multiresolution FFT Parameters For use in automatic music transcription algorithms.
Using Multiple Synchronized Views Heymo Kou.  What is the two main technologies applied for efficient video browsing? (one for audio, one for visual.
Evaluation of the Audio Beat Tracking System BeatRoot By Simon Dixon (JNMR 2007) Presentation by Yading Song Centre for Digital Music
Rhythmic Similarity Carmine Casciato MUMT 611 Thursday, March 13, 2005.
Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
Lapped Textures Emil Praun Adam Finkelstein Hugues Hoppe Emil Praun Adam Finkelstein Hugues Hoppe Princeton University Microsoft Research Princeton University.
Combining Spatial and Navigational Hypermedia in the Hyper-Hitchcock Hypervideo Editor Frank Shipman, Andreas Girgensohn, Lynn Wilcox FX Palo Alto Laboratory.
ADVISE: Advanced Digital Video Information Segmentation Engine
DEVON BRYANT CS 525 SEMESTER PROJECT Audio Signal MIDI Transcription.
On improving the intelligibility of synchronized over-lap-and-add (SOLA) at low TSM factor Wong, P.H.W.; Au, O.C.; Wong, J.W.C.; Lau, W.H.B. TENCON '97.
Graphics, Hypermedia, and Multimedia 7.  2001 Prentice Hall7.2 Chapter Outline Focus on Computer Graphics Dynamic Media: Beyond the Printed Page Interactive.
Feature vs. Model Based Vocal Tract Length Normalization for a Speech Recognition-based Interactive Toy Jacky CHAU Department of Computer Science and Engineering.
Losslessy Compression of Multimedia Data Hao Jiang Computer Science Department Sept. 25, 2007.
Recording a Game of Go: Hidden Markov Model Improves Weak Classifier Steven Scher
Recognizing and Tracking Human Action Josephine Sullivan and Stefan Carlsson.
MPEG-4 Applications Interactive TV (broadcast) Streaming media on the web (internet) Multimedia titles (CD-ROM) Network games Mobile multimedia (picture.
Chapter Seven Graphics, Multimedia, and Hypermedia.
1 Flash and Animation Presented by : Behzad Sajed Khosrowshahi.
Multimedia Enabling Software. The Human Perceptual System Since the multimedia systems are intended to be used by human, it is a pragmatic approach to.
Information Retrieval in Practice
Chapter Objectives Explain Web page multimedia issues
Adobe AuditionProject 7 guide © 2012 Adobe Systems IncorporatedHow to use loops, music beds, and sound effects 1 When creating a movie soundtrack, you.
Web Design, 3 rd Edition 6 Multimedia and Interactivity Elements.
TEMPORAL EVENT CLUSTERING FOR DIGITAL PHOTO COLLECTIONS Matthew Cooper, Jonathan Foote, Andreas Girgensohn, and Lynn Wilcox ACM Multimedia ACM Transactions.
A Generic Virtual Content Insertion System Based on Visual Attention Analysis H. Liu 1, 2, S. Jiang 1, Q. Huang 1, 2, C. Xu 2, 3 1 Institute of Computing.
SPECTRO-TEMPORAL POST-SMOOTHING IN NMF BASED SINGLE-CHANNEL SOURCE SEPARATION Emad M. Grais and Hakan Erdogan Sabanci University, Istanbul, Turkey  Single-channel.
Graphite 2004 Statistical Synthesis of Facial Expressions for the Portrayal of Emotion Lisa Gralewski Bristol University United Kingdom
* Video camera * Kodak * Flip Cameras * Still cameras than can take video * Smart Phones * iTouch * Tablets such as the iPad and Android tablets.
EE 492 ENGINEERING PROJECT LIP TRACKING Yusuf Ziya Işık & Ashat Turlibayev Yusuf Ziya Işık & Ashat Turlibayev Advisor: Prof. Dr. Bülent Sankur Advisor:
Windows Movie Maker Getting Started. What is Windows Movie Maker? Windows Movie Maker allows a user to capture (from a video camera) or import audio,
Windows Movie Maker Create and Download a Movie. Objectives □ Create a small video □ Download a video from a video camera to a computer.
Hyper-Hitchcock F. Shipman, A. Girgensohn, and L. Wilcox, "Hyper-Hitchcock: Towards the Easy Authoring of Interactive Video", Proceedings of INTERACT 2003,
Digital Image Processing & Analysis Fall Outline Sampling and Quantization Image Transforms Discrete Cosine Transforms Image Operations Image Restoration.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.
3ds max pipeline Use postprocessed (properly cut) –Movie from Overview Camera –MOVEN file as.bvh –Vicon file as.fbx Create two bipeds in 3ds max and load.
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
Automatic Storytelling in Comics
Duraid Y. Mohammed Philip J. Duncan Francis F. Li. School of Computing Science and Engineering, University of Salford UK Audio Content Analysis in The.
Marwan Al-Namari 1 Digital Representations. Bits and Bytes Devices can only be in one of two states 0 or 1, yes or no, on or off, … Bit: a unit of data.
Adobe AuditionProject 4 guide © 2012 Adobe Systems IncorporatedOverview of Adobe Audition workspace1 Adobe Audition is an audio application designed for.
© ACTS-MoMuSys All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France.
COMP135/COMP535 Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 2 Lecture 2 – Digital Representations.
Windows Movie Maker. Definition You can use Windows Movie Maker to capture audio and video to your computer from a video camera, Web camera, or other.
Chapter1 The flash interface and action script 3.0.
Automatic Video Authoring with Media Analysis 2003/11/25 Chen-hsiu Huang Advisor: Dr. Ja-Ling Wu.
Windows Movie Maker Tutorials By: Aaron L, James C, John W.
What is Windows Movie Maker? Windows Movie Maker is an easy to use video editing software that allows you to make home movies, automated photo albums,
Image Mosaicing with Motion Segmentation from Video Augusto Roman, Taly Gilat EE392J Final Project 03/20/01.
Lecture Capture: Student Perceptions, Expectations, and Behaviors Jack Barokas, Tel Aviv University Terena Networking Conference.
Audio Processing Mitch Parry. Resource! Sound Waves and Harmonic Motion.
Automatic Classification of Audio Data by Carlos H. L. Costa, Jaime D. Valle, Ro L. Koerich IEEE International Conference on Systems, Man, and Cybernetics.
Web Design, 5 th Edition 6 Multimedia and Interactivity Elements.
Onset Detection, Tempo Estimation, and Beat Tracking
Visual Information Retrieval
ECE 417 Lecture 1: Multimedia Signal Processing
Carmine Casciato MUMT 611 Thursday, March 13, 2005
Students Liav Viner Omri Ravid Supervisors Dr. Ofer Hadar
Enhancing Your Presentation with Multimedia
Inserting Graphics, Media, and Objects
Fundamentals of Music Processing
Carmine Casciato MUMT 611 Thursday, March 13, 2005
An enhanced estimation: motion and rotation estimation
mc3: multimedia collaborative content creation
Presenter: Simon de Leon Date: March 2, 2006 Course: MUMT611
EE 492 ENGINEERING PROJECT
The Visual Guidance Technology of Synchronizing Camera Animation and 360-degree CVR for Science Education Yanxiang Zhang, Ying Li Department of Communication.
Presentation transcript:

Creating Music Videos using Automated Media Analysis Authored by Jonathan Foote, Matthew Cooper, and Andreas Girgensohn Presented by Sukhyung Shin, Ninad Dewal

One neat usage… Home videos are LONG –… AND generally have poor quality video & audio Video has fast motion Video has moments of extreme brightness –Too tedious to watch –Too precious to throw away Solution: –Automatic Music Video Creation

Key Guidelines to Keep in Mind Soundtrack quality  video quality –You think the video is better Synchronization helps both –Enhanced perception of quality Users choose clips –Fully automated not optimal –Need mix of both

What they did, in a Nutshell Automatic/Semi-automatic creation: –Source video –Arbitrary audio soundtrack Video clips aligned w/ audio changes –Audio: looked for tempo –Video: looked for unsuitability High level of synchronization

Audio Parameterization Self-similarity (SS) analysis –Independent of type of music –Past and future regions –Novel point between high SS regions –Standard spectral parameterization: Based on STFT (short term Fourier transform) Sampled at 22 kHz, quantized into 30 bins

Audio Self-Similarity Analysis Parameterized  2D representation Key = Dis-similarity measurement (cosine) –Can yield large scores for low magnitude vectors –Similarity Matrix S –Serves as visualization of audio file structure High similarity: bright

Not similar regions: darker Look for regions of: Low cross-similarity Then high self-similarity Compare with to obtain novelty N(i) for frame i:

Segmenting and Editing Video Video boundaries into takes and clips Discarding Unsuitable Video –Excessive camera motion or poor exposure –Unsuitability score First estimate camera speed and direction Compare this estimate vs. current camera motion Test exposure/brightness Discard clips with score > 0.5

Aligning Audio and Video So far, you have: –Peaks from audio –Clips from video boundaries Simple solution: –Rank audio peaks and match w/ video boundaries –Assuming: video longer than audio (what if not?) –Clip video clips even further if too big Assuming: High suitability score w/ audio region –Focus on audio segmenting; video usually poor –For fully automated: Algorithms used: sort, DP

User Control –Hitchcock System:

More Uses… Home Videos  Music Videos –Precious but tedious Music artists –MTV, VH1 Movie, TV Show, Anime Fans –Creating free MV as hobbies

Improvements Rhythmic synchronization –Distinctive tempo or beat Combining source and soundtrack audio –Has issues with edit boundaries

Conclusions Preliminary studies had positive outlook –Users could interact w/ Hitchcock Authors realized that… –…source video’s audio should be used Hitchcock interface combined w/ automated ordering worked well.