Multimedia Interfaces What is a multimedia interface – Most anything where users do not just interact with text – E.g., audio, speech, images, faces, video,

Slides:



Advertisements
Similar presentations
Chapter 3 – Web Design Tables & Page Layout
Advertisements

User interface Adobe video and audio applications provide a consistent, customizable workspace. Although each application has its own set of panels (such.
Using Multimedia on the Web Enhancing a Web Site with Sound, Video, and Applets.
CNIT 132 – Week 9 Multimedia. Working with Multimedia Bandwidth is a measure of the amount of data that can be sent through a communication pipeline each.
1 Cascading Style Sheets Continued Different kinds of selectors in a style sheet –Simple- Pseudo-Class –Contextual- Pseudo-Element –Class Image Styles.
Kien A. Hua Division of Computer Science University of Central Florida.
MULTIMEDIA DEVELOPMENT 4.3 : AUTHORING TOOLS. At the end of the lesson, students should be able to: 1. Describe different types of authoring tools Learning.
3.02C Multimedia Fair Uses Guidelines and Elements
Information Retrieval in Practice
1 Presented by Jean-Daniel Fekete. 2  Motivation  Mélange [Elmqvist 2008] Multiple Focus Regions.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
Hyper-Hitchcock: Authoring Interactive Videos and Generating Interactive Summaries Frank Shipman Texas A&M University Andreas Girgensohn Lynn Wilcox FX.
Combining Spatial and Navigational Hypermedia in the Hyper-Hitchcock Hypervideo Editor Frank Shipman, Andreas Girgensohn, Lynn Wilcox FX Palo Alto Laboratory.
© De Montfort University, Synchronised Presentations using SMIL Howell Istance School of Computing De Montfort University.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Optimization via Search CPSC 315 – Programming Studio Spring 2009 Project 2, Lecture 4 Adapted from slides of Yoonsuck Choe.
Designing Software for Personal Music Management and Access Frank Shipman & Konstantinos Meintanis Department of Computer Science Texas A&M University.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
Microsoft Office Illustrated Inserting Illustrations, Objects, and Media Clips.
Projects in the Intelligent User Interfaces Group Frank Shipman Associate Director, Center for the Study of Digital Libraries.
Overview of Search Engines
© 2005 ComputerPREP, Inc. All rights reserved. HTML 4.0 and Web Page Design Module I.
Information Retrieval in Practice
Topics for Today General Audio Speech Music Music management support.
Expression Web 2 Concepts and Techniques Expression Web Design Feature Web Design Basics.
© 2008 The McGraw-Hill Companies, Inc. All rights reserved. M I C R O S O F T ® Animating and Using Multimedia Effects Lesson 10.
Digital Sound and Video Chapter 10, Exploring the Digital Domain.
Chapter 11-Multimedia Authoring Tools. Overview Introduction to multimedia authoring tools. Types of authoring tools. Cross-platform authoring notes.
Lecture #32 WWW Search. Review: Data Organization Kinds of things to organize –Menu items –Text –Images –Sound –Videos –Records (I.e. a person ’ s name,
Introducing Dreamweaver MX 2004
Tutorial 1 Getting Started with Adobe Dreamweaver CS3
Working with iMovie: The Basics Created by MJ. Importing Video Footage attach your camera to your computer and set to VTR mode move the camera button.
Today’s Topics Image and video processing Image and video applications.
Publishing a Macromedia Flash Movie – Lesson 131 Publishing a Macromedia Flash Movie Lesson 13.
Centre for HCI Design INTERACT 2003 Tutorial Multimedia & the Web  Planning thematic threads through several media  Presentation layout: sequential or.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
Digital Image Processing & Analysis Spring Definitions Image Processing Image Analysis (Image Understanding) Computer Vision Low Level Processes:
Intelligent Vision Systems ENT 496 Object Shape Identification and Representation Hema C.R. Lecture 7.
Hyper-Hitchcock F. Shipman, A. Girgensohn, and L. Wilcox, "Hyper-Hitchcock: Towards the Easy Authoring of Interactive Video", Proceedings of INTERACT 2003,
Object Orientated Data Topic 5: Multimedia Technology.
Multimedia development platforms Robert Munro 2005.
Understanding The Semantics of Media Chapter 8 Camilo A. Celis.
4 Chapter Four Introduction to HTML. 4 Chapter Objectives Learn basic HTML commands Discover how to display graphic image objects in Web pages Create.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.
Introduction to Flash Animation CS 318. Topics Introduction to Flash and animation The Flash development environment Creating Flash animations  Layers.
Automatic Storytelling in Comics
Autonomous Robots Vision © Manfred Huber 2014.
IMovie 10 Overview. Importing Files from a Video Camera into iMovie 1.Plug the camera into a wall outlet using the power supply. (Some cameras will not.
Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.
Video Topics Hard to separate video processing techniques from image processing – Image and video processing for video authoring – Image and video applications.
Recorded Video Lots of video is recorded – Way too much for human attention What processing and interfaces make this video more valuable? Recorded video.
CS 445/656 Computer & New Media
Information Retrieval in Practice
Visualizing User Activity History
CS 445/656 Computer & New Media
CS 445/656 Computer & New Media
CS 445/656 Computer & New Media
Recorded Video Lots of video is recorded
Video Topics Hard to separate video processing techniques from image processing Image and video processing for video authoring Image and video applications.
Visual Information Retrieval
Search Engine Architecture
CHAPTER 8 Multimedia Authoring Tools
Inserting Graphics, Media, and Objects
Brief Review of Recognition + Context
Multimedia Information Retrieval
Music Computer & New Media.
Audio and Speech Computers & New Media.
Presentation transcript:

Multimedia Interfaces What is a multimedia interface – Most anything where users do not just interact with text – E.g., audio, speech, images, faces, video, sensor data, …

Working with Multimedia Symbolic vs. non-symbolic content – How can users search and browse for the content they need? – What is represented and what is not? – Important that interface design be appropriate to the particular content processing techniques Static vs. dynamic content – How can users locate particular states within a piece of content? – Need visualizations that enable state/segment-based indexing and visualization

General Audio Mapping audio cues to events – Recognizing sounds related to particular events (e.g. gunshot, falling, scream) Mapping events to audio cues – Audio debugger to speed up stepping through code Spatialized audio – Provides additional geographic/navigational channel – Example: Michael Joyce’s Interactive Central Park

Spatialized Audio Spatialized audio is easier when assuming headphones because of control Head-related transfer function (HRTF) – Difference in timing and signal strength determine how we identify position of sound Beamforming – Timing for constructive interference to create stronger signal at desired location Crosstalk Cancellation – Destructive interference to remove parts of signal at desired location

Audio Signal Analysis Fast Fourier Transform (FFT) and Discrete Wavelet Transform (DWT) – Transforms commonly used on audio signals – Allow for analysis of frequency features across time (e.g. power contained in a frequency interval) – FFTs have equal sized windows where wavelets can vary based on frequency Mel-frequency cepstral coeffients (MFCC) – Based on FFTs – Maps results into bands approximating human auditory system

Speech Speaker segmentation – Identify when a change in speaker occurs – Useful for basic indexing or summarization of speech content Speaker identification – Identify who is speaking during a segment – Enables search (and other features) based on speaker Speech recognition – Identify the content of speech

Speech Recognition Start by segmenting utterances and characterizing phonemes – Use gaps to segment – Group segments into words Limited vocabulary of commands – Classifiers for limited vocabulary (HMMs) Continuous speech – Language models for disambiguation – Speaker dependent or not

Music Music processing can support a variety of activities Composition – From traditional to interactive Selection – Example: iTunes, Pandora, – Use for shared spaces Playback – Interactive playback, social playback Management & Summarization – Example: MusicWiz Games – Guitar Hero, Rockband, etc.

MobiLenin Enable interaction with music in a public space – Not karaoke Voting like in many pub/bar games Audience can affect which version of music and video is shown

MusicWiz Metadata Module Audio Signal Module Lyrics Module Worksp. Express. Module Artist Module Relatedness Table Inference Engine Workspace Status Related Song Titles Music Collection Songs & Metadata Songs MusicWiz Interface Lyrics Statistics of Artist Similarity Internet Relatedness Assessment Sim. Values Music management environment that combines: –explicit information –implicit information –non-verbal expression of personal interpretation Two basic components: –interface for interacting with the music collection –inference engine for assessing music relatedness

Image Processing: Color Color histograms – how much of each color is in image – Probability of a pixel in the image being a particular color Color correlograms – how close colors are to each other in image – Probability of finding a pixel of a particular color at a specific distance from a pixel of a known color

Image/Video Processing: Subdividing Region subdivision – Sometimes we subdivide images into regions – Spread observed features at edges for more continuous model Temporal subdivision – Video is subdivided into segments – Spread features into neighboring segments

Image Processing: Foreground Background Separation Background Modeling – Convert to greyscale – Dynamic model (to cope with changes in signer body position and lighting) BP t =.96 * BP (t-1) +.04 P Foreground object detection – Pixels different from background model by more than a threshold are foreground pixels – Spatial filter removes regions of foreground pixels smaller than a minimum threshold Face location to determine position of foreground relative to the face Videos without a single main face are not considered as potential SL videos 13

Image Processing: Other Features Edge detection – Sobel filter Object and Face detection – Skintone models Face recognition Open Source Computer Vision (OpenCV)

MediaGLOW: Interpreting User Action Evolving Notion of Similarity via User Expression – Photos presented in a graph-based workspace with “springs” between each pair of photos. – Lengths of springs is initially based on a default distance metric based on their time, location, tags, or visual features. – Users can pin photos in place and create piles of photos. – Distance metric to piles change as new members are added, resulting in the dynamic layout of unpinned photos in the workspace.

DOTS: Supporting Use of Surveillance Video The problem – Number and size of surveillance systems are increasing but human attention is limiting factor Approach – Provide summaries of action – Build interfaces knowing limits of automation

DOTS: The Main Interface Components – Rotating camera bank with activity graphs – Mixed-initiative main viewer – Map with tracking data – Timeline with automatic events

DOTS: Tracking Layout Difficulty in tracking is that camera views are often similar Tracking layout places cameras around the main viewer to aid tracking Study showed significant improvement in tracking success over traditional viewer In either layout, map can be used to find activity near a location and time.

HyperHitchcock: Interactive Video Issue – Vision: Seamlessly interact with characters in the show – Reality: Difficult to author even simple interactive videos Today, video is included within pages of content but links between playing videos are not common.

Support for Hypervideo Authoring Links in video can lead to other video segments – Short main video with branches providing additional detail – Hyperlinks to branches just like in Web pages – Making of a scene in a movie, biography of an actor, different camera angle General hypervideo difficult to author – Simple hypervideo format with only a single active link Novel approach: use automatic video analysis, create an easy-to-use interface, and support simple hypervideo format

Hierarchical Video with Links Video sequences are represented as a containment hierarchy of video elements – Elements are video clips or composites grouping other video elements – Elements are played in sequence Each element can be link anchor or link destination Anchor for innermost element is available while element is playing After link destination video is played, play-back continues at the link anchor

Detail-on-demand Links Any video clip or composite can be link anchor or link destination Optional link offsets into destination Links have labels Link return behaviors control the purpose of the link – Play from where the viewer left the video – Play from the end of the source anchor sequence – Play from beginning of the source anchor sequence – Stop playback Different behaviors for destination completion or aborted playback

Hyper-Hitchcock Editor Hyper-Hitchcock evolved from Hitchcock video editor Video clips grouped in piles by similarity (e.g., recording time) Workspace to arrange clips – Resize keyframes to trim clips – Clips ordered as horizontal or vertical lists – Place links between clips – Group clips into composites Tree view to visualize containment hierarchy of composites

Trimming Clips in the Workspace Best five seconds of clip selected by default Resizing keyframe changes length of clip – Picks the best portion around initial five-second portion – Start and end can jump to sentence boundary silence Clip start and/or end can be locked in timeline Locked ends can be dragged Audio energy visualized in timeline to spot words and sentences

Attaching Links to Clips and Composites Link anchors and destinations can be clips, composites, or elements inside composites Color-coding and position indicates link attachment in workspace Links in and out of composite Blue: attached to composite Red: attached to element Dashed: between composite and element

Hypervideo Player Video player with controls for following and returning from links Several improvements based on user feedback – First version indicated links in timeline and showed the label for the active link – Next version showed labels in timeline – Current version includes keyframes for active link and for link history User study suggests further improvements

Today’s Topics General Audio – Audio cues, spatialized audio Speech – Segmentation, speaker id, recognition Music – Interactive music, summarization, organization Image and video processing – Color-oriented representations – Region and temporal segmentation – Foreground-background separation – Edge and face detection Image and video applications – MediaGlow – image selection – DOTS – surveillance – HyperHitchcock – interactive video