8th Annual CSIS Research Conference 1 Client Server Browsing of Sound Resources: Classification and Browsing E. Brazil Interaction Design Centre University.

Slides:



Advertisements
Similar presentations
1 Copyright © 2002 Pearson Education, Inc.. 2 Chapter 1 Introduction to Perl and CGI.
Advertisements

Remote Visualisation System (RVS) By: Anil Chandra.
Copyright © 2014 Pearson Education, Inc. Publishing as Prentice Hall
KARAOKE FORMATION Pratik Bhanawat (10bec113) Gunjan Gupta Gunjan Gupta (10bec112)
MULTIMEDIA DEVELOPMENT 4.3 : AUTHORING TOOLS. At the end of the lesson, students should be able to: 1. Describe different types of authoring tools Learning.
1 A scheme for racquet sports video analysis with the combination of audio-visual information Visual Communication and Image Processing 2005 Liyuan Xing,
Content-Based Classification, Search & Retrieval of Audio Erling Wold, Thom Blum, Douglas Keislar, James Wheaton Presented By: Adelle C. Knight.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
Toward Semantic Indexing and Retrieval Using Hierarchical Audio Models Wei-Ta Chu, Wen-Huang Cheng, Jane Yung-Jen Hsu and Ja-LingWu Multimedia Systems,
EE442—Multimedia Networking Jane Dong California State University, Los Angeles.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Multimedia Search and Retrieval: New Concepts, System Implementation, and Application Qian Huang, Atul Puri, Zhu Liu IEEE TRANSACTION ON CIRCUITS AND SYSTEMS.
Soft. Eng. I, Fall 2006Dr Driss Kettani, from I. Sommerville1 CSC-3324: Chapter 6 Software Design Section 10.3 (except )
Tree-Maps: A Space-Filling Approach to the Visualization of Hierarchical Information Structures Brian Johnson Ben Shneiderman (HCIL TR 91-06) Steve Betten.
ITEC810 Project By: P. M. Mathindri Nilushika Pathiraja 1.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Open Statistics: Envisioning a Statistical Knowledge Network Ben Shneiderman Founding Director ( ), Human-Computer Interaction.
MUSCLE movie data base is a multimodal movie corpus collected to develop content- based multimedia processing like: - speaker clustering - speaker turn.
Final Year Student Projects: Prelude Michael R. Lyu.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Presented by Zeehasham Rasheed
Semantic Video Classification Based on Subtitles and Domain Terminologies Polyxeni Katsiouli, Vassileios Tsetsos, Stathes Hadjiefthymiades P ervasive C.
Tutorial 7 Working with Multimedia. XP Objectives Explore various multimedia applications on the Web Learn about sound file formats and properties Embed.
What is Asset Bank? Asset Bank is an enterprise-scale Digital Asset Management system A fully searchable, categorised library of digital images, videos.
Chapter 4 Product Design. Objectives of Design  As all other aspects of object-oriented system development, design can be deployed in an iterative or.
Sound Applications Advanced Multimedia Tamara Berg.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
SoundSense: Scalable Sound Sensing for People-Centric Application on Mobile Phones Hon Lu, Wei Pan, Nocholas D. lane, Tanzeem Choudhury and Andrew T. Campbell.
Adaptive 3D Web Sites by by Luca Chittaro and Roberto Ranon MAJ(P) Charles E. Grindle 7 Dec 05.
Chapter 11-Multimedia Authoring Tools. Overview Introduction to multimedia authoring tools. Types of authoring tools. Cross-platform authoring notes.
Audio classification Discriminating speech, music and environmental audio Rajas A. Sambhare ECE 539.
SoundSense by Andrius Andrijauskas. Introduction  Today’s mobile phones come with various embedded sensors such as GPS, WiFi, compass, etc.  Arguably,
You are probably surrounded by music whether you realize it or not. It may be by playing an instrument, listening to songs on the radio, studying music,
Visual User Interfaces David Rashty. “Grasping the whole is a gigantic theme. Arguably, intellectual history’s most important. Ant-vision is humanity’s.
DYNAMIC WAP BASED VOTING SYSTEM Bertrand COLAS Submission date: May 2002 School of Computing Bachelor of Engineering with Honours in Computer.
I Copyright © 2004, Oracle. All rights reserved. Introduction Copyright © 2004, Oracle. All rights reserved.
ICAD-01, Espoo, Finland, July 29 - August 1, 2001 Sonic Browsing: An Auditory Tool For Multimedia Asset Management Mikael Fernström & Eoin Brazil Interaction.
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
1 Welcome to CSC 301 Web Programming Charles Frank.
Introduction to Making Multimedia
School of Informatics School Research Conference 2003 Noise and other stuffDr Paul Vickers1 Noise, mobility, accessibility, and other stuff Paul Vickers.
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
A NOVEL PREFETCHING METHOD FOR SCENE-BASED MOBILE SOCIAL NETWORK SERVICE 作者 :Song Li, Wendong Wang, Yidong Cui, Kun Yu, Hao Wang 報告者 : 饒展榕.
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
Duraid Y. Mohammed Philip J. Duncan Francis F. Li. School of Computing Science and Engineering, University of Salford UK Audio Content Analysis in The.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Interaction design centre ICAD-03, Boston, USA, July 6-9, 2003 Experiments with the Sonic Browser Eoin Brazil, Mikael Fernström Interaction Design Centre.
MMDB-9 J. Teuhola Standardization: MPEG-7 “Multimedia Content Description Interface” Standard for describing multimedia content (metadata).
Performance Comparison of Speaker and Emotion Recognition
MSc Project Musical Instrument Identification System MIIS Xiang LI ee05m216 Supervisor: Mark Plumbley.
MPEG-4: Multimedia Coding Standard Supporting Mobile Multimedia System Lian Mo, Alan Jiang, Junhua Ding April, 2001.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
MPEG-7 Audio Overview Ichiro Fujinaga MUMT 611 McGill University.
Web Server By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
Unit 19 Computer Music Systems 1 Examine the hardware options available for the composition and production of music using computer technology assess the.
What is Multimedia Anyway? David Millard and Paul Lewis.
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
Automatic Classification of Audio Data by Carlos H. L. Costa, Jaime D. Valle, Ro L. Koerich IEEE International Conference on Systems, Man, and Cybernetics.
CS 445/656 Computer & New Media
MATLAB Distributed, and Other Toolboxes
CHAPTER 8 Multimedia Authoring Tools
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Chapter 10 Development of Multimedia Project
Overview What is Multimedia? Characteristics of multimedia
Audio and Speech Computers & New Media.
Realtime Recognition of Orchestral Instruments
Realtime Recognition of Orchestral Instruments
Presentation transcript:

8th Annual CSIS Research Conference 1 Client Server Browsing of Sound Resources: Classification and Browsing E. Brazil Interaction Design Centre University of Limerick Ireland

8th Annual CSIS Research Conference 2 Introduction ?- how to classify sound resources and how to provide an interface to browse these resources. !- provide a browsable sound database for users via intranet / Internet environments

8th Annual CSIS Research Conference Overview of Research Areas Sound Classification Sound Representation Sound Browsing

8th Annual CSIS Research Conference Sound Classification Two levels of classification Course level –Distinguish whether Speech, Music, Environmental, Silence or Other category Fine level –Use human perceptual features

8th Annual CSIS Research Conference Coarse-level classification of audio (1) –Audio signals are classified into basic types, including speech, music, several types of environmental sounds, and silence –Take morphological and statistical analyses of short- time feature curves (energy function, average zero- crossing rate, fundamental frequency), as well as a rule- based heuristic classification procedure

8th Annual CSIS Research Conference Coarse-level classification of audio (2) Short-time energy function –Short-time energy of audio signal reflects the amplitude variations over time Short-time average zero-crossing rate –ZCR is the number of times the signal passes through zero in a given time interval Spectral Centroid

8th Annual CSIS Research Conference Fine-level classification of audio Further classification will be conducted within each basic type: –music: classify music played by different instruments, different types of music, singing, plain song –speech: differentiate voices of man, woman, and child, speech with music background –environmental sound: divide them into classes such as applause, bell ring, footstep, windstorm, laughter, bird’s sound, and so on

8th Annual CSIS Research Conference Sound Representation Previous work has concentrated on –Visual star-field type display New novel visual representations –Visualisations on spheres (non-Euclidean spaces) –Hyper tree –Excentric labeling

8th Annual CSIS Research Conference Star-field Display Virtual University - Uni. Vienna

8th Annual CSIS Research Conference Visualisations on Spheres H3: Laying Out Large Directed Graphs in 3D Hyperbolic Space - Munzer

8th Annual CSIS Research Conference Hyper Tree

8th Annual CSIS Research Conference Excentric Labeling HCIL – Uni. Maryland

8th Annual CSIS Research Conference Sound Browsing Iterative & Interactive Activity: –Opportunistic & Serendipitous Enable users’ to explore a data set External & internal properties of objects: –Context & Content Evaluate and revise understanding of relationships

8th Annual CSIS Research Conference 14 The Sonic Browser Application Audio: Direct representation of tunes (exploting the cocktailparty effect) Sounds are panned out in a stereo field controlled by the visual location of the tunes nearest to the cursor. The volume of the tunes playing concurrently is proportional to the visual distance between the objects and the cursor

8th Annual CSIS Research Conference 16 The Sonic Browser Application

8th Annual CSIS Research Conference Client – Server Issues let the server do the mixing and spatialisation analysis and classification on server lightweight client - Java. different network topologies and protocols. –Latency issues –Use of a floating ‘Aura’

8th Annual CSIS Research Conference Cue Points Use Cue Points as Marker Points –Mark a specific point or section of a sound Play only significant portion of sound while browsing Reduce time to identify sound by playing characteristic or significant part Found in many common sound file formats *Technical Report UL-IDC-01-02

8th Annual CSIS Research Conference 22 Application Platform: HW & OS Normal Multimedia PC –(Pentium II/III w. SB Live, etc) Server –MS Windows 98/2000 Client –Any O/S with Java Runtime

8th Annual CSIS Research Conference Conclusion Facilitate different visualisation tools, e.g. for non-Euclidean space. Address payment and copyright issues Investigate other file types, e.g. MPEG-7.

8th Annual CSIS Research Conference References (1) Brazil, E. (2001). Cue Points: An Examination Of Common Sound File Formats. Limerick, University of Limerick. Fekete, J. D., Plaisant, C. (1999). Excentric Labeling: Dynamic Neighborhood Labeling for Data Visualization. Conference on Human factors in Computer Systems, New York, ACM. Fernström, M., Brazil, E. (2001). Sonic Browsing: An Auditory Tool For Multimedia Asset Management. International Conference on Auditory Display, Espoo, Finland. Ó Maidín, D. and M. Fernström (2000). The Best of Two Worlds: Retrieving and Browsing. COST-G6 Conference on Digital Audio Effects DAFx-00, Verona, Universita degli Studi Verona.

8th Annual CSIS Research Conference References (2) Shneiderman, B. (1996). The eyes have it: A task by data type taxonomy for information visualizations. IEEE, Visual Languages, Boulder, CO, USA. Zhang, T., Kuo, C.C. (1998). Content-based Classification and Retrieval of Audio. SPIE's 43rd Annual Meeting - Conference on Advanced Signal Processing Algorithms, Architectures, and Implementations VIII, San Diego. Zhang, T., Kuo, C.C. (1998). Hierarchical System for Content- Based Audio Classification and Retrieval. SPIE's Conference on Multimedia Storage and Archiving Systems III, Boston.