Audio Fingerprinting as a New Task for MIREX-2014 Chung-Che Wang Jyh-Shing Roger Jang.

Slides:



Advertisements
Similar presentations
Multimedia: Digitised Sound Data Section 3. Sound in Multimedia Types: Voice Overs Special Effects Musical Backdrops Sound can make multimedia presentations.
Advertisements

Mar 2003 Ognen Paunovski :: Andon Dragomanov S3CTIT03 Modern Trends in Audio Compression presented by Ognen Paunovski Andon Dragomanov 2 nd International.
Digital Audio Teppo Räisänen LIIKE/OAMK. General Information Auditive information is transmitted by vibrations of air molecules The speed of sound waves.
Multimedia Authoring1 Introduction to Garageband Garageband is both a: MIDI sequencer Digital audio recorder Garageband: Real Instruments Tracks displayed.
Investigating the sound quality of different audio file formats In this activity, we are going to record a short voice sample with a sound recording tool,
A stereo audio file 1. Audio Channels Number of audio channels determines number of waveforms in a recording Two relevant types of recording Stereo recording.
Sound in multimedia How many of you like the use of audio in The Universal Machine? What about The Universal Computer? Why or why not? Does your preference.
4.1Different Audio Attributes 4.2Common Audio File Formats 4.3Balancing between File Size and Audio Quality 4.4Making Audio Elements Fit Our Needs.
Digital Audio Production Munsang College Information and Communication Technology S2.
Part A Multimedia Production Rico Yu. Part A Multimedia Production Ch.1 Text Ch.2 Graphics Ch.3 Sound Ch.4 Animations Ch.5 Video.
KARAOKE FORMATION Pratik Bhanawat (10bec113) Gunjan Gupta Gunjan Gupta (10bec112)
2 What is Nero Music2Go? Audio encoding solution for Nintendo DSi Sound Nero provides the application which can create audio files, compatible with “Nintendo.
Image and Sound Editing Raed S. Rasheed Digital Sound Digital sound types – Monophonic sound – Stereophonic sound – Quadraphonic sound – Surround.
Nexidia Confidential “Searching Audio and Video Sources On the Web” SpeechTEK West 2007.
FINGER PRINTING BASED AUDIO RETRIEVAL Query by example Content retrieval Srinija Vallabhaneni.
Audio compression Skills: Audacity compression IT concepts: quality-file size trade off, capture-edit-compress, lossy versus lossless compression This.
Audio data Skills: Set sample size and rate in Audacity IT concepts: analog to digital conversion, digital to analog conversion, sample rate, sample size,
Sound in PowerPoint Demonstration Sound File Inserted in PPT  Requires existing file (wav, mp3, wma, or mid)  Insert >Movies & Sounds >Sound from file.
Video/Image Fingerprinting & Search Naren Chittar CS 223-B project, Winter 2008.
The Chinese University of Hong Kong Department of Computer Science and Engineering Lyu0202 Advanced Audio Information Retrieval System.
AUDIO VIDEO FLASH DIGITAL MEDIA: COMMUNICATION AND DESIGN
Digital Data Patrice Koehl Computer Science UC Davis.
Introduction to Digital Audio
1 OWLTS Online World Language Testing Software Pittsburgh Public Schools Prismatic Consulting LLC.
Representation of Data in Computer Systems
Audio Retrieval David Kauchak cs458 Fall Administrative Assignment 4 Two parts Midterm Average:52.8 Median:52 High:57 In-class “quiz”: 11/13.
Digital audio. In digital audio, the purpose of binary numbers is to express the values of samples that represent analog sound. (contrasted to MIDI binary.
DAISY in the Assistive Technology Devices for the Visually Impaired June, 2009 HIMS Co., Ltd.
Project 4 Image Search based on BoW model with Inverted File System
NM7613: Music Signal Analysis and Retrieval 音樂訊號分析與檢索 Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
Computer Some basic concepts. Binary number Why binary? Look at a decimal number: 3511 Look at a binary number: 1011 counting decimal binary
Quick Sort Instructor : Prof. Jyh-Shing Roger Jang Designer : Shao-Huan Wang The ideas are reference to the textbook “Fundamentals of Data Structures.
Sound or Audio, whichever you prefer –MIDI Files.midi or.mid (Musical Instrument Digital Interface) use for instrumental music. –This format is supported.
Overview of Multimedia A multimedia presentation might contain: –Text –Animation –Digital Sound Effects –Voices –Video Clips –Photographic Stills –Music.
Anatomy of a Sound File v © Allan C. Milne Abertay University.
2015/10/221 Progressive Filtering and Its Application for Query-by-Singing/Humming J.-S. Roger Jang ( 張智星 ) Multimedia Information Retrieval Lab CS Dept.,
Demos for QBSH J.-S. Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
Digital Sound Actual representation of sound Stored in form of thousands of individual numbers (called samples) Not device dependent Stored in bits.
ITEC Final Presentation For Fall 2011 Table of Content –Basic Requirements  Audacity  Inskcape  GIMP  Blender  Animation of 2D and 3D.
File Sizes & Storage Requirements.  An image has a width in pixels and a height in pixels  Start by calculating the number of pixels all up  640 x.
Hidden Markov Classifiers for Music Genres. Igor Karpov Rice University Comp 540 Term Project Fall 2002.
Music Information Retrieval Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.
2016/6/41 Recent Improvement Over QBSH and AFP J.-S. Roger Jang (張智星) Multimedia Information Retrieval (MIR) Lab CSIE Dept, National Taiwan Univ.
Sound in Multimedia Psychology of sound what do you use it for? what techniques for its communication exist? Science of sound why does it exist? how it.
Multimedia and weBLOGging Grade 7-9 | Cahaya Bangsa Classical School (C) 2010 Digital Media Production Facility 04 – Audio Basic.
Sound Editing Software. Audacity Background It is a free software for non-commerical use. It supports WAV, MP3, OGG format Available at M1, Rm 107 (Not.
Advanced AV Production Practicum Math for The Audio Video Professional Calculating Audio and Video File Sizes Copyright © Texas Education Agency, 2015.
QBSH Corpus The QBSH corpus provided by Roger Jang [1] consists of recordings of children’s songs from students taking the course “Audio Signal Processing.
Audio Streaming © Nanda Ganesan, Ph.D.. Audio File Features Audio file is a record of captured sound that can be played back –The WAV File is an example.
Content-Based MP3 Information Retrieval Chueh-Chih Liu Department of Accounting Information Systems Chihlee Institute of Technology 2005/06/16.
Music Emotion Classification: A Fuzzy Approach
Audio Fingerprinting Overview: RARE Algorithms, Resources Chris Burges, John Platt, Jon Goldstein, Erin Renshaw
Combining audio into a multimedia presentation
AUDIO Reflection Questions: (IN AUDIO ENGINEERING FOLDER)
Introduction to ISMIR/MIREX
Intro. to Audio Signals Jyh-Shing Roger Jang (張智星)
MIR Lab: R&D Foci and Demos ( MIR實驗室:研發重點及展示)
Data representation – Sound.
Data representation – Sound.
Learning Objectives Be able to explain how sound can be sampled and stored in digital form. Understand how sampling rate affects digital audio quality.
自我介紹 學歷: 研究方向: 經歷: 1984:學士,台大電機系 1992:博士,加州大學柏克萊分校、電機電腦系
Binary 4 File Sizes.
Closing Remarks on MSAR-2017
Multimedia: Digitised Sound Data
Intro. to Audio Signals Jyh-Shing Roger Jang (張智星)
Ultra-High Performance Low-Power Audio Recorder
Intro. to Audio Signals Jyh-Shing Roger Jang (張智星)
Assist. Lecturer Safeen H. Rasool Collage of SCIENCE IT Dept.
Teaching machines to appreciate music
AHRC Corpus 3 hours of conversation in each language
Presentation transcript:

Audio Fingerprinting as a New Task for MIREX-2014 Chung-Che Wang Jyh-Shing Roger Jang

2 AFP Dataset Database 589 songs (*.mp3 and *.wav) Language: English, Chinese, Japanese, mono, 554 stereo hours long Query set 305 songs (*.wav) 166 mono, 139 stereo hours long Recorded by Smartphones at various environment Various sampling rates and bit resolutions

3 AFP As a New Task for MIREX Database Database size: 10K or more Open set: The first 30-sec clips corresponding to the queries Restriction of the indexed database 40KB for every 10 second audio MB overhead Query set Query length: 10 sec Chop all recordings into 10-sec query segments Open and hidden sets The first 2 clips of each recording is open, and all the others are hidden Restriction on the features for each query example: 40KB for every 10 second recording Performance measure Top-1 accuracy