Content-Based MP3 Information Retrieval Chueh-Chih Liu Department of Accounting Information Systems Chihlee Institute of Technology 2005/06/16.

Slides:



Advertisements
Similar presentations
Speaker Associate Professor Ning-Han Liu. What’s MIR  Music information retrieval (MIR) is the interdisciplinary science of retrieving information from.
Advertisements

Multimedia Database Systems
Presented by Xinyu Chang
Int 2 Multimedia Revision. Digitised Sound Analogue sound recorded from person, or real instruments.
Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.
ARNOLD SMEULDERS MARCEL WORRING SIMONE SANTINI AMARNATH GUPTA RAMESH JAIN PRESENTERS FATIH CAKIR MELIHCAN TURK Content-Based Image Retrieval at the End.
A Novel Scheme for Video Similarity Detection Chu-Hong Hoi, Steven March 5, 2003.
Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin Nov
Multimedia Search and Retrieval: New Concepts, System Implementation, and Application Qian Huang, Atul Puri, Zhu Liu IEEE TRANSACTION ON CIRCUITS AND SYSTEMS.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
Relevance Feedback based on Parameter Estimation of Target Distribution K. C. Sia and Irwin King Department of Computer Science & Engineering The Chinese.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
3D Hand Pose Estimation by Finding Appearance-Based Matches in a Large Database of Training Views
Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.
A. Frank Multimedia Multimedia/Video Search. 2 A. Frank Contents Multimedia (MM) and search/retrieval Text-based MM search in General SEs Text-based MM.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
The Chinese University of Hong Kong Department of Computer Science and Engineering Lyu0202 Advanced Audio Information Retrieval System.
A fuzzy video content representation for video summarization and content-based retrieval Anastasios D. Doulamis, Nikolaos D. Doulamis, Stefanos D. Kollias.
Presenting by, Prashanth B R 1AR08CS035 Dept.Of CSE. AIeMS-Bidadi. Sketch4Match – Content-based Image Retrieval System Using Sketches Under the Guidance.
Content-Based Video Retrieval System Presented by: Edmund Liang CSE 8337: Information Retrieval.
The 2nd International Conference of e-Learning and Distance Education, 21 to 23 February 2011, Riyadh, Saudi Arabia Prof. Dr. Torky Sultan Faculty of Computers.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
1 Template-Based Classification Method for Chinese Character Recognition Presenter: Tienwei Tsai Department of Informaiton Management, Chihlee Institute.
Wavelet-Based Multiresolution Matching for Content-Based Image Retrieval Presented by Tienwei Tsai Department of Computer Science and Engineering Tatung.
Multimedia Databases (MMDB)
1 Lecture 12: Multimedia Not in Web 101 Text  Important Multimedia Issues  Audio  Movies and Video  Multimedia and HTML Documents.
Information Technologies for Presentation of Bulgarian Folk Songs with Music, Notes and Text in a Digital Library Lozanka Peycheva, Nikolay Kirov, Maria.
Music Information Retrieval -or- how to search for (and maybe find) music and do away with incipits Michael Fingerhut Multimedia Library and Engineering.
Computer Vision – Overview Hanyang University Jong-Il Park.
Sound or Audio, whichever you prefer –MIDI Files.midi or.mid (Musical Instrument Digital Interface) use for instrumental music. –This format is supported.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A Comparison of SOM Based Document Categorization Systems.
COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL Presented by 2006/8.
By Chung-Hong Lee ( 李俊宏 ) Assistant Professor Dept. of Information Management Chang Jung Christian University 資料庫與資訊檢索系統的整合 - 一個文件資料庫系統的開發研究.
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
Chapter 6: Information Retrieval and Web Search
IEEE Int'l Symposium on Signal Processing and its Applications 1 An Unsupervised Learning Approach to Content-Based Image Retrieval Yixin Chen & James.
Music Information Retrieval Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.
Content-Based Image Retrieval Using Fuzzy Cognition Concepts Presented by Tienwei Tsai Department of Computer Science and Engineering Tatung University.
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
March 31, 1998NSF IDM 98, Group F1 Group F Multi-modal Issues, Systems and Applications.
Melodic Similarity Presenter: Greg Eustace. Overview Defining melody Introduction to melodic similarity and its applications Choosing the level of representation.
Content-Based Image Retrieval Using Block Discrete Cosine Transform Presented by Te-Wei Chiang Department of Information Networking Technology Chihlee.
MMDB-8 J. Teuhola Audio databases About digital audio: Advent of digital audio CD in Order of magnitude improvement in overall sound quality.
CSCI-100 Introduction to Computing Hardware Part II.
QBSH Corpus The QBSH corpus provided by Roger Jang [1] consists of recordings of children’s songs from students taking the course “Audio Signal Processing.
Semantic Extraction and Semantics-Based Annotation and Retrieval for Video Databases Authors: Yan Liu & Fei Li Department of Computer Science Columbia.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
1/12/ Multimedia Data Mining. Multimedia data types any type of information medium that can be represented, processed, stored and transmitted over.
Yixin Chen and James Z. Wang The Pennsylvania State University
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
SAPIR Search in Audio-Visual Content using P2P Information Retrival For more information visit: Support.
Multilingual Information Retrieval using GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of Kaohsiung.
Query by Image and Video Content: The QBIC System M. Flickner et al. IEEE Computer Special Issue on Content-Based Retrieval Vol. 28, No. 9, September 1995.
A Multilingual Hierarchy Mapping Method Based on GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of.
Relevance Feedback in Image Retrieval System: A Survey Tao Huang Lin Luo Chengcui Zhang.
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
Content-Based Image Retrieval Using Color Space Transformation and Wavelet Transform Presented by Tienwei Tsai Department of Information Management Chihlee.
Looking for New, Not Known Music Only : Music Retrieval by Melody Style Fang-Fei Kuo Dept. of Computer Science and Information Engineering National Chiao.
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
A content-based System for Music Recommendation and Visualization of User Preference Working on Semantic Notions Dmitry Bogdanov, Martin Haro, Ferdinand.
A shallow description framework for musical style recognition Pedro J. Ponce de León, Carlos Pérez-Sancho and José Manuel Iñesta Departamento de Lenguajes.
Naifan Zhuang, Jun Ye, Kien A. Hua
Introduction Multimedia initial focus
Introduction to Music Information Retrieval (MIR)
Ying Dai Faculty of software and information science,
Multimedia Information Retrieval
Color Image Retrieval based on Primitives of Color Moments
Color Image Retrieval based on Primitives of Color Moments
Presentation transcript:

Content-Based MP3 Information Retrieval Chueh-Chih Liu Department of Accounting Information Systems Chihlee Institute of Technology 2005/06/16

Outline 1. Introduction 2. System Architecture 3. Proposed Methods 4. Experimental Results 5. Conclusions

1. Introduction Multimedia Information Retrieval Traditional multimedia database — File name or keywords Content-Based Retrieval (CBR) The musical files of the computer are mainly included WAV and MIDI MP3 (MPEG Audio Layer-3) –Tone quality is like CD music –Storage space is small Text 、 Picture and Audio showing the information description of the information content Waveform Audio : Recording the real musical data Musical Instrument Digital Interface : Recording the simplified musical data WAVMIDI tone quality memory space

1. Introduction cont. Query By Humming (QBH) is getting more and more attention Using QBH to search MP3 music, we have to consider the following issues: –Background music –Inaccuracy of humming –Inconsistent length of phrase

2. System Architecture MP3 Database Feature Extraction and Transformation Feature Extraction and Transformation MP3 Segmentation MP3 Segmentation MP3 Phase Feature Database Feature Extraction and Transformation Feature Extraction and Transformation Similarity Matching MP3 Phase Feature Database Establishment of MP3 Feature Database. Music Content-Based Retrieval. MP3 M P 3 (1,2,3) 1.xxxxxx 2. xxxxxx 3. xxxxxx

3. Proposed Methods Relative note is defined as the pitch change between the current musical note and its preceding one The N-gram approach is according to the relationship of near N musical notes Using N-gram to defined mapping functions (N = 2, 3 and 4) 79, 76, 76, 77, 74, 74, 72, 74, 76, 77, 79, 79, > 76 = 76 < 77 > 74 = 74 > 72 < 74 <76 < 77 < 79 = 79 = 79

3. Proposed Methods cont. Relative Note Mapping Function Distance of near musical notes Distance of near musical notes Euclidean distance Method I Method II MIDI Note Similarity Matching of the MP3 Phase N-gram : Bi-gram 、 Tri-gram 、 Four-gram Bi-gram 、 Tri-gram 、 Four-gram by proper proportion

3. Proposed Methods Method I <1 =2 >3 Bi-gram Tri-gram Four-gram <<1=<4><7 <=2==5>=8 <>3=>6>>9 N = 2 N = 3 N = 4 <<<1=<<10><<19 <<=2=<=11><=20 3=<>12><>21 <=<4==<13>=<22 <==5===14>==23 6==>15>=>24 <><7=><16>><25 <>=8=>=17>>=26 <>>9=>>18>>>27 Method I : Method Based on Relative Note

3. Proposed Methods Method I <1 =2 >3 79 > 76 = = 74 > 72 < 74 < 76 < 77 < 79 = 79 = <<1=<4><7 <=2==5>=8 <>3=>6>> Bi-gramTri-gram Method I : Method Based on Relative Note

3. Proposed Methods Method I Euclidean distance N is the smaller dimension between and = The vector of query i by singer The vector of phrase j in MP3 phrase feature database Mapping Function (3,2,1) (2,1,3) (3,2,1,3,3) (2,1,3) N = 3 Min,,

3. Proposed Methods Method I w B + w T + w F = 1 Bi-gram mapping function Tri-gram mapping function Four-gram mapping function

3. Proposed Methods Method II Singer humming and the data of paragraph of MP3 phases Euclidean distance 79, 76, 76, 77, 74, 74, 72, 74, 76, 77, 79, 79, 79 -3, 0, +1, -3, 0, -2, +2, +2, +1, +2, 0, 0 Method II : Method Based on Note Difference

Original <=> <123 =456 >789 Sample11<2<3 Diff. <<16 Correct 3>2<3 ><7 Sample23>2>1 2 >>9 Method III : Modified Mapping Function 3. Proposed Methods Method III

Improve <=> <T1T2T3 =T4T5T6 >T7T8T9 Sample11<2<3Diff T7-T12 Correct 3>2< T7 Sample23>2>12 T7-T

4. Experimental Results The aim of these experiments are –Investigate the performance of the matching method –Probe into some influence factors in order to achieve better performance Data Collection –Cashbox KTV billboard most hot 20 songs –675 phases

4. Experimental Results cont. 8.5% 38.3% 53.2% 27.6% 66.0% 76.6% 14.9% 78.7% 91.5% 0.0% 10.0% 20.0% 30.0% 40.0% 50.0% 60.0% 70.0% 80.0% 90.0% 100.0% TOP 5TOP 10TOP 15 Method IMethod IIMethod III Method I : Using N-gram to do melody similarity comparison (the baseline one). Method II : Using near note difference to do melody similarity comparison. Method III : Using N-gram to do melody similarity comparison (modified mapping function). Mapping Function Something Wrong MP3 music extraction Did not sing the songs correctly Describe the characteristic of the music

4. Experimental Results cont. Method 1 Method 2 More flexible Too rigid

5. Conclusions The aim of this paper is to develop a system for users to retrieve the MP3 music object by simply humming Matching methods –Using N-gram and Mapping Function to do melody similarity comparison (Method I 、 III) –Using near note difference to do melody similarity comparison (Method II) Our experimental results show that –Encoding based on N-gram method and using Mapping Function to the music melody matching is feasible

5. Conclusions cont. The proportion of Bi-gram, Tri-gram and Four-gram –Bi-gram information was naturally loose, but it was more flexible and fault-tolerant. –Four-gram was more rigid and strong, but it was less flexible. –The behaviors of Tri-gram were between those of Bi-gram and Four-gram. Tri-gram > Bi-gram > Four-gram

5. Conclusions cont. Method IIMethod III The up and down relationship The characteristic of pitch difference The up and down relationship Too rigid in a senseMore flexible Almost correctSome mistakes

5. Conclusions cont. Find out more reasonable Mapping Functions Merge Method III and Method II Use better fault-tolerant techniques Consider different kinds of characteristics of music Develop a better method of extracting the melody of MP3 object Segment MP3 phrases automatically

Thank You !!!