Audio Content Description

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

A Brief Overview of Neural Networks By Rohit Dua, Samuel A. Mulder, Steve E. Watkins, and Donald C. Wunsch.
1 Image Classification MSc Image Processing Assignment March 2003.
Designing Facial Animation For Speaking Persian Language Hadi Rahimzadeh June 2005.
Introduction The aim the project is to analyse non real time EEG (Electroencephalogram) signal using different mathematical models in Matlab to predict.
Perceptron.
Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.
Applications of Wavelet Transform and Artificial Neural Network in Digital Signal Detection for Indoor Optical Wireless Communication Sujan Rajbhandari.
Equine Gait Analysis and Visualization Methods Dr. Marjorie Skubic Samer Arafat Justin Satterley Computer Engineering & Computer Science Dr. Kevin Keegan.
Wavelet Transform. What Are Wavelets? In general, a family of representations using: hierarchical (nested) basis functions finite (“compact”) support.
Machine Learning Motivation for machine learning How to set up a problem How to design a learner Introduce one class of learners (ANN) –Perceptrons –Feed-forward.
Optimal Adaptation for Statistical Classifiers Xiao Li.
Signal Processing of Germanium Detector Signals David Scraggs University of Liverpool UNTF 2006.
A Wavelet-Based Approach to the Discovery of Themes and Motives in Melodies Gissel Velarde and David Meredith Aalborg University Department of Architecture,
Introduction to Wavelets -part 2
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
Database Construction for Speech to Lip-readable Animation Conversion Gyorgy Takacs, Attila Tihanyi, Tamas Bardi, Gergo Feldhoffer, Balint Srancsik Peter.
CSC 4510 – Machine Learning Dr. Mary-Angela Papalaskari Department of Computing Sciences Villanova University Course website:
Functional Brain Signal Processing: EEG & fMRI Lesson 8 Kaushik Majumdar Indian Statistical Institute Bangalore Center M.Tech.
Electrical and Computer Systems Engineering Postgraduate Student Research Forum 2001 WAVELET ANALYSIS FOR CONDITION MONITORING OF CIRCUIT BREAKERS Author:
Multiple-Layer Networks and Backpropagation Algorithms
Kumar Srijan ( ) Syed Ahsan( ). Problem Statement To create a Neural Networks based multiclass object classifier which can do rotation,
Appendix B: An Example of Back-propagation algorithm
Lecture 13 Wavelet transformation II. Fourier Transform (FT) Forward FT: Inverse FT: Examples: Slide from Alexander Kolesnikov ’s lecture notes.
CAPTCHA solving Tianhui Cai Period 3. CAPTCHAs Completely Automated Public Turing tests to tell Computers and Humans Apart Determines whether a user is.
Wavelet transform Wavelet transform is a relatively new concept (about 10 more years old) First of all, why do we need a transform, or what is a transform.
Multi-Layer Perceptron
Chapter 5: Neighborhood Processing
CCN COMPLEX COMPUTING NETWORKS1 This research has been supported in part by European Commission FP6 IYTE-Wireless Project (Contract No: )
Music Genre Classification Alex Stabile. Example File
Prediction of the Foreign Exchange Market Using Classifying Neural Network Doug Moll Chad Zeman.
1 Wavelet Transform. 2 Definition of The Continuous Wavelet Transform CWT The continuous-time wavelet transform (CWT) of f(x) with respect to a wavelet.
Neural Networks Presented by M. Abbasi Course lecturer: Dr.Tohidkhah.
1 Chapter 02 Continuous Wavelet Transform CWT. 2 Definition of the CWT The continuous-time wavelet transform (CWT) of f(t) with respect to a wavelet 
The Wavelet Tutorial: Part2 Dr. Charturong Tantibundhit.
WAVELET AND IDENTIFICATION WAVELET AND IDENTIFICATION Hamed Kashani.
MULTIMEDIA DATA MODELS AND AUTHORING
Neural Network Recognition of Frequency Disturbance Recorder Signals Stephen Tang REU Final Presentation July 22, 2014.
Neural Networks Lecture 4 out of 4. Practical Considerations Input Architecture Output.
Speech Recognition through Neural Networks By Mohammad Usman Afzal Mohammad Waseem.
Pattern Recognition Lecture 20: Neural Networks 3 Dr. Richard Spillman Pacific Lutheran University.
Automatic Classification of Audio Data by Carlos H. L. Costa, Jaime D. Valle, Ro L. Koerich IEEE International Conference on Systems, Man, and Cybernetics.
CSE343/543 Machine Learning Mayank Vatsa Lecture slides are prepared using several teaching resources and no authorship is claimed for any slides.
Today’s Lecture Neural networks Training
Wavelet Transform Advanced Digital Signal Processing Lecture 12
Multiple-Layer Networks and Backpropagation Algorithms
Neural Networks.
Artificial neural networks
Presentation on Artificial Neural Network Based Pathological Voice Classification Using MFCC Features Presenter: Subash Chandra Pakhrin 072MSI616 MSC in.
Wavelets : Introduction and Examples
IIS for Image Processing
Final Year Project Presentation --- Magic Paint Face
Multi-resolution analysis
An Introduction to Support Vector Machines
Machine Learning Today: Reading: Maria Florina Balcan
Tomasz Maszczyk and Włodzisław Duch Department of Informatics,
Introduction To Wavelets
Artificial Neural Network & Backpropagation Algorithm
Object Classes Most recent work is at the object level We perceive the world in terms of objects, belonging to different classes. What are the differences.
Neural Network - 2 Mayank Vatsa
Wavelet transform Wavelet transform is a relatively new concept (about 10 more years old) First of all, why do we need a transform, or what is a transform.
Creating Data Representations
Wavelet Transform Fourier Transform Wavelet Transform
Yi Zhao1, Yanyan Shen*1, Yanmin Zhu1, Junjie Yao2
John H.L. Hansen & Taufiq Al Babba Hasan
Introduction to Radial Basis Function Networks
Govt. Polytechnic Dhangar(Fatehabad)
Research Institute for Future Media Computing
Chapter 15: Wavelets (i) Fourier spectrum provides all the frequencies
Ch4: Backpropagation (BP)
Presentation transcript:

Audio Content Description with Wavelets Neural Nets and Diploma Thesis Stephan Rein Prof. Dr.-Ing. Thomas Sikora Prof. Dr. Martin Reisslein Dr. Nicolas Moreau

Overview Next Generation Internet Search Machine MPEG-7: Multimedia Content Description Why Wavelets? Statistical Analysis of Wavelet Coefficients Neural Nets for Audio Content Classification Results Summary

Next Generation Internet Search Machine identifiy classical movements So.1 iii Men 57 feature extraction similarity measure So.1 iv Men 57 So.1 iv

Moving Pictures Expert Group MPEG-1, 2, 4: Compression of Multimedia Data MPEG-7: Description of Multimedia Data Idea of Multimedia content description: key to completely novel and futuristic applications Content description tools Platform for Descriptive Data Encourage research on content description

6 Sonatas & Partitas for the Test Data Base: J. S. Bach 6 Sonatas & Partitas for the Solo Violin BWV 1001-1006 1934-36 1957 1952 1973 Current today, current in 100 years from now Recordings differ in time, frequency, quality and sound environments Polyphonic and non separable phenomena

Problem Short-Term Fourier Analysis: Trade-off between Time and Frequency (Heisenberg Uncertainty Principle) Short analysis window: high frequencies can be well located, but low frequency components can not be measured Long analysis window: low frequencies can be measured, but high frequencies can not be resolved in time coarser time resolution when? ? ? ?

Solution: Wavelet Time-Scale Approach lower scale high frequency convolution higher scale low frequency time(position) scale

Wavelet Mother Functions must satisfy admissibility conditions (Farge 1992). Decrease quickly towards 0 Zero mean Localized in time and frequency domain Family of shifts and dilations of must allow for signal reconstruction

Analysis of Wavelet Coefficients Gaussian Wavelet Envelope Descriptor Statistical Data Summarization Tools arithmetic mean geometric mean harmonic mean standard deviation variation mean absolute deviation median interquartile range range skewness Scale Frequency Measure Percentile Correlations

A novel Wavelet Dispersion Measure time a) scale b) rank d) c) e)

Performance Wavelet Disp. Measure Identify pieces of novel recording of Menuhin 1934 recordings employed by the search system user query: recording of Menuhin 1934

Neural Nets for Audio Classification training Perceptron Neural Net Backpropagation Net Probabilistic Radial Basis Net next slides answer user query Mil 75 Wavelet dispersion vectors target vectors Men34 Men57 Hei57 class 1 1 Neural Net class 2 example 2 vectors class 32 32 class x

Single Layer Perceptron Network net output net input 3 b transfer function training algorithm net error

Backpropagation Network Hidden layers and output layer Different transfer functions Gradient decent algorithm: learning rate minimum

Performance Perceptron Net Perfectly learned example recordings Was not able to generalize

Performance Radial Neural Net best performance with biorthogonal wavelet good performance with Morlet wavelet

Summary Analysis of Wavelets and Neural Nets for identification of classical movements Novel Wavelet Dispersion Measure Novel Methodology with 78 % success rate: a) biorthogonal Wavelet b) Dispersion Measure c) Radial Basis Neural Net Readily applicable for next generation Internet Search Machine

Publication & Contact Pending U.S. patent application: ask Prof. Martin Reisslein (reisslein@asu.edu) for details Diploma Thesis available at www.fulton.asu.edu/~mre S. Rein, M. Reisslein, T. Sikora (sikora@nue.tu-berlin.de), Audio Content Description with Wavelets and Neural Nets, 4-page version submitted to ICASSP’ 04, available on request: rein@cs.tu-berlin.de

Thank You for your help Prof. Dr.-Ing. Thomas Sikora Prof. Dr. Martin Reisslein Dr. Nicolas Moreau Birgit Boldin Dr.-Ing. Frank Fitzek