Analysis of Audio Using PCA

Slides:

Advertisements

Similar presentations

Frequency analysis.

Advertisements

Tamara Berg Advanced Multimedia

Audio Compression ADPCM ATRAC (Minidisk) MPEG Audio –3 layers referred to as layers I, II, and III –The third layer is mp3.

CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.

ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: The Linear Prediction Model The Autocorrelation Method Levinson and Durbin.

Lecture 7: Basis Functions & Fourier Series

Tensors and Component Analysis Musawir Ali. Tensor: Generalization of an n-dimensional array Vector: order-1 tensor Matrix: order-2 tensor Order-3 tensor.

CORRECTING RMS VALUE OF A SINE WAVEFORM SAMPLED DUE LIMITED NUMBER OF PERIODS AND DETERMINATE APERTURE TIME ON DMM Keywords: Digital Multimeter (DMM),

Harmonic Series and Spectrograms 220 Hz (A3) Why do they sound different? Instrument 1 Instrument 2Sine Wave.

Learning Objectives Static and Dynamic Characteristics of Signals

CEN352, Dr. Ghulam Muhammad King Saud University

School of Computing Science Simon Fraser University

Principal Component Analysis

7th IEEE Technical Exchange Meeting 2000 Hybrid Wavelet-SVD based Filtering of Noise in Harmonics By Prof. Maamar Bettayeb and Syed Faisal Ali Shah King.

Subband-based Independent Component Analysis Y. Qi, P.S. Krishnaprasad, and S.A. Shamma ECE Department University of Maryland, College Park.

Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 7: Coding and Representation 1 Computational Architectures in.

Database Construction for Speech to Lip-readable Animation Conversion Gyorgy Takacs, Attila Tihanyi, Tamas Bardi, Gergo Feldhoffer, Balint Srancsik Peter.

EE513 Audio Signals and Systems Noise Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.

EE513 Audio Signals and Systems Digital Signal Processing (Systems) Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.

Audio Fundamentals Lesson 2 Sound, Sound Wave and Sound Perception

Where we’re going Speed, Storage Issues Frequency Space.

GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.

Lecture 1 Signals in the Time and Frequency Domains

Summarized by Soo-Jin Kim

Harmonic Series and Spectrograms

Fourier Concepts ES3 © 2001 KEDMI Scientific Computing. All Rights Reserved. Square wave example: V(t)= 4/  sin(t) + 4/3  sin(3t) + 4/5  sin(5t) +

Feature extraction 1.Introduction 2.T-test 3.Signal Noise Ratio (SNR) 4.Linear Correlation Coefficient (LCC) 5.Principle component analysis (PCA) 6.Linear.

Digital Signal Processing

CISE315 SaS, L171/16 Lecture 8: Basis Functions & Fourier Series 3. Basis functions: Concept of basis function. Fourier series representation of time functions.

Module 2 SPECTRAL ANALYSIS OF COMMUNICATION SIGNAL.

Wireless and Mobile Computing Transmission Fundamentals Lecture 2.

Basics of Neural Networks Neural Network Topologies.

School of Electrical & Computer Engineering Image Denoising Using Steerable Pyramids Alex Cunningham Ben Clarke Dy narath Eang ECE November 2008.

ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska

Audio processing methods on marine mammal vocalizations Xanadu Halkias Laboratory for the Recognition and Organization of Speech and Audio

CCN COMPLEX COMPUTING NETWORKS1 This research has been supported in part by European Commission FP6 IYTE-Wireless Project (Contract No: )

Fourier Series Fourier Transform Discrete Fourier Transform ISAT 300 Instrumentation and Measurement Spring 2000.

1/20 System Overview Cyclic mo-cap data (walking, running..) Cyclic mo-cap data (walking, running..) Music / Sound (audio) Music / Sound (audio) Resulting.

A NOVEL METHOD FOR COLOR FACE RECOGNITION USING KNN CLASSIFIER

Harmonic Series and Spectrograms BY JORDAN KEARNS (W&L ‘14) & JON ERICKSON (STILL HERE )

Sound and Its System. What is sound? basically a waveform of energy that is produced by some form of a mechanical vibration (ex: a tuning fork), and has.

MSc Project Musical Instrument Identification System MIIS Xiang LI ee05m216 Supervisor: Mark Plumbley.

Principal Component Analysis (PCA)

Chapter 8 Lossy Compression Algorithms. Fundamentals of Multimedia, Chapter Introduction Lossless compression algorithms do not deliver compression.

Fundamentals of Multimedia Chapter 6 Basics of Digital Audio Ze-Nian Li and Mark S. Drew 건국대학교 인터넷미디어공학부 임 창 훈.

Xiaoying Pang Indiana University March. 17 th, 2010 Independent Component Analysis for Beam Measurement.

ECE 8443 – Pattern Recognition ECE 3163 – Signals and Systems Objectives: Eigenfunctions Fourier Series of CT Signals Trigonometric Fourier Series Dirichlet.

Intro to Fourier Series BY JORDAN KEARNS (W&L ‘14) & JON ERICKSON (STILL HERE )

Spectral subtraction algorithm and optimize Wanfeng Zou 7/3/2014.

Lecture 7: Basis Functions & Fourier Series

A 2 veto for Continuous Wave Searches

Discrete Fourier Transform (DFT)

Outline Introduction Signal, random variable, random process and spectra Analog modulation Analog to digital conversion Digital transmission through baseband.

Chapter 3 Sampling.

Application of Independent Component Analysis (ICA) to Beam Diagnosis

Outline Linear Shift-invariant system Linear filters

Intro to Fourier Series

Outline Linear Shift-invariant system Linear filters

Lecture 14 PCA, pPCA, ICA.

4. DIGITAL IMAGE TRANSFORMS 4.1. Introduction

Notes Assignments Tutorial problems

Signals and Systems Networks and Communication Department Chapter (1)

The Sound of the Original Sentences

SVD, PCA, AND THE NFL By: Andrew Zachary.

X.1 Principal component analysis

Image Compression via SVD

CEN352, Dr. Ghulam Muhammad King Saud University

Principal Component Analysis

NOISE FILTER AND PC FILTERING

Presentation transcript:

Analysis of Audio Using PCA SAMEER MESHRAM (M.S.) 1725972 VARUN RUPCHANDANI (M.S.) 1728011 VIDUR VIJ (M.S.) 1727990 June 4, 2018

CONTENTS Sound Waves - Basics Problem Statement / Proposed Algorithm Overview VMD and PCA Algorithm Implementation Results Conclusion i

SOUND WAVES - Basics A Sound waves are disturbances in space caused by vibration from a source. Usually represented in the form of a sinusoid: 𝑠 𝑡 =𝐴∗ 𝑠𝑖𝑛 2𝜋𝑓𝑡 + ∅ =𝐴∗𝑠𝑖𝑛 2𝜋 𝑇 𝑡+ ∅ Since analog signal A/D conversion is required for storage and processing Audible range for humans: 20 Hz to 20 kHz Fig : (A) Frequency and wavelength of a typical sound wave (F. Camastra et al 2015) 1

Algorithm for the proposed method PROBLEM STATEMENT Separation of two distinct audio sources from a single mixed audio signal Applications: Bio-medical signal processing, speech recognition, etc. A Fig : (A) Representation of individual and mixed sound signals Algorithm for the proposed method 1. Input observed signal x(t) 2. Apply VMD on x(t) to get n modes : M 1 , M 2 , - up to M n 3. Apply PCA on M 1 to M n to select m principal components : P 1 , P 2 ,- up to P m 4. Extracted sources : P 1 , P 2 ,- up to P m 2

VARIATIONAL MODE DECOMPOSITION (VMD) VMD decomposes a signal into a number of modes Each mode is band-limited about its center frequency It decomposes a real valued signal into a finite number of modes Principal components of these modes are selected to reconstruct the original signal. Fig : (A) VMD decomposition of noisy tri-harmonic. (a) The noisy input signal. (b)-(d) The three modes extracted by de-noising VMD, 3

PRINCIPAL COMPONENT ANALYSIS (PCA) Principal Component Analysis transforms the data into a new domain The new basis are perpendicular to each other The new basis of the data is along the direction of max variance (PC1) Data can be decomposed using eigenvalue decomposition of covariance matrix Or SVD of data matrix PC1 Fig : (A) Representation of the concept of PCA 4

ALGORITHM IMPLEMENTATION - I Signal Input: 𝑥 𝑡 = 𝑥 1 𝑡 + 𝑥 2 𝑡 Case 1 Case 2 𝑥 1 𝑡 =sin(3t) 𝑥 1 𝑡 ="Cymbals audio“ [3 - 5 kHz] 𝑥 𝟐 𝑡 =sin(7t) 𝑥 𝟐 𝑡 ="Bass Drum audio" [60 - 100 Hz] Matlab function(s): audioread() A B Fig : (A) Unmixed and mixed audio signal for case 1, (B) Unmixed and mixed audio signal for case 2 5

ALGORITHM IMPLEMENTATION - II Variational Mode Decomposition on : 𝑥 𝑡 = 𝑥 1 𝑡 + 𝑥 2 𝑡 Input 𝒙 𝑡 =sin 3t +sin(7t) 𝒙 𝑡 ="Cymbals + Bass Drums" Output Modes: M 1 , M 2 , - M n Matlab function(s): VMD() Principal Component Analysis on VMD Modes : M 1 , M 2 , - M n Input Modes of 𝒙 𝑡 Output Extracted Principal Components: P 1 , P 2 ,- P m Matlab function(s): pca() Signal Reconstruction: 𝐏𝐂𝐀 𝐫𝐞𝐜𝐨𝐧𝐬𝐭𝐫𝐮𝐜𝐭𝐢𝐨𝐧=𝐏𝐂 𝐬𝐜𝐨𝐫𝐞𝐬 × 𝐄𝐢𝐠𝐞𝐧𝐯𝐞𝐜𝐭𝐨𝐫𝐬 𝐓 +𝐌𝐞𝐚𝐧 6

RESULTS - Case 1 (Sinusoids) B Fig : (A) Unmixed audio signal 1 for case 1, (B) Unmixed audio signal 2 for case 1 7

RESULTS - Case 1 (Sinusoids) B Fig : (A) Spectrogram of unmixed audio signal 1 for case 1, (B) Spectrogram of unmixed audio signal 2 for case 1 8

RESULTS - Case 1 (Sinusoids) Comparison between original and separated signal For 𝒙 𝟏 𝒕 =𝐬𝐢𝐧(𝟑𝐭) Correlation coefficient 0.9872 Percentage error 6.3593 Root mean square error 0.0187 For 𝒙 𝟐 𝒕 =𝐬𝐢𝐧(𝟕𝐭) Correlation coefficient 0.9987 Percentage error 4.5866 Root mean square error 0.0053 9

RESULTS - Case 2 (Cymbals & Bass Drums) Fig : (A) Unmixed audio signal for cymbal, (B) Unmixed audio signal for bass drum 10

RESULTS - Case 2 (Cymbals & Bass Drums) Fig : (A) Spectrogram of unmixed audio signal for cymbal, (B) Spectrogram of unmixed audio signal for bass drum 11

RESULTS - Case 2 (Cymbals & Bass Drums) Comparison between original and separated signal For Cymbal Correlation coefficient 0.6794 Percentage error 43.9026 Root mean square error 0.1601 For Bass Drum Correlation coefficient 0.9988 Percentage error 16.4570 Root mean square error 0.0489 12

CONCLUSION Algorithm based on VMD & PCA was implemented for audio separation Mixture of pure sinusoids is separated very efficiently Instruments cymbal and bass drum are separated satisfactorily The algorithm has scope for further improvement 13

THE END Thank You! Questions?