Spectral Analysis & Spectrogram

Slides:



Advertisements
Similar presentations
Types of Image Enhancements in spatial domain
Advertisements

DCSP-13 Jianfeng Feng
DSPRevision I
Presented by Erin Palmer. Speech processing is widely used today Can you think of some examples? Phone dialog systems (bank, Amtrak) Computers dictation.
Acoustic/Prosodic Features
David Hansen and James Michelussi
Department of Kinesiology and Applied Physiology Spectrum Estimation W. Rose
Chapter 11 Signal Processing with Wavelets. Objectives Define and illustrate the difference between a stationary and non-stationary signal. Describe the.
CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.
Voiceprint System Development Design, implement, test unique voiceprint biometric system Research Day Presentation, May 3 rd 2013 Rahul Raj (Team Lead),
Amplitude modulation DSB-LC (full AM)
Digital Image Processing
Let’s go back to this problem: We take N samples of a sinusoid (or a complex exponential) and we want to estimate its amplitude and frequency by the FFT.
Resident Physics Lectures
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
Introduction The aim the project is to analyse non real time EEG (Electroencephalogram) signal using different mathematical models in Matlab to predict.
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
Spatial and Temporal Data Mining
Image Enhancement To process an image so that the result is more suitable than the original image for a specific application. Spatial domain methods and.
Speech & Audio Processing
1 Speech Parametrisation Compact encoding of information in speech Accentuates important info –Attempts to eliminate irrelevant information Accentuates.
Time and Frequency Representations Accompanying presentation Kenan Gençol presented in the course Signal Transformations instructed by Prof.Dr. Ömer Nezih.
Overview of Adaptive Multi-Rate Narrow Band (AMR-NB) Speech Codec
Y x x. y = x y y x x. y = - x y y x x. y = 0 Two vectors x and y are orthogonal when their scalar product is zero x. y = 0and xy = 1= Two vectors x and.
Wavelet-based Coding And its application in JPEG2000 Monia Ghobadi CSC561 project
Spectral Analysis Spectral analysis is concerned with the determination of the energy or power spectrum of a continuous-time signal It is assumed that.
64-QAM Communications System Design and Characterization Project #1 EE283
Representing Acoustic Information
Introduction to Spectral Estimation
LE 460 L Acoustics and Experimental Phonetics L-13
Ni.com Data Analysis: Time and Frequency Domain. ni.com Typical Data Acquisition System.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
Understanding ADC Specifications September Definition of Terms 000 Analogue Input Voltage Digital Output Code FS1/2.
Acoustic Analysis of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.
CS654: Digital Image Analysis Lecture 17: Image Enhancement.
Preprocessing Ch2, v.5a1 Chapter 2 : Preprocessing of audio signals in time and frequency domain  Time framing  Frequency model  Fourier transform 
By Sarita Jondhale1 Signal Processing And Analysis Methods For Speech Recognition.
Speech Signal Representations I Seminar Speech Recognition 2002 F.R. Verhage.
EE104: Lecture 5 Outline Review of Last Lecture Introduction to Fourier Transforms Fourier Transform from Fourier Series Fourier Transform Pair and Signal.
Speech Recognition Feature Extraction. Speech recognition simplified block diagram Speech Capture Speech Capture Feature Extraction Feature Extraction.
Pre-Class Music Paul Lansky Six Fantasies on a Poem by Thomas Campion.
Noise Reduction Two Stage Mel-Warped Weiner Filter Approach.
1“Principles & Applications of SAR” Instructor: Franz Meyer © 2009, University of Alaska ALL RIGHTS RESERVED Dr. Franz J Meyer Earth & Planetary Remote.
Lecture#10 Spectrum Estimation
STORM WP7 Software library for nonlinear analysis of fluctuations in space plasma time series P. Kovács, A. Koppán C. Munteanu, M. Echim, STORM Annual.
EEL 6586: AUTOMATIC SPEECH PROCESSING Speech Features Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida February 27,
EE 495 Modern Navigation Systems Noise & Random Processes Mon, March 02 EE 495 Modern Navigation Systems Slide 1 of 19.
Antenna and Microwave Laboratory Babol Noshirvani University of Technology Antenna and Microwave Laboratory Babol Noshirvani University.
My_seis_GUI : User-Interface grabs a waveform object defined by ‘Station’, ‘Channel’, ‘Network’, ‘Port’, ‘Host’ Start and Stop times are in form [YYYY.
The Discrete Fourier Transform
The Story of Wavelets Theory and Engineering Applications
Time Compression/Expansion Independent of Pitch. Listening Dies Irae from Requiem, by Michel Chion (1973)
CS Spring 2010 CS 414 – Multimedia Systems Design Lecture 4 – Audio and Digital Image Representation Klara Nahrstedt Spring 2010.
Acoustic Phonetics 3/14/00.
Correlation and Power Spectra Application 5. Zero-Mean Gaussian Noise.
Lifecycle from Sound to Digital to Sound. Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz.
Lecture 19 Spectrogram: Spectral Analysis via DFT & DTFT
Fourier series With coefficients:.
distance, m (log scale) -25o 0o +25o C50, dB left right L-shaped
National Mathematics Day
Ch. 2 : Preprocessing of audio signals in time and frequency domain
Applications of Multirate Signal Processing
EEE422 Signals and Systems Laboratory
Speech Signal Processing
Spectral Analysis Spectral analysis is concerned with the determination of the energy or power spectrum of a continuous-time signal It is assumed that.
64-QAM Communications System Design and Characterization
Fourier Transform.
LECTURE 18: FAST FOURIER TRANSFORM
LECTURE 18: FAST FOURIER TRANSFORM
Two models of the stimulus–response relationship for auditory neurons.
Presentation transcript:

Spectral Analysis & Spectrogram SASPL Cheolwoo Jo

Contents Spectral Analysis Spectrogram Analysis of Spectrogram Narrow Band Wide Band Analysis of Spectrogram AR, MA, ARMA model of Speech

Spectral Analysis

Spectrogram in matlab [B,F,T] = SPECGRAM(A,NFFT,Fs,WINDOW,NOVERLAP) A: Speech Signal NFFT: Number of samples Fs: Sampling Frequency Window: Type of Window NOVERLAP: Number of Overlapping signal

Spectrogram in auditory toolbox function [array,raw] = spectrogram(wave,segsize,nlap,ntrans); %function array = spectrogram(wave,segsize,nlap,ntrans); % defaults spectrogram(wave,128,8,4) % nlap is number of hamming windows overlapping a point; % ntrans is factor by which transform is bigger than segment; % returns a spectrogram 'array' with fourth root of power, % filter smoothed and formatted for display. % Added option to return raw spectrogram.... Malcolm 5/26/95 % Added code so that input could be any direction ... Malcolm 5/26/95 % (c) 1998 Interval Research Corporation if nargin < 4; ntrans=4; end if nargin < 3; nlap=8; end if nargin < 2; segsize=128; end [r c] = size(wave); if (r < c) wave = filter([1 -0.95],[1],wave'); else wave = filter([1 -0.95],[1],wave); end

s = length(wave); nsegs = floor(s/(segsize/nlap))-nlap+1; array = zeros(ntrans/2*segsize,nsegs); window = 0.54-0.46*cos(2*pi/(segsize+1)*(1:segsize)'); for i = 1:nsegs seg = zeros(ntrans*segsize,1); % leave half full of zeroes seg(1:segsize) = ... window.*wave(((i-1)*segsize/nlap+1):((i+nlap-1)*segsize/nlap)); seg = abs(fft(seg)); % reverse for image display array(:,i) = seg(((ntrans/2*segsize)+1):(ntrans*segsize)); end if nargout > 1 raw = array;

array = array .* array; % back into power domain for smoothing for i=1:nsegs % smooth the spectral slices array(:,i) = filter([.2 1 .2],[1],array(:,i)); end for i=1:ntrans/2*segsize % smooth the channels array(i,:) = filter([.2 1 .2],[1],array(i,:)); % compress with square root of amplitude (fourth root of power) off = 0.0001*max(max(array)); % low end stabilization offset, array = (off+array).^0.25-off^0.25; % better than a threshold hack! array = 255/max(max(array))*array;

Spectrogram level scaling Log|x[k]|2 Gray Intensity Black Log Energy(dB) White 60dB

Specgram SPECGRAM(A,F,Fs,WINDOW) Matlab A: Speech F: Vector of Frequencies Fs: Sampling Frequency Window: Type of Window Function