INTRODUCTION TO FUNDAMENTALS OF SIGNAL PROCESSING

Slides:



Advertisements
Similar presentations
ECE 8443 – Pattern Recognition EE 3512 – Signals: Continuous and Discrete Objectives: Response to a Sinusoidal Input Frequency Analysis of an RC Circuit.
Advertisements

Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
Chapter 8: The Discrete Fourier Transform
SOME SIMPLE MANIPULATIONS OF SOUND USING DIGITAL SIGNAL PROCESSING Richard M. Stern demo August 31, 2004 Department of Electrical and Computer.
Sampling, Reconstruction, and Elementary Digital Filters R.C. Maher ECEN4002/5002 DSP Laboratory Spring 2002.
Lecture #7 FREQUENCY RESPONSE OF LSI SYSTEMS Department of Electrical and Computer Engineering Carnegie Mellon University Pittsburgh, Pennsylvania.
DSP for Engineering Aplications DSP for Engineering Aplications ECI Semester /2010 Department of Engineering and Design London South Bank University.
Lecture #18 FAST FOURIER TRANSFORM INVERSES AND ALTERNATE IMPLEMENTATIONS Department of Electrical and Computer Engineering Carnegie Mellon University.
INTRODUCTION TO ADVANCED DIGITAL SIGNAL PROCESSING
About this Course Subject: Textbook Reference book Course website
Digital signal Processing Digital signal Processing ECI Semester /2004 Telecommunication and Internet Engineering, School of Engineering, South.
Digital Signal Processing
EE513 Audio Signals and Systems Digital Signal Processing (Systems) Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
… Representation of a CT Signal Using Impulse Functions
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission.
1 Prof. Nizamettin AYDIN Digital Signal Processing.
April 12, 2005Week 13 1 EE521 Analog and Digital Communications James K. Beard, Ph. D. Tuesday, March 29, 2005
CEN352 Digital Signal Processing Lecture No. 1 Department of Computer Engineering, College of Computer and Information Sciences, King Saud University,
EEE 503 Digital Signal Processing Lecture #1 : Introduction Dr. Panuthat Boonpramuk Department of Control System & Instrumentation Engineering KMUTT.
G Practical MRI 1 – 29 th January 2015 G Practical MRI 1 Introduction to the course Mathematical fundamentals.
Zhongguo Liu_Biomedical Engineering_Shandong Univ. Chapter 8 The Discrete Fourier Transform Zhongguo Liu Biomedical Engineering School of Control.
Fourier Analysis of Discrete-Time Systems
ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska
Signals & Systems B-Tech (Hons). Signals & Systems Lecture # 1 Instructor Engr. Kashif Shahzad 2015.
Fourier and Wavelet Transformations Michael J. Watts
By Sarita Jondhale 1 Signal preprocessor: “conditions” the speech signal s(n) to new form which is more suitable for the analysis Postprocessor: operate.
SOME SIMPLE MANIPULATIONS OF SOUND USING DIGITAL SIGNAL PROCESSING Richard M. Stern demo January 15, 2015 Department of Electrical and Computer.
Learning from the Past, Looking to the Future James R. (Jim) Beaty, PhD - NASA Langley Research Center Vehicle Analysis Branch, Systems Analysis & Concepts.
UNIT-IV. Introduction Speech signal is generated from a system. Generation is via excitation of system. Speech travels through various media. Nature of.
بسم الله الرحمن الرحيم Lecture (1) Introduction to DSP Dr. Iman Abuel Maaly University of Khartoum Department of Electrical and Electronic Engineering.
DSP First, 2/e Lecture 16 DTFT Properties. June 2016 © , JH McClellan & RW Schafer 2 License Info for DSPFirst Slides  This work released under.
Signal & Linear Systems (EELE 3310)
Instructor: Mian Shahzad Iqbal
PERFORMANCE OF A WAVELET-BASED RECEIVER FOR BPSK AND QPSK SIGNALS IN ADDITIVE WHITE GAUSSIAN NOISE CHANNELS Dr. Robert Barsanti, Timothy Smith, Robert.
Linear Constant-Coefficient Difference Equations
ECE 533 Digital Image Processing
CS 591 S1 – Computational Audio
Digital Communications Chapter 13. Source Coding
LECTURE 11: FOURIER TRANSFORM PROPERTIES
Course Details Course Name: Digital Signal Processing
Sampling rate conversion by a rational factor
Sampling and Reconstruction
Fourier and Wavelet Transformations
Outline Linear Shift-invariant system Linear filters
Lecture #17 INTRODUCTION TO THE FAST FOURIER TRANSFORM ALGORITHM
Outline Linear Shift-invariant system Linear filters
Lecture 15 DTFT: Discrete-Time Fourier Transform
Signal & Linear Systems (EELE 3310)
CT-321 Digital Signal Processing
HKN ECE 210 Exam 3 Review Session
Richard M. Stern demo January 12, 2009
COSC 3451: Signals and Systems
LECTURE 18: FOURIER ANALYSIS OF CT SYSTEMS
Linear Prediction.
INTRODUCTION TO ADVANCED DIGITAL SIGNAL PROCESSING
Lecture #6 INTRODUCTION TO THE Z-TRANSFORM
Lecture #8 (Second half) FREQUENCY RESPONSE OF LSI SYSTEMS
Chapter 9 Advanced Topics in DSP
Lecture #18 FAST FOURIER TRANSFORM ALTERNATE IMPLEMENTATIONS
INTRODUCTION TO THE SHORT-TIME FOURIER TRANSFORM (STFT)
Speech Processing Final Project
Lecture 7: Signal Processing
Lecture #17 INTRODUCTION TO THE FAST FOURIER TRANSFORM ALGORITHM
Electrical Communications Systems ECE Spring 2019
INTRODUCTION TO ADVANCED DIGITAL SIGNAL PROCESSING
LECTURE 11: FOURIER TRANSFORM PROPERTIES
Robust Speech Recognition in the 21st Century
Combination of Feature and Channel Compensation (1/2)
Presentation transcript:

INTRODUCTION TO 18-491 FUNDAMENTALS OF SIGNAL PROCESSING Richard M. Stern 18-491 lecture January 14, 2019 Department of Electrical and Computer Engineering Carnegie Mellon University Pittsburgh, Pennsylvania 15213

Welcome to 18-491 Fundamentals of Signal Processing (DSP)! Today will Review mechanics of course Review course content Preview material in 18-491 (DSP)

Important people (for this course at least) Instructor: Richard Stern PH B24, 8-2535, rms@cs.cmu.edu Course management assistant: Michelle Mahouski HH 1112, 8-4951, mmahousk@andrew.cmu.edu

More important people Teaching interns: Tyler Supradeep Vuong Rangarajan

Some course details Meeting time and place: Lectures here and now Recitations Friday 10:30 – 12:20, 12:30 – 2:20, SH 214 Pre-requisites (you really need these!): Signals and Systems 18-290 Some MATLAB or background (presumably from 18-290)

Does our work get graded? Yes! Grades based on: Homework (including MATLAB problems) (33%) Three exams (67%) Two midterms (March 6 and April 3), and final exam Plan on attending the exams!

Textbooks Major text: Oppenheim, Schafer, Yoder, and Padgett: Discrete-Time Signal Processing Plan on purchasing a hard copy new or used Material to be supplemented by class notes at end of course Some other texts listed in syllabus

Other support sources Office hours: Course home page: Two hours per week for instructor and each TA, times TBA You can schedule additional times with me as needed Course home page: http://www.ece.cmu.edu/~ece491 Canvas to be used for grades (but probably not much else) Piazza to be used for class discussions

Academic stress and sources of help This is a hard course Take good care of yourself If you are having trouble, seek help Teaching staff CMU Counseling and Psychological Services (CaPS) We are here to help!

Academic integrity (i.e. cheating and plagiarism) CMU’s take on academic integrity: http://www.cmu.edu/policies/documents/Cheating.html ECE’s take on academic integrity: http://www.ece.cmu.edu/programs-admissions/masters/academic-integrity.html Most important rule: Don’t cheat! But what do we mean by that? Discussing general strategies on homework with other students is OK Solving homework together is NOT OK Accessing material from previous years is NOT OK “Collaborating” on exams is REALLY REALLY NOT OK!

18-491: major topic areas Signal processing in the time domain: convolution Frequency-domain processing: The DTFT and the Z-transform Complementary signal representations Sampling and change of sampling rate The DFT and the FFT Digital filter implementation Digital filter design Selected applications Orange headings refer to deterministic topics

Complementary signal representations Unit sample response Discrete-time Fourier transforms Z-transforms Difference equations Poles and zeros of an LSI system

Some application areas (we may not get to all of these) Linear prediction and lattice filters Adaptive filtering Optimal Wiener filtering Two-dimensional DSP (image processing) Short-time Fourier analysis Speech processing Orange headings refer to deterministic topics

Signal representation: why perform signal processing? A speech waveform in time: “Welcome to DSP I”

A time-frequency representation of “welcome” is much more informative

Downsampling the waveform Downsampling the waveform by factor of 2:

Consequences of downsampling by 2 Original: Downsampled:

Upsampling the waveform Upsampling by a factor of 2:

Consequences of upsampling by 2 Original: Upsampled:

Linear filtering the waveform x[n] y[n] Filter 1: y[n] = 3.6y[n–1]+5.0y[n–2]–3.2y[n–3]+.82y[n–4] +.013x[n]–.032x[n–1]+.044x[n–2]–.033x[n–3]+.013x[n–4] Filter 2: y[n] = 2.7y[n–1]–3.3y[n–2]+2.0y[n–3]–.57y[n–4] +.35x[n]–1.3x[n–1]+2.0x[n–2]–1.3x[n–3]+.35x[n–4]

Filter 1 in the time domain

Output of Filter 1 in the frequency domain Original: Lowpass:

Filter 2 in the time domain

Output of Filter 2 in the frequency domain Original: Highpass:

Let’s look at the lowpass filter from different points of view … x[n] y[n] Difference equation for Lowpass Filter 1: y[n] = 3.6y[n–1]+5.0y[n–2]–3.2y[n–3]+.82y[n–4] +.013x[n]–.032x[n–1]+.044x[n–2]–.033x[n–3]+.013x[n–4]

Lowpass filtering in the time domain: the unit sample response

Lowpass filtering in the frequency domain: magnitude and phase of the DTFT

Another type of modeling: the source-filter model of speech A useful model for representing the generation of speech sounds: Pitch Pulse train source Noise source Vocal tract model Amplitude p[n]

The poles and zeros of the lowpass filter

Signal modeling: let’s consider the “uh” in “welcome:”

The raw spectrum

All-pole modeling: the LPC spectrum

An application of LPC modeling: separating the vocal tract excitation and and filter Original speech: Speech with 75-Hz excitation: Speech with 150 Hz excitation: Speech with noise excitation: Comment: this is a major techniques used in speech coding Welcome16 Welcome 75 Welcome 150 Welcome 0

Classical signal enhancement: compensation of speech for noise and filtering Approach of Acero, Liu, Moreno, et al. (1990-1997)… Compensation achieved by estimating parameters of noise and filter and applying inverse operations “Clean” speech Degraded speech x[m] h[m] z[m] Linear filtering n[m] Additive noise

“Classical” combined compensation improves accuracy in stationary environments Threshold shifts by ~7 dB Accuracy still poor for low SNRs Complete retraining –7 dB 13 dB Clean VTS (1997) Original CDCN (1990) “Recovered” CMN (baseline) out_pre0_norm out_new_pre20 out out_post0_norm out_new_post20

Another type of signal enhancement: adaptive noise cancellation Speech + noise enters primary channel, correlated noise enters reference channel Adaptive filter attempts to convert noise in secondary channel to best resemble noise in primary channel and subtracts Performance degrades when speech leaks into reference channel and in reverberation Push-to-talk will make life MUCH easier!!

Simulation of noise cancellation for a PDA using two mics in “endfire” configuration Speech in cafeteria noise, no noise cancellation Speech with noise cancellation But …. simulation assumed no reverb ANC_base ANC_cancel

Signal separation: speech is quite intelligible, even when presented only in fragments Procedure: Determine which time-frequency time-frequency components appear to be dominated by the desired signal Reconstruct signal based on “good” components A Monaural example: Mixed signals - Separated signals - 5_spk 1st_spk 2nd_spk 3rd_spk 4th_spk 5th_spk

Practical signal separation: Audio samples using selective reconstruction based on ITD RT60 (ms) 0 300 No Proc Delay-sum ZCAE-bin ZCAE-cont Brian-Ba-R0I0 Brian-Ba-R3I0 Brian-DS-R0I0 Brian-DS-R3I0 Brian-ZB-R0I0 Brian-ZB-R3I0 Brian-ZC-R0I0 Brian-ZC-R3I0

Phase vocoding: changing time scale and pitch Changing the time scale: Original speech Faster by 4:3 Slower by 1:2 Transposing pitch: Original music After phase vocoding Transposing up by a major third Transposing down by a major third Comment: this is one of several techniques used to perform autotuning Welcome16 Welcome 75 Welcome 150 Welcome 0

Summary Lots of interesting topics that teach us how to understand signals and design filters An emphasis on developing a solid understanding of fundamentals Will introduce selected applications to demonstrate utility of techniques I hope that you have as much fun in signal processing as I have had!