INTRODUCTION TO ADVANCED DIGITAL SIGNAL PROCESSING

Slides:

Advertisements

Similar presentations

Figures for Chapter 7 Advanced signal processing Dillon (2001) Hearing Aids.

Advertisements

Copyright 2001, Agrawal & BushnellVLSI Test: Lecture 181 Lecture 18 DSP-Based Analog Circuit Testing  Definitions  Unit Test Period (UTP)  Correlation.

Digital Coding of Analog Signal Prepared By: Amit Degada Teaching Assistant Electronics Engineering Department, Sardar Vallabhbhai National Institute of.

Page 0 of 34 MBE Vocoder. Page 1 of 34 Outline Introduction to vocoders MBE vocoder –MBE Parameters –Parameter estimation –Analysis and synthesis algorithm.

CEN352, Dr. Ghulam Muhammad King Saud University

Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.

1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.

Overview of Adaptive Multi-Rate Narrow Band (AMR-NB) Speech Codec

SOME SIMPLE MANIPULATIONS OF SOUND USING DIGITAL SIGNAL PROCESSING Richard M. Stern demo August 31, 2004 Department of Electrical and Computer.

EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.

Effects in frequency domain Stefania Serafin Music Informatics Fall 2004.

Lecture #18 FAST FOURIER TRANSFORM INVERSES AND ALTERNATE IMPLEMENTATIONS Department of Electrical and Computer Engineering Carnegie Mellon University.

INTRODUCTION TO ADVANCED DIGITAL SIGNAL PROCESSING

So far: Historical overview of speech technology  basic components/goals for systems Quick review of DSP fundamentals Quick overview of pattern recognition.

EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.

Robust Automatic Speech Recognition In the 21 st Century Richard Stern (with Alex Acero, Yu-Hsiang Chiu, Evandro Gouvêa, Mark Harvilla, Chanwoo Kim, Kshitiz.

GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Encoding of Waveforms Encoding of Waveforms to Compress Information.

1 Techniques to control noise and fading l Noise and fading are the primary sources of distortion in communication channels l Techniques to reduce noise.

Microphone Integration – Can Improve ARS Accuracy? Tom Houy

Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission.

Page 0 of 23 MELP Vocoders Nima Moghadam SN#: Saeed Nari SN#: Supervisor Dr. Saameti April 2005 Sharif University of Technology.

Overview of Part I, CMSC5707 Advanced Topics in Artificial Intelligence KH Wong (6 weeks) Audio signal processing – Signals in time & frequency domains.

Performance analysis of channel estimation and adaptive equalization in slow fading channel Chen Zhifeng Electrical and Computer Engineering University.

Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.

Chapter 12 The Principles of Computer Music Contents Digital Audio Processing Noise Reduction Audio Compression Digital Rights Management (DRM)

ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska

VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.

[Advanced] Speech & Audio Signal Processing ES 157/257: Speech and Audio Processing Prof. Patrick Wolfe, Harvard DEAS 02 February 2006.

Signals & Systems B-Tech (Hons). Signals & Systems Lecture # 1 Instructor Engr. Kashif Shahzad 2015.

Robust Feature Extraction for Automatic Speech Recognition based on Data-driven and Physiologically-motivated Approaches Mark J. Harvilla1, Chanwoo Kim2.

Dr. Galal Nadim.  The root-MUltiple SIgnal Classification (root- MUSIC) super resolution algorithm is used for indoor channel characterization (estimate.

SOME SIMPLE MANIPULATIONS OF SOUND USING DIGITAL SIGNAL PROCESSING Richard M. Stern demo January 15, 2015 Department of Electrical and Computer.

1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.

UNIT-IV. Introduction Speech signal is generated from a system. Generation is via excitation of system. Speech travels through various media. Nature of.

PERFORMANCE OF A WAVELET-BASED RECEIVER FOR BPSK AND QPSK SIGNALS IN ADDITIVE WHITE GAUSSIAN NOISE CHANNELS Dr. Robert Barsanti, Timothy Smith, Robert.

High Quality Voice Morphing

Voice Manipulator Department of Electrical & Computer Engineering

Techniques to control noise and fading

Digital Communications Chapter 13. Source Coding

Adaptive Filters Common filter design methods assume that the characteristics of the signal remain constant in time. However, when the signal characteristics.

III Digital Audio III.9 (Wed Oct 25) Phase vocoder for tempo and pitch changes.

Advanced Wireless Networks

Lecture 1.30 Structure of the optimal receiver deterministic signals.

III. Analysis of Modulation Metrics IV. Modifications

1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.

Microcomputer Systems 1

Microcomputer Systems 1

Voice Removal from Music

Lecture #17 INTRODUCTION TO THE FAST FOURIER TRANSFORM ALGORITHM

Linear Predictive Coding Methods

Outline Linear Shift-invariant system Linear filters

III Digital Audio III.9 (Wed Oct 24) Phase vocoder for tempo and pitch changes.

Two-Stage Mel-Warped Wiener Filter SNR-Dependent Waveform Processing

Chen Zhifeng Electrical and Computer Engineering University of Florida

Richard M. Stern demo January 12, 2009

Linear Prediction.

INTRODUCTION TO FUNDAMENTALS OF SIGNAL PROCESSING

INTRODUCTION TO ADVANCED DIGITAL SIGNAL PROCESSING

Govt. Polytechnic Dhangar(Fatehabad)

CEN352, Dr. Ghulam Muhammad King Saud University

EXAMPLES OF POLYAURAL PROCESSING

Lecture #18 FAST FOURIER TRANSFORM ALTERNATE IMPLEMENTATIONS

INTRODUCTION TO THE SHORT-TIME FOURIER TRANSFORM (STFT)

Lecture #17 INTRODUCTION TO THE FAST FOURIER TRANSFORM ALGORITHM

Presenter: Shih-Hsiang(士翔)

Robust Speech Recognition in the 21st Century

Combination of Feature and Channel Compensation (1/2)

Presentation transcript:

INTRODUCTION TO 18-792 ADVANCED DIGITAL SIGNAL PROCESSING Richard M. Stern 18-491 talk April 15, 2019 Department of Electrical and Computer Engineering Carnegie Mellon University Pittsburgh, Pennsylvania 15213

What is 18-792 Advanced DSP? One of several courses that extend and apply the topics discussed in 18-491 Focus is on one-dimensional signals, primarily speech and music Much of the course will discuss optimal solutions based on probabilistic/stochastic signal representations

Why take 18-792? ADSP is THE most interesting ECE grad course this fall ADSP is great fun (at least most of the time) You will be implementing algorithms that are fundamental to signal processing today

Advanced digital signal processing: major application issues Signal representation Signal modeling Signal enhancement Signal separation

18-792: major topic areas Multi-rate DSP Short-time Fourier analysis Overview of important properties of stochastic processes Traditional and modern spectral analysis Linear prediction Adaptive filtering Adaptive array processing Additional topics and applications Orange headings refer to deterministic topics

The source-filter model of speech A useful model for representing the generation of speech sounds: Pitch Pulse train source Noise source Vocal tract model Amplitude p[n]

Some examples of homework projects: separating vocal tract excitation and and filter Original speech: Speech with 75-Hz excitation: Speech with 150 Hz excitation: Speech with noise excitation: Comment:: this is a major technique used in speech coding Welcome16 Welcome 75 Welcome 150 Welcome 0

Classical signal enhancement: compensation of speech for noise and filtering Approach of Acero, Liu, Moreno, et al. (1990-1997)… Compensation achieved by estimating parameters of noise and filter and applying inverse operations “Clean” speech Degraded speech x[m] h[m] z[m] Linear filtering n[m] Additive noise

Compensating for the combined effects of additive noise and linear filtering in ASR Threshold shifts by ~7 dB Accuracy still poor for low SNRs Complete retraining –7 dB 13 dB Clean VTS (1997) Original CDCN (1990) “Recovered” CMN (baseline) out_pre0_norm out_new_pre20 out out_post0_norm out_new_post20

Signal separation: speech is quite intelligible, even when presented only in fragments Procedure: Determine which time-frequency time-frequency components appear to be dominated by the desired signal Reconstruct signal based on “good” components A Monaural example: Mixed signals - Separated signals - 5_spk 1st_spk 2nd_spk 3rd_spk 4th_spk 5th_spk

Practical signal separation: Audio samples using selective reconstruction based on ITD RT60 (ms) 0 300 No Proc Delay-sum ZCAE-bin ZCAE-cont Brian-Ba-R0I0 Brian-Ba-R3I0 Brian-DS-R0I0 Brian-DS-R3I0 Brian-ZB-R0I0 Brian-ZB-R3I0 Brian-ZC-R0I0 Brian-ZC-R3I0

Phase vocoding: changing time scale and pitch Changing the time scale: Original speech Faster by 4:3 Slower by 1:2 Transposing pitch: Original music After phase vocoding Transposing up by a major third Transposing down by a major third Comment: this is one of the techniques used to perform autotuning Comment: this is how autotuning is done Welcome16 Welcome 75 Welcome 150 Welcome 0

Another type of signal enhancement: adaptive noise cancellation SupP Speech + noise enters primary channel, correlated noise enters reference channel Adaptive filter attempts to convert noise in secondary channel to best resemble noise in primary channel and subtracts Performance degrades when speech leaks into reference channel and in reverberation Original: Processed: Push-to-talk will make life MUCH easier!!

Noise cancellation for a PDA using two mics in “endfire” configuration Speech in cafeteria noise, no noise cancellation Speech with noise cancellation But …. simulation assumed no reverb ANC_base ANC_cancel

Summary Lots of interesting topics that extend core material from DSP Greater emphasis on implementation and applications Greater emphasis on statistically-optimal signal processing I hope that you have as much fun with this material as I have had!