SOME SIMPLE MANIPULATIONS OF SOUND USING DIGITAL SIGNAL PROCESSING Richard M. Stern 18-491 demo January 15, 2015 Department of Electrical and Computer.

Slides:

Advertisements

Similar presentations

Design of Digital IIR Filter

Advertisements

Figures for Chapter 7 Advanced signal processing Dillon (2001) Hearing Aids.

ECE 8443 – Pattern Recognition EE 3512 – Signals: Continuous and Discrete Objectives: Response to a Sinusoidal Input Frequency Analysis of an RC Circuit.

Digital Coding of Analog Signal Prepared By: Amit Degada Teaching Assistant Electronics Engineering Department, Sardar Vallabhbhai National Institute of.

ROBUST SIGNAL REPRESENTATIONS FOR AUTOMATIC SPEECH RECOGNITION

Pitch Shifting and Dynamic Filtering Rossum (1992a) Digital sampling instrument for digital audio data; Rossum (1992b) Dynamic digital IIR audio filter.

Sampling and quantization Seminary 2. Problem 2.1 Typical errors in reconstruction: Leaking and aliasing We have a transmission system with f s =8 kHz.

CEN352, Dr. Ghulam Muhammad King Saud University

Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.

Applying Models of Auditory Processing to Automatic Speech Recognition: Promise and Progress Richard Stern (with Chanwoo Kim and Yu-Hsiang Chiu) Department.

SOME SIMPLE MANIPULATIONS OF SOUND USING DIGITAL SIGNAL PROCESSING Richard M. Stern demo August 31, 2004 Department of Electrical and Computer.

Introduction to Speech Synthesis ● Key terms and definitions ● Key processes in sythetic speech production ● Text-To-Phones ● Phones to Synthesizer parameters.

Pole Zero Speech Models Speech is nonstationary. It can approximately be considered stationary over short intervals (20-40 ms). Over thisinterval the source.

EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.

Speech Enhancement Based on a Combination of Spectral Subtraction and MMSE Log-STSA Estimator in Wavelet Domain LATSI laboratory, Department of Electronic,

1 Manipulating Digital Audio. 2 Digital Manipulation  Extremely powerful manipulation techniques  Cut and paste  Filtering  Frequency domain manipulation.

1 USING CLASS WEIGHTING IN INTER-CLASS MLLR Sam-Joo Doh and Richard M. Stern Department of Electrical and Computer Engineering and School of Computer Science.

INTRODUCTION TO ADVANCED DIGITAL SIGNAL PROCESSING

SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.

EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.

Robust Automatic Speech Recognition In the 21 st Century Richard Stern (with Alex Acero, Yu-Hsiang Chiu, Evandro Gouvêa, Mark Harvilla, Chanwoo Kim, Kshitiz.

Filtering Separating what you want from what you have.

Digital Signal Processing

Digital to Analogue Conversion Natural signals tend to be analogue Need to convert to digital.

Over-Sampling and Multi-Rate DSP Systems

Chapter 4: Sampling of Continuous-Time Signals

… Representation of a CT Signal Using Impulse Functions

GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.

„Bandwidth Extension of Speech Signals“ 2nd Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 22nd and 23rd June.

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Encoding of Waveforms Encoding of Waveforms to Compress Information.

AUDIO COMPRESSION msccomputerscience.com. The process of digitizing audio signals is called PCM PCM involves sampling audio signal at minimum rate which.

1 ELEN 6820 Speech and Audio Processing Prof. D. Ellis Columbia University Midterm Presentation High Quality Music Metacompression Using Repeated- Segment.

Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission.

CEN352 Digital Signal Processing Lecture No. 1 Department of Computer Engineering, College of Computer and Information Sciences, King Saud University,

EE Audio Signals and Systems Digital Signal Processing (Synthesis) Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.

Basics of Neural Networks Neural Network Topologies.

Unit-V DSP APPLICATIONS. UNIT V -SYLLABUS DSP APPLICATIONS Multirate signal processing: Decimation Interpolation Sampling rate conversion by a rational.

Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.

Independent Component Analysis Algorithm for Adaptive Noise Cancelling 적응 잡음 제거를 위한 독립 성분 분석 알고리즘 Hyung-Min Park, Sang-Hoon Oh, and Soo-Young Lee Brain.

Hearing Research Center

Chapter 12 The Principles of Computer Music Contents Digital Audio Processing Noise Reduction Audio Compression Digital Rights Management (DRM)

ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska

VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.

Quiz 1 Review. Analog Synthesis Overview Sound is created by controlling electrical current within synthesizer, and amplifying result. Basic components:

Robust Feature Extraction for Automatic Speech Recognition based on Data-driven and Physiologically-motivated Approaches Mark J. Harvilla1, Chanwoo Kim2.

1 Audio Coding. 2 Digitization Processing Signal encoder Signal decoder samplingquantization storage Analog signal Digital data.

Subband Coding Jennie Abraham 07/23/2009. Overview Previously, different compression schemes were looked into – (i)Vector Quantization Scheme (ii)Differential.

Fourier and Wavelet Transformations Michael J. Watts

APPLICATION OF A WAVELET-BASED RECEIVER FOR THE COHERENT DETECTION OF FSK SIGNALS Dr. Robert Barsanti, Charles Lehman SSST March 2008, University of New.

Hossein Sameti Department of Computer Engineering Sharif University of Technology.

OTHER RESEARCH IN SIGNAL PROCESSING AND COMMUNICATIONS IN ECE Richard Stern Carnegie Mellon University (with Dave Casasent, Tsuhan Chen, Vijaya Kumar,

Sub-Band Coding Multimedia Systems and Standards S2 IF Telkom University.

WAVELET NOISE REMOVAL FROM BASEBAND DIGITAL SIGNALS IN BANDLIMITED CHANNELS Dr. Robert Barsanti SSST March 2010, University of Texas At Tyler.

1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.

UNIT-IV. Introduction Speech signal is generated from a system. Generation is via excitation of system. Speech travels through various media. Nature of.

بسم الله الرحمن الرحيم Lecture (1) Introduction to DSP Dr. Iman Abuel Maaly University of Khartoum Department of Electrical and Electronic Engineering.

PERFORMANCE OF A WAVELET-BASED RECEIVER FOR BPSK AND QPSK SIGNALS IN ADDITIVE WHITE GAUSSIAN NOISE CHANNELS Dr. Robert Barsanti, Timothy Smith, Robert.

Sampling rate conversion by a rational factor

Fourier and Wavelet Transformations

Voice Removal from Music

Kocaeli University Introduction to Engineering Applications

Richard M. Stern demo January 12, 2009

INTRODUCTION TO FUNDAMENTALS OF SIGNAL PROCESSING

INTRODUCTION TO ADVANCED DIGITAL SIGNAL PROCESSING

CEN352, Dr. Ghulam Muhammad King Saud University

INTRODUCTION TO THE SHORT-TIME FOURIER TRANSFORM (STFT)

INTRODUCTION TO ADVANCED DIGITAL SIGNAL PROCESSING

Robust Speech Recognition in the 21st Century

Combination of Feature and Channel Compensation (1/2)

Presentation transcript:

SOME SIMPLE MANIPULATIONS OF SOUND USING DIGITAL SIGNAL PROCESSING Richard M. Stern demo January 15, 2015 Department of Electrical and Computer Engineering and School of Computer Science Carnegie Mellon University Pittsburgh, Pennsylvania 15213

Carnegie Mellon Slide Digital Signal Processing I The original sound and its spectrogram

Carnegie Mellon Slide Digital Signal Processing I Downsampling the waveform Downsampling the waveform by factor of 2:

Carnegie Mellon Slide Digital Signal Processing I Consequences of downsampling Original: Downsample Downsampled:

Carnegie Mellon Slide Digital Signal Processing I Upsampling the waveform Upsampling by a factor of 2:

Carnegie Mellon Slide Digital Signal Processing I Consequences of upsampling Original: Upsampled:

Carnegie Mellon Slide Digital Signal Processing I Linear filtering the waveform x[n] y[n] Filter 1: y[n] = 3.6y[n–1]+5.0y[n–2]–3.2y[n–3]+.82y[n–4] +.013x[n]–.032x[n–1]+.044x[n–2]–.033x[n–3]+.013x[n–4] Filter 2: y[n] = 2.7y[n–1]–3.3y[n–2]+2.0y[n–3–.57y[n–4] +.35x[n]–1.3x[n–1]+2.0x[n–2]–1.3x[n–3]+.35x[n–4]

Carnegie Mellon Slide Digital Signal Processing I Filter 1 in the time domain

Carnegie Mellon Slide Digital Signal Processing I Output of Filter 1 in the frequency domain Original: Lowpass:

Carnegie Mellon Slide Digital Signal Processing I Filter 2 in the time domain

Carnegie Mellon Slide Digital Signal Processing I Output of Filter 2 in the frequency domain Original: Highpass:

Carnegie Mellon Slide Digital Signal Processing I The source-filter model of speech A useful model for representing the generation of speech sounds: Pitch Pulse train source Noise source Vocal tract model Amplitude p[n]

Carnegie Mellon Slide Digital Signal Processing I Original speech: Speech with 75-Hz excitation: Speech with 150-Hz excitation: Speech with noise excitation: Separating the vocal-tract excitation from the filter

Carnegie Mellon Slide Digital Signal Processing I Some Research Foci in ECE Processing, analysis, and compression of static and moving video Optical signal processing techniques Signal processing for digital data storage Speech recognition and understanding Multimedia fusion of video and audio information Architecture and protocols of telecommunications and computer networks

Carnegie Mellon Slide Digital Signal Processing I Approach of Acero, Liu, Moreno, et al. ( )… Compensation achieved by estimating parameters of noise and filter and applying inverse operations “Clean” speech x[m] h[m] n[m] z[m] Linear filtering Degraded speech Additive noise Classical signal enhancement: compensation of speech for noise and filtering

Carnegie Mellon Slide Digital Signal Processing I “Classical” combined compensation improves accuracy in stationary environments Threshold shifts by ~7 dB Accuracy still poor for low SNRs CMN (baseline) Complete retraining VTS (1997) CDCN (1990) –7 dB 13 dB Clean Original “Recovered”

Carnegie Mellon Slide Digital Signal Processing I Another type of signal enhancement: adaptive noise cancellation Speech + noise enters primary channel, correlated noise enters reference channel Adaptive filter attempts to convert noise in secondary channel to best resemble noise in primary channel and subtracts Performance degrades when speech leaks into reference channel and in reverberation

Carnegie Mellon Slide Digital Signal Processing I Simulation of noise cancellation for a PDA using two mics in “endfire” configuration Speech in cafeteria noise, no noise cancellation Speech with noise cancellation But …. simulation assumed no reverb

Carnegie Mellon Slide Digital Signal Processing I Signal separation: speech is quite intelligible, even when presented only in fragments Procedure: –Determine which time-frequency time- frequency components appear to be dominated by the desired signal –Reconstruct signal based on “good” components A Monaural example: –Mixed signals - –Separated signals -

Carnegie Mellon Slide Digital Signal Processing I Practical signal separation: Audio samples using selective reconstruction based on ITD RT60 (ms) No Proc Delay-sum ZCAE-bin ZCAE-cont

Carnegie Mellon Slide Digital Signal Processing I Summary Lots of interesting topics that extend core material from DSP Greater emphasis on implementation and applications Greater emphasis on statistically-optimal signal processing I hope that you have as much fun with this material as I have had!

Carnegie Mellon Slide Digital Signal Processing I Academic integrity (i.e. cheating and plagiarism) CMU’s take on academic integrity: – Most important rule: Don’t cheat! But what do we mean by that? –Discussing general strategies on homework with other students is OK –Solving homework together is NOT OK –Accessing material from previous years is NOT OK –“Collaborating” on exams is REALLY REALLY NOT OK!