DSP II: Final presentation Vocoder - making music talk Van Damme Wim Hemeryck Martijn.

Slides:



Advertisements
Similar presentations
An Approach in Reproducing the Auto-Tune Effect Mentees: Dong-San Choi & Tejas Rawal Mentor: David Jun.
Advertisements

Presented by Erin Palmer. Speech processing is widely used today Can you think of some examples? Phone dialog systems (bank, Amtrak) Computers dictation.
Liner Predictive Pitch Synchronization Voiced speech detection, analysis and synthesis Jim Bryan Florida Institute of Technology ECE5525 Final Project.
Easily extensible unix software for spectral analysis, display modification, and synthesis of musical sounds James W. Beauchamp School of Music Dept.
Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Speaker Identification Using Pitch Engineering Expo Banquet /08/09.
Page 0 of 34 MBE Vocoder. Page 1 of 34 Outline Introduction to vocoders MBE vocoder –MBE Parameters –Parameter estimation –Analysis and synthesis algorithm.
CENTER FOR SPOKEN LANGUAGE UNDERSTANDING 1 PREDICTION AND SYNTHESIS OF PROSODIC EFFECTS ON SPECTRAL BALANCE OF VOWELS Jan P.H. van Santen and Xiaochuan.
Speaker Recognition Sharat.S.Chikkerur Center for Unified Biometrics and Sensors
Speech in Multimedia Hao Jiang Computer Science Department Boston College Oct. 9, 2007.
Automatic Lip- Synchronization Using Linear Prediction of Speech Christopher Kohnert SK Semwal University of Colorado, Colorado Springs.
Analysis and Synthesis of Shouted Speech Tuomo Raitio Jouni Pohjalainen Manu Airaksinen Paavo Alku Antti Suni Martti Vainio.
CELLULAR COMMUNICATIONS 5. Speech Coding. Low Bit-rate Voice Coding  Voice is an analogue signal  Needed to be transformed in a digital form (bits)
Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
Special Topic: Sound. Overview What is sound? How is it made? How can I make sound with the Wunderboard?
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
2001/05/24Chin-Kai Wu, CS, NTHU1 Improved frame erasure concealment for CELP-based coders Juan Carlos De Martin, Takahiro Unno, Vishu Viswanathan DSPS.
System Microphone Keyboard Output. Cross Synthesis: Two Implementations.
Digital Voice Communication Link EE 413 – TEAM 2 April 21 st, 2005.
Effects in frequency domain Stefania Serafin Music Informatics Fall 2004.
Analysis & Synthesis The Vocoder and its related technology.
1 Chapter 1 Introduction. 2 Outline 1.1 A Very Abstract Summary 1.2 History 1.3 Model of the Signaling System 1.4 Information Source 1.5 Encoding a Source.
Text-To-Speech Synthesis An Overview. What is a TTS System  Goal A system that can read any text Automatic production of new sentences Not just audio.
Electronics Design Laboratory Lecture #11, Fall 2014
WHAT IS COMMUNICATION? The root word of Communication is the Latin word Communicare. which means “ to make common to many, share” Therefore, Communication.
LE 460 L Acoustics and Experimental Phonetics L-13
Voice Over IP Developing IPHONE Jeremy Stanley CS 460 section 1.
LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Encoding of Waveforms Encoding of Waveforms to Compress Information.
Page 0 of 23 MELP Vocoders Nima Moghadam SN#: Saeed Nari SN#: Supervisor Dr. Saameti April 2005 Sharif University of Technology.
Speech Signal Representations I Seminar Speech Recognition 2002 F.R. Verhage.
NOISE DETECTION AND CLASSIFICATION IN SPEECH SIGNALS WITH BOOSTING Nobuyuki Miyake, Tetsuya Takiguchi and Yasuo Ariki Department of Computer and System.
COPYRIGHT © All rights reserved by Sound acoustics Germany The averaged quality measures over all test cases indicate the real influence of a test object.
Martin Hewitson Overview of DC work. GEO DC workshop June DC work Noise characterisation Noise projections, noise sources, noise couplings Calibration.
Pitch Determination by Wavelet Transformation Santhosh Bellikoth ECE Speech Processing Instructor: Dr Kepuska.
Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.
Dynamic Captioning: Video Accessibility Enhancement for Hearing Impairment Richang Hong, Meng Wang, Mengdi Xuy Shuicheng Yany and Tat-Seng Chua School.
ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
Look who’s talking? Project 3.1 Yannick Thimister Han van Venrooij Bob Verlinden Project DKE Maastricht University.
SOUND By Logan, Emma, Genevieve, and Bella. Headphones  Sometimes at music concerts or airports musicians or workers wear earplugs to not hear all the.
(Extremely) Simplified Model of Speech Production
Sound Waveforms Neil E. Cotter Associate Professor (Lecturer) ECE Department University of Utah CONCEPT U AL TOOLS.
1/20 System Overview Cyclic mo-cap data (walking, running..) Cyclic mo-cap data (walking, running..) Music / Sound (audio) Music / Sound (audio) Resulting.
Duraid Y. Mohammed Philip J. Duncan Francis F. Li. School of Computing Science and Engineering, University of Salford UK Audio Content Analysis in The.
1 Audio Coding. 2 Digitization Processing Signal encoder Signal decoder samplingquantization storage Analog signal Digital data.
Performance Comparison of Speaker and Emotion Recognition
SPEECH CODING Maryam Zebarjad Alessandro Chiumento Supervisor : Sylwester Szczpaniak.
0 / 27 John-Paul Hosom 1 Alexander Kain Brian O. Bush Towards the Recovery of Targets from Coarticulated Speech for Automatic Speech Recognition Center.
Present document contains informations proprietary to France Telecom. Accepting this document means for its recipient he or she recognizes the confidential.
Chapter 20 Speech Encoding by Parameters 20.1 Linear Predictive Coding (LPC) 20.2 Linear Predictive Vocoder 20.3 Code Excited Linear Prediction (CELP)
Voice Sampling. Sampling Rate Nyquist’s theorem states that a signal can be reconstructed if it is sampled at twice the maximum frequency of the signal.
Sound Quality.
Auto-chromatic Musical Instrument Tuner Craig Janus and Robert Schmanski Advisor: Dr. James Irwin.
Music Transcription through Statistical Analysis Group 3 Austin Assavavallop, William Feater, Greg Heim, Philipp Pfieffenberger, Wamba Yves Design Phase.
Speaker Verification System Middle Term Presentation Performed by: Barak Benita & Daniel Adler Instructor: Erez Sabag.
Speech Recognition with Matlab ® Neil E. Cotter ECE Department UNIVERSITY OF UTAH
Decoder Chapter 12 Subject: Digital System Year: 2009.
Speech emotion detection General architecture of a speech emotion detection system: What features?
Motivation ● The (Ham) world needs an open source, patent free speech codec at bit rates of less than 5000 bit/s ● I know how to build one!
Introduction to Audio Watermarking Schemes N. Lazic and P
Spectrum Analysis and Processing
Automatic Speech Processing Project
Vocoders.
III Digital Audio III.9 (Wed Oct 25) Phase vocoder for tempo and pitch changes.
1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.
Elements of Music.
Making coils more musical
III Digital Audio III.9 (Wed Oct 24) Phase vocoder for tempo and pitch changes.
The Vocoder and its related technology
Two-Stage Mel-Warped Wiener Filter SNR-Dependent Waveform Processing
Presentation transcript:

DSP II: Final presentation Vocoder - making music talk Van Damme Wim Hemeryck Martijn

Overview Summary LPC vocoder theory LPC vocoder demonstration Channel vocoder theory Channel vocoder demonstration

Summary LPC vocoder theory Encoder Decoder

Segmentation Use of a window function: hanning Without window overlap: not very smooth sound, many glitches With window overlap: slightly smoother, but sometimes messy pitch and intonation

V/U/S detector Goal was the lpc-10e standard: weighted voicing detection using several indicators → too little information, well hidden Tremain paper In the end: a fairly simple speaker independent (normalized!) voicing detector using MSF, ZCR and pitch information

Demonstration LPC No window overlap – High pitch (female) – Low pitch (male) Window overlap – High pitch (female) – Low pitch (male) – Own voice → more noise

LPC vocoder and music Basic idea: replace pulse train for voiced frames with periodic elementary note shapes of a music instrument, at a frequency corresponding to the pitch. No access to such libraries Therefore: channel vocoder…

Channel Vocoder theory

Channel vocoder demonstration Example ‘Around the world’ Real-Time