Synthetic Audio A Brief Historical Introduction Generating sounds Synthesis can be “additive” or “subtractive” Additive means combining components (e.g.,

Slides:



Advertisements
Similar presentations
Basic Tools for Understanding Synthesis. Synthesizer A musical instrument that produces waveforms, typically in the audio range of about 20 to 20,000.
Advertisements

Acoustic Characteristics of Consonants
Analysis and Digital Implementation of the Talk Box Effect Yuan Chen Advisor: Professor Paul Cuff.
Speech & Audio Coding TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jörgen Ahlberg.
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Speech Synthesis December 4, 2014 Gentle Reminders Final exam: Friday, December 12 th, 3:30 – 5:30 pm In this room! Final exam review: Wednesday, December.
PHONETICS AND PHONOLOGY
Chapter 7 Principles of Analog Synthesis and Voltage Control Contents Understanding Musical Sound Electronic Sound Generation Voltage Control Fundamentals.
The Human Voice. I. Speech production 1. The vocal organs
ACOUSTICAL THEORY OF SPEECH PRODUCTION
The Human Voice Chapters 15 and 17. Main Vocal Organs Lungs Reservoir and energy source Larynx Vocal folds Cavities: pharynx, nasal, oral Air exits through.
Introduction to Acoustics Words contain sequences of sounds Each sound (phone) is produced by sending signals from the brain to the vocal articulators.
PH 105 Dr. Cecilia Vogel Lecture 14. OUTLINE  consonants  vowels  vocal folds as sound source  formants  speech spectrograms  singing.
Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.
PHYS 103 lecture 29 voice acoustics. Vocal anatomy Air flow through vocal folds produces “buzzing” (like lips) Frequency is determined by thickness (mass)
Phonetics (Part 1) Dr. Ansa Hameed.
1 Frequency Domain Analysis/Synthesis Concerned with the reproduction of the frequency spectrum within the speech waveform Less concern with amplitude.
EE 225D, Section I: Broad background Synthesis/vocoding history (chaps 2&3) Recognition history (chap 4) Machine recognition basics (chap 5) Human recognition.
Introduction to Speech Synthesis ● Key terms and definitions ● Key processes in sythetic speech production ● Text-To-Phones ● Phones to Synthesizer parameters.
Dr. Tim Collins EE2F2 – Music Technology Dr. Tim Collins
Introduction to Speech Production Lecture 1. Phonetics and Phonology Phonetics: The physical manifestation of language in sound waves. –How sounds are.
Acoustics of Instruments Music Theory Class Gettysburg College.
Anatomic Aspects Larynx: Sytem of muscles, cartileges and ligaments.
Analysis & Synthesis The Vocoder and its related technology.
7/3/2015Hillenbrand: Source Filter Theory1 Source-Filter Theory Source-filter theory can be used to describe and explain any kind of speech – vowels,
1 Speech synthesis 2 What is the task? –Generating natural sounding speech on the fly, usually from text What are the main difficulties? –What to say.
Voice Transformations Challenges: Signal processing techniques have advanced faster than our understanding of the physics Examples: – Rate of articulation.
 All objects have a NATURAL FREQUENCY at which they tend to vibrate. This frequency depends on the material the object is made of, the shape, and many.
Physics 1251 The Science and Technology of Musical Sound Unit 3 Session 31 MWF The Fundamentals of the Human Voice Unit 3 Session 31 MWF The Fundamentals.
A Brief Exploration of Electronic Music and its Theory By: Zac Changnon.
Speech Communications (Chapter 7) Prepared by: Ahmed M. El-Sherbeeny, PhD 1.
Speech synthesis Recording and sampling Speech recognition Apr. 5
Thursday 13 th September. Periods Music Technology A2.
Computer Sound Synthesis 2
Speech Synthesis April 14, 2009 Some Reminders Final Exam is next Monday: In this room (I am looking into changing the start time to 9 am.) I have a.
Phonetics: the generation of speech Phonemes “The shortest segment of speech that, if changed, would change the meaning of a word.” hog fog log *Phonemes.
Synthesis advanced techniques. Other modules Synthesis would be fairly dull if we were limited to mixing together and filtering a few standard waveforms.
Chapter 7 SPEECH COMMUNICATIONS
Acoustic Phonetics 3/9/00. Acoustic Theory of Speech Production Modeling the vocal tract –Modeling= the construction of some replica of the actual physical.
Speech Science Fall 2009 Oct 26, Consonants Resonant Consonants They are produced in a similar way as vowels i.e., filtering the complex wave produced.
Speech Science VII Acoustic Structure of Speech Sounds WS
Physics 1251 The Science and Technology of Musical Sound Unit 4 Session 37 MWF Synthesizers Unit 4 Session 37 MWF Synthesizers.
Applied Speech Sciences 4/11/00. Speech Science Application Speech production via computers Forensics- criminal investigations; voice prints Assessing.
The end of vowels + The beginning of fricatives November 19, 2012.
Subtractive Sound Synthesis. Subtractive Synthesis Involves subtracting frequency components from a complex tone to produce a desired sound Why is it.
Speech Synthesis April 12, 2013 Speech Synthesis: A Basic Overview Speech synthesis is the generation of speech by machine. The reasons for studying.
Stops Stops include / p, b, t, d, k, g/ (and glottal stop)
Perception + Synthesis April 6, 2010 CP Results.
(Extremely) Simplified Model of Speech Production
Sound Waveforms Neil E. Cotter Associate Professor (Lecturer) ECE Department University of Utah CONCEPT U AL TOOLS.
Introduction to Language Phonetics 1. Explore the relationship between sound and spelling Become familiar with International Phonetic Alphabet (IPA )
Introduction to Digital Speech Processing Presented by Dr. Allam Mousa 1 An Najah National University SP_1_intro.
Matakuliah: G0922/Introduction to Linguistics Tahun: 2008 Session 3 Phonetics: Consonants.
Ways to generate computer speech Record a human speaking every sentence HAL will ever speak (not likely) Make a mathematical model of the human vocal.
THE SOUNDS OF LANGUAGE the study of inventory and structure of the sounds of language.  Human can produce any number of sounds including those we never.
Synthesisers. A Synthesizer is an electronic sound generating device, that creates sounds using oscillators and filters. An Oscillator provides the sound.
IIT Bombay ISTE, IITB, Mumbai, 28 March, SPEECH SYNTHESIS PC Pandey EE Dept IIT Bombay March ‘03.
Speech Generation and Perception
P105 Lecture #27 visuals 20 March 2013.
Speech Recognition with Matlab ® Neil E. Cotter ECE Department UNIVERSITY OF UTAH
Types of Synthesizers and How They Work
B. Harpsichord Strings are plucked
Intro. to Audio Signals Jyh-Shing Roger Jang (張智星)
The Human Voice. 1. The vocal organs
an Introduction to English
CS 591 S1 – Computational Audio -- Spring, 2017
Structure of Spoken Language
The Human Voice. 1. The vocal organs
Intro. to Audio Signals Jyh-Shing Roger Jang (張智星)
What is a sound? Sound is a pressure wave in air or any other material medium. The human ear and brain working together are very good at detecting and.
Presentation transcript:

Synthetic Audio A Brief Historical Introduction

Generating sounds Synthesis can be “additive” or “subtractive” Additive means combining components (e.g., sinusoids) Subtractive means filtered Analogy to physical mechanisms The human speech example …

Von Kempelen’s chess-playing “automaton”

Wheatstone’s Speaking Machine (from von Kempelen) Vibrating reed to simulate vocal cords Nasal passage Bellows for producing pressure Leather “vocal tract” Whistles for “s” and “sh” sounds

The Voice Operation Demonstrator (Voder) Shown at New York World’s Fair, 1939; also San Francisco Exhibition, 1939 Apparently the first electronic synthesizer Required a human operator, long training Related to human voice production, but not a physical model (no tongue analogy, etc.) Used filters to model the effect of varying vocal tract shape

(Extremely) Simplified Model of Speech Production Periodic source Noise source Filters voiced unvoiced Coupling Speech What does the spectrum of a periodic source look like?

The Voder at the 1939 World’s Fair in New York

Daisy VODER EXAMPLES Extended demo

Excitations for speech sounds Periodic: vowels, glides, liquids Noise: voiceless fricatives (f,s) Both: voiced fricatives (v,z) Burst-like sounds: p-b,k-g,t-d

Later Speech Synthesis methods Phonemic Synth by rule: 1961 Cascaded resonances (Fant, 1953) Parallel resonances (Holmes, 1973) (synth followed by natural) KlattTalk -> DECTalk (1970’s) Speak & Spell (1979) Concatenation (unit selection) Much more on synthesis later in course

Music Machines Barrel organs (water or spring powered) –Like music boxes, pins pluck or depress keys Melography (18th century) - writable medium –Punched cards, as in Jacquard loom Player pianos -> modern digital versions (Bosendorfer) Telharmonium - additive sinusoidal synthesis –Lots of 1900 generators = one huge machine

Electronic music Theramin - player varies capacitances, alters frequency and amplitude for sinusoid Analog synthesizers - oscillators, mixers, etc. - Moog, later FM synthesis Digital synthesizers - the modern way

Professor Theremin gives a lesson …