Intro. to Audio Signals Jyh-Shing Roger Jang (張智星)

Slides:



Advertisements
Similar presentations
Acoustic/Prosodic Features
Advertisements

Introduction to Digital Audio
Sound can make multimedia presentations dynamic and interesting.
Time-Frequency Analysis Analyzing sounds as a sequence of frames
Part A Multimedia Production Rico Yu. Part A Multimedia Production Ch.1 Text Ch.2 Graphics Ch.3 Sound Ch.4 Animations Ch.5 Video.
Chapter 4: Representation of data in computer systems: Sound OCR Computing for GCSE © Hodder Education 2011.
Physical modeling of speech XV Pacific Voice Conference PVSF-PIXAR Brad Story Dept. of Speech, Language and Hearing Sciences University of Arizona.
SIMS-201 Characteristics of Audio Signals Sampling of Audio Signals Introduction to Audio Information.
Intro. to Audio Signals Jyh-Shing Roger Jang ( 張智星 ) MIR Lab, CSIE Dept National Taiwan Univ., Taiwan.
Digital Audio.
Image and Sound Editing Raed S. Rasheed Digital Sound Digital sound types – Monophonic sound – Stereophonic sound – Quadraphonic sound – Surround.
Digital audio recording Kimmo Tukiainen. My background playing music since I was five first time in a studio at fourteen recording on my own for six months.
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
The Human Voice Chapters 15 and 17. Main Vocal Organs Lungs Reservoir and energy source Larynx Vocal folds Cavities: pharynx, nasal, oral Air exits through.
Introduction to Acoustics Words contain sequences of sounds Each sound (phone) is produced by sending signals from the brain to the vocal articulators.
PH 105 Dr. Cecilia Vogel Lecture 14. OUTLINE  consonants  vowels  vocal folds as sound source  formants  speech spectrograms  singing.
SCA Introduction to Multimedia
1 Lab Preparation Initial focus on Speaker Verification –Tools –Expertise –Good example “Biometric technologies are automated methods of verifying or recognising.
Introduction to Sound Sounds are vibrations that travel though the air or some other medium A sound wave is an audible vibration that travels through.
 Continuous sequence of vibrations of air  (Why no sound in space? Contrary to Star Wars etc.)  Abstraction of an audio wave:  Ear translates vibrations.
CSC 8610 & 5930 Multimedia Technology Lecture 4 Digital Audio Representation.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
Introduction to Interactive Media 10: Audio in Interactive Digital Media.
Lecture # 22 Audition, Audacity & Sound Editing Sound Representation.
DTC 354 Digital Storytelling Rebecca Goodrich. Wave made up of changes in air pressure by an object vibrating in a medium—water or air.
Sound and audio. Table of Content 1.Introduction 2.Properties of sound 3.Characteristics of digital sound 4.Calculate audio data size 5.Benefits of using.
COSC 1P02 Introduction to Computer Science 4.1 Cosc 1P02 Week 4 Lecture slides “Programs are meant to be read by humans and only incidentally for computers.
Modication by tuti 1 LECTURE 7 THE USES OF DIGITAL AUDIO IN MULTIMEDIA.
Music Tech.  What is the definition of sound?  What is a wave?
Digital Sound Actual representation of sound Stored in form of thousands of individual numbers (called samples) Not device dependent Stored in bits.
Day 6 Exam I is on Thursday. Be sure to attend lab this week.
Chapter 15 Sounds.
Demos Vernier microphone, logger pro, physics with computers, voice program measure voice, chilandi plates, tuning forks, anyone with perfect pitch, meter.
Audio / Sound INTRODUCTION TO MULTIMEDIA SYSTEMS Lect. No 3: AUDIO TECHNOLOGY.
Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.
Sound Waveforms Neil E. Cotter Associate Professor (Lecturer) ECE Department University of Utah CONCEPT U AL TOOLS.
Sound in Multimedia Psychology of sound what do you use it for? what techniques for its communication exist? Science of sound why does it exist? how it.
Multimedia Sound. What is Sound? Sound, sound wave, acoustics Sound is a continuous wave that travels through a medium Sound wave: energy causes disturbance.
P105 Lecture #27 visuals 20 March 2013.
Speech Recognition with Matlab ® Neil E. Cotter ECE Department UNIVERSITY OF UTAH
Introduction to Digital Audio
DSP First, 2/e LECTURE #1 Sinusoids Aug © , JH McClellan & RW Schafer.
Chapter 4 Fundamentals of Digital Audio
B. Harpsichord Strings are plucked
The Human Voice. 1. The vocal organs
Discrete Fourier Transform (DFT)
Ch. 2 : Preprocessing of audio signals in time and frequency domain
Intro. to Audio Signals Jyh-Shing Roger Jang (張智星)
Introduction to Digital Audio
The Human Voice. 1. The vocal organs
Department of Electronics and Commn. Engg.
Sound Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman
Higher Intensity (Volume)
Multimedia Fundamentals(continued)
Intro. to Audio Signals Jyh-Shing Roger Jang (張智星)
Mobile Systems Workshop 1 Narrow band speech coding for mobile phones
Introduction to Digital Audio
Digital Media Lecture 12: Additional Audio Georgia Gwinnett College
Introduction to Digital Audio
Remember me? The number of times this happens in 1 second determines the frequency of the sound wave.
Introduction to Digital Audio
Elements of Acoustic Phonetics
The Production of Speech
Assist. Lecturer Safeen H. Rasool Collage of SCIENCE IT Dept.
Sound and Matlab® Neil E. Cotter ECE Department
Signal Processing and Data Analysis Simon Godsill Lent 2015
Introduction to Digital Audio
Sound and Matlab® Neil E. Cotter ECE Department
Recap In previous lessons we have looked at how numbers can be stored as binary. We have also seen how images are stored as binary. This lesson we are.
Duration & Pitch Modification via WSOLA
Presentation transcript:

Intro. to Audio Signals Jyh-Shing Roger Jang (張智星) http://mirlab.org/jang MIR Lab, CSIE Dept National Taiwan Univ., Taiwan

What Are Audio Signals? Audio signals are… Signals that are audible to human, such as speech and music The range of fundamental frequencies of audible signals is about 20 ~ 20000 Hz. The range is wider for the young people, narrower for the elderly. Quiz! Quiz!

Voice Generation & Reception Steps in voice generation & reception Vibration of voice source Resonance by surrounding objects Traveling through air (or other media) Reception of membranes and neurons at inner ears Recognition by brains Examples Singing Whistling Guitar Flute Pressure wave Sound waveform

Categorization of Audio Signals Number of sources Monophonic: example Polyphonic: example Waveform Quasi-periodic sound voiced sound of speech Aperiodic sound Unvoiced sound of speech Source types Sounds from animals (bioacoustics) Dog barking, cat meowing, frog croaking, duck quacking, cow mooing… Sounds from non-animals Car engines, thunders, music instruments

Parameters for Recording Quiz! Three major parameters for recording audio files Sample rate: no. of samples per second 8 kHz (phone quality) 16 KHz (for common speech recognition) 44.1 KHz (CD quality) Bit resolution: no. of bits for representing a sample 8-bit (uint8 with range: 0~255) 16-bit (int16 with range: -32768~32767) No of channels Mono: 1 channel Stereo: 2 channels Quiz! Hz = Hertz = samples/sec (Also used for fundamental frequency…)

Live Recording Three major parameters for recording audio files Sample rate, bit resolution, and no. of channels Demo of recording via Cooledit Compare of the waveforms of a tuning fork and human speech of vowels. What is the major difference? Why? Quiz!

Tools for General Audio Processing Tools for real-time recording and waveform display Audacity CoolEdit GoldWave MATLAB

S/U/V in Speech Speech signals can be divided into S, U, V S (silence): no speech activity U (unvoiced): speech activity without vibration from vocal chords V (voiced): speech activity with vibration How to detect S, U, V? By putting your hand on your throat to feel the vibration By waveform observation Quiz!

Speech Signal of “Sunday” Unvoiced vs. voiced frames Books/audioSignalProcessing/example/displaySunday.m

Silence, Unvoiced and Voiced Sounds Examples of S, U, V “Six” “資訊系” Quiz! s u v s u v

Storage for Audio Files Examples of storage requirement 1 min. of recording with fs=16000, nbits=16, #channel=1 60 (sec)*16 (KHz)*2 (bytes)*1 (channel) = 1920 KB = 1.92 MB 3-mins of CD music with fs=44.1KHz, nbits=16, #channel=2  180 (sec)*44.1 (KHz)*2 (bytes)*2 (channels) = 31752 KB = 32 MB Quiz! MP3 compression ratio is about 10!

Human Speech Production

Source-filter Model for Human Speech Production Speech is split into a rapidly varying excitation signal and a slowly varying filter. The envelope of the power spectra contains the vocal tract info. Two important characteristics of the model are fundamental frequency (f0) and formants (F1, F2, F3, …) Pharyngeal cavity Nasal cavity Oral cavity unvoiced voiced

The Vocal Tract

Glottal Volume Velocity & Resulting Sound Pressure (Voiced)

(c) Output Energy Spectrum Speech Production Glottal Pulses Vocal Tract Speech Signal = + + = (a) Source Spectrum (b) Filter Function (c) Output Energy Spectrum

Videos for Vocal Cords Movement Movement of vocal cords http://www.youtube.com/watch?v=mJedwz_r2Pc http://www.youtube.com/watch?v=v9Wdf-RwLcs

Other Interesting Phenomena Interesting phenomena about audio signals Don’t trust what you have heard! (Vision rules) Perceived speech is highly context dependent:

Hints for Exercises How to generate a sine wave signal: Math formula: MATLAB code: duration=3; f=440; fs=16000; time=(0:duration*fs-1)/fs; y=sin(2*pi*f*time); plot(time, y); sound(y, fs);