CS1315: Introduction to Media Computation Sound Encoding and Manipulation.

Slides:

Advertisements

Similar presentations

Introduction to Digital Audio

Advertisements

Georgia Institute of Technology Introduction to Processing Digital Sounds.

Properties of Sound Neil Freebern. Sound Sound is produced when something vibrates. Vibrations disturb the air, creating variations in air pressure. Variation.

Motivation Application driven -- VoD, Information on Demand (WWW), education, telemedicine, videoconference, videophone Storage capacity Large capacity.

From the air to the iPod. Minute disturbances in the air, caused by a vibrating object Air molecules bunch together, then spread out Changes in density.

SIMS-201 Characteristics of Audio Signals Sampling of Audio Signals Introduction to Audio Information.

IT-101 Section 001 Lecture #8 Introduction to Information Technology.

School of Computing Science Simon Fraser University

Chapter 2 : Business Information Business Data Communications, 4e.

Fundamentals of Digital Audio. The Central Problem n Waves in nature, including sound waves, are continuous: Between any two points on the curve, no matter.

Digital Audio Multimedia Systems (Module 1 Lesson 1)

 Principles of Digital Audio. Analog Audio  3 Characteristics of analog audio signals: 1. Continuous signal – single repetitive waveform 2. Infinite.

Computer Science 101 Introduction to Programming with Sounds.

Introduction to Computing and Programming in Python: A Multimedia Approach Chapter 6: Modifying Sounds Using Loops.

Introduction to Sound Sounds are vibrations that travel though the air or some other medium A sound wave is an audible vibration that travels through.

Digital audio. In digital audio, the purpose of binary numbers is to express the values of samples that represent analog sound. (contrasted to MIDI binary.

Computer Science 121 Scientific Computing Winter 2014 Chapter 13 Sounds and Signals.

School of Informatics CG087 Time-based Multimedia Assets Sampling & SequencingDr Paul Vickers1 Sampling & Sequencing Combining MIDI and audio.

Lab #8 Follow-Up: Sounds and Signals* * Figures from Kaplan, D. (2003) Introduction to Scientific Computation and Programming CLI Engineering.

Introduction to Interactive Media 10: Audio in Interactive Digital Media.

Lecture # 22 Audition, Audacity & Sound Editing Sound Representation.

DTC 354 Digital Storytelling Rebecca Goodrich. Wave made up of changes in air pressure by an object vibrating in a medium—water or air.

CompSci Today’s topics Sound Upcoming ä Intellectual property ä Network analysis ä Security Reading.

Computing with Digital Media: A Study of Humans and Technology Mark Guzdial, School of Interactive Computing.

CSC361/661 Digital Media Spring 2002

Digital Media Dr. Jim Rowan ITEC Monday, August 27.

Computer Some basic concepts. Binary number Why binary? Look at a decimal number: 3511 Look at a binary number: 1011 counting decimal binary

Announcements Chapter 11 for today No quiz this week Instructor got behind…. We'll be back in MGH389 on Friday.

CompSci Today’s topics Sound Upcoming ä Sound splicing ä Intellectual property ä Network analysis Reading Introduction to Computing & Programming.

Digital Recording Theory Using Peak. Listening James Tenney, Collage #1 (“Blue Suede”),  Available in Bracken Library, on James Tenney Selected.

Basics of Digital Audio Outline  Introduction  Digitization of Sound  MIDI: Musical Instrument Digital Interface.

COSC 1P02 Introduction to Computer Science 4.1 Cosc 1P02 Week 4 Lecture slides “Programs are meant to be read by humans and only incidentally for computers.

CIS-325: Data Communications1 CIS-325 Data Communications Dr. L. G. Williams, Instructor.

Introduction to SOUND.

Sound Waves Physic 11 · Chapter 9.

SOUND SOUND IS A FORM OF ENERGY SOUND IS A WAVE SOUND WAVES REQUIRE A MEDIUM SOUND HAS PROPERTIES: –AMPLITUDE –FREQUENCY SOUND VOLUME IS MEASURED IN DECIBELS.

Georgia Institute of Technology Introduction to Processing Digital Sounds part 1 Barb Ericson Georgia Institute of Technology Sept 2005.

The Physics Of Sound Why do we hear what we hear?

1 Introduction to Information Technology LECTURE 6 AUDIO AS INFORMATION IT 101 – Section 3 Spring, 2005.

CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2009.

Encoding and Simple Manipulation

CS 102 Computers In Context (Multimedia)‏ 03 / 30 / 2009 Instructor: Michael Eckmann.

Introduction to Computing and Programming in Python: A Multimedia Approach Chapter 6: Modifying Sounds Using Loops.

Intro-Sound-part1 Introduction to Processing Digital Sounds part 1 Barb Ericson Georgia Institute of Technology Oct 2009.

CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2014.

The Physics Of Sound Why do we hear what we hear? (Turn on your speakers)

Recording Arts…Audio Sound Waves Fall What does this all mean to you in this class? You are always working with sound waves – it is important to.

Multimedia Sound. What is Sound? Sound, sound wave, acoustics Sound is a continuous wave that travels through a medium Sound wave: energy causes disturbance.

Session 18 The physics of sound and the manipulation of digital sounds.

Digital Audio I. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul.

Chapter 12 Preview Objectives The Production of Sound Waves

1 CS 177 Week 7 Recitation Slides Modifying Sounds using Loops + Discussion of some Exam Questions.

Physics Mrs. Dimler SOUND.  Every sound wave begins with a vibrating object, such as the vibrating prong of a tuning fork. Tuning fork and air molecules.

Fourier Analysis Patrice Koehl Department of Biological Sciences National University of Singapore

The Physics of Sound.

Multimedia Systems and Applications

Introduction to Digital Audio

"Digital Media Primer" Yue-Ling Wong, Copyright (c)2013 by Pearson Education, Inc. All rights reserved.

Chapter 6: Modifying Sounds Using Loops

Introduction to Digital Audio

Introduction to Digital Audio

How sound works: Acoustics, the physics of sound

Introduction to Digital Audio

Introduction to Processing Digital Sounds

Introduction to Digital Audio

Recap In previous lessons we have looked at how numbers can be stored as binary. We have also seen how images are stored as binary. This lesson we are.

CS1315: Introduction to Media Computation

Presentation transcript:

CS1315: Introduction to Media Computation Sound Encoding and Manipulation

Sound is made when something vibrates The vibration disturbs the air around it and makes changes in air pressure These changes in air pressure move through the air as sound waves The sound waves cause pressure changes against our ear drum sending nerve impulses to our brain

Sound waves are pressure waves Each air molecule oscillates back and forth, affecting their neighbors…air molecules themselves don’t travel from vibration source to your ear Creates areas of high and low pressure, called compressions and rarefactions

Sound waves are longitudinal Representation of sound by a sine wave is merely an attempt to illustrate the sinusoidal nature of the pressure-time fluctuations

Properties of sound waves Pressure fluctuation comes in cycles  frequency of wave is number of cycles per second (cps), or Hertz (Complex sounds have more than one frequency in them.)  amplitude is maximum height of the wave (aka maximum pressure fluctuation) Each repetition of a waveform is called a cycle

Volume and pitch: Psychoacoustics, the psychology of sound Our perception of pitch is related (logarithmically) to frequency  Higher frequencies are perceived as higher pitches  A above middle C is 440 Hz Our perception of volume is related (logarithmically) to changes in amplitude  If the amplitude doubles, it’s about a 3 decibel (dB) change

Humans and Pitch In general we can hear frequencies between 20 Hz and 20,000 Hz Hz = hertz = cycles per second 20,000 Hz = 20 kHz (kilohertz) The older we get, the less we can perceive really high frequencies – recall those commercials for ‘silent’ ring-tones for teens Many animals can make sounds and hear frequencies that are beyond what we can hear – hence the creation of dog whistles

“Logarithmically?” It’s strange, but our hearing works on ratios not differences, e.g., for pitch.  We hear the difference between 200 Hz and 400 Hz, as the same as 500 Hz and 1000 Hz  Similarly, 200 Hz to 600 Hz, and 1000 Hz to 3000 Hz Intensity (volume) is measured as watts per meter squared  A change from 0.1W/m 2 to 0.01 W/m 2, sounds the same to us as 0.001W/m 2 to W/m 2

Decibel is a logarithmic measure A decibel is a ratio between two intensities: 10 * log 10 (I 1 /I 2 )  As an absolute measure, it’s in comparison to threshold of audibility  0 dB can’t be heard.  Normal speech is 60 dB.  A shout is about 80 dB

Digitizing Sound: How do we get that into numbers? Waves are analog (continuous) Remember in calculus, estimating the curve by creating rectangles? We can do the same to estimate the sound curve  Analog-to-digital conversion (ADC) will give us the amplitude at an instant as a number: a sample  How many samples do we need? Remember…pictures are continuous. How do we represent them digitally?

Understanding how computers represent sound Consider how a film represents motion…  Movie is made by taking still photos in rapid sequence at a constant rate, usually twenty-four frames per second  When the photos are displayed in sequence at that same rate, it fools us into thinking we are seeing continuous motion, even though we are actually seeing twenty-four discrete images per second

A collection of still frames

How computers represent sound Digital recording of sound works on the same principle  We take many discrete samples of the sound wave's instantaneous amplitude, store that information, then later reproduce those amplitudes at the same rate to create the illusion of a continuous wave

How often should we take samples? Many many samples must be taken per second-- many more than are necessary for filming a visual image In fact, we need to take more than twice as many samples as the highest frequency we wish to record…enter Nyquist Theorem The number of samples taken per second is known as the sampling rate

Nyquist Theorem* (will be on exam) We need twice as many samples as the maximum frequency in order to represent (and recreate, later) the original sound. The number of samples recorded per second is the sampling rate  If we capture 8000 samples per second, the highest frequency we can capture is 4000 Hz That’s how phones work  CD quality is 44,100 samples per second …what is the max frequency a CD can represent?

Nyquist Theorem Think back to the example of a film camera, which shoots 24 frames per second  If we're shooting a movie of a car, and the car wheel spins at a rate greater than 12 revolutions per second, it's exceeding half the "sampling rate" of the camera… the wheel completes more than 1/2 revolution per frame.  If, for example it actually completes 18/24 of a revolution per frame, it will appear to be going backward at a rate of 6 revolutions per second  In other words, if we don't witness what happens between samples, a 270-degree revolution of the wheel is indistinguishable from a -90-degree revolution. The samples we obtain in the two cases are precisely the same

Nyquist Theorem For audio sampling, the phenomenon is practically identical Consider a graph of a 4,000 Hz cosine wave, being sampled at a rate of 22,050 Hz Sample is taken every milliseconds

Nyquist Theorem Consider same 4,000 Hz cosine wave sampled at 6,000 Hz Sample is taken every milliseconds Wave completes more than 1/2 cycle per sample, and the resulting samples are indistinguishable from those that would be obtained from a 2,000 Hz cosine wave

Nyquist Theorem The simple lesson to be learned from the Nyquist theorem: digital audio cannot accurately represent any frequency greater than half the sampling rate If we want to record frequencies as high as 20,000 Hz, how many times per second would we need to sample the sound?

Digitizing sound with the computer: bit depth Each sample is a numerical value representing the instantaneous amplitude of the signal at the moment it was sampled The range of possible numbers used by a computer depends on the number of binary digits (bits) used to store each number As the number of bits increases, the range of possible numbers they can express increases by a power of two How many bits did we use per color channel per pixel? How many values could each color channel take on?

Digitizing sound with the computer Each sample is stored as a number (two bytes=16 bits) What’s the range of available combinations?  16 bits, 2 16 = 65,536  But we want both positive and negative values To indicate compressions and rarefactions.  What if we use one bit to indicate positive (0) or negative (1)?  That leaves us with = 15 bits  15 bits, 2 15 = 32,768  One of those combinations will stand for zero We’ll use a “positive” one, so that’s one less pattern for positives

+/- 32K Each sample can be between -32,768 and 32,767 Compare this to for light intensity (i.e. 8 bits or 1 byte giving us 256 different values) Why such a bizarre number? Because 32, , = 2 16 i.e. 16 bits, or 2 bytes< 0 > 0 0

Sounds as arrays (called lists in jython) Samples are just stored one right after the other in the computer’s memory That’s called an array  It’s an especially efficient (quickly accessed) memory structure (Like pixels in a picture)

Working with sounds We’ll use pickAFile and makeSound.  We want.wav files (NOTE: not all.wav files will work) We’ll use getSamples to get all the sample objects out of a sound We can also get the value at any index with getSampleValueAt Can get a sound’s length (getLength(sound)) Can get a sound’s sampling rate (getSamplingRate(sound)) Can save sounds with writeSoundTo(sound,"file.wav")

Demonstrating Working with Sound in JES >>> filename = pickAFile() >>> print filename /Users/monica/Desktop/MediaSources/preamble.wav >>> sound = makeSound(filename) >>> print sound Sound of length >>> samples = getSamples(sound) >>> print samples Samples, length >>> print getSampleValueAt(sound, 1) 36 >>> print getSampleValueAt(sound, 2) 29

Demonstrating working with samples >>> print getLength(sound) >>> print getSamplingRate(sound) >>> print getSampleValueAt(sound, ) 68 >>> print getSampleValueAt(sound, ) # note this is too high I wasn't able to do what you wanted. The error java.lang.ArrayIndexOutOfBoundsException has occured Please check line 0 of >>> print getSampleValueAt(sound, 1) 36 >>> setSampleValueAt(sound,1, 12) >>> print getSampleValueAt(sound, 1) 12

Working with Samples We can get sample objects out of a sound with getSamples(sound) or getSampleObjectAt(sound, index) A sample object it still part of the sound object, so if you change the sample object, the sound changes. Sample objects understand getSampleValue(sample) and setSampleValue(sample, value)

Example: Manipulating Samples >>> soundfile=pickAFile() >>> sound=makeSound(soundfile) >>> sample=getSampleObjectAt(sound, 1) >>> print sample Sample at 1 value at 59 >>> print sound Sound of length >>> print getSound(sample) Sound of length >>> print getSampleValue(sample) 59 >>> setSampleValue(sample, 29) >>> print getSampleValue(sample) 29

“But there are thousands of these samples!” How do we do something to these samples to manipulate them, when there are thousands of them per second? We use a loop and get the computer to iterate in order to do something to each sample. An example loop that just gets and reassigns the same value: for sample in getSamples(sound): value = getSampleValue(sample) setSampleValue(sample, value)

N.B.: function name changes setSampleValue(sample, 200)  used to be called setSample(sample, 200) getSampleValue(sample)  used to be called getSample(sample) So if you are using an old version of JES (pre-3.1) you may have to use setSample and getSample.

Properties of sound Frequency Amplitude Wavelength

Properties of digital sound Sampling rate/frequency – not to be confused with the frequency of the sound we are trying to capture! Bit depth