Download presentation
Presentation is loading. Please wait.
Published byBritney Wilcox Modified over 9 years ago
1
R ESEARCH BY E LAINE C HEW AND C HING -H UA C HUAN U NIVERSITY OF S OUTHERN C ALIFORNIA P RESENTATION BY S EAN S WEENEY D IGI P EN I NSTITUTE OF T ECHNOLOGY CS 582 / A PRIL 17, 2011 D R. D IMITRI V OLPER Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm
2
Presentation Flow Musical Pitch and Key Human Perception of Pitch The Spiral Array Model Pitches Chords Keys The CEG Algorithm Algorithm Visualization
3
Musical Pitch and Key Pitch The perceived value of a tone, “Low” to “High” Psycho-acoustic (subjective) perception of Frequency Frequency (Hz) is a scientific measurement of period Key (Western music) Labels the “center” tone in a section of music Standard smallest interval: Semitone or “half-step” Standard pattern of semitones around “center” Ascending: 2,2,1,2,2,2,1
4
Human Perception of Pitch Limited range of perception Typically 20Hz – 20,000Hz Range tends to decrease with age Noticable Difference is coarser at low Hz Less distance (Hz) between lower sounds Around 1400 perceivable intervals Certain frequency distances sound relatively close Thirds, Fifths, Octaves
5
The Spiral Array Model
6
Helical Structure Toroidal across Octaves Distance in 3D model approximates perceived closeness between pitch Pitch, chord and key can all map to the same space
7
Chords in the Spiral Array Standard chords are based on three supporting tones Create Triangles in 3D relative to the model Triangles are effectively continuous, as pitch is Major and Minor chords’ centers thus form helixes
8
Key in the Spiral Array Simple keys are based on three supporting chords Creates triangles in 3D, based on supporting chords’ triangular centers Triangles are effectively continuous, as chords are Major and Minor keys’ centers thus form helixes
9
Center of Effect Center of Effect (CE) Relative location of a chord based on its supporting tones Notes of different strength change the CE location Complex chord CE’s will not line up exactly on the model
10
Center of Effect Generator (CEG) Key-Finding Center of Effect relates position of multiple pitches in model Spatially closest chord is most likely key Correlates input music to standard key structure
11
Helping Visualize the CEG Algorithm Keys exist as a triangle in 3-space Keys’ centers-of-effect make up two helixes in the 3D model In standard intonation, keys are discrete (12 minor, 12 major)
12
Helping Visualize the CEG Algorithm From a complex audio signal, weighted values are calculated for bins on each discrete tone The weighted values approximate the current key’s location on the model The spatially-closest key is the most likely match
13
CEG Key-Finding Algorithm Pitch detection Extract pitch class and strength from signal Key finding Nearest Neighbor Search in Spiral Array
14
Fast Fourier Transform Efficient algorithm to compute Discrete Fourier Transform O(n log n) vs O(n 2 ) Transforms function into its Frequency Domain representation Widely used across many fields Solving Partial Differential Equations Data Compression Polynomial Multiplication Spectral Analysis Frequency bands
15
Algorithm for Pitch Class/Strength from FFT For each frequency spectrum in a 0.37 second period: 1. For each frequency band find peak value 2. For each pitch-class, k, and its strength at time j: F jk, is the sum of all peak values for that frequency band (and others related by octaves) 3. Normalize 1. Divide all pitch-strength values by the largest: 2. Divide all pitch-strength values by their sum: (k = 0, 1, …, 11)
16
CEG Key-Finding Algorithm Pitch detection Extract pitch class and strength from signal Key finding Nearest Neighbor Search in Spiral Array
17
CEG Algorithm For pitch class and strength from each 0.37 seconds: 1. Assign pitch-names to pitch classes: 1. Generate CE for previous 5 seconds; and 2. Assign pitch-names to current pitch-classes by nearest neighbor search in Spiral Array Space 2. Determine Key based on pitch names: 1. Generate the cumulative CE from beginning to current 2. Perform nearest-neighbor search to find closest key
18
BIBLIOGRAPHY: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm Chuan, C. and Chew, E. IEEE International Conference on Multimedia & Expo 2005 Towards a Mathematical Model of Tonality Chew, E. Doctoral dissertation, MIT 2000 Questions?
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.