Similarity Measurements in Chopin Mazurka Performances Craig Stuart Sapp C4DM Seminar, 11 July 2007 Queen Mary, University of London.

Slides:



Advertisements
Similar presentations
Digital Signal Processing
Advertisements

Recording-based performance analysis: Feature extraction in Chopin mazurkas Craig Sapp (Royal Holloway, Univ. of London) Andrew Earis (Royal College of.
Sonic Visualiser Tour CHARM Symposium 30 June 2006 Craig Stuart Sapp.
Introducing the Mazurka Project Nick Cook and Craig Stuart Sapp Music Informatics research group seminar School of Informatics, City University, London.
DFT/FFT and Wavelets ● Additive Synthesis demonstration (wave addition) ● Standard Definitions ● Computing the DFT and FFT ● Sine and cosine wave multiplication.
1 Two waves passing through the same region will superimpose - e.g. the displacements simply add Two pulses travelling in opposite directions will pass.
Chopin Analysis.
The frequency spectrum
1 Normal Probability Distributions. 2 Review relative frequency histogram 1/10 2/10 4/10 2/10 1/10 Values of a variable, say test scores In.
Mazurka Project Update Craig Stuart Sapp CHARM Symposium Kings College, University of London 26 January 2006.
SIMS-201 Characteristics of Audio Signals Sampling of Audio Signals Introduction to Audio Information.
IT-101 Section 001 Lecture #8 Introduction to Information Technology.
Reverse Conducting Slides Mazurka Project Presentation Materials Version: 14 November 2005.
Pitch Recognition with Wavelets Final Presentation by Stephen Geiger.
A.Diederich– International University Bremen – Sensation and Perception – Fall Frequency Analysis in the Cochlea and Auditory Nerve cont'd The Perception.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
1 Summary Statistics Excel Tutorial Using Excel to calculate summary statistics Prepared for SSAC by *David McAvity – The Evergreen State College* © The.
Key Detection In Musical Signals Philip Brown, ’07 Advisor: Dr. Shane Cotter.
1 Two waves passing through the same region will superimpose - e.g. the displacements simply add Two pulses travelling in opposite directions will pass.
Describing Data with Tables and Graphs.  A frequency distribution is a collection of observations produced by sorting observations into classes and showing.
CHARM Advisory Board Meeting Institute for Historical Research, School of Advanced Study, UL Senate House, London 12 Dec 2006 Craig Stuart Sapp Centre.
Harmonics, Timbre & The Frequency Domain
EE513 Audio Signals and Systems Digital Signal Processing (Systems) Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Digital audio. In digital audio, the purpose of binary numbers is to express the values of samples that represent analog sound. (contrasted to MIDI binary.
Beat-level comparative performance analysis Craig Stuart Sapp CHARM Friday the 13 th of April 2007 CHARM Symposium 4: Methods for analysing recordings.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
NSW Curriculum and Learning Innovation Centre Tinker with Tinker Plots Elaine Watkins, Senior Curriculum Officer, Numeracy.
Comparative Analysis of Multiple Musical Performances Craig Stuart Sapp ISMIR; Vienna, Austria 27 September 2007.
Graphs of Frequency Distribution Introduction to Statistics Chapter 2 Jan 21, 2010 Class #2.
Standard #1: Write an Algebraic Expression from a word problem. Text Section: 1.1.
Performance Authenticity: A Case Study of the Concert Artist Label Association of Recorded Sound Collections Conference Stanford University 28 March 2008.
The Mazurka Project Craig Stuart Sapp Centre for the History and Analysis of Recorded Music Royal Holloway, Univ. of London Universiteit van Amsterdam.
Applications Statistical Graphical Models in Music Informatics Yushen Han Feb I548 Presentation.
Covariance and correlation
Paper by Craig Stuart Sapp 2007 & 2008 Presented by Salehe Erfanian Ebadi QMUL ELE021/ELED021/ELEM021 5 March 2012.
15.1 Properties of Sound  If you could see atoms, the difference between high and low pressure is not as great.  The image below is exaggerated to show.
Wireless and Mobile Computing Transmission Fundamentals Lecture 2.
CS 6825: Binary Image Processing – binary blob metrics
Dynamic Range and Dynamic Range Processors
Digital Image Processing CCS331 Relationships of Pixel 1.
Unit 9: Probability, Statistics and Percents Section 1: Relative Frequency and Probability The frequency of something is how often it happens Relative.
CHARM Symposium Kings College London 6 January 2006 The Mazurka Project.
Extracting Melody Lines from Complex Audio Jana Eggink Supervisor: Guy J. Brown University of Sheffield {j.eggink
1 Introduction to Information Technology LECTURE 6 AUDIO AS INFORMATION IT 101 – Section 3 Spring, 2005.
Mazurka Project 29 June 2006 CHARM Symposium Craig Stuart Sapp.
11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.
CHAPTER 2 SECTION 2-1 PATTERNS AND ITERATIONS SEQUENCE An arrangement of numbers in a particular order. The numbers are called terms and the pattern.
The Mazurka Project Science and Music Seminar University of Cambridge 28 Nov 2006 Craig Stuart Sapp Centre for the History and Analysis of Recorded Music.
Things to Consider When Writing Melodies Vital Elements  Two most vital elements - rhythm and melody.  Harmonic structure of your composition will.
Computational Performance Analysis Craig Stuart Sapp CHARM 25 April 2007 CCRMA Colloquium.
Track Color Coding Horizontal color bars on channel strips in the mix window Vertical color bars to the left of each track in the edit window Helpful in.
int [] scores = new int [10];
BASIC INSTRUMENTS - oscilloscopes
Info Read SEGY Wavelet estimation New Project Correlate near offset far offset Display Well Tie Elog Strata Geoview Hampson-Russell References Create New.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
CLASSIFICATION OF ECG SIGNAL USING WAVELET ANALYSIS
BNAD 276: Statistical Inference in Management Winter, Green sheet Seating Chart.
Screen Stage Lecturer’s desk Gallagher Theater Row A Row A Row A Row B
Electromagnetism lab project
Travelling to School.
Common Core Math I Unit 1: One-Variable Statistics Boxplots, Interquartile Range, and Outliers; Choosing Appropriate Measures.
Sampling rate conversion by a rational factor
Mark Sayles, Ian M. Winter  Neuron 
Patterns – Learning Outcomes
Common Core Math I Unit 2: One-Variable Statistics Boxplots, Interquartile Range, and Outliers; Choosing Appropriate Measures.
Common Core Math I Unit 1: One-Variable Statistics Boxplots, Interquartile Range, and Outliers; Choosing Appropriate Measures.
Common Core Math I Unit 1: One-Variable Statistics Boxplots, Interquartile Range, and Outliers; Choosing Appropriate Measures.
Trajectory Encoding in the Hippocampus and Entorhinal Cortex
Measuring the Similarity of Rhythmic Patterns
Presentation transcript:

Similarity Measurements in Chopin Mazurka Performances Craig Stuart Sapp C4DM Seminar, 11 July 2007 Queen Mary, University of London

Mazurka Project 2,210 recordings of 49 mazurkas 105 performers on 160 CDs, 98 hours of music Earliest is 1907 Pachmann performance of 50/2 = 45 performances/mazurka on average least: 31 performances of 41/3 most: 64 performances of 63/3 number of mazurka performances in each decade Number of performance by decade Performers of mazurka 63/3:

Expressive Audio Features (Piano) Note Attack Note Loudness Note Release Not much else a pianist can control String instruments have more control variables Voice has even more…

Expressive Performance Features Note Attack Note Loudness Note Release beat durations offbeat timings tempo individual notes articulation hand synchrony dynamics articulation (accents) meter phrasing articulation (staccato, legato) pedaling meter ornamentation

Data Extraction (1) Extracting beat times from audio files Using Sonic Visualiser for data entry processing

Data Extraction (2) Step 1: Listen to music and tap to beats (; key) Notice taps do not fall on the audio attacks: ms hardware granularity built into program Human: ~30 ms SD for constant tempo; ~80 ms SD for mazurkas

Data Extraction (3) Step 2: Add onset detection function to the display Mz SpectralReflux plugin for Sonic Visualiser: (Linux & Windows)

Data Extraction (4) Step 3: Estimate onset times from function peaks Send taps (green) and onsets (orange) to external program: (no interlayer processing plugins for Sonic Visualiser yet…)

Data Extraction (5) Step 4: Load snaptap results into SV (purple): Mz SpectralReflux currently sentitive to noise (old recordings) so snaptap only works on clean recordings.

Data Extraction (6) Step 5: Correct errors Hardest part of data entry. Down to ~30 min/mazurka for clean recordings. 278 performances (of ~6 mazurkas) at this stage.

Well-Behaved Mz HarmonicSpectrogram poor low freq resolution… LHRH LH (Mohovich 1999) pickup longer than 16 th LH before RH triplets played same speed as sextuplets first sextuplet note longest

Misbehaved (Risler 1920) cc:url;cyc LHRH lots of false positives lots of noise/clicks pickup longer than 16 th LH before RH triplets played same speed as sextuplets first sextuplet note longest LHRH

Extracted Feature Data Beat times durations or tempos Audio amplitude at beat locations (seconds) (BPM) (~dB) Further feature extraction by Andrew Earis (note timings/dynamics)

Beat-Tempo Graphs average

Beat-Dynamics Graphs average

Pearson Correlation Example: x = (1,3,4,2,7,4,2,3,1) y = (4,4,3,5,5,6,4,3,2) r = x = 3 y = 4 (average of y) (average of x)

Shape Matching

Correlation & Fourier Analysis spectrumsignalsinusoids = spectrum, indexed by k (frequency) = signal, indexed by n (time) = set of k complex sinusoids indexed by n correlation: multiply & sum

Correlation & Fourier Analysis (2) Let k n Then the DFT can then be written as: 44.1 kHz srate with N = Hz 129 Hz 86 Hz 43 Hz 172 Hz 215 Hz 301 Hz 258 Hz …

Performance Tempo Correlations Biret Brailowsky Chiu Friere Indjic Luisada Rubinstein 1938 Rubinstein 1966 Smith Uninsky BiLuBrChFlInR8R6SmUn Highest correlation to Biret Lowest correlation to Biret

Correlation Maps – Nearest Neighbor Draw one line connecting each performance to its closest correlation match Correlating to the entire performance length. Mazurka 63/3

Absolute Correlation Map Average

Scape Plotting Domain 1-D data sequences chopped up to form a 2-D plot Example of a composition with 6 beats at tempos A, B, C, D, E, and F: original sequence:  time axis  start end  analysis window size 

Scape Plotting Example Averaging in each cell with base sequence (7,8,2,5,8,4): ( ) 5 = 25 5 = 5 average of all 6 numbers at bottom average of last 5 numbers (8+4) 2 = 12 2 =6

Average-Tempo Timescapes average tempo of entire performance average for performance slower faster phrases Mazurka in F maj. Op. 68, No. 3 Chiu 1999

Arch Correlation Scapes 2-bar arches8-bar arches like wavelet spectrogram 24-bar arch 8-bar arches

Composite Features low frequencies high frequencies Phrasing Accentuation ritards, accelerandos mazurka meter multiple performance features embedded in data Using exponential smoothing: apply twice: forwards & backwards to remove group delay effects

Binary Correlation Scapes window size (in measures) Yaroshinsky 2005 Ashkenazy 1982 Yaroshinsky 2005 Jonas 1947 white = high correlationblack = low correlation mazurka 30/2 8

Scapes Are 3D Plots 2D plotting domain + 1D plotting range

Scapes of Multiple Performances Yaroshinsky/Freneczy Yaroshinsky/RangellYaroshinsky/Rubinstein Yaroshinsky/ Milkina Yaroshinsky/ Friedman 1.correlate one performance to all others – assign unique color to each comparison. 2. superimpose all analyses. 3. look down from above at highest peaks

Perormance Correlation Scapes Who is most similar to a particular performer at any given region in the music? mazurka.org.uk/ana/pcor mazurka.org.uk/ana/pcor-gbdyn

Maps and Scapes Correlation maps give gross detail, like a real map: Correlation scapes give local details, like a photograph: map points are tops of scapes

Map and Photos of the Forest Correlation network Mazurka in A minor, 17/4 scenic photographs at each performance location in forest map = tops of triangles

Boring Timescape Pictures The same performance by Magaloff on two different CD re-releases: Philips Philips /29-2 Structures at bottoms due to errors in beat extraction, measuring limits in beat extraction, and correlation graininess. mazurka 17/4 in A minor Occasionally we get over-exposed photographs back from the store, and we usually have to throw them in the waste bin.

Boring Timescape Pictures? mazurka 17/4 in A minor Calliope 3321 Concert Artist Two difference performances from two different performers on two different record labels from two different countries. Hatto 1997 Indjic 1988 see:

Beat-Event Timing Differences Hatto beat location times: 0.853, 1.475, 2.049, 2.647, 3.278, etc. Indjic beat location times: 0.588, 1.208, 1.788, 2.408, 3.018, etc. beat number Hatto / Indjic beat time deviations deviation [seconds] remove 0.7% timeshift difference plot

googol1 in Timing Difference Probability beat number Hatto / Indjic beat time deviations deviation [seconds] 1 : : beats probability that same performer can produce a second performance so closely is equivalent to one atom out of an entire star.

Same Performer Over Time Mazurka in B minor 30/2 Tsong 1993 Tsong 2005

Including the Average Mazurka in B minor 30/2

Mutual Best Matches Clidat 1994 Tsong 2005 Shebanova 2002 Average Mazurka in B minor 30/2

Significance of Similarity Ts’ong 2005 performance matches best to Shebanova 2002 in 3 phrases when comparing 36 performances of mazurka 30/2. Is this a coincidence or not? Could ask the pianist (but might be problem in suggesting an answer beforehand). Also they might not remember or be totally conscious of the borrowing (such as accents in language). Or there could be a third performer between them. Ideally a model would be used to calculate a probability of significance. Mazurka in B minor 30/2

Phrenology

Significant or Not? Example of non-significant matches (or at least very weak) (Mazurka 30/2) phrase level fragmented boundaries inverted triangles foreground / background (bottom / top) don’t match – no continuity in scope. Example of possibly significant matches repetition of performer vertical structure uniform top with roots at bottom level Match to same performer always significant

Purely Random Matching Plot has to show some match at all points… Many performances are equidistant to Uninsky performance (mazurka 30/2) two white-noise sequences

Including Average Performance helps but does not solve significance question (mazurka 30/2)

What Is Significant? No direct link found on web between Brailowsky and Luisada (such as teacher/student). Strong match in Brailowsky to Luisada probably due to large amount of mazurka metric pattern: Phrasing shapes very different, so match is both significant (high frequencies match) and not significant (low frequencies don’t match). Luisada Brailowsky

Best Matching Not Mutual Looking at both performers pictures helps with “significance” ?

Performance Map Schematic Luisada Biret Brailowsky Average Luisada: Brailowsky: Brailowsky has the strongest mazurka meter pattern Luisada has the second strongest mazurka meter pattern another performer PCA

Falvay 1989 Tsujii 2005 Nezu 2005 Poblocka 1999 Shebanova 2002 Strong Interpretive Influences Mazurka in C Major 24/2 note parallel colors

Performance Ranking Given a reference performance, -- which other performance matches best -- which other performance matches second best … -- which other performance matches worst

0 th -Order Similarity Rank Use large-scale correlation to order performances similarity to a target performance rankcorrelation target mazurka 30/2 best match worst match } synthetic noise Performance maps used rank #1 data:

1 st -Order Scape Rank Area represented by best choice in scape plot. Hatto effect causes problems: Who is #2 for Indjic? (mazurka 68/3)

Scape Layers Hatto takes up nearly 100% of Indjic’s scape So remove Hatto layer…

Peeling the Layers remove Hatto remove Average remove Gierzod remove Fliere remove Uninsky remove Friedman remove Osinksa remove Czerny-Stefanska

2 nd -Order Scape Rank Rank score = area represented by best choice in scape plot, but peel away previous best choices. performance0-rank 1-rank2-rank Still slightly sensitive to the Hatto effect (mazurka 30/2)

3 rd -Order Scape Rank Start with the 2 nd -order rankings, selecting a cutoff point in the rankings to define a background noise level in the scape. Then one-by-one add each of the non-noise performances into the scape along with the background noise performances. Measure the area covered by the non-noise performance (only one non-noise performance at a time).

3 rd -Order Rankings target same performer 50% noise floor (2-rankings below “noise floor”)

Proportional Noise 0.72 (= 3-rank noise floor) Gradually add noise to target 50% noise moved target to noise floor

Real Data (1) Ash81 Ash82 Bir Ble Bra Chi Cli Cor Fer Fio Fli Fra Hat Ind Jon Lui Lus Mag Mic Mil Moh Ole Pob Ran Rub39 Rub66 She Smi Sof Tso93 Tso05 Uni Yar Avg Mazurka 30/2 Target: Rubinstein rank score (correlation) 3-rank score Avg = 8% noise equivalent same performer ~12% noise equivalent

Real Data (2) Ash81 Ash82 Bir Ble Bra Chi Cli Cor Fer Fio Fli Fra Hat Ind Jon Lui Lus Mag Mic Mil Moh Ole Pob Ran Rub39 Rub66 She Smi Sof Tso93 Tso05 Uni Yar Avg Mazurka 30/2 Target: Rubinstein rank score (correlation) 3-rank score

Distance to the Noise Floor Metric for measuring individuality of interpretation?, 55% noise 38% noise, 3-Rank scores more absolute than correlation values 3-Rank scores less sensitive to local extrema noise floor is always about a 3-rank score of 10%

Cortot Performance Ranking Master class recording contains 48 out of 64 measures (75%) Con. Artists Rankings 0-Rank: 1.Average 2.Rangell 01 3.Milkina 70 4.Mohovich 99 5.Shebanova 02 … 32. Masterclass Masterclass Rankings 0-Rank: 1.Poblocka 99 2.Average 3.Rubinstein 52 4.Tsong 93 5.Tsong 05 … 33. Con. Artist 3-Rank 1.Average 2.Rubinstein 52 3.Luisada 90 4.Poblocka 99 5.Hatto 94 … 35. Con. Artist 3-Rank 1.Average 2.Rangell 01 3.Mohovich 99 4.Rubinstein 39 5.Milkina 70 … 31. Masterclass (comparing 35 performances + average) Match to other Cortot near bottom of rankings. Match to other Cortot near bottom of rankings.