Search-Effectiveness Measures for Symbolic Music Queries in Very Large Databases Craig Stuart Sapp Yi-Wen Liu

Slides:

Advertisements

Similar presentations

1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.

Advertisements

G. Alonso, D. Kossmann Systems Group

Gapped Blast and PSI BLAST Basic Local Alignment Search Tool ~Sean Boyle Basic Local Alignment Search Tool ~Sean Boyle.

Melodic Organization Chapter 6. Motive Short melodic and/or rhythmic pattern Usually only a few beats Recurs throughout a piece or section Unifying element.

Measures of Information Hartley defined the first information measure: –H = n log s –n is the length of the message and s is the number of possible values.

Evaluating Search Engine

Clustering short time series gene expression data Jason Ernst, Gerard J. Nau and Ziv Bar-Joseph BIOINFORMATICS, vol

Heuristic alignment algorithms and cost matrices

Topics: Inferential Statistics

Simple Linear Regression

Topic 7 Sampling And Sampling Distributions. The term Population represents everything we want to study, bearing in mind that the population is ever changing.

Sample size computations Petter Mostad

Chapter 13 Introduction to Linear Regression and Correlation Analysis

ANCOVA Psy 420 Andrew Ainsworth. What is ANCOVA?

Intro to Statistics for the Behavioral Sciences PSYC 1900

Similar Sequence Similar Function Charles Yan Spring 2006.

PSY 307 – Statistics for the Behavioral Sciences Chapter 8 – The Normal Curve, Sample vs Population, and Probability.

PSY 307 – Statistics for the Behavioral Sciences Chapter 8 – The Normal Curve, Sample vs Population, and Probability.

Data Acquisition Chapter 2. Data Acquisition 1 st step: get data 1 st step: get data – Usually data gathered by some geophysical device – Most surveys.

Chemometrics Method comparison

Elec471 Embedded Computer Systems Chapter 4, Probability and Statistics By Prof. Tim Johnson, PE Wentworth Institute of Technology Boston, MA Theory and.

Entropy and some applications in image processing Neucimar J. Leite Institute of Computing

© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.

Polyphonic Queries A Review of Recent Research by Cory Mckay.

One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.

Query Rewriting Using Monolingual Statistical Machine Translation Stefan Riezler Yi Liu Google 2010 Association for Computational Linguistics.

Paper by Craig Stuart Sapp 2007 & 2008 Presented by Salehe Erfanian Ebadi QMUL ELE021/ELED021/ELEM021 5 March 2012.

Dr. Richard Young Optronic Laboratories, Inc..  Uncertainty budgets are a growing requirement of measurements.  Multiple measurements are generally.

HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 8 Continuous.

Inference for a Single Population Proportion (p).

Unit 4: Modeling Topic 6: Least Squares Method April 1, 2003.

Confidence Interval Estimation

PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.

Measurement Uncertainties and Inconsistencies Dr. Richard Young Optronic Laboratories, Inc.

Efficient Minimal Perfect Hash Language Models David Guthrie, Mark Hepple, Wei Liu University of Sheffield.

Vegas Baby A trip to Vegas is just a sample of a random variable (i.e. 100 card games, 100 slot plays or 100 video poker games) Which is more likely? Win.

Physics 114: Exam 2 Review Lectures 11-16

Analysis of Algorithms

Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.

Counseling Research: Quantitative, Qualitative, and Mixed Methods, 1e © 2010 Pearson Education, Inc. All rights reserved. Basic Statistical Concepts Sang.

Chapter 1 Introduction to Statistics. Statistical Methods Were developed to serve a purpose Were developed to serve a purpose The purpose for each statistical.

Introduction to Behavioral Statistics Probability, The Binomial Distribution and the Normal Curve.

Melodic Organization Motive Rhythmic Motive Melodic Motive

The Semivariogram in Remote Sensing: An Introduction P. J. Curran, Remote Sensing of Environment 24: (1988). Presented by Dahl Winters Geog 577,

Melodic Search: Strategies and Formats CS 275B/Music 254.

11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.

CHAPTER 5 SIGNAL SPACE ANALYSIS

Melodic Similarity Presenter: Greg Eustace. Overview Defining melody Introduction to melodic similarity and its applications Choosing the level of representation.

Chapter 8: Simple Linear Regression Yang Zhenlin.

Financial Risk Management of Insurance Enterprises

OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.

1 What happens to the location estimator if we minimize with a power other that 2? Robert J. Blodgett Statistic Seminar - March 13, 2008.

Computational Biology, Part 3 Representing and Finding Sequence Features using Frequency Matrices Robert F. Murphy Copyright  All rights reserved.

Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.

Maximum likelihood estimators Example: Random data X i drawn from a Poisson distribution with unknown  We want to determine  For any assumed value of.

Quantitative methods and R – (2) LING115 December 2, 2009.

Review Design of experiments, histograms, average and standard deviation, normal approximation, measurement error, and probability.

Musical Similarity: More perspectives and compound techniques

Financial Risk Management of Insurance Enterprises

Statistical Methods For Engineers

Memory and Melodic Density : A Model for Melody Segmentation

Lesson Comparing Two Means.

Basic Training for Statistical Process Control

Basic Training for Statistical Process Control

Feifei Li, Ching Chang, George Kollios, Azer Bestavros

Quantitative Reasoning

Craig Stuart Sapp Yi-Wen Liu Eleanor Selfridge-Field ISMIR 2004

8.3 Estimating a Population Mean

Presentation transcript:

Search-Effectiveness Measures for Symbolic Music Queries in Very Large Databases Craig Stuart Sapp Yi-Wen Liu Eleanor Selfridge-Field ISMIR October 2004 Barcelona, Spain Universitat Pompeu Fabra

Introduction Match-Count Profiles

pitch name 12-tone pitch 12-tone interval musical interval scale degree pitch refined contour pitch gross contour 35 states 12 70/octave 24/octave Musical Features We examined search characteristics of 14 musical features: 7 Rhythm features: (3 duration & 4 metric) 1. duration 2. duration gross contour 3. duration refined contour 7 Pitch features ( examples below ) 4. beat level 5. metric level 6. metric gross contour 7. metric refined contour + (3) (5) (3) (5) (2) (10~14)(37~74) How do all these different features affect searching in a database?

Anchored vs. Unanchored Searches Two types of search methods, Examples: search only from the start of a database entries search starting at any position in database entries

Example Feature Searches FeatureQueryAnchored Matches Unanchored Matches pchF A C4641,710 12p ,710 mi+M3 +m31,9246,882 12i+4 +31,9256,894 sd1 3 52,0097,024 prcU 4,67717,712 pgcU 19,78776,865 pitch name 12-tone pitch musical interval 12-tone interval scale degree pitch refined contour pitch gross contour (in Themefinder) Searching a database of 100,000 melodic incipits/themes

Raw Data Extraction Now plot measurements as a “match-count profile” 10,585 matches 1,351 matches 464 matches 161 matches 83 matches 21 matches 12 matches target incipit: generated database queries: (anchored) x-axis: query length y-axis: match count (log scale) Query length=1 length=2 length=3 length=4 length=5 length=7 length=6

Individual Match-Count Profile Query length 12-tone interval features: Anchored and Unanchored searches merge at length = 8 target incipit : Unique match found at length = 10

Interesting Query Lengths Query length TTU = length of query yielding unique match target incipit: TTS = length giving matches under limit size e.g., first search-length which gives fewer than 10 matches How long query length must be to generate a sufficiently small set of matches

Average Match-Count Profiles Average all target profiles over entire database:

Average Match-Count Profiles Average all target profiles over entire database:

Anchored/Unanchored Profile Slopes Synthetic DatabaseReal Database Anchored Searching: O(log N)Unanchored Searching: O(N 2 ) Anchored/Unanchored slopes not much different. Anchored searching is much faster.

Match-Count Profiles for Pitch Features (All dataset) Log 2 matches Query length 18.5 notes/theme avg. Steeper initial slope = more descriptive feature Twelve-tone pitch and full pitch spelling features are very identical (orange curve) Absolute twelve tone pitch and relative twelve-tone interval are close. 7-symbol scale degree features close to 5-symbol refined pitch contour. 3-symbol pitch gross contour more descriptive than 3-symbol duration gross contour.

51 notes/song avg. Rhythm feature curves more crooked. TTS for rhythm twice as long than pitch TTS. TTS for gross metric descriptions 5 times as long as pitch TTS values. Match-Count Profiles for All Features Phrase/meter effects?

Four Applications of Profiles: Entropy & Entropy Rate Match Count Predictions Joint Feature Analysis Synthetic Database Analysis

Entropy entropy definition: Entropy measures basic information content of a musical feature also called “Shannon Entropy” Entropy (bits/symbol)Normalized probability distribution 3.4 bits/note is the minimum symbol storage size needed to store sequences of 12-tone intervals (Folksong data set). Example calculation: bits/pitch or “First-order Entropy”

Entropy Rate Real music features are related to surrounding musical context. Entropy rate (bits/symbol) N=Sequence length entropy-rate definition: also called “Average Entropy” “Nth-order” entropy Entropy is a contextless (memoryless) measure. Average entropy (entropy-rate) is more informative: bits/symbol Entropy & entropy rate for various repertories: (12p features) Note:

Entropy-Rate Estimation from TTS Entropy characterizes the minimum possible average TTS. Entropy-rate characterizes the actual average TTS. M = database size

Applications of Profiles Entropy & Entropy Rate Match Count Predictions Joint Feature Analysis Synthetic Database Analysis

Joint Feature Analysis How independent/dependent are pitch and rhythm features? What is the effect of searching pitch and rhythm features in parallel? Pitch + Rhythm Analyze as a combined feature

Mutual Information H(a)H(a) H(b)H(b) joint entropy H(a,b) conditional entropy H(a|b) = H(a,b) – H(b) H(b|a) = H(a,b) – H(a) mutual information I(a;b) = H(a) + H(b) – H(a,b) e.g., pitch e.g., rhythm Measurement of the correlation of two types of features

Combining Pitch and Rhythm Searches H(pgc) = H(rgc) = H(pgc, rgc) = I(pgc; rgc) = H(pgc) + H(rgc) – H(pgc,rgc) = Pitch and Rhythm are very independent features. Therefore, combining independent search features should be effective. less than two orders of magnitude interaction Individual Entropies: Joint Entropy: Mutual Information: (at least for pgc+rgc averaged over entire database)

Joint Feature Profiles pgc and rgc are generic features less prone to query errors. 3*3 states work as well as 88 twelve-tone interval states. (All dataset) Log 2 matches Query length for pgc/rgc vs. twelve-tone interval searching

(All dataset) Log 2 matches Query length Joint Feature Match Count Profiles Single Feature Match Count Profiles Log 2 matches Joint Feature Search Effectiveness

(All dataset) Log 2 matches Joint Feature Match Count Profiles Single Feature Match Count Profiles Log 2 matches Joint Feature Search Effectiveness TTS: TTS: 4 - 6

Applications of Profiles Entropy & Entropy Rate Match Count Predictions Joint Feature Analysis Synthetic Database Analysis

Expectation Function Example: Consider a database of “best 3 out of 5” Heads/Tails coin flips: H H T H T T H T T H H T T H H T T T T H H H H H H Likelyhood starting sequence is “H”: 50% Likelyhood starting sequence is “H T”: 25% Likelyhood starting sequence is “H H ”: 25% database size Expected match counts for an n-length query (H = measured entropy rate) Entropy Rate can be used to predict the number of matches: Entropy Rate = Entropy = log 2 2 = 1 bit/symbol E(1) = M/2 1 = M/2 Therefore R = 2 log2 = 2 1 = 2 E(2) = M/2 2 = M/4

Match-Count Profile Constraint The match-count profile queries are constructed from database entries. large nsmall n dominates the curve at large n slope is the entropy rate of a search feature (at small n) dominates the curve at small n Therefore at least one match is always expected. Steal this guaranteed match from M, and add as a constant to the expectation function: How to get rid of curvature caused by constant +1 term?

Match-Count and Derivative Profile Comparison Match-Count Profile expectation function: Derivative Match-Count Profile initial slope of both profiles is the entropy rate derivative profile maintains entropy-rate slope for larger query lengths To measure the entropy rate of small databases, you would need to use the derivative plot since the +1 term would be two powerful What about ?

“Match-Count Profile” Expectation Plot Functions Using unanchored rhythmic features for Essen-full “Derivative Profile” Removes +1 curvature, but sensitive to duplicate entries in the database Removes +1 curvature and not sensitive to duplicate entries in the database. Best method for measuring entropy-rate “Target-Exclusion Profile” E(n) - E(n+1) E(n)E(n) E(n) - 1

Applications of Profiles Entropy & Entropy Rate Match Count Predictions Joint Feature Analysis Synthetic Database Analysis

Synthetic vs. Real Database Profiles (Entropy) E(n) E(n) – E(n+1) “database similarity” no change in slope due to duplicate entries in database synthetic databases will not curve Uniform random data Weighted Random Markov process generated data Real data Legend: Based on real data probability distribution.

Effects of Duplicate Entries on Profiles E(n) and E(n)-1 profiles can be used to measure amount of duplication in database Duplicate entries in the database do not have a significant effect on entropy-rate measurements: E(n) – E(n+1) E(n) E(n) - 1 very sensitive to detecting duplication E(n) – E(n+1) removes effect of duplicate entries entirely.

Effect of Incipit Length on Profiles Derivative Profile shorter incipits cause quantization noise in low match-count region slope at long query lengths is artificially increased when incipits are too short. How short is is too short for a musical incipit? An incipit a short initial excerpt from a full composition

Search-Effectiveness Measures for Symbolic Music Queries in Very Large Databases Craig Stuart Sapp Yi-Wen Liu Eleanor Selfridge-Field ISMIR October 2004 Barcelona, Spain Universitat Pompeu Fabra

Extra Slides:

Summary Interesting metrics for analyzing the effectiveness of search features: Match-Count Profiles: Examines match characteristics of a musical feature for longer and longer queries. Entropy Rate: Characterizes match count profiles well with a single number. Useful for predicting the expected average number of matches for a given length query. TTS: The number of symbols in query necessary to generate a sufficiently small number of matches (average). TTU not as useful due to noise.

Proof for Derivative Plots (algebra manipulation) plotting on a log scale, so take the log of both sides: Let:and so the equation becomes: (expectation function for Match-Count Profiles) since Let: which is a line with a slope proportional to the entropy (rate) slope (subtract n and n+1 values of E( ) to cancel +1 term)

Derivative Plots for 12i features Vocal music tending to lower entropy rates Luxembourg set has most predictable interval sequences. Latin Motets (vocal) have highest entropy-rate for twelve-tone intervals.

Themefinder Website

Themefinder Collections Data set Count Web Interface Classical10,718 themefinder.org Folksong8,473 themefinder.org Renaissance18,946 latinmotet.themefinder.org US RISM A/II55,490 Polish6,060 Luxembourg612 lux.themefinder.org 100,299 total:

Matches on First Seven Notes x4 x2 A. C. D. E. F. G. H. B.

Entropy and Entropy Rate for various repertories in the Themefinder database (12p features) bits/symbol Entropy rate less than or equal to the Entropy

Search Failure Rates Average note count/incipit: 16 Database size: 100,299 Plot measures how often a search produces too many matches for query sequences as long as the database entry.

Time To Uniqueness Query length TTU = the number of query symbols needed to find the exact match in the database. Turns out to not be very useful since it is more susceptible to noise in the data. target match:

Effect of Incipit Length on Profiles Derivative Curve shorter incipits cause quantization noise in low match-count region slope at long query lengths is artificially increased when incipits are too short.

3.4 bits/note is the lower symbol storage size limit needed to store sequences of 12-tone intervals (Folksong data set). Entropy can be used as a basic estimate for how many notes are necessary to find a unique/sufficient match in the database, but... normalized probability Probability Distributions

Expectation Function = database size = average expected match counts for an n-length query where H is the entropy rate of the feature being searched for (Entropy rate is assumed to be constant) In general: For example, consider sequences created with a uniform random distribution of three states (the next symbol in the sequence is equally likely to be any of the three states). Then, the entropy of the sequence is: which makes and the formula for the expected match counts becomes: then 1/3 of the database entries should be matched with a one- length query on the average: and a length-two query should return 1/9 of the database on the average:

Joint Pitch/Rhythm Effects on TTS Classical datasetChinese Folksongs dataset TTS Adding rgc to pitch features usually reduces the search length by 2 notes. Combining rgc and pgc reduces search length by 4 notes.