Melodic Similarity Presenter: Greg Eustace. Overview Defining melody Introduction to melodic similarity and its applications Choosing the level of representation.

Slides:



Advertisements
Similar presentations
Longest Common Subsequence
Advertisements

Melodic Organization Chapter 6. Motive Short melodic and/or rhythmic pattern Usually only a few beats Recurs throughout a piece or section Unifying element.
Music Perception. Why music perception? 1. Found in all cultures - listening to music is a universal activity. 2. Interesting from a developmental point.
Computational Linguistic Techniques Applied to Drugname Matching Bonnie J. Dorr, University of Maryland Greg Kondrak, University of Alberta June 26, 2003.
Rhythmic Similarity Carmine Casciato MUMT 611 Thursday, March 13, 2005.
1 Suffix Trees and Suffix Arrays Modern Information Retrieval by R. Baeza-Yates and B. Ribeiro-Neto Addison-Wesley, (Chapter 8)
A.Diederich – International University Bremen – USC – MMM – Spring 2005 Rhythm and timing  Clarke, E.F. Rhythm and timing in music. In Deutsch, D. Chapter.
Information Retrieval in Practice
Slide 1 EE3J2 Data Mining EE3J2 Data Mining Lecture 12: Sequence Analysis Martin Russell.
Developing Pairwise Sequence Alignment Algorithms Dr. Nancy Warter-Perez.
Whole Genome Alignment using Multithreaded Parallel Implementation Hyma S Murthy CMSC 838 Presentation.
Developing Pairwise Sequence Alignment Algorithms Dr. Nancy Warter-Perez June 23, 2005.
Data Quality Class 7. Agenda Record Linkage Data Cleansing.
Distance Functions for Sequence Data and Time Series
Developing Pairwise Sequence Alignment Algorithms Dr. Nancy Warter-Perez June 23, 2004.
T.Sharon 1 Internet Resources Discovery (IRD) Music IR.
Based on Slides by D. Gunopulos (UCR)
Developing Pairwise Sequence Alignment Algorithms Dr. Nancy Warter-Perez May 20, 2003.
1Ellen L. Walker Matching Find a smaller image in a larger image Applications Find object / pattern of interest in a larger picture Identify moving objects.
INTRODUCTION TO DATA STRUCTURE. Topics To Be Discussed………………………. Meaning of Data Structure Classification of Data Structure Data Structure Operations.
Modern Information Retrieval Chapter 4 Query Languages.
Developing Pairwise Sequence Alignment Algorithms Dr. Nancy Warter-Perez May 10, 2005.
Shape Classification Using the Inner-Distance Haibin Ling David W. Jacobs IEEE TRANSACTION ON PATTERN ANAYSIS AND MACHINE INTELLIGENCE FEBRUARY 2007.
Music Indexing and Retrieval for Multimedia Digital Libraries King Fahd University of Petroleum and Minerals Information and Computer Science Department.
1 Theory I Algorithm Design and Analysis (11 - Edit distance and approximate string matching) Prof. Dr. Th. Ottmann.
Overview of Search Engines
Pitch Pitch can be described as being how high or low the sound is heard. Pitch is determined by the speed or frequency of the vibration which is causing.
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
The Nuts & Bolts of Music
Developing Pairwise Sequence Alignment Algorithms
Polyphonic Queries A Review of Recent Research by Cory Mckay.
Space-Efficient Sequence Alignment Space-Efficient Sequence Alignment Bioinformatics 202 University of California, San Diego Lecture Notes No. 7 Dr. Pavel.
A Time Based Approach to Musical Pattern Discovery in Polyphonic Music Tamar Berman Graduate School of Library and Information Science University of Illinois.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
1 Music Classification Using Significant Repeating Patterns Chang-Rong Lin, Ning-Han Liu, Yi-Hung Wu, Arbee L.P. Chen, Proc. of 9th International Conference,
Audio Scene Analysis and Music Cognitive Elements of Music Listening
1 Melodic Similarity MUMT 611, March 2005 Assignment 4 Paul Kolesnik.
7 -1 Chapter 7 Dynamic Programming Fibonacci sequence Fibonacci sequence: 0, 1, 1, 2, 3, 5, 8, 13, 21, … F i = i if i  1 F i = F i-1 + F i-2 if.
Sequence Analysis CSC 487/687 Introduction to computing for Bioinformatics.
1 University of Palestine Topics In CIS ITBS 3202 Ms. Eman Alajrami 2 nd Semester
Melodic Search: Strategies and Formats CS 275B/Music 254.
A survey of different shape analysis techniques 1 A Survey of Different Shape Analysis Techniques -- Huang Nan.
AURAL SKILLS ASSESSMENT TASK 2 Question 2 THE CONCEPTS OF MUSIC General Knowledge.
The 4 Parameters of Sound PITCH = the frequency of vibration (heard as “high” vs. “low”) DURATION = the length of time a sound lasts (heard as aspects.
Ravello, Settembre 2003Indexing Structures for Approximate String Matching Alessandra Gabriele Filippo Mignosi Antonio Restivo Marinella Sciortino.
Essential Music Vocabulary Review. Directions: Identify the music vocabulary word that matches the given definition. Students will be called at random.
Using Transportation Distances for Measuring Melodic Similarity Pichaya Tappayuthpijarn Qiang Wang.
Content-Based MP3 Information Retrieval Chueh-Chih Liu Department of Accounting Information Systems Chihlee Institute of Technology 2005/06/16.
Fundamentals, Design, and Implementation, 9/e Appendix B The Semantic Object Model.
The Elements of Music “Student Selected Piece of Music”
 6 th Musical Literacy 1.1 All students will be able to use a steady tone when performing.
1 Hidden Markov Model: Overview and Applications in MIR MUMT 611, March 2005 Paul Kolesnik MUMT 611, March 2005 Paul Kolesnik.
Classification of melody by composer using hidden Markov models Greg Eustace MUMT 614: Music Information Acquisition, Preservation, and Retrieval.
Audio Scene Analysis and Music Cognitive Elements of Music Listening Kevin D. Donohue Databeam Professor Electrical and Computer Engineering University.
Tree structured and combined methods for comparing metered polyphonic music Kjell Lëmstrom David Rizo Valero José Manuel Iñesta CMMR’08 May 21, 2008.
Information Retrieval in Practice
Carmine Casciato MUMT 611 Thursday, March 13, 2005
Music Matching Speaker : 黃茂政 指導教授 : 陳嘉琳 博士.
CS 430: Information Discovery
Distance Functions for Sequence Data and Time Series
Relational Algebra Chapter 4, Part A
Carmine Casciato MUMT 611 Thursday, March 13, 2005
Sequence Alignment 11/24/2018.
Computer Vision Lecture 16: Texture II
Relational Algebra Chapter 4, Sections 4.1 – 4.2
Integrating Segmentation and Similarity in Melodic Analysis
Scale-Space Representation for Matching of 3D Models
MUMT 611, March 2005 Assignment 4 Paul Kolesnik
Chapter 17 Designing Databases
Presentation transcript:

Melodic Similarity Presenter: Greg Eustace

Overview Defining melody Introduction to melodic similarity and its applications Choosing the level of representation Weighting Pre-filtering Algorithms for measuring melodic similarity –Sequential representations –2-D and geometric representations

Defining melody Melody: a monophonic succession of tones characterized by information pertaining to pitch and rhythm. Such information contributes to a melody’s perceived shape. In polyphonic music the melody represents the dominant tune. A melody is an efficient and robust construct. Levitin describes a melody as “an auditory object that maintains its identity under certain transformations... along the six dimensions of pitch tempo, timbre, loudness, spatial location and reverberant environment, sometimes with changes in rhythm but rarely with changes in contour.” Melodic similarity is subjective.

Melodic similarity & applications Smith, McNab and Witten defined similarity as “the complexity of the transformation process involved in mapping one object onto the other”. For algorithms which measure melodic similarity, typically a short query melody is compared with a larger melody (perhaps one of many in a database). The general problem requires algorithms that can provide a measure of this complexity, where the minimum value represents the best match. Applications –Academic research, compositional exploration, memory aid, etc. –Copyright checks –Measures of melodic similarity are integral to so called query by humming algorithms.

Choosing the level of representation Absolute pitch –MIDI note numbers –Hewlett’s base-40 system, which represents enharmonic spellings of notes (40 per octave). Relative pitch (i.e. intervals) –Intervallic representation are transposition invariant –Related representation: modulo interval technique Melodic contour –A note instance is characterized by it’s relationship to the proceeding note. –Three common representations include: up, down and same. –Other representations group ascending and descending intervals into two or more categories. –Consistent with perception

Choosing the level of representation Other representations are based on I/R (implication/realization) structures are based on Narmour's theory of perception and cognition of melodies. Some studies consider scalar vs. non-scalar tones Rhythmic information is less often considered. Relevant aspects include note durations, rest positions and time signatures, marked by stress (e.g. dynamic articulation).

Weighting & pre-filtering Weighting: –Notes may be weighted according to perceptual importance. –Note duration is the most common measure of importance (e.g. smaller notes receive less weighting). –Another example is weighting by stress. Pre-filtering: –Melody extraction techniques are required in order to separate the melody from all other notes in the piece. –Standardization may be required to eliminate performance specific characteristics.

Measuring melodic similarity: Sequence Matching Sequence matching compares sequences of numbers representing melodies Used in speech recognition, biological sequence analysis and text information retrieval. Common problems: Computational inefficiency and perceptual inconsistency.

Sequence Matching: Edit Distance The edit distance (or Levenshtein distance) between two sequences is defined as “the minimum total cost of transforming one sequence (the source sequence) into the other (the target) sequence given a set of allow edit operations and cost function that defines the cost of each edit operation.” Typical edit operations: –Insertion: involves adding an element to the target sequence. –Deletion: involve removing an element from the source sequence. –Replacement: involves replacing an element in the source sequence with an element from the target sequence.

Sequence Matching: Edit Distance Take character strings as a simple example: cats - mat Deleting one letter gives: cat – mat Replacing one letter gives: mat – mat Other (less common) operations: –Fragmentation: replacing a sequence of notes in the target sequence by a single note in the source sequence. –Consolidation: replacing a single note in the target sequence by a sequence of notes in the source sequence.

Edit distance: algorithms 1.Longest common substring: Melodic similarity is ranked according to the length of the longest continuous sequence that is identical to a sequence in the query 2.Longest common subsequence: matching notes may be separated by any number or size of gaps. 3.N-grams: An n-gram is defined as a sub-sequence of n elements from a given sequence. A measure of similarity may be determined by counting the number of matching subsequences of a given length. For example, for a query consisting of the sequence of notes: CDEFGA There are 3 possible n-grams of length 4 to be identified in the target. CDEF, DEFG and EFGA

Edit distance: algorithms 4. Dynamic programming: determines the best match between two sequences on a local basis. Given two melodies a matrix can be constructed from the following formula, where c represents the matrix; q and p represents the query and target melody sequences respectively; the index i ranges from 0 to the query length and j from 0 to the target length, d is the cost of an insert or a delete; e is the cost of an exact match and m is the cost of mismatch (where d = 1, e = 0 and m = 1 in this case): From chai, 2001

Edit distance: algorithms For example, using this approach to match the query: [ ] against the target: [ ] we obtain: From chai, 2001 Tracing the local minima results in the following match: [* ] [* ]

2-D and Geometric representations 2-D representations may reduce a melody to a set of onset-pitch pairs. Earth Mover’s Distance function (EMD): “The EMD measures a minimum flow for transforming one weighted point set into another for the purpose of measuring melodic similarity.” Proportional Transportation Distance (PTD): A variant of the EMD function is calculated by first dividing each point’s weight by the point set’s total weight (for both point sets) and then calculating the EMD of the result.

2-D and Geometric representations Geometric measures of similarity model two melodies as monotonic pitch-duration functions of time, where the difference between melody a (Ma) and melody b (Mb) is readily calculated as the difference in area between the two functions. The query melody could be moved relative to the target in order to minimize the area between the two.

2-D and Geometric representations Other algorithms superimpose these functions on cylinders, where time is measured in terms of the angle theta and pitch is measured in terms of z.