Carmine Casciato MUMT 611 Thursday, March 13, 2005

Slides:



Advertisements
Similar presentations
A Musical Data Mining Primer CS235 – Spring ’03 Dan Berger
Advertisements

Local Discriminative Distance Metrics and Their Real World Applications Local Discriminative Distance Metrics and Their Real World Applications Yang Mu,
Creating Music Videos using Automated Media Analysis Authored by Jonathan Foote, Matthew Cooper, and Andreas Girgensohn Presented by Sukhyung Shin, Ninad.
Computational Rhythm and Beat Analysis Nick Berkner.
Franz de Leon, Kirk Martinez Web and Internet Science Group  School of Electronics and Computer Science  University of Southampton {fadl1d09,
Toward Automatic Music Audio Summary Generation from Signal Analysis Seminar „Communications Engineering“ 11. December 2007 Patricia Signé.
Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.
LAM: Musical Audio Similarity Michael Casey Centre for Cognition, Computation and Culture Department of Computing Goldsmiths College, University of London.
Energy Characterization and Optimization of Embedded Data Mining Algorithms: A Case Study of the DTW-kNN Framework Huazhong University of Science & Technology,
Rhythmic Similarity Carmine Casciato MUMT 611 Thursday, March 13, 2005.
Content-Based Classification, Search & Retrieval of Audio Erling Wold, Thom Blum, Douglas Keislar, James Wheaton Presented By: Adelle C. Knight.
“Random Projections on Smooth Manifolds” -A short summary
Toward Semantic Indexing and Retrieval Using Hierarchical Audio Models Wei-Ta Chu, Wen-Huang Cheng, Jane Yung-Jen Hsu and Ja-LingWu Multimedia Systems,
Effective Image Database Search via Dimensionality Reduction Anders Bjorholm Dahl and Henrik Aanæs IEEE Computer Society Conference on Computer Vision.
Principal Component Analysis
An Exploration of timbre: its perception, analysis and representation Dr. Deirdre Bolger CNRS-LMS,Paris Invited lecture, Institut für Musikwissenschaft,
CS292 Computational Vision and Language Visual Features - Colour and Texture.
Hand Signals Recognition from Video Using 3D Motion Capture Archive Tai-Peng Tian Stan Sclaroff Computer Science Department B OSTON U NIVERSITY I. Introduction.
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Sound Applications Advanced Multimedia Tamara Berg.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
AdvisorStudent Dr. Jia Li Shaojun Liu Dept. of Computer Science and Engineering, Oakland University 3D Shape Classification Using Conformal Mapping In.
Educational Software using Audio to Score Alignment Antoine Gomas supervised by Dr. Tim Collins & Pr. Corinne Mailhes 7 th of September, 2007.
Centre for Computational Creativity Semantic Audio Studio Tools and Techniques using MPEG-7 Dr. Michael Casey Centre for Computational Creativity Department.
1 TEMPLATE MATCHING  The Goal: Given a set of reference patterns known as TEMPLATES, find to which one an unknown pattern matches best. That is, each.
BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.
Student: Mike Jiang Advisor: Dr. Ras, Zbigniew W. Music Information Retrieval.
Music Information Retrieval -or- how to search for (and maybe find) music and do away with incipits Michael Fingerhut Multimedia Library and Engineering.
Fundamentals of Music Processing
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
Rhythmic Transcription of MIDI Signals Carmine Casciato MUMT 611 Thursday, February 10, 2005.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
Understanding The Semantics of Media Chapter 8 Camilo A. Celis.
Audio Tempo Extraction Presenter: Simon de Leon Date: February 9, 2006 Course: MUMT611.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.
Duraid Y. Mohammed Philip J. Duncan Francis F. Li. School of Computing Science and Engineering, University of Salford UK Audio Content Analysis in The.
An Approximate Nearest Neighbor Retrieval Scheme for Computationally Intensive Distance Measures Pratyush Bhatt MS by Research(CVIT)
Melodic Similarity Presenter: Greg Eustace. Overview Defining melody Introduction to melodic similarity and its applications Choosing the level of representation.
QBSH Corpus The QBSH corpus provided by Roger Jang [1] consists of recordings of children’s songs from students taking the course “Audio Signal Processing.
MSc Project Musical Instrument Identification System MIIS Xiang LI ee05m216 Supervisor: Mark Plumbley.
Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.
Chapter 15: Classification of Time- Embedded EEG Using Short-Time Principal Component Analysis by Nguyen Duc Thang 5/2009.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching.
Harmonic Models CS 275B/Music 254.
1/16 Dynamic Programming Carmine Casciato MUMT 611 Thursday March 31 st 2005.
Audio Processing Mitch Parry. Resource! Sound Waves and Harmonic Motion.
Audio Processing Mitch Parry. Similar to Image Processing? For images a pixel is the smallest unit The color is a distribution of the spectrum of visible.
Automatic Transcription of Polyphonic Music
Naifan Zhuang, Jun Ye, Kien A. Hua
Rhythmic Transcription of MIDI Signals
Audio Segmentation, Classification, and Retrieval
Calypso Goes to School Additional resource required: Audio Sound file ‘Calypso Goes to School’
A review of audio fingerprinting (Cano et al. 2005)
Artist Identification Based on Song Analysis
Recognition with Expression Variations
Supervised Time Series Pattern Discovery through Local Importance
Sharat.S.Chikkerur S.Anand Mantravadi Rajeev.K.Srinivasan
A Time Series Representation Framework Based on Learned Patterns
Dynamical Statistical Shape Priors for Level Set Based Tracking
Carmine Casciato MUMT 611 Thursday, March 13, 2005
A Similarity Retrieval System for Multimodal Functional Brain Images
Creating Data Representations
EE513 Audio Signals and Systems
Presenter: Simon de Leon Date: March 2, 2006 Course: MUMT611
Presentation on Timbre Similarity
Time Series Filtering Time Series
NON-NEGATIVE COMPONENT PARTS OF SOUND FOR CLASSIFICATION Yong-Choon Cho, Seungjin Choi, Sung-Yang Bang Wen-Yi Chu Department of Computer Science &
Measuring the Similarity of Rhythmic Patterns
Presentation transcript:

Carmine Casciato MUMT 611 Thursday, March 13, 2005 Rhythmic Similarity Carmine Casciato MUMT 611 Thursday, March 13, 2005

Overview Music Tech research into rhythmic similarity Representations Problem of segmentation

Usage

Usage Computer accompaniment systems Ethno-musicological research Database management Queries by humming Genre classification Speech processing

Paulus and Klapuri (2002) Detects rhythmic similarity between musical samples containing arbitrary drum/percussive sounds Removes sinusoidal components from sound to detect noisy percussive sounds, from this produces beat and measure estimations Extracts loudness and brightness (mean square energy and spectral centroid, respectively) per measure Dynamic Time Warping (DTW) finds optimal path between feature vectors No training of system required

Foote, Cooper, and Nam (2002) Derivation of ‘beat spectrum’ Feature used is power of FFT bins embedded into similarity matrix Euclidean distance is used as similarity metric, tested against cosine angles between feature vectors, and FFT co-efficients

Ellis and Arroyo (2004) Principal Component Analysis, a dimension reduction tool Requires robust segmentation Poor classification results

EigenRhythms (Ellis and Arroyo 2004) Useful for generating variations?

Toussaint (2002) Geometric representations of rhythms Require extensive segmentation Tests various distance metrics Minimum Spanning Trees offer a framework for rhythmic development analysis

Toussaint Representations (2002)

Music Technology References Ellis, D., and J. Arroyo. 2004. Eigenrhythms: Drum pattern basis sets for classification and generation. In Proceedings of the International Conference on Music Information Retrieval. Foote, J., M. Cooper, and U. Nam. 2002. Audio retrieval by rhythmic similarity. In Proceedings of the International Conference on Music Information Retrieval. Paulus, J., and A. Klapuri. 2002. Measuring the similarity of rhythmic patterns. In Proceedings of the International Conference on Music Information Retrieval. Toussaint, G. 2002. A mathematical analysis of African, Brazilian, and Cuban clave rhythms. In Proceedings of BRIDGES: Mathematical Connections in Art, Music, and Science.

Other Disciplines Gabrielsson, A. 1973. Similarity ratings and dimension analyses of auditory rhythm patterns. Scandinavian Journal of Psychology 14: 13860. Lerdahl F., and R. Jackendoff. 1983. A generative theory of tonal music. Cambridge: The MIT Press. Powel, D., and P. Essens. 1985. Perception of temporal patterns. Music Perception 2(4): 41140.