A Musical Data Mining Primer CS235 – Spring ’03 Dan Berger

Slides:



Advertisements
Similar presentations
Presentation at Society of The Query conference, Amsterdam November 13-14, 2009 (original title: Learning from Google: software design as a methodology.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
A PowerPoint Presentation
4.1Different Audio Attributes 4.2Common Audio File Formats 4.3Balancing between File Size and Audio Quality 4.4Making Audio Elements Fit Our Needs.
Speaker Associate Professor Ning-Han Liu. What’s MIR  Music information retrieval (MIR) is the interdisciplinary science of retrieving information from.
Multimedia Database Systems
Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.
Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification.
AUDIO COMPRESSION TOOLS & TECHNIQUES Gautam Bhattacharya.
1 Digital Audio Compression. 2 Formats  There are many different formats for storing and communicating digital audio:  CD audio  Wav  Aiff  Au 
Rhythmic Similarity Carmine Casciato MUMT 611 Thursday, March 13, 2005.
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
SUBJECTIVE ATTRIBUTES OF SOUND Acoustics of Concert Halls and Rooms Science of Sound, Chapters 5,6,7 Loudness, Timbre.
Berenzweig - Music Recommendation1 Music Recommendation Systems: A Progress Report Adam Berenzweig April 19, 2002.
Copyright Nov. 2002, George Tzanetakis Digital Music & Music Processing George Tzanetakis PostDoctoral Fellow Computer Science Department Carnegie Mellon.
Object-based Image Representation Dr. B.S. Manjunath Sitaram Bhagavathy Shawn Newsam Baris Sumengen Vision Research Lab University of California, Santa.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Web Mining Research: A Survey
Chapter 14 Recording and Editing Sound. Getting Started FAQs: − How does audio capability enhance my PC? − How does your PC record, store, and play digital.
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Introduction to MIR Course Overview 1.
Advanced Multimedia Music Information Retrieval Tamara Berg.
Sound Applications Advanced Multimedia Tamara Berg.
MACHINE LEARNING TECHNIQUES FOR MUSIC PREDICTION S. Grant Lowe Advisor: Prof. Nick Webb.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
Multimedia Databases (MMDB)
Survey Of Music Information Needs, Uses, and Seeking Behaviors Jin Ha Lee J. Stephen Downie Graduate School of Library and Information Science University.
Media Representations - Audio
August 12, 2004IAML - IASA 2004 Congress, Olso1 Music Information Retrieval, or how to search for (and maybe find) music and do away with incipits Michael.
Student: Mike Jiang Advisor: Dr. Ras, Zbigniew W. Music Information Retrieval.
Music Information Retrieval -or- how to search for (and maybe find) music and do away with incipits Michael Fingerhut Multimedia Library and Engineering.
Multimedia Elements: Sound, Animation, and Video.
Aspects of Music Information Retrieval Will Meurer School of Information University of Texas.
Chapter 15 Recording and Editing Sound. 2Practical PC 5 th Edition Chapter 15 Getting Started In this Chapter, you will learn: − How sound capability.
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Extracting meaningful labels for WEBSOM text archives Advisor.
Preprocessing for Data Mining Vikram Pudi IIIT Hyderabad.
Computer Science 1 Week 11. This Week... QBasic While Loops QBasic While Loops Audio Basics Audio Basics.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
Duraid Y. Mohammed Philip J. Duncan Francis F. Li. School of Computing Science and Engineering, University of Salford UK Audio Content Analysis in The.
MMDB-8 J. Teuhola Audio databases About digital audio: Advent of digital audio CD in Order of magnitude improvement in overall sound quality.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Singer Similarity Doug Van Nort MUMT 611. Goal Determine Singer / Vocalist based on extracted features of audio signal Classify audio files based on singer.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
MPEG-7 Audio Overview Ichiro Fujinaga MUMT 611 McGill University.
Content-Based MP3 Information Retrieval Chueh-Chih Liu Department of Accounting Information Systems Chihlee Institute of Technology 2005/06/16.
A System for Automatic Personalized Tracking of Scientific Literature on the Web Tzachi Perlstein Yael Nir.
Query by Singing and Humming System
A Supervised Machine Learning Algorithm for Research Articles Leonidas Akritidis, Panayiotis Bozanis Dept. of Computer & Communication Engineering, University.
1 CS 430: Information Discovery Lecture 23 Non-Textual Materials.
Electronic Document Management By Portford Solutions Group, Inc.
Audio Fingerprinting Wes Hatch MUMT-614 Mar.13, 2003.
Chapter 15 Recording and Editing Sound
Introduction Multimedia initial focus
A review of audio fingerprinting (Cano et al. 2005)
Introduction to Music Information Retrieval (MIR)
Information Retrieval
Aspects of Music Information Retrieval
Musical Style Classification
Data Warehousing and Data Mining
Data Mining Chapter 6 Search Engines
CSE 635 Multimedia Information Retrieval
Presentation on Timbre Similarity
Govt. Polytechnic Dhangar(Fatehabad)
COMS 161 Introduction to Computing
Web Mining Research: A Survey
Institute of New Media Development and Research
Measuring the Similarity of Rhythmic Patterns
Presentation transcript:

A Musical Data Mining Primer CS235 – Spring ’03 Dan Berger

Outline Motivation/Problem Overview Background Types of Music Digital Representations Psychoacoustics Query (Content vs. Meta-Data) Categorization & Clustering Finding More Conclusion

Motivation More music is being stored digitally: PressPlay offers 300,000 tracks for download As collections grow – organizing and searching manually become hard; How to find the “right” music in a sea of possibilities? How to find new artists given current preferences? How to find a song you heard on the radio?

Problem Overview Music is a highly dimension time series: 5 CD quality > 13M samples! It seems logical to apply data mining and IR techniques to this form of information. Query, Clustering, Prediction, etc. Application isn’t straightforward for reasons we’ll discuss shortly.

Background: Types of Music Monophonic: one note sounds at a time. Homophonic: multiple note sound – all starting (and ending) at the same instant. Polyphonic: no constraints on concurrency. Most general – and difficult to handle.

Background: Digital Representations Structured (Symbolic): MIDI – stores note duration & intensity, instructions for a synthesizer Unstructured (Sampled): PCM – stores quantized periodic samples Leverages Nyquist/Shannon’s sampling thm. to faithfully capture the signal. MP3/Vorbis/AAC – discards “useless” information – reduces storage and fidelity Use psychoacoustics Some work at rediscovering musical structure.

Background: Psychoacoustics Two main relevant results: Limited, freq. dependant resolution Auditory masking We hear different frequencies differently: sound spectrum broken into “critical bands” We “miss” signals due to spectral &/or temporal “collision.” Loud sounds mask softer ones, Two sounds of similar frequency get blended

Query – Content is King Current systems use textual meta-data to facilitate query: Song/Album Title, Artist, Genre* The goal is to query by the musical content: Similarity ‘find songs “like” the current one’ ‘find songs “with” this musical phrase’

Result: Query By Humming A handful of research systems have been built that locate songs in a collection based on the user humming or singing a melodic portion of the song. Typically search over a collection of monophonic MIDI files.

Content Based Query Recall: music is a time series with high dimensionality. Need robust dimensionality reduction. Not all parts of music are equally important. Feature extraction – remember the important features. Which features are important?

Similarity/Feature Extraction The current “hard problem” – there are ad- hoc solutions, but little supporting theory. Tempo (bpm), volume, spectral qualities, transitions, etc. Sound source: is it a piano? a trumpet? Singer recognition: who’s the vocalist? Collectively: “Machine Listening” These are hard problems with some positive results.

Compression Complexity Different compression schemes (MP3/Vorbis/AAC) use psychoacoustics differently. Different implementations of a scheme may also! Feature extraction needs to be robust to these variations. Seems to be an open problem.

Categorization/Clustering Genre (rock/r&B/pop/jazz/blues/etc.) is manually assigned – and subjective. Work is being done on automatic classification and clustering. Relies on (and sometimes reinvents) the similarity metric work described previously.

Browsing & Visualization: LOUD: physical exploration Islands of Music: uses self organizing maps to visualize clusters of similar songs.

Current Efforts Amazon/iTunes/etc. use collaborative filtering. If the population is myopic and predictable, it works well, otherwise not. Hit Song Science – clusters a provided set of songs against a database of top 30 hits to predict success. Claims to have predicted the success of Nora Jones. Relatable – musical “fingerprint” technology – involved with “Napster 2”

Finding More Conferences: Int. Symposium on Music IR (ISMIR) Int. Conference on Music and AI (ICMAI) Joint Conference on Digital Libraries Journals: ACM/IEEE Multimedia Groups: MIT Media Lab: Machine Listening Group

Conclusion Slow steady progress is being made. “Music Appreciation” is fuzzy we can’t define it but we know it when we hear it. References, and more detail, are in my survey paper, available shortly on the web.

Fini Questions?