Copyright Nov. 2002, George Tzanetakis Digital Music & Music Processing George Tzanetakis PostDoctoral Fellow Computer Science Department Carnegie Mellon.

Slides:



Advertisements
Similar presentations
CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.
Advertisements

A Musical Data Mining Primer CS235 – Spring ’03 Dan Berger
Time-Frequency Analysis Analyzing sounds as a sequence of frames
Speaker Associate Professor Ning-Han Liu. What’s MIR  Music information retrieval (MIR) is the interdisciplinary science of retrieving information from.
Basic Computer 101 and Basic Digital Audio basic is a relative term.
1 Copyright 2011 G.Tzanetakis Music Information Retrieval George Tzanetakis Associate Professor, IEEE Senior Member.
Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.
Department of Computer Science University of California, San Diego
Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification.
Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.
Content-Based Classification, Search & Retrieval of Audio Erling Wold, Thom Blum, Douglas Keislar, James Wheaton Presented By: Adelle C. Knight.
Content-based Video Indexing, Classification & Retrieval Presented by HOI, Chu Hong Nov. 27, 2002.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
Music Analysis and Retrieval for Audio Signals George Tzanetakis PostDoctoral Fellow Computer Science Department Carnegie Mellon University
Digital audio and computer music COS 116: 2/26/2008.
DEVON BRYANT CS 525 SEMESTER PROJECT Audio Signal MIDI Transcription.
Multimedia Database Systems Retrieval by Content Department of Informatics Aristotle University of Thessaloniki Fall-Winter 2008.
Presented by Zeehasham Rasheed
Analysis & Synthesis The Vocoder and its related technology.
Digital audio and computer music COS 116, Spring 2012 Guest lecture: Rebecca Fiebrink.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Introduction to MIR Course Overview 1.
Image Compression - JPEG. Video Compression MPEG –Audio compression Lossy / perceptually lossless / lossless 3 layers Models based on speech generation.
Audio Retrieval David Kauchak cs458 Fall Administrative Assignment 4 Two parts Midterm Average:52.8 Median:52 High:57 In-class “quiz”: 11/13.
Content-Based Video Retrieval System Presented by: Edmund Liang CSE 8337: Information Retrieval.
Advanced Multimedia Music Information Retrieval Tamara Berg.
Sound Applications Advanced Multimedia Tamara Berg.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
MACHINE LEARNING TECHNIQUES FOR MUSIC PREDICTION S. Grant Lowe Advisor: Prof. Nick Webb.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
August 12, 2004IAML - IASA 2004 Congress, Olso1 Music Information Retrieval, or how to search for (and maybe find) music and do away with incipits Michael.
Student: Mike Jiang Advisor: Dr. Ras, Zbigniew W. Music Information Retrieval.
Music Information Retrieval -or- how to search for (and maybe find) music and do away with incipits Michael Fingerhut Multimedia Library and Engineering.
Aspects of Music Information Retrieval Will Meurer School of Information University of Texas.
MPEG Audio coders. Motion Pictures Expert Group(MPEG) The coders associated with audio compression part of MPEG standard are called MPEG audio compressor.
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
Music-Driven Motion Editing Marc Cardle Rainbow Group Computer Laboratory University of Cambridge Marc Cardle Rainbow Group Computer.
Physics 1251 The Science and Technology of Musical Sound Unit 4 Session 42 MWF Computer Music Unit 4 Session 42 MWF Computer Music.
Digital audio and computer music COS 116, Spring 2010 Adam Finkelstein Slides and demo thanks to Rebecca Fiebrink.
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 1 ECE-C490 Image Processing Architecture MP-3 Compression Course Review Oleh Tretiak.
Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.
[The Band SIG] MPEG7 - Audio 손우람 2007 년 12 월 1 일.
It sure is smart but can it swing? (Digital audio and computer music)
Audio Tempo Extraction Presenter: Simon de Leon Date: February 9, 2006 Course: MUMT611.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
(Extremely) Simplified Model of Speech Production
Duraid Y. Mohammed Philip J. Duncan Francis F. Li. School of Computing Science and Engineering, University of Salford UK Audio Content Analysis in The.
MMDB-8 J. Teuhola Audio databases About digital audio: Advent of digital audio CD in Order of magnitude improvement in overall sound quality.
Singer Similarity Doug Van Nort MUMT 611. Goal Determine Singer / Vocalist based on extracted features of audio signal Classify audio files based on singer.
MMDB-9 J. Teuhola Standardization: MPEG-7 “Multimedia Content Description Interface” Standard for describing multimedia content (metadata).
MSc Project Musical Instrument Identification System MIIS Xiang LI ee05m216 Supervisor: Mark Plumbley.
Content-Based MP3 Information Retrieval Chueh-Chih Liu Department of Accounting Information Systems Chihlee Institute of Technology 2005/06/16.
1 CS 430 / INFO 430 Information Retrieval Lecture 17 Metadata 4.
Extraction of Individual Tracks from Polyphonic Music Nick Starr.
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
A content-based System for Music Recommendation and Visualization of User Preference Working on Semantic Notions Dmitry Bogdanov, Martin Haro, Ferdinand.
Audio Processing Mitch Parry. Resource! Sound Waves and Harmonic Motion.
Audio Processing Mitch Parry. Similar to Image Processing? For images a pixel is the smallest unit The color is a distribution of the spectrum of visible.
Automatic Classification of Audio Data by Carlos H. L. Costa, Jaime D. Valle, Ro L. Koerich IEEE International Conference on Systems, Man, and Cybernetics.
Audio Fingerprinting Wes Hatch MUMT-614 Mar.13, 2003.
1 Tempo Induction and Beat Tracking for Audio Signals MUMT 611, February 2005 Assignment 3 Paul Kolesnik.
The Greek Audio Dataset
Audio Segmentation, Classification, and Retrieval
Aspects of Music Information Retrieval
Multimedia Information Retrieval
Presenter: Simon de Leon Date: March 2, 2006 Course: MUMT611
Presentation on Timbre Similarity
Review and Importance CS 111.
Music Signal Processing
Presentation transcript:

Copyright Nov. 2002, George Tzanetakis Digital Music & Music Processing George Tzanetakis PostDoctoral Fellow Computer Science Department Carnegie Mellon University

Copyright Nov. 2002, George Tzanetakis Overview Music Information Retrieval (MIR) and Computer Audition Motivation Techniques Applications Computer Music and Sound Synthesis Examples, demos

Copyright Nov. 2002, George Tzanetakis MIR Music History 9000 B.C

Copyright Nov. 2002, George Tzanetakis Music 4 million recorded CD tracks 4000 CDs / month Mp3 bandwidth % Global Pervasive Persistent Why ?

Copyright Nov. 2002, George Tzanetakis The future of MIR Library of all recorded music Tasks: organize, search, retrieve, classify, recommend, browse, listen, annotate Examples:

Copyright Nov. 2002, George Tzanetakis Audio MIR Pipeline Signal ProcessingMachine LearningHuman Computer Interaction Hearing Representation Understanding Analysis Reacting Interaction

Copyright Nov. 2002, George Tzanetakis Traditional Music Representations

Copyright Nov. 2002, George Tzanetakis Time domain waveform time pressure Decompose into building blocks time frequency

Copyright Nov. 2002, George Tzanetakis MIDI Musical Instrument Digital Interface Hardware interface File format Note events Duration, discrete pitch, “instrument” Extensions General MIDI Notation, OMR, continuous pitch

Copyright Nov. 2002, George Tzanetakis Symbolic vs Audio MIR Audio Polyphonic Transcription Symbolic Representation (MIDI) MIR Audio Computer Audition MIR Machine Learning Models

Copyright Nov. 2002, George Tzanetakis Feature extraction

Copyright Nov. 2002, George Tzanetakis Timbral Texture Timbre = differentiate sounds of same loudness, pitch Timbral Texture = differentiate mixtures of sounds Global, statistical and fuzzy properties

Copyright Nov. 2002, George Tzanetakis Spectrum t t+1 M M

Copyright Nov. 2002, George Tzanetakis Fourier Transform P=1/f

Copyright Nov. 2002, George Tzanetakis Short Time Fourier Transform STFT Filterbank interpretation FiltersOscillators Amplitude Frequency output

Copyright Nov. 2002, George Tzanetakis Short Time Fourier Transform II t t+1 M M

Copyright Nov. 2002, George Tzanetakis Formants From “Real Time Synthesis for Interactive Applications” P.Cook, A.K Peters Press, used by permission

Copyright Nov. 2002, George Tzanetakis Linear Prediction Coefficients f0 White Noise SourceFilterSpeech Lossless tubes

Copyright Nov. 2002, George Tzanetakis MPEG Audio Coding (mp3) Analysis Filterbank Psychoacoustic Model Available bits 32 linearly spaced bands Encoder: Slower, Complicated Decoder: Faster, Simpler Perceptual Audio Coding

Copyright Nov. 2002, George Tzanetakis Spectral Shape t M Centroid Rolloff Flux RMS Moments …

Copyright Nov. 2002, George Tzanetakis Summary of Timbral Texture Features Time-Frequency analysis Signal Processing (STFT, DWT) Source-filter (LPC) Perceptual (MP3) Spectral Shape to feature vector

Copyright Nov. 2002, George Tzanetakis Pitch Content Harmony-melody = pitch concepts Music theory score = music Bridge to symbolic MIR Automatic music transcription Non-transcriptive arguments

Copyright Nov. 2002, George Tzanetakis Automatic Pitch Detection P=1/f Time-domain Frequency-domain Perceptual Zerocrossings Autocorrelation analysis = peaks of function correspond to dominant pitches

Copyright Nov. 2002, George Tzanetakis Pitch Histograms JazzIrish Chroma - folded Height - unfolded

Copyright Nov. 2002, George Tzanetakis Automatic Music Transcription OriginalTranscribed Mixture signalNoise suppresion Predominant Pitch Estimation Remove detected sound Estimate # of voices

Copyright Nov. 2002, George Tzanetakis Rhythm Movement in time Origins in Poetry (iambic, trochaic) Foot tapping definition Hierarchical semi-periodic structure at multiple levels of detail Links to motion, dance Running vs global

Copyright Nov. 2002, George Tzanetakis Self similarity DWTAutocorrelation Peak Picking Beat Histograms Envelope Extraction

Copyright Nov. 2002, George Tzanetakis Beat Histograms

Copyright Nov. 2002, George Tzanetakis Analysis Classification Segmentation Similarity Retrieval Clustering Thumbnailing Fingerprinting

Copyright Nov. 2002, George Tzanetakis Analysis Overview Musical PieceTrajectoryPoint

Copyright Nov. 2002, George Tzanetakis Query-by-example Content-based Retrieval Ranked list of k nearest neighbors

Copyright Nov. 2002, George Tzanetakis QBE examples Rock: Beatles Jazz: Bobby Hutserson Funk: Mano negra World: Tibetan singer Computer Music: Paul Lansky Query Match

Copyright Nov. 2002, George Tzanetakis Automatic Musical Genre Classification Categorical music descriptions created by humans Fuzzy boundaries Statistical properties Timbral texture, rhythmic structure, harmonic content Automatic musical genre classification Evaluate musical content features Structure audio collections

Copyright Nov. 2002, George Tzanetakis Genregram demo Dynamic real-time visualization for classification of radio signals

Copyright Nov. 2002, George Tzanetakis Audio segmentation Detect changes of audio texture

Copyright Nov. 2002, George Tzanetakis Multifeature automatic segmenation methodology Time series of feature vector v(t) Detect abrupt changes in trajectory

Copyright Nov. 2002, George Tzanetakis Context & Content Aware User Interfaces Automatic results not perfect Music listening is personal and subjective Browsing vs retrieval “Overview, zoom and filter, details on demand”, Shneiderman mantra Adapt UI to music content and context Computer Audition Visualization

Copyright Nov. 2002, George Tzanetakis Content and Context Content ~ file Genre, male voice, saxophone Content ~ file, collection Similarity Slow-fast Multiple visualizations Same content Different context

Copyright Nov. 2002, George Tzanetakis Timbregrams Content & Context Similarity + Time Structure Principal Component Analysis Map feature vectors to color

Copyright Nov. 2002, George Tzanetakis Timbrespaces

Copyright Nov. 2002, George Tzanetakis Islands of Music

Copyright Nov. 2002, George Tzanetakis Auditory Scene Analysis