Ichiro Fujinaga McGill University

Slides:



Advertisements
Similar presentations
Intro to Band and Other Instruments SMS Band Home of the Stars Ms. Kinion.
Advertisements

Music Appreciation Musical Instruments.
Y Fernandez- CMS Instruments 1. Families of Instruments String Instruments Violin Viola Cello Double Bass Violin Viola Cello Double Bass.
Music JEOPARDY Family of Instruments By A. Vilcins
Instruments of the Orchestra
Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.
Elements of Music. Harmony Two or more notes together Two or more notes together Chord – three or more notes at one time Chord – three or more notes at.
Tetsuro Kitahara* Masataka Goto** Hiroshi G. Okuno*
How to find concert pitch. C instruments – flute, oboe, trombone, bc baritone, tuba & percussion: No change: Concert C = C B ♭ instruments – clarinet,
Chocolate-Chip Cookie Recipe 1 cup flour ½ tsp baking soda ½ tsp salt ½ cup butter 3/8 cup sugar 3/8 cup brown sugar ½ tsp vanilla 1 egg 1 cup chocolate.
Recent Research in Musical Timbre Perception James W. Beauchamp University of Illinois at Urbana-Champaign Andrew B. Horner Hong University of Science.
A brief reference and explanation Instrument Transposition.
Presented By: Miss Cross.  There are four main types of instruments: ◦ Winds ◦ Brass ◦ Strings ◦ Percussion  Which instruments belong to which categories?
LEARN THE STAFF NOTE NAMES! The clefs have been left off the staves so that all students can use this same PowerPoint. The treble and bass answers will.
Comparison of machine and human recognition of isolated instrument tones Ichiro Fujinaga McGill University.
ORGANIZING – 2010 Chinese New Year Festival Symphony Orchestra.
8F. TRANSPOSITION All Woodwinds. Question One Give the sounding pitch for this written clarinet part: Clarinet in B-flat.
Music Core Content.
Which instruments are members of the percussion family? A.) violin, viola, cello B.) xylophone, snare drum bass drum C.) oboe, bassoon, clarinet,
Syllabus Quiz  1. T or FA student should not have an understanding of composers and compositions at the end of the term.  2. T or F 90% of your grade.
Monday, November 15, 2010 Write and answer: There are four families of instruments. – String – Woodwind – Brass – Percussion List these instrument families.
GA-Based Feature Selection and Parameter Optimization for Support Vector Machine Cheng-Lung Huang, Chieh-Jen Wang Expert Systems with Applications, Volume.
Spring Creek Middle School Beginning Band Introduction to Band Instruments Mr. Hansen Revised: 18 June 2004.
Musical Instruments.
Instruments and Sound Technique. Instrumentation The main orchestral instruments are: FluteOboeClarinetBassoon.
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
Musical Instrument Families. Why are there different families Each instrument family has their own unique TIMBRE or sound How is sound produced –By vibrations!
Comparison of machine and human recognition of isolated instrument tones Ichiro Fujinaga McGill University.
Wind Instruments DHC 161 February 29, families woodwinds brass flutes reeds recorder transverse flute piccolo whistles clarinet oboe bassoon saxophone.
Spectral centroid PianoFlute Piano Flute decayed not decayed F0-dependent mean function which captures the pitch dependency (i.e. the position of distributions.
Welcome to the Boise Philharmonic’s Children’s Concert
MSc Project Musical Instrument Identification System MIIS Xiang LI ee05m216 Supervisor: Mark Plumbley.
Realtime Recognition of Orchestral Instruments Ichiro Fujinaga McGill University.
SPELLING PRACTISE. ORCHESTRA 1. Write the word. Spell it, Study it, Remember it.
Instruments. Strings Commonly made of wood and strings (usually four). Played by drawing a bow across the strings or plucking the strings. Examples: violin,
Music By: Raegan Light.
ZIN! ZIN! ZIN! A VIOLIN BY LLOYD MOSS INSTRUMENT MATCHING GAME
Recognition of Isolated Instrument Tones by Conservatory Students Asha Srinivasan, David Sullivan, and Ichiro Fujinaga Peabody Conservatory of Music Johns.
Orchestral Seating Chart ► The symphony orchestra is composed of three groups: Strings, Winds and Percussion ► The String section consists of basses, cellos,
Each corner of the room is a different answer (A, B, C, or D)
Instruments of the H R O S C A E T R.
An Appreciation © 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Chapter 2—Performing Media: Voices and Instruments Range: based.
A Presentation on Adaptive Neuro-Fuzzy Inference System using Particle Swarm Optimization and it’s Application By Sumanta Kundu (En.R.No.
{ Band Instruments Where do they belong???.  Band instruments have evolved in the last 500 years for better advancement of musical performance.  Band.
Performance media INSTRUMENTS = mechanism other than a voice that produces musical sounds. (single note or multi) CLASSES OF INSTRUMENTS 1 WOODWIND 2 BRASS.
Types of Musical Groups and Ensembles
Arkansas All-State Results
Introduction to Music Musical Instruments
Musical Instruments and Ensembles
Instruments and Clefs Families Ranges Clefs.
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Chapter 2—Performing Media: Voices and Instruments
THIS IS Jeopardy. THIS IS Jeopardy With Your Host... MS. NORDAN.
Basics of Music Notation
Voicing Chords in Multiple Parts: Instrumentation
Instrumentation and Transposition
School of Computer Science & Engineering
Instance Based Learning
Introduction to Instruments & Their Families
Mr. McVay 3rd Grade Beginning Music
Instance Based Learning (Adapted from various sources)
Tuba 2 Tubas.
Instruments of the Orchestra
INSTRUMENTS OF THE ORCHESTRA AND CONCERT BAND
Inputting Instrument Information into Destiny
Ichiro Fujinaga Peabody Conservatory of Music Johns Hopkins University
Realtime Recognition of Orchestral Instruments
Realtime Recognition of Orchestral Instruments
Ichiro Fujinaga McGill University
Tuba.
Presentation transcript:

Ichiro Fujinaga McGill University Comparison of machine and human recognition of isolated instrument tones Ichiro Fujinaga McGill University

Overview Introduction Exemplar-based learning k-NN classifier Genetic algorithm Machine recognition experiments Comparison with human performance Conclusions

Introduction “We tend to think of what we ‘really’ know as what we can talk about, and disparage knowledge that we can’t verbalize.” (Dowling 1989) Western civilization’s emphasis on logic, verbalization, and generalization as signs of intelligence Limitation of rule-based learning used in traditional Artificial Intelligence (AI) research The lazy learning model is proposed here as an alternative approach to modeling many aspects of music cognition

Traditional AI Research “… in AI generally, and in AI and Music in particular, the acquisition of non-verbal, implicit knowledge is difficult, and no proven methodology exists.” (Laske 1992) Rule-based approach in traditional AI research Exemplar-based learning systems Neural networks (greedy) k-NN classifiers (lazy) Adaptive system based on a k-NN classifier and a genetic algorithm

Exemplar-based learning The exemplar-based learning model is based on the idea that objects are categorized by their similarity to one or more stored examples There is much evidence from psychological studies to support exemplar-based categorization by humans This model differs both from rule-based or prototype-based (neural nets) models of concept formation in that it assumes no abstraction or generalizations of concepts This model can be implemented using k-nearest neighbor classifier and is further enhanced by application of a genetic algorithm

Applications of lazy learning model Optical music recognition (Fujinaga, Pennycook, and Alphonce 1989; MacMillan, Droettboom, and Fujinaga 2002) Vehicle identification (Lu, Hsu, and Maldague 1992) Pronunciation (Cost and Salzberg 1993) Cloud identification (Aha and Bankert 1994) Respiratory sounds classification (Sankur et al. 1994) Wine analysis and classification (Latorre et al. 1994) Natural language translation (Sato 1995)

Implementation of lazy learning The lazy learning model can be implemented by the k-nearest neighbor classifier (Cover and Hart 1967) A classification scheme to determine the class of a given sample by its feature vector The class represented by the majority of k-nearest neighbors (k-NN) is then assigned to the unclassified sample Besides its simplicity and intuitive appeal, the classifier can be easily modified, by continually adding new samples that it “encounters” into the database, to become an incremental learning system Criticisms: slow and high memory requirement

K-nearest neighbor classifier “The nearest neighbor algorithm is one of the simplest learning methods known, and yet no other algorithm has been shown to outperform it consistently.” (Cost and Salzberg 1993) Determine the class of a given sample by its feature vector: Distances between feature vectors of an unclassified sample and previously classified samples are calculated The class represented by the majority of k-nearest neighbors is then assigned to the unclassified sample

Example of k-NN classifier

Example of k-NN classifier Classifying Michael Jordan

Example of k-NN classifier Classifying David Wesley

Example of k-NN classifier Reshaping the Feature Space

Distance measures The distance in a N-dimensional feature space between two vectors X and Y can be defined as: A weighted distance can be defined as:

Genetic algorithms Optimization based on biological evolution Maintenance of population using selection, crossover, and mutation Chromosomes = weight vector Fitness function = recognition rate Leave-one-out cross validation

Genetic Algorithm Start Evaluate Population Terminate? Select Parents Produce Offspring Mutate Offspring Stop

Crossover in Genetic Algorithm Parent 1 Parent 2 1011010111101 + 1101010010100 101101 0010100 110101 0111101 Child 1 Child 2

Applications of Genetic Algorithm in Music Instrument design (Horner et al. 1992, Horner et al. 1993, Takala et al. 1993, Vuori and Välimäki 1993) Compositional aid (Horner and Goldberg 1991, Biles 1994, Johanson and Poli 1998, Wiggins 1998) Granular synthesis regulation (Fujinaga and Vantomme 1994) Optimal placement of microphones (Wang 1996)

Realtime Timbre Recognition Original source: McGill Master Samples Up to over 1300 notes from 39 different timbres (23 orchestral instruments) Spectrum analysis of first 232ms of attack (9 overlapping windows) Each analysis window (46 ms) consists of a list of amplitudes and frequencies in the spectra 2048 points / 44.1 / ms = 46ms

Features Static features (per window) Dynamic features pitch mass or the integral of the curve (zeroth-order moment) centroid (first-order moment) variance (second-order central moment) skewness (third-order central moment) amplitudes of the harmonic partials number of strong harmonic partials spectral irregularity tristimulus Dynamic features means and velocities of static features over time

Overall Architecture for Timbre Recognition Live mic Input Sound file Input Data Acquisition & Data Analysis (fiddle) Recognition K-NN Classifier Output Instrument Name Knowledge Base Feature Vectors Genetic Algorithm K-NN Classifier Best Weight Vector Off-line

Results Experiment I SHARC data static features Experiment II McGill samples Fiddle dynamic features Experiment III more features redefinition of attack point

Human vs Computer

Peabody experiment 88 subjects (undergrad, composition students and faculty) Source: McGill Master Samples 2-instruments (oboe, saxophones) 3-instruments (clarinet, trumpet, violin) 9-instruments (flute, oboe, clarinet, bassoon, saxophone, trombone, trumpet, violin, cello) 27-instruments: violin, viola, cello, bass piccolo, flute, alto flute, bass flute oboe, english horn, bassoon, contrabassoon Eb clarinet, Bb clarinet, bass clarinet, contrabass clarinet saxes: soprano, alto, tenor, baritone, bass trumpet, french horn, tuba trombones: alto, tenor, bass

Peabody vs other human groups

Peabody subjects vs Computer

The best Peabody subjects vs Computer

Future Research for Timbre Recognition Performer identification Speaker identification Tone-quality analysis Multi-instrument recognition Expert recognition of timbre

Conclusions Realtime adaptive timbre recognition by k-NN classifier enhanced with genetic algorithm A successful implementation of the exemplar-based learning system in a time-critical environment Recent human experiments poses new challenges for machine recognition of isolated tones

Recognition rate for different lengths of analysis window