NON-NEGATIVE MATRIX FACTORIZATION FOR REAL TIME MUSICAL ANALYSIS AND SIGHT-READING EVALUATION Chih-Chieh Cheng, Diane J. Hu, and Lawrence K. Saul, UC San.

Slides:



Advertisements
Similar presentations
Chapter 2: Rhythm and Pitch
Advertisements

Fullerton College Skills Center Better Accent Tutor (BAT) How to Access and use BAT to improve your pronunciation.
In Tune With Technology Digital Musical Tuners. What is its Purpose? Instrument tuners indicate whether the pitch of a specific note being played is sharp,
For those who have never played an instrument
Audacity: a step-by-step tutorial Presented by: Terry W. Hoffland.
Voiceprint System Development Design, implement, test unique voiceprint biometric system Research Day Presentation, May 3 rd 2013 Rahul Raj (Team Lead),
Testing Theories: Three Reasons Why Data Might not Match the Theory.
Toward Automatic Music Audio Summary Generation from Signal Analysis Seminar „Communications Engineering“ 11. December 2007 Patricia Signé.
Improvement of Audio Capture in Handheld Devices through Digital Filtering Problem Microphones in handheld devices are of low quality to reduce cost. This.
The frequency spectrum
Colouring in the Violin: A graphically-mediated approach to the reconstruction of heard melodies.
1 Electrical and Computer Engineering Guitar Virtuos Justin D’Alessando (EE) Jacob Dionne (CSE) Adam Montalbano (CSE) Jeffrey Newton (EE) Team Kelly Preliminary.
SYED SYAHRIL TRADITIONAL MUSICAL INSTRUMENT SIMULATOR FOR GUITAR1.
Machine Learning CPSC 315 – Programming Studio Spring 2009 Project 2, Lecture 5.
Logging and Replay of Go Game Steven Davis Elizabeth Fehrman Seth Groder.
Music and Logo Some Music Theory Just enough.. KISS We will say in the key of C no flats or sharps unless indicated. For our purpose we will stick to.
Checkers Steven Prockup Spencer MacDonald. Project Overview A computer checkers playing program where the human interface is through a physical checkerboard.
Evaluating Hypotheses
The C++ Tracing Tutor: Visualizing Computer Program Behavior for Beginning Programming Courses Rika Yoshii Alastair Milne Computer Science Department California.
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
Generation of atlas graphs & charts. Objective The major objective this training session is to equip participants with the knowledge and skills of creating.
First some catch-up. Everyone who uses slide presentation software should regularly check their show in show mode. This is the end product. You need to.
Quality Assurance in the clinical laboratory
MULTIMEDIA SYSTEM.
Chemometrics Method comparison
Ranga Rodrigo April 5, 2014 Most of the sides are from the Matlab tutorial. 1.
Graphic Equalizer Table By Jose Lerma. Main Idea The main idea of this table is to display the frequencies of any sound or audio input, either by microphone.
infinity-project.org Engineering education for today’s classroom 53 Design Problem - Digital Band Build a digital system that can create music of any.
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Math 116 Chapter 12.
SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIXFACTORIZATION AND SPECTRAL MASKS Jain-De,Lee Emad M. GraisHakan Erdogan 17 th International.
Power Point EDU 271 Microsoft PowerPoint is a powerful tool to create professional looking presentations and slide shows. PowerPoint allows you to construct.
Instrument Recognition in Polyphonic Music Jana Eggink Supervisor: Guy J. Brown University of Sheffield
1. 2 What is ? CBMReading is Curriculum Based Measurement for reading –Screen/Benchmark 3 times a year (all students) –Read a 3 grade-level passages.
Applications Statistical Graphical Models in Music Informatics Yushen Han Feb I548 Presentation.
Multimedia is a program that combines:
Song Pro Retro Alex Harper. Contents of Presentation Inspiration Basic Concept Speaker Module.sng file structure Song Pro Retro: Light Song Pro Retro:
Numerical Sequences. Why Sequences? There are six animations about limits to show the sequence in the domain and range. Problems displaying the data.
Name _____________________________________ Date ______________________ Period __________ W AVES AND I NTERFERENCE Directions: Use your LabQuest to generate.
MUSIC WEB QUEST FOR FOURTH YEAR STUDENT By: Cabili Mohaima P. Sabino Monipa P. Ibrahim Mosbicak Jr. T.
StAR web server tutorial for ROC Analysis. ROC Analysis ROC Analysis: This module allows the user to input data for several classifiers to be tested.
Learning How to Skype Presented by: Nicole Spells AET/541 – E-Learning University of Phoenix Jenna Pavleck.
Chapter 3 Instructional Software. Which software function could best give students opportunities to match English words with their German equivalents?
©Robomatter – Distribution or copying without permission is prohibited. 3B STEM Computer Science 1 ©Robomatter – Distribution or copying without permission.
Dan Rosenbaum Nir Muchtar Yoav Yosipovich Faculty member : Prof. Daniel LehmannIndustry Representative : Music Genome.
10 August 2005Benchmark/Mentor Student Guide Page 1 CPS Benchmark/Mentor Student Guide Internet Edition.
+ Chapter 7 Using Integrated Teaching Methods. + Integrated Teaching Methods Combining direct and indirect delivery of instruction Encourages self-directed.
MULTIMEDIA INPUT / OUTPUT TECHNOLOGIES INTRODUCTION 6/1/ A.Aruna, Assistant Professor, Faculty of Information Technology.
Test and Review chapter State the differences between archive and back-up data. Answer: Archive data is a copy of data which is no longer in regular.
Individual Differences in Human-Computer Interaction HMI Yun Hwan Kang.
6.1 Inference for a Single Proportion  Statistical confidence  Confidence intervals  How confidence intervals behave.
Robust Real Time Face Detection
Daniel A. Keim, Hans-Peter Kriegel Institute for Computer Science, University of Munich 3/23/ VisDB: Database exploration using Multidimensional.
Chapter 9 Wireless Model Predictive Control. MPC Simulation of Measurement Value on Detection of Bad Status Detection  In many recent MPC designs a similar.
1 City With a Memory CSE 535: Mobile Computing Andreea Danielescu Andrew McCord Brandon Mechtley Shawn Nikkila.
Exploring Problem Solving With Software, Apps, and Games By: Brittney Wynter.
Pitch Perception Or, what happens to the sound from the air outside your head to your brain….
© 2015 albert-learning.com TOEIC Question Responses 1 Listening Question Responses -1.
Building Fluency:.
Timbre and Memory An experiment for the musical mind Emily Yang Yu Music 151, 2008.
1 Automatic Music Style Recognition Arturo Camacho.
Student Name Category Gildersleeve Middle School Teacher Name Title of Project This is only a template You determine the Design, Font Style, Font Sizes,
Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs C.G. Puntonet and A. Prieto (Eds.): ICA 2004 Presenter.
What do we know from research on:. Key points Digital games for learning have some distinctive features (see slide 3) Digital games for learning can have.
Tier III Preparing for First Meeting. Making the Decision  When making the decision to move to Tier III, all those involve with the implementation of.
© English Language Testing Ltd Taking the Password Skills Test.
CS 591 S1 – Computational Audio
Quality Assurance in the clinical laboratory
ECE 791 Project Proposal Project Title: Developing and Evaluating a Tool for Converting MP3 Audio Files to Staff Music Project Team: Salvatore DeVito.
Emad M. Grais Hakan Erdogan
Presentation transcript:

NON-NEGATIVE MATRIX FACTORIZATION FOR REAL TIME MUSICAL ANALYSIS AND SIGHT-READING EVALUATION Chih-Chieh Cheng, Diane J. Hu, and Lawrence K. Saul, UC San Diego CSE Department We present an interactive, real-time system called the Sight Reading Tutor (SRT) that motivates beginning musicians to systematically practice their sight-reading. The system displays an animated score, “listens" to the player's instrument, and provides instant visual feedback distinguishing correctly versus incorrectly played notes. The back-end of this system operates in real-time to determine which notes have been correctly played by the user. We use NMF to learn nonnegative basis templates for each note and to evaluate whether the sound from the user's musical instrument matches the notes on a given musical score. 2 HOW CAN COMPUTERS HELP? To become a proficient sight-reader requires the constant practice of sight-reading. Unfortunately, the beginning musician struggles without the help of a teacher who provides immediate feedback. Here, we have designed a prototype software system that facilitates sight-reading practice following pedagogical guidelines: 2. The user begins by choosing a sight-reading exercise at the appropriate level and tempo. 3. The user hits “start” and the note to be played turns pink, indicating the current position in the score. Four count-in rests are given for the user to prepare. 4. When the user plays a note correctly, points are given, and the note turns blue permanently. Otherwise, the color of the note stays white. 1. The user runs the software on a laptop, and sets the laptop near/on top of a console piano. Notice that no digital hook-ups are needed. 1 WHAT IS SIGHT-READING? Sight-reading is the ability to perform music from a score with no preparation or previous acquaintance with the music score. The goal of the sight-reader is to play the piece from beginning to end with reasonable accuracy, while keeping tempo. This skill is often needed for activities like: collaborating with other musicians in an ensemble accompanying choral ensembles reading through a large amount of music quickly gauging musical ability at music competitions performing at social settings on the fly! 5 ONLINE & OFFLINE EVALUATION Offline Evaluation Real-time environment is simulated by recording notes and chords from a laptop microphone on top of a console piano. Each test sample was 1.5 sec in duration, generating 60 frames to be analyzed by NMF. (Top) Experimental results for single notes. (Bottom) Experimental results for major triad chords. Online Evaluation To evaluate the accuracy of feedback in our real-time system, we randomly selected ten sight-reading exercises of varying difficulty and used music software to generate three audio files for each exercise, testing different conditions. Exercised played one octave higher 98.2% of notes successfully rejected Exercise played in wrong key 96% of notes recognized correctly Exercise played correctly 3 MUSICAL ANALYSIS BY NMF Given a nonnegative matrix Y, non-negative matrix factorization (NMF) derives a low-rank approximation Y ≈Ŷ, where Ŷ =WX. The Kullback- Leibler (KL) divergence is used as the cost function: NMF can learn representations of musical notes based on their magnitude power spectra in the frequency domain. We have: Y = power spectra of single notes W = basis templates for magnitude power spectra of notes X = coefficients for reconstructing observations in Y from learned templates 4 NOTE & CHORD VERIFICATION During training, NMF is used to learn basis templates in W for 60 single notes spanning 5 octaves on the piano. These templates are then used to determine if the user is playing the desired note(s) on the musical score by trying to reconstruct the power spectra of [Top] The left histogram shows the costs-of-fit when middle C is reconstructed from (matched) templates for middle C; the right shows C# reconstructed from (mismatched) templates for middle C. The histograms are well-separated; choosing the appropriate threshold can detect notes off by a half-step and reject them as errors. [Bottom] Analogous histograms when the chords CEG and DbFAb are considered as potential matches to the desired notes CEG. [Top] Costs-of-fit when a note one octave above middle C is matched to templates for the same note (red) and one octave below (blue). [Bottom] Costs-of-fit when a triad one octave above middle C is matched to templates for the same notes (red) and one octave below it (blue). The octave error is revealed by computing both costs-of-fit and comparing them directly the played note(s). The decision-making process is shown at left. The optimal reconstruction is obtained by minimizing the generalized KL divergence (see Section 3). To obtain the overall cost-of-fit at time t, we normalize the reconstruction error by the total power in the window: [Above] The decision-making process for note & chord verification