Audio to Score Alignment for Educational Software

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Google Web Speech API Implementation Case Study: English Skill Online Practice Prajaks Jitngernmadan Faculty of Informatics, Burapha University.
Toward Automatic Music Audio Summary Generation from Signal Analysis Seminar „Communications Engineering“ 11. December 2007 Patricia Signé.
Learning Parameterized Maneuvers for Autonomous Helicopter Flight Jie Tang, Arjun Singh, Nimbus Goehausen, Pieter Abbeel UC Berkeley.
Speech in Multimedia Hao Jiang Computer Science Department Boston College Oct. 9, 2007.
Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.
December 2006 Cairo University Faculty of Computers and Information HMM Based Speech Synthesis Presented by Ossama Abdel-Hamid Mohamed.
Hidden Markov Model based 2D Shape Classification Ninad Thakoor 1 and Jean Gao 2 1 Electrical Engineering, University of Texas at Arlington, TX-76013,
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
Hidden Markov Models Pairwise Alignments. Hidden Markov Models Finite state automata with multiple states as a convenient description of complex dynamic.
Profile-profile alignment using hidden Markov models Wing Wong.
A Data-Driven Approach to Quantifying Natural Human Motion SIGGRAPH ’ 05 Liu Ren, Alton Patrick, Alexei A. Efros, Jassica K. Hodgins, and James M. Rehg.
Evaluation of Speech Detection Algorithm Project 1b Due February 14th.
Scientific Computing Department Faculty of Computer and Information Sciences Ain Shams University Supervised By: Mohammad F. Tolba Mohammad S. Abdel-Wahab.
1 AUTOMATIC TRANSCRIPTION OF PIANO MUSIC - SARA CORFINI LANGUAGE AND INTELLIGENCE U N I V E R S I T Y O F P I S A DEPARTMENT OF COMPUTER SCIENCE Automatic.
Online Chinese Character Handwriting Recognition for Linux
DIGITAL WATERMARKING OF AUDIO SIGNALS USING A PSYCHOACOUSTIC AUDITORY MODEL AND SPREAD SPECTRUM THEORY By: Ricardo A. Garcia University of Miami School.
Educational Software using Audio to Score Alignment Antoine Gomas supervised by Dr. Tim Collins & Pr. Corinne Mailhes 7 th of September, 2007.
Black-box Testing for Evolving COTS-Based Software
Uncovering spoken phrases in encrypted VoIP conversations BY, RITESH CHANDRA REDDY GUNNA. PRASAD VUNNAM.
Utterance Verification for Spontaneous Mandarin Speech Keyword Spotting Liu Xin, BinXi Wang Presenter: Kai-Wun Shih No.306, P.O. Box 1001,ZhengZhou,450002,
LOGO 2 nd Project Design for Library Programs Supervised By Dr: Mohammed Mikii.
Online Arabic Handwriting Recognition Fadi Biadsy Jihad El-Sana Nizar Habash Abdul-Rahman Daud Done byPresented by KFUPM Information & Computer Science.
BING: Binarized Normed Gradients for Objectness Estimation at 300fps
WRM FUTURE DEVELOPMENT DANIELE FELICI (ER1), ALI ABDALLAH (ESR1) WP2 EDUSAFE MEETING CERN, JUNE 2015.
SOFT COMPUTING TECHNIQUES FOR STATISTICAL DATABASES Miroslav Hudec INFOSTAT – Bratislava MSIS 2009.
Using Inactivity to Detect Unusual behavior Presenter : Siang Wang Advisor : Dr. Yen - Ting Chen Date : Motion and video Computing, WMVC.
1 Robust Endpoint Detection and Energy Normalization for Real-Time Speech and Speaker Recognition Qi Li, Senior Member, IEEE, Jinsong Zheng, Augustine.
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
Reviews and Inspections. Types of Evaluations Formal Design Reviews conducted by senior personnel or outside experts uncover potential problems Inspections.
Designing multiple biometric systems: Measure of ensemble effectiveness Allen Tang NTUIM.
Training and Evaluation Tool Milan Jovic Dusan Jevtic Dr Dragan Jankovic Public Reporting on Project Results TEMPUS project.
Stentor A new Computer-Aided Transcription software for French language.
PhD Candidate: Tao Ma Advised by: Dr. Joseph Picone Institute for Signal and Information Processing (ISIP) Mississippi State University Linear Dynamic.
Experimental Results Abstract Fingerspelling is widely used for education and communication among signers. We propose a new static fingerspelling recognition.
Performance Comparison of Speaker and Emotion Recognition
BY KALP SHAH Sentence Recognizer. Sphinx4 Sphinx4 is the best and versatile recognition system. Sphinx4 is a speech recognition system which is written.
Issues in Automatic Musical Genre Classification Cory McKay.
Hidden Markov Model and Its Application in Bioinformatics Liqing Department of Computer Science.
Transcription Software Amazing Slow Downer & Transcribe! Rick Lollar Amazing Slow Downer & Transcribe! Rick Lollar.
Statistical techniques for video analysis and searching chapter Anton Korotygin.
More SQA Reviews and Inspections. Types of Evaluations  Verification Unit Test, Integration Test, Usability Test, etc  Formal Reviews  aka "formal.
The information systems lifecycle Far more boring than you ever dreamed possible!
Flexible Speaker Adaptation using Maximum Likelihood Linear Regression Authors: C. J. Leggetter P. C. Woodland Presenter: 陳亮宇 Proc. ARPA Spoken Language.
INTRODUCTION CSE 470 : Software Engineering. Goals of Software Engineering To produce software that is absolutely correct. To produce software with minimum.
ItemBased Collaborative Filtering Recommendation Algorithms 1.
2014 Development of a Text-to-Speech Synthesis System for Yorùbá Language Olúòkun Adédayọ̀ Tolulope Department of Computer Science.
UWave: Accelerometer-based personalized gesture recognition and its applications Tae-min Hwang.
A NONPARAMETRIC BAYESIAN APPROACH FOR
Performing and rehearsing
Security of Grid Computing Environments
핵심어 검출을 위한 단일 끝점 DTW 알고리즘 Yong-Sun Choi and Soo-Young Lee
Artificial Intelligence for Speech Recognition
Hidden Markov Models (HMM)
The University of Texas at Dallas
Ever wanted to Anna Hathaway leave an appealing good morning alarm ?
Computational NeuroEngineering Lab
پروتكل آموزش سلامت به مددجو
Presented by Steven Lewis
ECE 477 Senior Design Group 1  Fall 2006
Online Arabic Handwriting Recognition
Isolated word, speaker independent speech recognition
Centro Universitario de la Defensa Escuela Naval Militar
Some iterative methods free from second derivatives for nonlinear equation Muhammad Aslam Noor Dept. of Mathematics, COMSATS Institute of Information Technology,
Visual Recognition of American Sign Language Using Hidden Markov Models 문현구 문현구.
DIGITAL WATERMARKING OF AUDIO SIGNALS USING A PSYCHOACOUSTIC AUDITORY MODEL AND SPREAD SPECTRUM THEORY By: Ricardo A. Garcia University of Miami School.
An Android Application to Evaluate Piano Playing Using Fast Fourier Transform (FFT) Algorithm Green Mandias1, Andria Wahyudi2, Hendriawan Jumawan3 and.
Dr. Ahmed ElShafee, Graduation Project I Presentation Template
Presentation transcript:

Audio to Score Alignment for Educational Software Interim Presentation for the MSc Personal Project Antoine Gomas supervised by Dr. Tim Collins 22nd of June, 2007

Agenda Introduction Objectives Review & Innovation Work Conclusion Achievements so far Planned work Conclusion

Audio to score alignment? Associate Notes in a score Timing points in a recording Example

Project objectives Implement a monophonic audio to score alignment algorithm Evaluate characteristics of the performance Design a learning interface to help music students improve their performance

Review (1) Previous work Algorithms already exist Similar to Spoken Language Processing Application: musicology Professional recordings

Review (2) Previous work (continued) Dynamic Time Warping Few parameters Heavy Low flexibility Hidden Markov Models Very flexible Large number of parameters (training)

Review (3) Innovation Apply to educational software Requires modifications & new functionalities Cope with errors Detect errors

Work First system Results so far Work plan for the next two months

First system (1)

First system (2) First “working” version Attack, Sustain, Silence Uses Dynamic Time Warping

First Results Works for simple cases: Good at rhythm recuperation Short performances Clean synthetic music Good at rhythm recuperation Requires correct pitches

Planned work Switch to HMMs Design learning interface Lower computing requirements More flexible to recover from student’s errors Design learning interface Thorough review about design standards No implementation expected

Conclusion Promising first results HMMs risky but interesting Challenging project

Thank you for listening Any questions ?