Lab 3: Isolated Word Recognition

Slides:



Advertisements
Similar presentations
Software for Engineers EG1003: Introduction to Engineering and Design.
Advertisements

                      Digital Audio 1.
School of Engineering & Technology Computer Architecture Pipeline.
EG1003: Introduction to Engineering and Design Software for Engineers.
1 Location Based File Exchange Controlled By Speech (LBFE-S) Final Project (Master Thesis) By Mohammed Marouf Supervisors John A. Sørenson.
EG1003: Introduction to Engineering and Design Introduction to LabVIEW.
Hawkes Software You have two options here: ∙ Purchase the boxed software at the bookstore ∙ Download this software from the Hawkes website Downloading.
MBAC 611.  We have been using MS Access to query and modify our databases.  MS Access provides a GUI (Graphical User Interface) that hides much of the.
Sample for Tip #52 Sync audio with slides. This movie was created in Windows Movie Maker Slides created in PowerPoint.
Office 03 Option Margaret S. Britt.  You will learn to:  Package PowerPoint for a CD  Burn the PowerPoint folder to a CD  Check the CD for accuracy.
Speech Recognition Final Project Resources
Temple University Speech Recognition using Sphinx 4 (Ti Digits test) Jaykrishna shukla,Amir Harati,Mubin Amehed,& cara Santin Department of Electrical.
Information Systems: Databases Define the role of general information systems Describe the elements of a database management system (DBMS) Describe the.
Business Software What is database software? p. 145 Allows you to create, access, and manage data Add, change, delete, sort, and retrieve data Next.
1 Application Software What is application software?  Programs that perform specific tasks for users.
Training Presentation For e-Learning Uploading an assignment on Moodle.
Sound on the Web. Using Sound on a Web Site Conveying information  pronounce a word or describe a product Set a mood  music to match the web page scene.
Activity 1 Record and edit your voice using Audacity 1.Download Audacity (a free and open source audio editing software from
DSP homework 1 HMM Training and Testing
2013 Computer Architecture Project MIPS Programming : Merge Sort ChangHyun Yun Room 236, Engineering Building.
2011 Computer Architecture Project #1 MIPS Programming: Merge Sort Ki Sup Hong Room 236, Engineering Building.
Voice Recognition (Presentation 2) By: Priya Devi A. S/W Developer, Xsys technologies Bangalore.
UNIT 7 Describing how an item functions [2] (infinitive with or without ‘to’)
Construction Planning and Prerequisite
Controlling Computer Using Speech Recognition (CCSR) Creative Masters Group Supervisor : Dr: Mounira Taileb.
Glencoe Introduction to Multimedia Chapter 8 Audio 1 sound effect An artificially created or enhanced sound used to achieve an effect (without speech or.
Introduction to Video Game Programming (VGP) Mr. Shultz.
Use Flowchart modeling to design and create an interactive quiz to be run on Powerpoint. Start Question Click to start the quiz Correct Incorrect, try.
Basic structure of sphinx 4
Using Voice to Solve Ergonomic Problems Dr. William Lenharth, CHFP UNH – Project54.
You may have already done this… Download the class files to the desktop Expand those files into root of USB stick Change your USB stick to drive “Z”!
1 SEG3120 Analysis and Design for User Interfaces LAB1: Video tape evaluation.
2 Software.
Course File seminar NCAAA SSR Preparation (3rd dec 2015)
DISCRETE HIDDEN MARKOV MODEL IMPLEMENTATION DIGITAL SPEECH PROCESSING HOMEWORK #1 DISCRETE HIDDEN MARKOV MODEL IMPLEMENTATION Date: Oct, Revised.
Submission of projects in Oncourse Computer Science and Computer Information IUPUI.
Project Presentation Eoin Culhane Multi Channel Music Recognition for an Electric Guitar.
HW2-2 Speech Analysis TA: 林賢進
#SummitNow Yes, I'm able to index audio files within Alfresco 2013 Fernando González @fegorama.
Canvas For Non-Traditional Classes Media is the key.
SCAI.org – Image Review Image Review Title Your Name Affiliation Please include audio introducing yourself, your affiliation and position, followed by.
These materials are prepared only for the students enrolled in the course Distributed Software Development (DSD) at the Department of Computer.
Using Speech Recognition to Predict VoIP Quality
Laboratory 1: Software for Engineers
Automated Detection of Human Emotion
Yes, I'm able to index audio files within Alfresco
Speech recognition in mobile environment Robust ASR with dual Mic
Committee of Experts World Intellectual Property Organization
Microsoft Access 2003 Illustrated Complete
Ying shen School of software engineering tongji university
Lab 2: Isolated Word Recognition
CIS 273 Innovative Education-- snaptutorial.com
Developing an Android application for
Computer-Aided Design (CAD) Competition
Lecture 5 – Improved Monte Carlo methods in finance: lab
Creating Transcripts of Your Narrated PowerPoints Richard Oliver Department of Information Systems 2018 Quality in Online Education Conference.
Software for Engineers
Speech Capture, Transcription and Analysis App
Software for Engineers
Vinod Kulathumani West Virginia University
HOW TO CREATE A CLASS Steps:
Introduction into CAD.
Egyptian Language School
Introduction into CAD.
Activity 1 Record and edit your voice using Audacity
PROJ2: Building an ASR System
Engineering Pro Tips Recitation 5 Intro
Computer-Aided Design (CAD) Competition
Lab 2: Information Retrieval
Automated Detection of Human Emotion
Presentation transcript:

Lab 3: Isolated Word Recognition Ying shen School of software engineering tongji university

Architecture of an ASR system 3/13/2017 HUMAN COMPUTER INTERACTION

Isolated word recognition based on HMM Train HMM main.m Record a test speech speechRecord.m Run GUI g_interfaces.m 3/13/2017 HUMAN COMPUTER INTERACTION

HUMAN COMPUTER INTERACTION Assignment Run SpeechRecognition-hmm. Get familiar with the program. Train another HMM using your own speech/words and modify GUI accordingly. Add a new function: Record test audios using Stat/End button Save training/test audios in different subdirectories 3/13/2017 HUMAN COMPUTER INTERACTION

HUMAN COMPUTER INTERACTION Report Describe what you have done in the training and test progress. The report should answer the following questions: The words to recognize The number of training samples The modifications to GUI and the codes The accuracy of speech recognition and how to improve it, if possible? Analyze the impact of different number of hmm states on recognition accuracy. Submit your work (code and report) to TA Prepare a readme file to illustrate how to run your program Compress the code (including all your .wav files and the readme file) and the report into a zip file: ID_name_lab3.zip 1352830-1552682: lixiyuan00100@126.com 1552684-1556122: htxu@tongji.edu.cn 3/13/2017 HUMAN COMPUTER INTERACTION