Lab 2: Isolated Word Recognition

Slides:



Advertisements
Similar presentations
Software for Engineers EG1003: Introduction to Engineering and Design.
Advertisements

                      Digital Audio 1.
School of Engineering & Technology Computer Architecture Pipeline.
1 OR Tri-County Training 2014 USING PRACTICE LAB On line access to the Practice Lab version of TaxWise software.
Online Math Tutoring. You will find everything you need in this tutorial to maximize your online tutoring experience. The following slides provide… A)
EG1003: Introduction to Engineering and Design Software for Engineers.
System for Engine Location Of a Web Train Paul Wimmer and Adam Weintrop Dr. Irwin and Dr. Schertz 10/31/05.
Hawkes Software You have two options here: ∙ Purchase the boxed software at the bookstore ∙ Download this software from the Hawkes website Downloading.
CGS 1000 Introduction to Computers and Technology.
MBAC 611.  We have been using MS Access to query and modify our databases.  MS Access provides a GUI (Graphical User Interface) that hides much of the.
Sample for Tip #52 Sync audio with slides. This movie was created in Windows Movie Maker Slides created in PowerPoint.
Speech Recognition Final Project Resources
Chapter 3 Application Software.
Temple University Speech Recognition using Sphinx 4 (Ti Digits test) Jaykrishna shukla,Amir Harati,Mubin Amehed,& cara Santin Department of Electrical.
Information Systems: Databases Define the role of general information systems Describe the elements of a database management system (DBMS) Describe the.
Business Software What is database software? p. 145 Allows you to create, access, and manage data Add, change, delete, sort, and retrieve data Next.
1 Application Software What is application software?  Programs that perform specific tasks for users.
Training Presentation For e-Learning Uploading an assignment on Moodle.
Activity 1 Record and edit your voice using Audacity 1.Download Audacity (a free and open source audio editing software from
Introduction of Geoprocessing Topic 7a 4/10/2007.
1 Lab of COMP 319 Lab tutor : Yao Zhang, Shenghua ZHONG Lab 4: Nov 30, 2011 Final Project: Image Compression.
DSP homework 1 HMM Training and Testing
2013 Computer Architecture Project MIPS Programming : Merge Sort ChangHyun Yun Room 236, Engineering Building.
2011 Computer Architecture Project #1 MIPS Programming: Merge Sort Ki Sup Hong Room 236, Engineering Building.
ARM Aerial Facilities BAECC Kickoff Meeting Feb Aircraft Data Local Share Drive ARM IOPshare (at Oak Ridge) ARM IOPshare (at Oak Ridge) Corrections.
Voice Recognition (Presentation 2) By: Priya Devi A. S/W Developer, Xsys technologies Bangalore.
UNIT 7 Describing how an item functions [2] (infinitive with or without ‘to’)
Controlling Computer Using Speech Recognition (CCSR) Creative Masters Group Supervisor : Dr: Mounira Taileb.
Glencoe Introduction to Multimedia Chapter 8 Audio 1 sound effect An artificially created or enhanced sound used to achieve an effect (without speech or.
Introduction to Video Game Programming (VGP) Mr. Shultz.
Discovering Computers Fundamentals, Third Edition CGS 1000 Introduction to Computers and Technology Summer 2007.
Basic structure of sphinx 4
1 SEG3120 Analysis and Design for User Interfaces LAB1: Video tape evaluation.
2 Software.
DISCRETE HIDDEN MARKOV MODEL IMPLEMENTATION DIGITAL SPEECH PROCESSING HOMEWORK #1 DISCRETE HIDDEN MARKOV MODEL IMPLEMENTATION Date: Oct, Revised.
What do you know about technology?. Hardware Netbooks- 4 (1 needs repaired) Netbooks- 4 (1 needs repaired) –Battery charger- chord –Power chord –wireless.
Introduction of Geoprocessing Lecture 9 3/24/2008.
Submission of projects in Oncourse Computer Science and Computer Information IUPUI.
Creating a Zip File with a Password. 1.Right Click on the File or Folder you want to Zip. 2.Choose “Add to Zip”
PROJECT Instructions Should be able to test on SITE systems Connection to the server via SSH POSTGRESQL version Must be done in groups If.
HW2-2 Speech Analysis TA: 林賢進
#SummitNow Yes, I'm able to index audio files within Alfresco 2013 Fernando González @fegorama.
SCAI.org – Image Review Image Review Title Your Name Affiliation Please include audio introducing yourself, your affiliation and position, followed by.
These materials are prepared only for the students enrolled in the course Distributed Software Development (DSD) at the Department of Computer.
HCS 235 aid Learn/hcs235aid.com FOR MORE CLASSES VISIT
Using Speech Recognition to Predict VoIP Quality
Laboratory 1: Software for Engineers
Automated Detection of Human Emotion
Yes, I'm able to index audio files within Alfresco
Speech recognition in mobile environment Robust ASR with dual Mic
Submitted by: Ala Berawi Sujod Makhlof Samah Hanani Supervisor:
Microsoft Access 2003 Illustrated Complete
Ying shen School of software engineering tongji university
Developing an Android application for
Lecture 5 – Improved Monte Carlo methods in finance: lab
Creating Transcripts of Your Narrated PowerPoints Richard Oliver Department of Information Systems 2018 Quality in Online Education Conference.
Software for Engineers
to … monnigcte.weebly.com
Speech Capture, Transcription and Analysis App
Software for Engineers
Vinod Kulathumani West Virginia University
Introduction into CAD.
Egyptian Language School
Introduction into CAD.
Lab 3: Isolated Word Recognition
Activity 1 Record and edit your voice using Audacity
What you will need to Register
PROJ2: Building an ASR System
Lab 2: Information Retrieval
Automated Detection of Human Emotion
Presentation transcript:

Lab 2: Isolated Word Recognition Ying shen School of software engineering tongji university

Architecture of an ASR system 3/13/2017 HUMAN COMPUTER INTERACTION

Isolated word recognition based on HMM Train HMM main.m Record a test speech speechRecord.m Run GUI g_interfaces.m 3/13/2017 HUMAN COMPUTER INTERACTION

HUMAN COMPUTER INTERACTION Assignment Run SpeechRecognition-hmm. Get familiar with the program. Train another HMM using your own speech/words and modify GUI accordingly. Add a new function: Record test audios using Stat/End button Save training/test audios in different subdirectories 3/13/2017 HUMAN COMPUTER INTERACTION

HUMAN COMPUTER INTERACTION Report Describe what you have done in the training and test progress. The report should answer the following questions: The words to recognize The number of training samples The modifications to GUI and the codes The accuracy of speech recognition and how to improve it, if possible? Analyze the impact of different number of hmm states on recognition accuracy. Submit your work (code and report) to TA Prepare a readme file to illustrate how to run your program Compress the code (including all your .wav files and the readme file) and the report into a zip file: ID_name_lab2.zip Email address: hci2018@163.com 3/13/2017 HUMAN COMPUTER INTERACTION