The Speech Solution www.infovox.se www.babeltech.com BABEL DEMONSTRATOR RESPITE PROJECT.

Slides:



Advertisements
Similar presentations
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Advertisements

Patrick Evans, P.E. Senior Advisory Pre-Sales Technical Support Rep Superelevation in the Roadway Designer.
Chapter 3 Loaders and Linkers. Purpose and Function Places object program in memory Linking – Combines 2 or more obj programs Relocation – Allows loading.
Microsoft Visual Basic 2010: Reloaded Fourth Edition Chapter Eight Sub and Function Procedures.
The Architecture Design Process
1 Chapter 2 C++ Syntax and Semantics, and the Program Development Process Dale/Weems/Headington.
Portability CPSC 315 – Programming Studio Spring 2008 Material from The Practice of Programming, by Pike and Kernighan.
1 CMSC 132: Object-Oriented Programming II Java Constructs Department of Computer Science University of Maryland, College Park.
Creating and Running Your First C# Program Svetlin Nakov Telerik Corporation
Lecture 1: Overview of Java. What is java? Developed by Sun Microsystems (James Gosling) A general-purpose object-oriented language Based on C/C++ Designed.
Structure Commander Technical Presentation. Copyright (C) MCS 2013, All rights reserved. 2 STRUCTURE COMMANDER Introduction Product Overview.
Android 4: Creating Contents Kirk Scott 1. Outline 4.1 Planning Contents 4.2 GIMP and Free Sound Recorder 4.3 Using FlashCardMaker to Create an XML File.
1 © Talend 2014 Service Locator Talend ESB Training 2014 Jan Bernhardt Zsolt Beothy-Elo
File System. NET+OS 6 File System Architecture Design Goals File System Layer Design Storage Services Layer Design RAM Services Layer Design Flash Services.
Bertrand Bellenot root.cern.ch ROOT I/O in JavaScript Reading ROOT files from any web browser ROOT Users Workshop
Reading ROOT files in any browser ROOT I/O IN JAVASCRIPT Bertrand Bellenot CERN, PH-SFT.
What is R By: Wase Siddiqui. Introduction R is a programming language which is used for statistical computing and graphics. “R is a language and environment.
Tutorial 7: Sub and Function Procedures1 Tutorial 7 Sub and Function Procedures.
Number Five Software One provider Multiple Solutions professional card design software The most flexible and powerful solution for card issuing! The best.
WorkPlace Pro Utilities.
GroundsOpsStaff (Last Updated 9/9/2010) A Grounds Operations and Staffing Computer Application Based on APPA Operational Guidelines for Grounds Management.
Creating and Running Your First C# Program Svetlin Nakov Telerik Corporation
INTRODUCTION TO FRONTPAGE. TOPICS TO BE DISCUSSED……….  Introduction Introduction  Features Features  Starting Front Page Starting Front Page  Components.
1 Autodesk Confidential Information November 2010 What’s New in the Inventor 2012 API?
Speech Recognition ECE5526 Wilson Burgos. Outline Introduction Objective Existing Solutions Implementation Test and Result Conclusion.
Gallagher & Robertson New look for toolbar  ‘flat’ look with raised button as mouse passes over  note new handle for dragging the toolbar.
Tutorial 111 The Visual Studio.NET Environment The major differences between Visual Basic 6.0 and Visual Basic.NET are the latter’s support for true object-oriented.
Research & Technology Experiments on different feature sets; comparison with DC baseline system RESPITE workshop Jan Martigny Joan Mari Hilario.
Interfaces to External EDA Tools Debussy Denali SWIFT™ Course 12.
NoteSearch - Find what you’re looking for. Prototype Team B.
LAS To WITSML Conversion Utility Presented at POSC Regional SIG meeting Stavanger, June 2006 by POSC & Satyam Computer Services Ltd.
Fundamentals of C and C++ Programming. EEL 3801 – Lotzi Bölöni Sub-Topics  Basic Program Structure  Variables - Types and Declarations  Basic Program.
ADTs and C++ Classes Classes and Members Constructors The header file and the implementation file Classes and Parameters Operator Overloading.
Confidential. All rights reserved © Hesse GmbH. Slide 1 An introduction to PBS200.
1 Chapter 2: Working with Data in a Project 2.1 Introduction to Tabular Data 2.2 Accessing Local Data 2.3 Accessing Remote Data 2.4 Importing Text Files.
OCLC SiteSearch User Group Meeting Implementing Interface Gadgets SiteSearch User Group Workshop 3/23/1999.
Class Libraries Chapter 1 1 Source Intro to Java Programming Y. Daniel Liang.
Esri UC2013. Technical Workshop. Technical Workshop 2013 Esri International User Conference July 8–12, 2013 | San Diego, California Supporting High-Quality.
1 Chapter 2 C++ Syntax and Semantics, and the Program Development Process.
Solutions using Microsoft Content Management Server 2002 Connector for SharePoint Technologies Sue Corke Mark Harrison Microsoft UK.
Copyright © 2009 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 6: Transition to Java Programming with Alice and Java First Edition.
The Speech Solution GENERAL OVERVIEW OF THE BABEL DEMONSTRATOR SYSTEM RESPITE PROJECT.
Combining Speech Attributes for Speech Recognition Jeremy Morris November 9, 2006.
UNIX File System (UFS) Chapter Five.
Programming Fundamentals. Overview of Previous Lecture Phases of C++ Environment Program statement Vs Preprocessor directive Whitespaces Comments.
The Instruction Set Architecture. Hardware – Software boundary Java Program C Program Ada Program Compiler Instruction Set Architecture Microcode Hardware.
Eyeblaster Rich Media Basics. Global Digital Marketing Solutions Superior Technology & Expert Service Spanning 15 Countries  Digital ad serving and campaign.
WORLD CONSORTIUM Welcome to. An overview by Phil Elliott Satzconcept Skandinavia a.s.
Overview of Previous Lesson(s) Over View 3 Program.
Scalable data access with Impala Zbigniew Baranowski Maciej Grzybek Daniel Lanza Garcia Kacper Surdy.
Supplementary Figure S1. Supplementary Figure S2.
Programming Fundamentals. Today’s Lecture Array Fundamentals Arrays as Class Member Data Arrays of Objects C-Strings The Standard C++ string Class.
1 Experiments with Detector- based Conditional Random Fields in Phonetic Recogntion Jeremy Morris 06/01/2007.
Enhance Edu. Agenda Features What is new ? Pre-Requisites Version 2 system Road Ahead.
Visual Programming Borland Delphi. Developing Applications Borland Delphi is an object-oriented, visual programming environment to develop 32-bit applications.
NetCDF Data Model Details Russ Rew, UCAR Unidata NetCDF 2009 Workshop
MESA A Simple Microarray Data Management Server. General MESA is a prototype web-based database solution for the massive amounts of initial data generated.
1 Release 7.0 Features and Enhancements Multi-byte Language Support Language Pack Editor Blackboard Backpack (Offline Synchronization Application) Embedded.
Storage and File Organization
Automatic Speech Recognition
Introduction To DBMS.
Working in the Forms Developer Environment
The Mach System Sri Ramkrishna.
Conditional Random Fields for ASR
CPSC 315 – Programming Studio Spring 2012
FEATURE OF THE (EVERY OTHER) DAY
File Sharing Sharing of files on multi-user systems is desirable
Portability CPSC 315 – Programming Studio
Directory Structure A collection of nodes containing information about all files Directory Files F 1 F 2 F 3 F 4 F n Both the directory structure and the.
Presentation transcript:

The Speech Solution BABEL DEMONSTRATOR RESPITE PROJECT

The Speech Solution Babel demonstrator The demo interface provides access to the ASR process at different anchor points such as : sampled speech signal, acoustic features, state likelihoods, recognized sentences.

The Speech Solution Customisable interface Each block can be processed independently by calling a user-defined external program, The only constraint is the compatibility with the input/output data format.

The Speech Solution Data display Display is fully handled by the ASR interface by specific plug-ins for the 3 different data types: acoustic features, probabilities/likelihoods, word hypothesis. Automatic time-alignment of the different data streams on the speech signal. Allow display of block’s internal data. Dynamic internal data management. The definition of the data format must include all the information required by these constraints.

The Speech Solution Data format Frame based Each frame can contain several synchronized data streams of any of the pre-defined data-types Binary header: Sample rate in Hzunsigned short Frame length in msfloat Frame shift in msfloat Number of streamsunsigned short Name stream #1 64 bytes Type stream #1unsigned short Frame size stream #1unsigned short Name stream #264 bytes Type stream #2unsigned short Frame size stream #264 bytes The stream names are used for identifying displayed windowpane.

The Speech Solution Data format (con’t) The stream type is one of the following: 1 – samples (PCM16)signed short 2 – featuresfloat 3 – probabilitiesfloat The actual data are then formatted the following way, for each frame: Time index in milliseconds for current frame - float Data stream #1 Data stream #2 … The time index is used for time alignment of different data streams. A time index of –1 means the end of the current utterance.

The Speech Solution Data format (con’t) The word hypothesis are written in Timit format: start_time(samples) end_time(samples) word_hyp for instance: Sil one eight six

The Speech Solution Status point 3 different plug-in for each data  Just copy the plug-in in a specific directory to add the different functions -> wavesurfer program not modified 3 different variables available from any function  SAMPLES FEATURES PROBABILITIES in Respite format On-line and Off-line processing implemented Display function for samples and probabilities implemented.

The Speech Solution Status point Customizable interface : implementation realized for samples data. Integration of windows / Unix programs for data processing

The Speech Solution Still to be done … Display of features data (e.g. RASTA …) Block management for features and probabilities Dynamic Management of the available RESPITE variable  The user can choose which variable to display NOISY SAMPLES PLP Features Probabilities

The Speech Solution Still to be done … Multiple stream processing …  Integration in variables  Integration in display menus Full Installation package …

The Speech Solution Demonstration … ….