Distributed Rendering Tool for Voices (DRTV) Familiar, Expressive Voices & Personalities Speech Technology & Media Solutions By Dale Schalow SCHALOW Innovations.

Slides:



Advertisements
Similar presentations
IBM WebSphere Everyplace Access for Multiplatforms Managing the e-business Customer Experience.
Advertisements

Electronic Publishing DGXIII/E Kieran O’Hea Techserv Expert Services to European Commission DGXIII/E.
Page 1. Page 2 Virtual Speaker: A Virtual Studio The software: Virtual Speaker is a package that automatically creates your voice files, prompts or any.
Collaborative Customer Relationship Management (CCRM) User Group June 23 rd, 2004.
This is an audio presentation. Please turn on your computer speakers. Press to start the presentation.
Languages & The Media, 5 Nov 2004, Berlin 1 New Markets, New Trends The technology side Stelios Piperidis
UNDERSTANDING JAVA APIS FOR MOBILE DEVICES v0.01.
Technological Convergence for Institutions & Audiences
The State of the Art in VoiceXML Chetan Sharma, MS Graduate Student School of CSIS, Pace University.
Chapter 1 Overview The Foundation for Your Future © The McGraw-Hill Companies, Inc., 2000.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
New Technologies Are Surfacing Everyday. l Some will have a dramatic affect on the business environment. l Others will totally change the way you live.
James A. Senn’s Information Technology, 3rd Edition
Find The Better Way Expand Your Voice with VXML May 10 th, 2005.
Text-To-Speech System for Marathi Miss. Deepa V. Kadam Indian Institute of Technology, Bombay.
1 Review PowerPoint for Unit Test covering Chapter #1 & #2.
Digital Alternatives to Transcribed Records at FAO IAMLADP Working Group on Technology for Conferences, Languages and Publications Task Force on Digital.
CONFIDENTIAL | © Nuance Communications, Inc. All rights reserved. ENTERPRISE SOLUTIONS 1 Parteek Singh.
The Digital Motion Picture Archive Framework Project © 2008 AMPAS Academy of Motion Picture Arts and Sciences Science and Technology Council Nancy Silver,
VoiceXML Builder Arturo Ramirez ACS 494 Master’s Graduate Project May 04, 2001.
Business Computing 550 Lesson 4. Fundamentals of Information Systems, Fifth Edition Chapter 4 Telecommunications, the Internet, Intranets, and Extranets.
Assistive Technology and Web Accessibility University of Hawaii Information Technology Services Jon Nakasone.
Putting eAccessibility at the core of information systems EPUB 3 – ebooks designed for all Markus Gylling, CTO, IDPF & DAISY Consortium 26/03/2012, Cité.
Conversational Applications Workshop Introduction Jim Larson.
1 © 2004 Cisco Systems, Inc. All rights reserved. Session Number Presentation_ID Media Resource Control Protocol v2 Sarvi Shanmugham, Editor: MRCP v1/v2.
An Overview of MPEG-21 Cory McKay. Introduction Built on top of MPEG-4 and MPEG-7 standards Much more than just an audiovisual standard Meant to be a.
 Using Screenr, Jing, and QuickTime Plus some alternatives!
Creating Speaking Web Pages: The Text-to-Speech Integrated Development Environment (TTS-IDE) David C. Gibbs Department of Mathematics and Computing University.
Course Title: M.M.T Chapter No: 01 “Introduction to Multimedia”
The PrestoSpace Project Valentin Tablan. 2 Sheffield NLP Group, January 24 th 2006 Project Mission The 20th Century was the first with an audiovisual.
1 Web-4-All What is Web-4-All? Who is it for? How does Web-4-All address a user’s needs? How does it work? Why is Web-4-All important? Government On-Line.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Silverlight Technology. Table of Contents 1.What is Silverlight Technology? 2.Silverlight Overview. 2.1 How it works 2.2 Silverlight development tools.
The Voice-Enabled Web: VoiceXML and Related Standards for Telephone Access to Web Applications 14 Feb Christophe Strobbe K.U.Leuven - ESAT-SCD-DocArch.
1 The Technical Standards and Your Bid Sarah Ormes UKOLN University of Bath Bath, BA2 7AY UKOLN is funded by Resource: The Council for Museums, Archives.
2 Our recipe for success  The right market  The right time  The right products  The right company.
Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.
Day Podcast content is consumed on the user’s personal computers or portable devices 2. Can be used with RSS feeds to be automatically downloaded.
November 4th, 1996ICAD Industry Panel1 Audio Taken Seriously; The present and future of audio at Microsoft Ken Greenebaum Internet.
ALA Institutional Repository Update ALA Archives at the University of Illinois Urbana-Champaign Chris Prom Cara Bertram Denise Rayman.
The Localisation Industry in Transition: New Economy, New Technology Florita Mendez Localisation Ireland 2000 Dublin, November 7, 2000.
Fundamentals of Graphic Communication 3.5 Accessible Design.
AgentSheets ® Thought Amplifier AgentSheets, Inc. Boulder, CO, USA Dr. Alexander Repenning, CEO.
Accessible Media Using Video and Audio to meet the needs of a diverse populations Presented by Kaela Parks.
Information Retrieval
Listener-Control Navigation of VoiceXML. Nuance Speech Analysis 92% of customer service is through phone. 84% of industrialists believe speech better.
ICT in Classroom Prepared by: Ymer LEKSI Kukes
MPEG-7 Audio Overview Ichiro Fujinaga MUMT 611 McGill University.
Digital Media Content MCD 7213 Development. Presentation outline What is media What is DIGITAL media? What is DIGITAL content? Traits of digital content.
Glencoe Introduction to Multimedia Chapter 8 Audio 1 Section 8.1 Audio in Multimedia Audio plays many roles in multimedia. Effective use in multimedia.
Speech Processing 1 Introduction Waldemar Skoberla phone: fax: WWW:
Chapter1 FOUNDATIONS OF INFORMATION SYSTEMS IN BUSINESS.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
Presentation of Curricula THE SCHOOL OF ELECTRICAL AND COMPUTER ENGINEERING OF APPLIED STUDIES DIGITAL BROADCASTING AND BROADBAND TECHNOLOGIES DBBT project.
Your Interactive Guide to the Digital World Discovering Computers 2012 Chapter 13 Computer Programs and Programming Languages.
Presented By Sharmin Sirajudeen S7 CS Reg No :
VoiceXML Tutorial: Part 1 Introduction and User Interaction with DTMF
Discovering Computers 2011: Living in a Digital World Chapter 3
Objectives Overview Identify the four categories of application software Describe characteristics of a user interface Identify the key features of widely.
Yes, I'm able to index audio files within Alfresco
Bentley Project Reel Digitization Bentley Historical Library t
Digital vs Analogue.
CNN Script | PHP article scripts from I-Netsolution.
Media Products and Processes
Chapter 11-Business and Technology
Silverlight Technology
Replace with Application Image
TECHNOLOGICAL CONVERGENCE for Institutions & Audiences
Multimedia Systems & Interfaces
VoiceXML An investigation Author: Mya Anderson
Presentation transcript:

Distributed Rendering Tool for Voices (DRTV) Familiar, Expressive Voices & Personalities Speech Technology & Media Solutions By Dale Schalow SCHALOW Innovations Ashburn, Virginia (USA)

DRTV Goals Professionally Design, Produce, Develop Familiar-sounding Voices from today, tomorrow and the past Provide Always-On Service to Consumers, Businesses and Government Provided for Interactive and Linear Media Users as a Hosted Solution (Client/Server)

Description High-quality voices for use in Internet and Content. Managing Assets with New and Historic Sources.

Description High-quality voices for use in Internet and Content. –Entertainment and Education 3D animation, gaming Film, TV, radio –Accessibility Seniors Low Vision Motor-Impaired

Description Build and Manage Speech Assets: –Establish formal voice asset collection, storage and distribution –Facilitate asset preservation and restoration –Coordinate with Museums, Libraries, 3D Game/Film Studios, Radio, Foundations, Colleges, etc

Description Build and Manage Assets: –Refactor inventory for both audio and audio- visual physical assets (tapes, digital, reels, master sound recordings) –Maintain digital asset libraries –Maintain product voice library with dictionary of terms (paired vocabulary) –Coordinate asset management IS/IT needs and initiatives with customer or partnering group

Technology New media technology used –NLP Toolkit (Natural Language Processing) –Cross-Encoding for Embedded Media (PCs, HD, AAC, MP3/Internet Radio, etc) Standards being adopted –W3C (World-Wide Web Consortium) –Java™ and VoiceXML, SSML (Speech Synthesis Markup Language)

Team/Resources Resources allocated to this project –Support & outside services Internal software development Internet Service Provider Pro Recording Studios 3rd party vendors (hardware/software)

Speech Tech Procedures Step 1 - New Voice as Source? –Professionally Record using N-based “tape script” Output format as PCM (e.g. Wave 1-channel 16 bit) Step 2 - Existing Voice as Source –Import audio source (PCM/16 bit quality) –“Auto-Extract” using N-based “tape script” to pull phonetic-features phonemes and transcriptions Audio scanning with automatically generated text- based grammars Retaining audio output

Speech Tech Procedures Step 3 - Apply Vocabulary –Build a default dictionary of terms to allow automatic translation –Minimum 40k words (ideally more is better) Step 4 - Process Text-to-Speech (TTS) –Take as input some text (e.g. “hello”) –Use the speech synthesis engine to generate audio with the applied vocabulary Step 5 - Use the URL/file of the generated voice from Step 4 for vertical application (Web page, game, 3D import, etc)

Speech Tech Procedures Benefits Reduces time and manual effort to re-do fundamental tasks Achieved high-quality output Moving things forward on at least two-fronts –1) Voices we already know or recognize –2) Voices and creations we are yet to discover in the process Appeals to many demographics for marketability

DRTV Contact Information For more information: SCHALOW Innovations Dale B. Schalow Phone: (703) Web: