Presentation is loading. Please wait.

Presentation is loading. Please wait.

Distributed Rendering Tool for Voices (DRTV) Familiar, Expressive Voices & Personalities Speech Technology & Media Solutions By Dale Schalow SCHALOW Innovations.

Similar presentations


Presentation on theme: "Distributed Rendering Tool for Voices (DRTV) Familiar, Expressive Voices & Personalities Speech Technology & Media Solutions By Dale Schalow SCHALOW Innovations."— Presentation transcript:

1 Distributed Rendering Tool for Voices (DRTV) Familiar, Expressive Voices & Personalities Speech Technology & Media Solutions By Dale Schalow SCHALOW Innovations Ashburn, Virginia (USA)

2 DRTV Goals Professionally Design, Produce, Develop Familiar-sounding Voices from today, tomorrow and the past Provide Always-On Service to Consumers, Businesses and Government Provided for Interactive and Linear Media Users as a Hosted Solution (Client/Server)

3 Description High-quality voices for use in Internet and Content. Managing Assets with New and Historic Sources.

4 Description High-quality voices for use in Internet and Content. –Entertainment and Education 3D animation, gaming Film, TV, radio –Accessibility Seniors Low Vision Motor-Impaired

5 Description Build and Manage Speech Assets: –Establish formal voice asset collection, storage and distribution –Facilitate asset preservation and restoration –Coordinate with Museums, Libraries, 3D Game/Film Studios, Radio, Foundations, Colleges, etc

6 Description Build and Manage Assets: –Refactor inventory for both audio and audio- visual physical assets (tapes, digital, reels, master sound recordings) –Maintain digital asset libraries –Maintain product voice library with dictionary of terms (paired vocabulary) –Coordinate asset management IS/IT needs and initiatives with customer or partnering group

7 Technology New media technology used –NLP Toolkit (Natural Language Processing) –Cross-Encoding for Embedded Media (PCs, HD, AAC, MP3/Internet Radio, etc) Standards being adopted –W3C (World-Wide Web Consortium) –Java™ and VoiceXML, SSML (Speech Synthesis Markup Language)

8 Team/Resources Resources allocated to this project –Support & outside services Internal software development Internet Service Provider Pro Recording Studios 3rd party vendors (hardware/software)

9 Speech Tech Procedures Step 1 - New Voice as Source? –Professionally Record using N-based “tape script” Output format as PCM (e.g. Wave 1-channel 16 bit) Step 2 - Existing Voice as Source –Import audio source (PCM/16 bit quality) –“Auto-Extract” using N-based “tape script” to pull phonetic-features phonemes and transcriptions Audio scanning with automatically generated text- based grammars Retaining audio output

10 Speech Tech Procedures Step 3 - Apply Vocabulary –Build a default dictionary of terms to allow automatic translation –Minimum 40k words (ideally more is better) Step 4 - Process Text-to-Speech (TTS) –Take as input some text (e.g. “hello”) –Use the speech synthesis engine to generate audio with the applied vocabulary Step 5 - Use the URL/file of the generated voice from Step 4 for vertical application (Web page, game, 3D import, etc)

11 Speech Tech Procedures Benefits Reduces time and manual effort to re-do fundamental tasks Achieved high-quality output Moving things forward on at least two-fronts –1) Voices we already know or recognize –2) Voices and creations we are yet to discover in the process Appeals to many demographics for marketability

12 DRTV Contact Information For more information: SCHALOW Innovations Dale B. Schalow Phone: (703) 625-7367 Email: dale@schalow.comdale@schalow.com Web: http://schalow.com


Download ppt "Distributed Rendering Tool for Voices (DRTV) Familiar, Expressive Voices & Personalities Speech Technology & Media Solutions By Dale Schalow SCHALOW Innovations."

Similar presentations


Ads by Google