November 4th, 1996ICAD Industry Panel1 Audio Taken Seriously; The present and future of audio at Microsoft Ken Greenebaum Internet Platforms and tools Division Microsoft Corporation
November 4th, 1996ICAD Industry Panel2 Slides, other materials online: ics/kgreene/icad
November 4th, 1996ICAD Industry Panel3 Overview ù Today ùSolid media foundations (DirectX, ActiveMovie) ù Soon ùAdvanced media (ActiveAnimation, Whisper/Whistler) ù Tomorrow ùConversational interfaces
November 4th, 1996ICAD Industry Panel4 Today: DirectSound ù Streaming audio ù Reasonable latency ù Input (soon) ù Device independence ù Multiple app’s audio mix ù DSound3D ù DSound3D
November 4th, 1996ICAD Industry Panel5 Today: Active Movie ù Graph based media architecture ù Movie playback ù Movie record (soon!) ù Open filter API ù Audio plugin technology
November 4th, 1996ICAD Industry Panel6 Today: Netshow ù Streaming network audio/video ù Multicast audio using RTP (real-time protocol) ù ASF file format, conversion, editing tools ù NT server
November 4th, 1996ICAD Industry Panel7 Today: Interactive Music Today: Interactive Music (Formerly BlueRibbon’s AudioActive) ù Intelligent interactive music ù Composes/Delivers music ù Based on expert system ù Human composer ‘authors’ templates ù Music always sounds fresh and original ù Look for it: PowerPoint ‘97, MSN Riff
November 4th, 1996ICAD Industry Panel8 Soon: DirectMusic Soon: DirectMusic Contact: ù Consistent Playback of MIDI Music ù Internet support for Music ù DLS downloadable sample sets ù Optional software MIDI synth ù Internet MIDI jamming?
November 4th, 1996ICAD Industry Panel9 Soon: “Appelles” ù Expect an announcement soon! ù Animation Description Language ù Functional Paradigm ù Media Integration ù Implicit Time ù Language Integration (Java) ù Enable sophisticated Web animation
November 4th, 1996ICAD Industry Panel10 Appelles Audio Capabilities: ù All audio types orthogonal ù Parametric Synthesis ù MIDI ù Audio Active Music Synthesis ù Streaming audio ù PCM Audio ù 3D Spatialized sound embedded in geometry
November 4th, 1996ICAD Industry Panel11 Soon: “Talisman” Audio ù Hardware acceleration of: ùDSound/DSound3D ùEcho Cancellation ùActive Movie filter accelerator ù32bit mixer ùDLS compatible synthesizer ùMODEM/Telephony
November 4th, 1996ICAD Industry Panel12 Soon: “Whisper” ù Windows Highly Intelligent Speech Recognizer ù Based on SphinxII ù Continuous speech recognition ù Speaker independent ù Context-free grammar decoding
November 4th, 1996ICAD Industry Panel13 Soon: “Whistler” ù Trainable Text to Speech Synthesizer ù Training from human speech; maintains: ùNatural prosody ùCharacteristics of original human ù Emotional control ù Uses NLP technology to parse text
November 4th, 1996ICAD Industry Panel14 Tomorrow: Conversational Interfaces ù Motivation: ùGiven choice people communicate with speech ùPeople prefer natural language over ‘command languages’ ùanthropomorphism unavoidable w/spoken interaction
November 4th, 1996ICAD Industry Panel15 Persona Project ù Conversational Assistant as UI ùSpoken conversation (voice recognition/synth) ùNatural Language (in limited domains) ùAssistant w/Rich visual presence ùSimulates verbal and non-verbal cues
November 4th, 1996ICAD Industry Panel16 Here’s Peedy and Gene:
November 4th, 1996ICAD Industry Panel17 Conclusion: ù Microsoft is: ù Taking media very seriously ùOffering a solid foundation today ùDesigning the future