Download presentation
Presentation is loading. Please wait.
Published byRalf Richards Modified over 9 years ago
1
1 Telepresence: An Umbrella Research Topic Jim Gray Microsoft Research Gray@Microsoft.com http://research.Microsoft.com/~Gray/
2
2 Federal Research Support Nerve Center of Science If it’s not broke, don’t fix it. But…. l US Science is the engine of progress BUT….. l Best and brightest are spending increasing time fund raising l Seems excessive to me. l Venture capital community is richer and more generous than Federal Support THE LONG BOOM
3
3 Cyberspace is a New World. l We have discovered a “new continent”. l It is changing how we learn, work, and play. –1 T$/y industry –1 T$ new wealth since 1993 –30% of US economic growth since 1993 l There is a gold rush to stake out territory. l But we also need explorers: Lewis & Clark expeditions Universities to teach the next generation(s) l Governments, industry, and philanthropists should fund long-term research. THE LONG BOOM
4
4 1970196019801990 Workstations Lisp machine, Stanford Xerox Alto Apollo, Sun Networking Arpanet, Internet Ethernet, Pup, Datakit DECnet, LANs, TCP/IP Graphics Sketchpad, Utah GM/IBM, LucasFilm E&S, SGI, PIXAR,.. Windows Englebart, Rochester Alto, Smalltalk Star, Mac, Microsoft Research Investments Pay Off CSTB –NRC Evolving the High-Performance Computing and Communications Imitative to Support the nations Information Infrastructure, NA Press, Washington DC, 1995. Time-sharing CTSS, Multics, SSD Unix SDS 940, 360/67 VMS Government funded Industrial Billion Dollar/year Industry
5
5 1970198019902000 Relational Data Bases Berkeley, Wisc,… IBM Oracle, IBM,… Parallel DBs Tokyo,Wisconsin, UCLA ICL, IBM ICL, Teradata, Tandem Research Investments Pay Off Data Mining (complex queries) Wisc, Stanford, … IBM, Arbor,… IRI, Arbor, Plato, …
6
6 Why Can’t Industry Fund IT Research? l It does : IBM (5.8%), Intel (13%), Lucent (12%), Microsoft (14.%), Sun (12%),... –R&D is ~5%-15% (50 B$ of 500 B$) l AD is 10% of that (5 B$) –Long-Range Research is 10% of that 500 M$ 2,500 researchers and university support –Compaq: 4.8% R&D (1.3 B$ of 27.3 B$). AOL: 3.7% D, ?R (96 M$ of 2.6 B$) –Dell:1.6% R&D (204 M$ of 12.6 B$), EDS, MCI-WorldCom, …. l To be competitive, some companies cannot make large long-term research investments. The Xerox/PARC story: created Mac, Adobe, 3Com…
7
7 PITAC Report Presidential IT Advisory Committee http://www.ccic.gov/ac/report/ http://www.ccic.gov/ac/report/ l Findings: –Software construction is a mess: needs breakthroughs. –We do not know how to scale the Internet 100x l Security, manageability, services, terabit per second issues. –USG needs high-performance computing ( Simulation) but market is not providing vector-supers – just providing processor arrays. –Trained people are in very short supply. l Recommendations: –Lewis & Clark expeditions to 21 st century. –Increase long-term research funding by 1.4B$/y. –Re-invigorate university research & teaching. –Facilitate immigration of technical experts.
8
8 Outline (ambitious!) l Microsoft Research (census) l Tele-Presentations (Gordon Bell, Jim Gemmell) l Microsoft Research initiative on Telepresence l What if you could record everything you see & hear? l The architecture revolution: processing moves to transducers
9
9 Microsoft Research -- 1991 l Founded in 1991 l Goal: pursue strategic technologies for Microsoft l Original research groups: –Natural Language Processing –Operating Systems –Programming Languages l Overall size < 20 at the end of 1992
10
10 Microsoft Research -- 1999 l 400 Researchers in 25 areas –Operating systems to Statistical Physics l Research lab locations: –Redmond, Cambridge, San Francisco, Beijing l Internationally recognized research teams –Hundreds of publications, presentations –Leadership roles in professional societies, journals, conferences
11
11 MS Research Areas l Operating systems, languages, compilers, virtual machines, networking, wireless computing, fault-tolerance, large scale servers, security l Natural language, speech, vision, graphics, decision theory, information retrieval, UI, collaboration, statistics, signal processing l Cryptography, statistical physics and discrete mathematics
12
12 Growing Fast l Grew 20x from ‘92 to ‘99 l Decided in ‘97 to grow by a 3x in 3 years –200 in FY97 => 600 in FY00, primarily in Redmond l Major impact on MS products –Virtually all MS products shipped today use technology from MS Research l Key role in MS growth –Pioneering research in software that allows computers to see, hear, speak and understand
13
13 Microsoft Research Philosophy l University organizational model –Flat structure, critical mass groups l Open research environment –Aggressive publication of research results in literature and on world wide web –Frequent visitors, daily seminars –Over 100 visiting professors and interns in 1998 –Over 110 visiting researchers in 1998
14
14 Some Key Senior Researchers l Systems –Rick Rashid, Butler Lampson, Gordon Bell –Anoop Gupta, Roger Needham, Chuck Thacker l Databases & Data Mining –David Lomet, Jim Gray, Usama Fayyad l Graphics –Jim Kajiya, Jim Blinn, Alvy Ray Smith, Michael Cohen l Speech & Language –Karen Jensen, George Heidorn, X.D. Huang, Alex Acero, Hsiao-Wuen Hon, Scott Meredith
15
15 Some Key Senior Researchers l UI Design, Intelligent Systems, IR –George Robertson, Linda Stone, Susan Dumais, David Heckerman, Eric Horvitz, Jack Breese l Computer Vision & Signal Processing –Steve Shafer, Rick Szeliski, P. Anandan, Rico Malvar l Cryptography & Theory –Yacov Yacobi, Jennifer Chayes, Christian Borg, Michael Freedman l Languages & Compilers –Daniel Weise, Chris Fraser, Amitabh Srivastava, Luca Cardelli, David Hanson, Charles Simonyi, Todd Proebsting
16
16 Microsoft Research l 1997 Business Week Poll of Academia: –Voted #7 lab (overall) in Computer Science –Voted #3 industrial research lab (after Bell Labs and IBM Research) –Voted #2 most desirable lab to work (after Stanford)
17
17 What I Do. l Work for the government! –CSTB, PITAC(software, ngi), LoC study,.... l Work on scaleable systems: –1 Billion Transactions Per Day Cluster –TerraServer –New: Sloan Digital Sky Survey
18
18 Outline (ambitious!) l Microsoft Research (census) l Tele-Presentations (Gordon Bell, Jim Gemmell) l Microsoft Research initiative on Telepresence l What if you could record everything you see & hear? l The architecture revolution: processing moves to transducers
19
19 Gordon Bell on Tele Presentations http://research.microsoft.com/barc/GBell/
20
20 Motivation: Telepresentations Presenter and/or audience telepresent NOT: meeting or collaboration settings Forget the nasty social issues! Mostly one-way
21
21 Telepresentation Elements è Slides è Audio l Video l Script, text comments, hyperlinks, etc.
22
22 Telepresentations: The Essentials l Slide and audio a must l Add some video (low quality) to make us feel good l Storage and transmission costs low
23
23 Telepresentations: The Killer App l Increased attendance & lower travel costs l Practical and low-cost NOW l e.g. ACM97 - 2,000 visitors in real space, 20,000 visitors on Internet http://research.microsoft.com/acm97
24
24 Today’s Experiment l Would you like to pause, rewind, browse? l Do you wish you could have seen this –At home? –At another time? l How much does a present speaker add? How much would you pay for real presence?
25
25 University Lectures Online l Research lectures on-line & on-demand l http://murl.microsoft.com/ http://murl.microsoft.com/ l Will get UVC content l Available to anyone anywhere –T1 good, 28.8 OK l Generated by CMU, MIT, MSR, Stanford, UW, Xerox l Hosted by MSR
26
26 Outline (ambitious!) l Microsoft Research (census) l Tele-Presentations (Gordon Bell, Jim Gemmell) l Microsoft Research initiative on Telepresence l What if you could record everything you see & hear? l The architecture revolution: processing moves to transducers
27
27 Changing role of computation l Past: Computers for: –computing (Cray) –business data processing (IBM) –“document” creation (PC) l Future: Computers for: –understanding & learning –communicating –consuming & entertaining l Requires new User Interface to machines
28
Flows
29
29 Making “Flows” a Reality l Computer Graphics –Creating realistic looking environments, people l Computer Vision –Analyzing posture, gaze, gestures l Speech input/output l Natural Language –Analysis, IR l Implicit requests for information
30
Building life-like human characters
31
31 How to fail at Tele-Conferences 1. Eliminate gaze awareness and sense of space of a normal group setting 2. Have long audio latencies & poor audio quality 3. Use incompatible equipment 4. Make it much harder to initiate the call to make a phone call
32
32 Gaze Awareness & Sense of Space l Is anyone paying attention? l Who is talking (where is sound coming from?
33
33 Gaze Awareness l Looking at screen: the forehead shot l Looking at camera: the glowering shot l Looking at YOU.
34
34 You can’t just move the eyes l Glowering l Surprise l Boredom l Interest
35
35 Mona Lisa Effect l Eyes and nose indicated gaze
36
36 Spatialized Audio & Video Pointing “nose vector” at target l Map video onto wire frame l Rotate frame to point in space l Move (fake) eyes in frame (>30°) to point at target l Project voice on that vector.
37
Recognizing gestures Live video Area of motion H flow V flow
38
38 Generating life-like speech from textual data l Data-driven stochastic speech –Natural sounding –Rapid, automatic customizability l Examples –Synthetic voice w/ transplanted speech contours
39
39 l AT&T Voder, 1962, by Homer Dudley –Daisy (Inspiration for HAL’s voice in 2001) l Microsoft Research Whistler, 1997 –Scarborough Fair Artificial singing
40
40 Analyzing language l Language recognition shipped in Word 97 l General purpose text-critiquing, summarization, Japanese word-breaking
41
41 Inside The Office Grammar Checker
42
42 Understanding language: MindNet l A huge language knowledge base l Automatically created from dictionaries l Words (nodes) linked by relationships l Millions of links l Recently added (Encarta) encyclopedia knowledge
43
Is_a mouth Locn_of MindNet -- “Going to the birds” face peck limb goose creature make Is_a Typ_obj sound Typ_subj preen Is_a Part feather Not_is_a plant Is_a gaggle Is_a Part Part_of catch Typ_subj_of Typ_obj claw wing Is_a turtle beak Is_a strike Means hawk Typ_subj opening Is_a chatter Means Typ_subj Typ_obj Is_a cleansmoothbill Is_a duck Is_a Typ_0bj_of keepanimal quack Is_a Cause Purpose bird meat egg poultry Is_a supply Purpose Typ_obj Quesp hen chicken Is_a leg arm Is_a Typ_subj_of fly
44
44 Changing balance between user & software systems l Yesterday: –Applications were single programs running in isolation –Users used to (more or less) understand systems that they used l Today: –Componentized applications operate in concert –Sophisticated users understand only small percentage of systems they use
45
45 Tomorrow’s Systems and Applications l Users will not be able to predict –where computations will be performed, –when they will be performed or –by what software components l Gap between system capabilities and user understanding will grow to the point that the only way user will be able to use system is through assisting agents
46
46 Examples of user agents & implicit actions l Lumiere (Office 97) –Monitoring user and program events to provide user help and assistance l Implicit queries –Inferring information needs from browsing l Lookout/SpamKiller –Monitoring mail activity to auto-categorize it
47
User Modeling l Models of a user’s informational goals –User’s query (when available…) –User’s background –Acute and long-term search activity –Acute actions with objects and documents –Program data structures l Explicit and implicit information access and display
48
48 Outline (ambitious!) l Microsoft Research (census) l Tele-Presentations (Gordon Bell, Jim Gemmell) l Microsoft Research initiative on Telepresence l What if you could record everything you see & hear? l The architecture revolution: processing moves to transducers
49
49 Some Tera-Byte Databases Kilo Mega Giga Tera Peta Exa Zetta Yotta l The Web: 1 TB of HTML l TerraServer 1 TB of images l Several other 1 TB (file) servers l Hotmail: 15 TB of email l Sloan Digital Sky Survey: 40 TB raw, 2 TB cooked l EOS/DIS (picture of planet each week) –15 PB by 2007 l Federal Clearing house: images of checks –15 PB by 2006 (7 year history) l Nuclear Stockpile Stewardship Program –10 Exabytes (???!!)
50
50 Kilo Mega Giga Tera Peta Exa Zetta Yotta A novel A letter Library of Congress (text) All Disks All Tapes A Movie LoC (image) Info Capture l You can record everything you see or hear or read. l What would you do with it? l How would you organize & analyze it? Video 8 PB per lifetime (10GBph) Audio 30 TB (10KBps) Read or write:8 GB (words) See: http://www.lesk.com/mlesk/ksg97/ksg.html
51
51 Kilo Mega Giga Tera Peta Exa Zetta Yotta A novel A letter Library of Congress (text) All Disks All Tapes A Movie LoC (image) All Photos LoC (sound + cinima) All Information!
52
52 Alan Newell’s & Michael Lesk’s Points www.lesk.com/mlesk/ksg97/ksg.html l Soon everything can be recorded and kept l Most data will never be seen by humans l Precious Resource: Human attention Auto-Summarization Auto-Search will be a key enabling technology.
53
53 Outline (ambitious!) l Microsoft Research (census) l Tele-Presentations (Gordon Bell, Jim Gemmell) l Microsoft Research initiative on Telepresence l What if you could record everything you see & hear? l The architecture revolution: processing moves to transducers
54
54 Put Everything in Future (Disk) Controllers (it’s not “if”, it’s “when?”) Acknowledgements : Dave Patterson explained this to me a year ago Kim Keeton Erik Riedel Catharine Van Ingen Helped me sharpen these arguments
55
55 Remember Your Roots
56
56 Technology Drivers: Disks l Disks on track l 100x in 10 years 2 TB 3.5” drive l Shrink to 1” is 200GB l Disk replaces tape? l Disk is super computer! Kilo Mega Giga Tera Peta Exa Zetta Yotta
57
57 Data Gravity Processing Moves to Transducers l Move Processing to data sources l Move to where the power (and sheet metal) is l Processor in –Modem –Display –Microphones (speech recognition) & cameras (vision) –Storage: Data storage and analysis
58
58 It’s Already True of Printers Peripheral = CyberBrick l You buy a printer l You get a –several network interfaces –A Postscript engine l cpu, l memory, l software, l a spooler (soon) –and… a print engine.
59
59 Tera Byte Backplane l TODAY –Disk controller is 10 mips risc engine with 2MB DRAM –NIC is similar power l SOON –Will become 100 mips systems with 100 MB DRAM. l They are nodes in a federation (can run Oracle on NT in disk controller). l Advantages –Uniform programming model –Great tools –Security –economics (CyberBricks) –Move computation to data (minimize traffic) All Device Controllers will be Cray 1’s Central Processor & Memory
60
60 Basic Argument for x-Disks l Future disk controller is a super-computer. –1 bips processor –128 MB dram –100 GB disk plus one arm l Connects to SAN via high-level protocols –RPC, HTTP, DCOM, Kerberos, Directory Services,…. –Commands are RPCs –Management, security,…. –Services file/web/db/… requests –Managed by general-purpose OS with good dev environment l Apps in disk saves data movement –need programming environment in controller
61
61 The Slippery Slope l If you add function to server l Then you add more function to server l Function gravitates to data. Nothing = Sector Server Everything = App Server Something = Fixed App Server
62
62 Why Not a Sector Server? (let’s get physical!) l Good idea, that’s what we have today. l But –cache added for performance –Sector remap added for fault tolerance –error reporting and diagnostics added –SCSI commends (reserve,.. are growing) –Sharing problematic (space mgmt, security,…) l Slipping down the slope to a 2-D block server
63
63 Why Not a 1-D Block Server? Put A LITTLE on the Disk Server l Tried and true design –HSC - VAX cluster –EMC –IBM Sysplex (3980?) l But look inside –Has a cache –Has space management –Has error reporting & management –Has RAID 0, 1, 2, 3, 4, 5, 10, 50,… –Has locking –Has remote replication –Has an OS –Security is problematic –Low-level interface moves too many bytes
64
64 Why Not a 2-D Block Server? Put A LITTLE on the Disk Server l Tried and true design –Cedar -> NFS –file server, cache, space,.. –Open file is many fewer msgs l Grows to have –Directories + Naming –Authentication + access control –RAID 0, 1, 2, 3, 4, 5, 10, 50,… –Locking –Backup/restore/admin –Cooperative caching with client l File Servers are a BIG hit: NetWare™ –SNAP! is my favorite today
65
65 Why Not a File Server? Put a Little on the Disk Server l Tried and true design –Auspex, NetApp,... – Netware l Yes, but look at NetWare –File interface gives you app invocation interface –Became an app server l Mail, DB, Web,…. –Netware had a primitive OS l Hard to program, so optimized wrong thing
66
66 Why Not Everything? Allow Everything on Disk Server (thin client’s) l Tried and true design –Mainframes, Minis,... –Web servers,… –Encapsulates data –Minimizes data moves –Scaleable l It is where everyone ends up. l All the arguments against are short-term.
67
67 The Slippery Slope l If you add function to server l Then you add more function to server l Function gravitates to data. Nothing = Sector Server Everything = App Server Something = Fixed App Server
68
68 Disk = Node l has magnetic storage (100 GB?) l has processor & DRAM l has SAN attachment l has execution environment OS Kernel SAN driverDisk driver File SystemRPC,... ServicesDBMS Applications
69
69 Technology Drivers: System on a Chip l Integrate Processing with memory on one chip –chip is 75% memory now –1MB cache >> 1960 supercomputers –256 Mb memory chip is 32 MB! –IRAM, CRAM, PIM,… projects abound l Integrate Networking with processing on one chip –system bus is a kind of network –ATM, FiberChannel, Ethernet,.. Logic on chip. –Direct IO (no intermediate bus) l Functionally specialized cards shrink to a chip.
70
70 Technology Drivers: What if Networking Was as Cheap As Disk IO? l TCP/IP –Unix/NT 100% cpu @ 40MBps l Disk –Unix/NT 8% cpu @ 40MBps Why the Difference? Host Bus Adapter does SCSI packetizing, checksum,… flow control DMA Host does TCP/IP packetizing, checksum,… flow control small buffers
71
71 Technology Drivers: The Promise of SAN/VIA:10x in 2 years http://www.ViArch.org/ l Today: –wires are 10 MBps (100 Mbps Ethernet) –~20 MBps tcp/ip saturates 2 cpus –round-trip latency is ~300 us l In the lab –Wires are 10x faster Myrinet, Gbps Ethernet, ServerNet,… – Fast user-level communication l tcp/ip ~ 100 MBps 10% of each processor l round-trip latency is 15 us
72
72 Gbps Ethernet: 110 MBps SAN: Standard Interconnect PCI: 70 MBps UW Scsi: 40 MBps FW scsi: 20 MBps scsi: 5 MBps l LAN faster than memory bus? l 1 GBps links in lab. l 100$ port cost soon l Port is computer RIP FDDI RIP ATM RIP SCI RIP SCSI RIP FC RIP ?
73
73 Technology Drivers: 100 GBps Ethernet replaces SCSI l Why I love SCSI –Its fast (40MBps) –The protocol uses little processor power l Why I hate SCSI –Wires must be short –Cables are pricey –pins bend
74
74 Functionally Specialized Cards l Storage l Network l Display M MB DRAM P mips processor ASIC Today: P=50 mips M= 2 MB In a few years P= 200 mips M= 64 MB
75
75 Technology Drivers Plug & Play Software l RPC is standardizing: (DCOM, IIOP, HTTP) –Gives huge TOOL LEVERAGE –Solves the hard problems for you: l naming, l security, l directory service, l operations,... l Commoditized programming environments –FreeBSD, Linix, Solaris,…+ tools –NetWare + tools –WinCE, WinNT,…+ tools –JavaOS + tools l Apps gravitate to data. l General purpose OS on controller runs apps.
76
76 Basic Argument for x-Disks l Future disk controller is a super-computer. –1 bips processor –128 MB dram –100 GB disk plus one arm l Connects to SAN via high-level protocols –RPC, HTTP, DCOM, Kerberos, Directory Services,…. –Commands are RPCs –management, security,…. –Services file/web/db/… requests –Managed by general-purpose OS with good dev environment l Move apps to disk to save data movement –need programming environment in controller
77
77 Summary l Microsoft Research (census) l Tele-Presentations (Gordon Bell, Jim Gemmell) l Microsoft Research initiative on Telepresence l What if you could record everything you see & hear? l The architecture revolution: processing moves to transducers
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.