MGB 2003 © 2003 Microsoft Corporation. All rights reserved.

Slides:



Advertisements
Similar presentations
Symantec 2010 Windows 7 Migration EMEA Results. Methodology Applied Research performed survey 1,360 enterprises worldwide SMBs and enterprises Cross-industry.
Advertisements

Symantec 2010 Windows 7 Migration Global Results.
Microsoft Research Microsoft Research Jim Gray Distinguished Engineer Microsoft Research San Francisco SKYSERVER.
Chapter 5 Transfer of Training
Oracle to MySQL Database Migration SQLWays - Migration Software Presentation Copyright (c) Ispirer Systems Ltd. All Rights Reserved.
1. XP 2 * The Web is a collection of files that reside on computers, called Web servers. * Web servers are connected to each other through the Internet.
1 Senn, Information Technology, 3 rd Edition © 2004 Pearson Prentice Hall James A. Senns Information Technology, 3 rd Edition Chapter 7 Enterprise Databases.
Chapter 14 Intranets & Extranets. Awad –Electronic Commerce 1/e © 2002 Prentice Hall 2 OBJECTIVES Introduction Technical Infrastructure Planning an Intranet.
Distributed Systems Architectures
Trying to Use Databases for Science Jim Gray Microsoft Research
Online Science -- The World-Wide Telescope Archetype
U Computer Systems Research: Past and Future u Butler Lampson u People have been inventing new ideas in computer systems for nearly four decades, usually.
World Wide Telescope mining the Sky using Web Services Information At Your Fingertips for astronomers Jim Gray Microsoft Research Alex Szalay Johns Hopkins.
The Access Grid Ivan R. Judson 5/25/2004.
1 Online Science the New Computational Science Jim Gray Microsoft Research Alex Szalay Johns Hopkins.
1 Online Science The World-Wide Telescope as a Prototype For the New Computational Science Jim Gray Microsoft Research Talk at
1 Online Science -- The World-Wide Telescope as an Archetype Jim Gray Microsoft Research Collaborating with: Alex Szalay, Peter Kunszt, Ani
1 Online Science The World-Wide Telescope as a Prototype For the New Computational Science Jim Gray Microsoft Research
Online Science The World-Wide Telescope as a Prototype For the New Computational Science Jim Gray Microsoft Research
1 Online Science The World-Wide Telescope as a Prototype For the New Computational Science Jim Gray Microsoft Research
The New Economy in Indianapolis The New Economy in Indianapolis Rob Atkinson Vice President and Director, Technology and New Economy Project Progressive.
28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
ECMA Open XML File Formats and the Evolution of Open File Formats Mark Lange Senior Policy Counsel Microsoft EMEA.
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
1 Preliminary results of the Environmental Data Exchange Network for Inland Waters (EDEN-IW) project Practical lessons. P. Haastrup.
Introduction to HTML, XHTML, and CSS
Astronomy Data Bases Jim Gray Microsoft Research.
40 Tips Leveraging the New APICS.org to the Benefit of Your Organization, Members, and Customers! 1.
Alexey Miroshnikov InfoStroy Ltd. Locatioin: St.Petersburg, Russia Established: 1990 APL: since 1979 First APL conference: 1990, Copenhagen People: 42+
Introduction Lesson 1 Microsoft Office 2010 and the Internet
Internet Applications
Services Course Office Web Apps Participant Guide.
Server Access The REST of the Story David Cleary
Computer Literacy BASICS
Why should I consider Implementing a Document Imaging / Management System? Created by Harold Hegerhorst North American Technology. LLC © North American.
Virtualization & Disaster Recovery
Creating and Capturing Customer Value
The Longevity Economy The Emerging Market in Plain Sight.
Gilbert Kalb, GMD-IK GMD was established in GMD is a non-profit company Owners are the Federal Republic of Germany (90%) and some federal states.
31242/32549 Advanced Internet Programming Advanced Java Programming
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Introduction to Computer Administration Introduction.
Who are the Experts?Simon KampaSlide 1 Who are the Experts? Simon Kampa IAM Group University of Southampton
Cloud Storage in Czech Republic Czech national Cloud Storage and Data Repository project.
1 Tracking Innovation in NC Patterns and Implications for NC's Eastern Region John Hardin, Executive Director NC Board of Science & Technology
Macromedia Dreamweaver MX 2004 – Design Professional Dreamweaver GETTING STARTED WITH.
XP New Perspectives on Browser and Basics Tutorial 1 1 Browser and Basics Tutorial 1.
® Microsoft Office 2010 Browser and Basics.
Missouri Enterprise Helping Missouri Manufacturers Make More, Sell More, Earn More Missouri Manufacturer Survey: The Top Ten Things You Told Us.
ANSC644 Bioinformatics-Database Mining 1 ANSC644 Bioinformatics §Carl J. Schmidt §051 Townsend Hall §
DEV392: Extending SharePoint Products And Technologies Through Web Parts And ASP.NET Clint Covington, Program Manager Data And Developer Services - Office.
Project 1 Introduction to HTML.
Building Enterprise Applications Using Visual Studio ®.NET Enterprise Architect.
Welcome Back. Microsoft Research Faculty Summit 2008.
The Internet and World Wide Web.  Understand how the Internet evolved  Describe common Internet communication methods and activities  Setting up your.
Alex Szalay, Jim Gray Analyzing Large Data Sets in Astronomy.
CS480 Computer Science Seminar Introduction to Microsoft Solutions Framework (MSF)
Sky Survey Database Design National e-Science Centre Edinburgh 8 April 2003.
WEB SERVER SOFTWARE FEATURE SETS
Grids 2003 The Great Academia/Industry Grid Debate Dan Fay | Microsoft Research Grid, grid, everywhere a Grid Blocking out the scenery, breaking my mind.
1 Online Science The World-Wide Telescope as a Prototype For the New Computational Science Jim Gray Microsoft Research
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
Microsoft Research San Francisco (aka BARC: bay area research center) Jim Gray Researcher Microsoft Research Scalable servers Scalable servers Collaboration.
Applicatieplatform congres 12 & 13 maart. Microsoft Application Platform A Lifecycle View Sam Guckenheimer Group Product Planner Visual Studio Team System.
Building Enterprise Applications Using Visual Studio®
Online Science The World-Wide Telescope as a Prototype For the New Computational Science Jim Gray Microsoft Research
BARC Scaleable Servers
Rick, the SkyServer is a website we built to make it easy for professional and armature astronomers to access the terabytes of data gathered by the Sloan.
Jim Gray Researcher Microsoft Research
Unit# 5: Internet and Worldwide Web
Jim Gray Microsoft Research
Presentation transcript:

MGB 2003 © 2003 Microsoft Corporation. All rights reserved. This presentation is for informational purposes only. Microsoft makes no warranties, express or implied, in this summary.

Managing Innovation: How Microsoft Research Works MGB 2003 Managing Innovation: How Microsoft Research Works Jim Gray Distinguished Engineer Microsoft Corporation © 2003 Microsoft Corporation. All rights reserved. This presentation is for informational purposes only. Microsoft makes no warranties, express or implied, in this summary.

Actionable Ideas Co-lo if possible Adopt a “university model” Recruit from the top Recruit for passion and a desire to have impact Install a Research Program Management organization to orchestrate tech-transfer Institute an annual TechFest

Innovation Build versus Buy versus Invest Build: Have in-house research Bell Labs, IBM, GM, Pfizer, Merc, Microsoft… Buy: Acquire startups or whole companies IBM, Cisco, Intel, Microsoft, Pfizer, Merc… Invest: All boats rise Government research funding All 3 approaches valid Complement one another

Companies Are Different IBM S G&A 23% Product 31% Gross 38% other 2% R&D 6% Intel R&D 15% S G&A 16% Product 19% Gross 50% Microsoft Gross 40% Product 18% S G&A 27% R&D 15% Oracle R&D 12% Gross 36% Product 26% S G&A Accenture Gross 32% Product 47% S G&A 21% R&D 0% HP S G&A 16% Product 44% Gross 27% other 7% R&D 6% Cisco S G&A 25% Product 33% Gross 26% R&D 16% DELL R&D 1% Gross 18% Product 73% S G&A 8% EDS S G&A 9% Product 69% Gross 8% other 14% R&D 0% Selected IT company FY02 R&D budgets: Notice that R&D is correlated with margin IBM and HP have large service revenues So, their “real” R&D investment rate is higher Dell, Accenture, EDS have modest R&D – innovate in other ways

Microsoft Is Different It is a software company: Almost entirely an IP company Margins on successful products are enormous The cost of failure is enormous – missed market It is BIG and so must look for BIG bets High-velocity business: Product mix shifts every decade. If you miss a shift, you are dead.

Most R&D Is D How to Do Basic Research in Industry Most R&D Is D How to Do Basic Research in Industry? Critical questions (from Rick Rashid) How can I create and maintain a world class research organization in an industrial setting? How do I keep the lines of communication open between product teams and researchers? How do I get new technology into products quickly?

Approach Adapt the Academic Model Organizational goal: Advance state of the art University organizational model Flat structure, critical mass groups Open research environment Aggressive publication in peer-reviewed literature Frequent visitors, daily seminars Strong ties to University Research Nearly 15% of basic research budget directly invested in Universities Lab grants, research grants, fellowships, etc. Hundreds of interns and visitors

Microsoft Research Today MGB 2003 Microsoft Research Today Founded in 1991 Staff of over 700 in over 55 areas Internationally recognized research teams Research lab locations : Redmond, Washington, 75% San Francisco, California 1% Cambridge, United Kingdom 10% Beijing, People’s Republic of China 10% Mountain View, California 5% Founded in 1991 to pursue technologies that are of strategic importance to MS’s future. We hire the best and the brightest and we’re all deeply dedicated to working closely with MS product groups © 2003 Microsoft Corporation. All rights reserved. This presentation is for informational purposes only. Microsoft makes no warranties, express or implied, in this summary.

Microsoft Research Expanding the State of the Art Thousands of peer-reviewed publications 10%…30% of papers at our focus conferences graphics, programming, systems, data management… Community leadership Professional societies Journals Conferences Mentoring Interns Hosting academic summers and sabbaticals Special workshops

How To Build A Group Identify a promising area Hire the leader (internal or external) Support her/him Build team around senior researcher Look for people who Want to have impact Have passion for their ideas Same template works for whole labs Cambridge, Beijing, Silicon Valley

Keeping Open The Lines Of Communication To Product Teams Co-location helps: 75% “on campus” “How can I help?” attitude demonstrates willingness to “get dirty” to help product succeed Product group spin-offs build strong ties Over time a number of product groups evolved from research (e.g., Windows Media) Researchers involved in all corporate product reviews

MSR Relationship To MS Products Virtually every research group actively engaged with product groups E.G., Windows, Office, streaming media, SQL, Exchange, IIS, commerce server, visual studio, office, consumer products, MSN, etc. Tech transfer: Ideas Code People Contacts Recruiting

Focused Technology Transfer Quickly getting technology into products Program management team with sole focus on tech transfer Researchers on product “advisory” boards “Mind-swaps” – joint product/research off-sites Joint product/research teams, e.g., ClearType (Windows XP) Datamining (SQL 2000) Natural Language & Speech (Office) TabletPC Smart Personal Objects (SPOT) Encourage and recognize contributions

MSR Techfest Internal open house for Microsoft Research Annual event since 2001 ~ 7000 attendees 170 demos, 26 lectures “Research in progress” Breadboard demos This is research idea/prototype Great networking event: Breaks down barriers Serendipitous connections.

Examples Of Technology Transfer Critical support technologies Memory Optimization Technology enabled sim-ship of Win95/Office95 Automated bug detection in Windows 2000 Key technologies that drive products E.G., MS audio 4.0, ClearType, intelligent search, collaborative filtering, Intellimirror, etc. Incubated major products Windows streaming media Windows CE, TabletPC, eBook Ecommerce, Datamining Natural language and speech technologies, etc.

MSR Mission Statement Expand the state of the art in each of the areas in which we do research Rapidly transfer innovative technologies into Microsoft products Ensure that Microsoft products have a future

Personal Examples of R&D Scaleable Servers TerraServer SkyServer Databases Data Cube, Snapshot Isolation SQL Stress testing Reliable Multicast Personal Media Management

TerraServer & TerraService MGB 2003 TerraServer & TerraService TerraServer TerraService http://terraserver-usa.com USGS Photo and Topo maps 16TB of data Online since 1997 7 billon pages served 120 TB served Shows Scalability Availability Manageability SQL + Windows http://terraservice.net A .NET web service OpenGIS Place Search TerraServer Map Server Landmarks & annotations layered on imagery Used by thousands of real apps today Shows Web Services Performance © 2003 Microsoft Corporation. All rights reserved. This presentation is for informational purposes only. Microsoft makes no warranties, express or implied, in this summary.

TerraServer Today

TerraServer Tomorrow Mirrored System versus SAN 3 mirrored DB servers + spare versus 4 DB servers Commodity versus Enterprise White box Dual Xeon versus 8-way branded DAS 250GB SATA versus FC-SAN 73GB SCSI No Tape versus LTO Tape Robot $0.1M versus $1.8M Geoplex: 2 sites You can afford 2! KVM / IP

World Wide Telescope http://www.voforum.org/ Premise: Most Astro data is online So, the Internet is the world’s best telescope: Has data on every part of the sky In every measured spectral band As deep as the best instruments It is up when you are up; the “seeing” is always great (no working at night, no clouds no moons no…) It’s a smart telescope: links objects and data to literature on them

Next-Generation Data Analysis Looking for Needles in haystacks – the Higgs particle Haystacks: Dark matter, Dark energy Needles are easier than haystacks Global statistics have poor scaling Correlation functions are N2, likelihood techniques N3 As data and computers grow at same rate, we can only keep up with N logN A way out? data is fuzzy, answers are approximate Requires combination of statistics and computer science

Data Federations Of Web Services Massive datasets live near their owners: Near the instrument’s software pipeline Near the applications Near data knowledge and curation Super Computer centers become Super Data Centers Each Archive publishes a web service Schema: documents the data Methods on objects (queries) Scientists get “personalized” extracts Uniform access to multiple Archives A common global schema Challenge: What is the object model for your science? Federation

Web Services – The Key? Web SERVER: Web SERVICE: Your program Web Service http Web SERVER: Given a url + parameters Returns a web page (often dynamic) Web SERVICE: Given a XML document (soap msg) Returns an XML document Tools make this look like an RPC. F(x,y,z) returns (u, v, w) Distributed objects for the web. + naming, discovery, security,.. Internet-scale distributed computing Web page Your program Web Service soap Data In your address space object in xml

Federating Astronomy Archives IRAS 25m Great Test for data mining algorithms It is real and well documented data High-dimensional data (with confidence intervals) Spatial data Temporal data Many different instruments from many different places and many different times Federation is a goal There is a lot of it (petabytes) Can share cross company University researchers 2MASS 2m DSS Optical IRAS 100m WENSS 92cm NVSS 20cm ROSAT ~keV GB 6cm

SkyServer – One such archive SkyServer.SDSS.org Sloan Digital Sky Survey Pixels + Data Mining 400 attributes per “object” Spectrograms for 1% Demo: pixel space record space set space teaching

SkyQuery: Federating Archives http://skyquery.net/ Distributed Query tool using a set of web services Federates ten astronomy archives from Pasadena, Chicago, Baltimore, Cambridge (England) Implemented in C# and .NET Allows queries like: SELECT o.objId, o.r, o.type, t.objId FROM SDSS:PhotoPrimary o, TWOMASS:PhotoPrimary t WHERE XMATCH(o,t)<3.5 AND AREA(181.3,-0.76,6.5) AND o.type=3 and (o.I - t.m_j)>2

SkyQuery Structure Portal Each SkyNode publishes Plans Query (2 phase) Integrates answers Is itself a web service Each SkyNode publishes Schema Web Service Database Web Service Image Cutout INT SDSS SkyQuery Portal FIRST 2MASS

Databases Theory to practice Data Cube Wrote paper SQL Server product and ISO Standard adopted idea Snapshot Isolation Paper in 1996 Product in 2004 old Reader version new

Databases Stress Test Generate millions of random SQL queries SqlServer Generate millions of random SQL queries Send them to 4 different products Compare the answers: If all agree, good! If not, a bug somewhere Found many bugs in DB products Much appreciated by MS DB group Tool cloned by other DB vendors DB2 = Oracle Informix

SQL Automated Test Example Four SQL systems on 2,000 statements Case W X Y Z 1672 1672 1672 1672 232 234 241 31 1 1 1 1 31 15 12 28 1 12 5 116 0 29 32 4 18 18 19 25 45 19 18 113 All four agree 84% W,X, and Y agree 95% Problem with intermediate table. Error

PGM Pretty Good Multicast Reliable multicast protocol Scales using hierarchy, suppression, and FEC “on-demand” (FEC on-demand is our contribution) Joint work with Cisco and others IETF standard Implemented prototype (Multicast PowerPoint) Shipped in Windows XP

MyLifeBits “A lifetime store of everything” The experiment: digitizing Gordon Bell’s life The software: Based on SQL server Tools to capture web pages, IM chats, TV, radio & telephone Reports, links, full text search, pivot by time or any other attribute

MyLifeBits Software Internet MyLifeBits store Radio capture tool Telephone capture tool PocketPC transfer tool PocketRadio player TV capture tool MyLifeBits store Internet Radio EPG tool TV EPG download tool MAPI interface Legacy email client Browser tool database files Legacy applications MyLifeBits Shell Voice annotation tool Text annotation tool

Research Failures Not everything is a success We had technology transfer failures We had projects with little impact Success and Failure depend on environment Even if you have a GREAT! idea There are many exogenous factors in technology transfer And, sometimes the idea or focus is wrong Allow people to fail once or twice.

Summary Actionable Ideas Co-lo if possible Adopt a “university model” Recruit from the top Recruit for passion and a desire to have impact Install a Research Program Management organization to orchestrate tech-transfer Institute an annual TechFest

© 2003 Microsoft Corporation. All rights reserved. MGB 2003 © 2003 Microsoft Corporation. All rights reserved. This presentation is for informational purposes only. Microsoft makes no warranties, express or implied, in this summary. © 2003 Microsoft Corporation. All rights reserved. This presentation is for informational purposes only. Microsoft makes no warranties, express or implied, in this summary.