Private & Confidential Dr. Rao Mikkilineni, Kawa Objects Ian Seyler, Return Infinity May 16, 2011 Parallax - A New Operating System for Scalable, Distributed, and Parallel Computing DIME Network Architecture (DNA) for a New Generation of Many-core Computing SMTPS11 Stop Start
Private & Confidential Agenda The hardware upheaval and the von Neumann Bottleneck Possible Solution using a Parallel DIME™ network computing model with telecom grade trust Parallax – A potentially new Operating System (OS) Proof of concept demo 2 The history of the evolution of current OSs is filled with lessons on wasted billions (does anyone remember Multics or OS2?), unmet expectations (who would have thought UNIX, the original System V, would vanish), surprise winners (Windows and Linux), and stealthy survivors (Mach in a Mac)
Private & Confidential Many-core Servers SeaMicro – Custom Servers – 512, 1.66 GHz 64 bit X86 Intel Atom cores in 10 RU; 2,048 CPUs/rack Calxeda - highly integrated Server‐on‐Chip built around a new generation ARM processor – 480 cores Silicon Graphics – Altix UV – – 2048 cores, 16 TB memory per Single System Image scales to 32,768 processor sockets providing up to 262,144 Intel Xeon cores (8-cores per socket) 3
Private & Confidential Hardware Upheaval and von Neumann Bottleneck 4 Network Infrastructure With Complex Management Systems Layers of Management Infrastructure Up to 46,080 processing cores or 29.8 petabytes of storage per container Running an OS that cannot see beyond tens of cores No Operating System that provides Application-centric Resource Management in real-time Operating System Gap 512 Cores 480 Cores
Private & Confidential Current Economics of IT 5 % of TCO over Five Years $61.2M$31.6M Hardware Upheaval is not Matched by Software Innovation!!
Private & Confidential SPC Element Network & von Neumann Bottleneck...code......mngt code......code......mngt code... Service Regulation Executable Instructions Service Package Executable Instructions Distributed Application Serial Processing...mngt code... Service Regulation Executable Instructions Network, Storage, Virtualization, application etc. etc. Management Parallel FCAPS* Management of Stored Program Computing Element using Signaling Channel Distributed Intelligent Managed Element Network...mngt code... Real-time Application Management (Provisioning, Monitoring & Control)...code... Application (Service Component in a Distributed Workflow) Managed Intelligent Computing Element 1.Signaling & Self- Management of Node 2.Workflow with DIME Network Management * Fault, Configuration, Accounting, Performance and Security (Node & Network) Hello World Stop Start Hello World End-to-end distributed transaction response is no longer controlled by the individual node OS in a shared resource environment
Private & Confidential DIMEs In A Multi-Core Server F Physical Server 1 Parallax OS ( P) App A B Free Memory (F) SSSS AABB S ABBA PPPPP Shared Memory (S) MICE Signaling I/O F Free Memory (F) Physical Server 1 F Free Memory (F) Physical Server 1 F Free Memory (F) Network DIME Sub-network Managers FCAPS Run-time Orchestrator Linux Service (Service Regulator and Service Package) Proof of Concept Features DIME Instantiation Discovery Workflow Orchestration Scaling Dynamic Reconfiguration Fault Management Server 1Server 2Server 3 ABABAB 7
Private & Confidential DNA In A Multi-core Server 8 Neumann, J. v. “The General and Logical Theory of Automata” In E. b. Taub, John von Neumann Collected Works (pp. Vol 5, p259). Chicago: University of Illinois Press (1951) George B. Dyson, “Darwin among the Machines, the evolution of global intelligence”, Helix Books, Addition Wesley Publishing Company, Inc., Reading, MA, 1997, p123.
Private & Confidential Service Deployment FFF DIME Sub-network Managers FCAPS Run-time Orchestrator Linux Network Service Component Developer (Service Creation) Service Workflow Creator (Service Delivery) Node 1 Worker 1 Node 1 Worker 2 Node 2 Worker 1 Node 2 Worker 2 Node 3 Worker 1 Node 3 Worker 2 Hello World Service Control Manager (Service Assurance) 9
Private & Confidential Lessons From Biology "The basic principle of dealing with malfunctions in nature is to make their effect as unimportant as possible and to apply correctives, if they are necessary at all, at leisure. In our dealings with artificial automata, on the other hand, we require an immediate diagnosis. Therefore, we are trying to arrange the automata in such a manner that errors will become as conspicuous as possible, and intervention and correction follow immediately." --- John von Neumann, "The General and Logical Theory of Automata", John von Neumann Collected Works, Edited by A. H. Taub, Volume 5, p 289 (Hixon Symposium 1948) "It's very likely that on the basis of philosophy that every error has to be caught, explained, and corrected, a system of the complexity of the living organism would not run for a millisecond." --- von Neumann, Theory of Self- Reproducing Automata (1948) at the Hixon Symposium, Pasadena, California 10
Private & Confidential Dr. Rao Mikkilineni, Kawa Objects Ian Seyler, Return Infinity May 16, 2011 Questions? DIME Network Architecture (DNA) for a New Generation of Many-core Computing SMTPS11 Stop Start