System Software OptIPuter System Software Andrew A. Chien SAIC Chair Professor, Computer Science and Engineering, UCSD Director, Center for Networked Systems.

Slides:



Advertisements
Similar presentations
-Grids and the OptIPuter Software Architecture Andrew A. Chien Director, Center for Networked Systems SAIC Chair Professor, Computer Science and Engineering.
Advertisements

Distributed Data Processing
The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
COS 461 Fall 1997 Networks and Protocols u networks and protocols –definitions –motivation –history u protocol hierarchy –reasons for layering –quick tour.
Data Grids Jon Ludwig Leor Dilmanian Braden Allchin Andrew Brown.
4.1.5 System Management Background What is in System Management Resource control and scheduling Booting, reconfiguration, defining limits for resource.
Xingfu Wu Xingfu Wu and Valerie Taylor Department of Computer Science Texas A&M University iGrid 2005, Calit2, UCSD, Sep. 29,
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
8.
DWDM-RAM: DARPA-Sponsored Research for Data Intensive Service-on-Demand Advanced Optical Networks DWDM RAM DWDM RAM BUSINESS WITHOUT BOUNDARIES.
Distributed Virtual Computer (DVC): Simplifying the Development of High-Performance Grid Applications Nut Taesombut and Andrew A. Chien Department of Computer.
1 In VINI Veritas: Realistic and Controlled Network Experimentation Jennifer Rexford with Andy Bavier, Nick Feamster, Mark Huang, and Larry Peterson
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
Protocols and the TCP/IP Suite Chapter 4 (Stallings Book)
Protocols and the TCP/IP Suite
Asper School of Business University of Manitoba Systems Analysis & Design Instructor: Bob Travica System architectures Updated: November 2014.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Globus Ian Foster and Carl Kesselman Argonne National Laboratory and University of Southern California
Chapter 9: Moving to Design
Introduction to Grid Computing Ann Chervenak Carl Kesselman And the members of the Globus Team.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Chapter 1: Hierarchical Network Design
Moving to Design.
Distributed Real-Time Systems for the Intelligent Power Grid Prof. Vincenzo Liberatore.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
Feb 6-7, OptIPuter Software Research and Architecture Andrew A. Chien Computer Science and Engineering University of California, San Diego OptIPuter.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED.
Presenter: Dipesh Gautam.  Introduction  Why Data Grid?  High Level View  Design Considerations  Data Grid Services  Topology  Grids and Cloud.
IT Infrastructure Chap 1: Definition
DataGrid Middleware: Enabling Big Science on Big Data One of the most demanding and important challenges that we face as we attempt to construct the distributed.
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
PPDG and ATLAS Particle Physics Data Grid Ed May - ANL ATLAS Software Week LBNL May 12, 2000.
The Globus Project: A Status Report Ian Foster Carl Kesselman
Service Oriented Architectures Presentation By: Clifton Sweeney November 3 rd 2008.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Perspectives on Grid Technology Ian Foster Argonne National Laboratory The University of Chicago.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
9 Systems Analysis and Design in a Changing World, Fourth Edition.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRID ARCHITECTURE Chintan O.Patel. CS 551 Fall 2002 Workshop 1 Software Architectures 2 What is Grid ? "...a flexible, secure, coordinated resource- sharing.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
The Globus Toolkit®: The Open Source Solution for Grid Computing
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Optical Architecture Invisible Nodes, Elements, Hierarchical, Centrally Controlled, Fairly Static Traditional Provider Services: Invisible, Static Resources,
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
GCRC Meeting 2004 BIRN Coordinating Center Software Development Vicky Rowley.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
CHAPTER 4 PROTOCOLS AND THE TCP/IP SUITE Acknowledgement: The Slides Were Provided By Cory Beard, William Stallings For Their Textbook “Wireless Communication.
7. Grid Computing Systems and Resource Management
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
TeraPaths: A QoS Enabled Collaborative Data Sharing Infrastructure for Petascale Computing Research The TeraPaths Project Team Usatlas Tier 2 workshop.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
RobuSTore: Performance Isolation for Distributed Storage and Parallel Disk Arrays Justin Burke, Huaxia Xia, and Andrew A. Chien Department of Computer.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
9 Systems Analysis and Design in a Changing World, Fifth Edition.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Chapter 1: Explore the Network
Clouds , Grids and Clusters
Introduction to Data Management in EGI
University of Technology
GGF15 – Grids and Network Virtualization
A GUI Based Aid for Generation of Code-Frameworks of TMOs
The Anatomy and The Physiology of the Grid
Presentation transcript:

System Software OptIPuter System Software Andrew A. Chien SAIC Chair Professor, Computer Science and Engineering, UCSD Director, Center for Networked Systems September 2003

System Software OptIPuter System Software Team Challenge –~20 Lead Researchers, Many More in Entire Team –Diverse Researcher Backgrounds and Focus –Broad Research Agenda, Abstract Shared Perspective Process –Innumerable Phone Calls and 1-on-1 Meetings, Fall 2002-Spring 2003 –Team Meeting with UCSD and UCI Teams (October 4, 2002) –Straw Man OptIPuter System Software Architecture (January 2003) –Goals, Context, Organization, Relationship of Efforts –OptIPuter All Hands Meeting, February 6-7, 2003 –First Presentation to Entire Team –Feedback, Revision, Improvement, Deeper Understanding, Shared Perspective –Optical Signalling and Network Management Meeting (May 22, 2003) –Mambretti Organized –OptIPuter Software Architecture Version 1.0 (July 2003) –Structure Stabilized, interfaces Becoming Concrete

System Software ’s Transform Distributed Systems Key Technology Changes –Massive Bandwidth – x Increases Wide-Area Systems –“End To End” -Connections –Private Networks, Guaranteed Bandwidth –Endpoints are Parallel Clusters –Large-Scale Network-Attached –Storage –Instruments –Displays –Other Peripherals –Grids and Flexible Wide-Area Sharing Opportunities –Communication –Tight Wide-area Resource Coupling –Simpler Distributed Applications –Proactive Computing and Communication Challenge is Abstractions, Technologies, and Protocols (SOFTWARE!) to Deliver these Capabilities to Applications

System Software Towards Middleware for -Networked Systems Fabric Resource Access and Control: Computers, Storage, Networks Connectivity Globus_IO/XIO & GSI Resource GRAM, GridFTP, GRIS, Co- allocation Collective DUROC, GARA, Replica Catalogs, Metadata Servers, Brokers, Workflow Application Leverage Investment and Capabilities (e.g. Globus 2.2 and 3.0) –Carl Kesselman OptIPuter Participant –Ian Foster, OptIPuter Frontier Advisory Board Explore What Must Change –New Software/Protocols for Managing Lambdas –Simplify, Deliver Higher Performance and New Capabilities Globus Architecture

System Software OptIPuter Software Architecture for Distributed Virtual Computers v1.1 Layer 4: XCP Node Operating Systems -configuration, Net Management Grid and Web Middleware – (Globus/OGSA/WebServices/J2EE) Physical Resources DVC #1 OptIPuter Applications DVC #2DVC #3 Layer 5: SABUL, RBUDP, Fast, GTP Real-Time Objects Security Models Data Services: DWTP Higher Level Grid Services Visualization DVC/ Middleware High-Speed Transport Optical Signaling/Mgmt

System Software OptIPuter Links Three Major Sets of Technology Activities Distributed Virtual Computers –Provide a Simple Abstractions –Aggregate Component Technology Capabilities –Surface Novel Capabilities High speed Transport Protocols [Bannister’s Talk] –Long Thread of High Bandwidth-Delay Product Network Protocols –Span The Range “Reach” For Dedicated Optical Connections –Complete Integration with IP Network Management –Hybrid – to Local Packet-Switched Networks –Separate – End-to-end Optical Network Signaling and Management [Mambretti’s Talk] –Single Domain and Inter-Domain –Hybrid Circuit and Packet-Switched Networks –Planning and Execution

System Software Distributed Virtual Computers

System Software Exploiting ’s for an Application Network View: Ad Hoc connections –Applications Request -Connections –Network Recognizes High BW flows and Configures System View: Enclave of Resources and Connections –a Distributed Virtual Computer (a SYSTEM) –How to Specify, Implement, and Exploit?

System Software DVC Examples Virtual Cluster (Hide Complexity of Grid; Resource Flexibility) –Shared Single Domain (Spans Multiple) –Private Connections; Simple Network Naming –Simple Resource Discovery and Access –Uniform Performance Characteristics –Direct Access to Everything (Storage, Displays, etc.) Real-Time Virtual Cluster for Distributed Collaborative Visualization –Grid Resources + Real-Time (TMO) Collaborative Visualization Cluster –Grid Resources + Photonic Multicast or LambdaRAM (Leigh) SIO/NCMIR UCI or UIC SDSC UCSD CSE

System Software Realizing Distributed Virtual Computers Research Challenges –Application-driven Definition of Abstractions –Useful Collections which Match Application Paradigms and Needs –Incorporates New Collective Models –DVC Description –Namespaces, Communication, Performance, Real-Time, … –Standard Specifications; Most Applications Parameterize –Integration Of Component Technologies Executing the DVC on a Grid –Planner That Identifies Resources –Selects from Virtual Grid Resources –Negotiates with Resource Managers and Brokers –Executor and Monitor for DVC –Acquires and Configures –Monitors for Failures and Performance –Adapts and Reconfigures

System Software OptIPuter Component Technologies

System Software Current Storage Views Network-attached Storage (NAS) –Filesystem protocols; Integrated Access-Control and Security –Low performance; Little Aggregation and Parallelism Grid View: High-Level Storage Federation –GridFTP (Distributed File Sharing) –GSI-based Access/Authentication –Put/Get, Third-Party Transfers, Whole File and Segments Single-System view: Lower-level storage federation –Secure Single System View –SAN – Block Level Disk and Controller Protocols –High Performance, Efficient sharing Research Areas –Network-Attached Secure Disk –Direct Access File Systems

System Software We Need a Distributed Storage Solution for e-Science Distributed Data Generators BIRN: Distributed Data, Intensive Analysis –100GB Data Elements; Petabyte Data Sets –Comparative and Collective Analysis across Data Elements –Visualization of Multi-Scale Data Objects

System Software Storage Research Directions From Performance to Performability –Manage and Exploit Multi-Latency Performance –Parallel Performance, Stability, and Isolation –Integration of Device, Network, Site Reliability Concerns OptIPuter Storage Directions –Application-Driven Design –Needs, Performance, Device/Site/Network Flexibility, Coding and Selection –Integrate Dynamic ’s and SAN Networks –Peering, Protocol Interfacing, Performance –Performance Robust Storage –Erasure/Other Redundancy; Large-Scale Parallelism; Statistical Approaches to Performance Isolation –Secure Shared Storage: Threshold Cryptography Approach

System Software OptIPuter Security Considerations OptIPuter as a Computing Platform –Information Assurance and Security Needed for Applications –Current Plan: use Globus Security Infrastructure OptIPuter as a Research Platform –Current Efforts –Distributed Security Services (Goodrich & Tamassia) –Incremental IP Trace-Back via Packet Marking for DOS Defense (Goodrich) –Enhanced Forensic Analysis By Design (Karin & Peisert) –Planned Efforts –Minimum Round Trip Latency Control (Goodrich) –Hardening Against Attacks by Multi-Path Routing (Goodrich, Karin) –End-to-End Application and Session Security Through Dedicated Lambdas (Karin) Source: Karin, UCSD and Goodrich, UCI

System Software Multi-Lambda Security Opportunities Security Frequently Defined Through Three Measures: –Integrity, Confidentiality, And Reliability (“Uptime”) Can These Measures be Enhanced by Employing Multiple Lambdas? Can Confidentiality be Improved by Dividing the Transmission Over Multiple Lambdas? –Fundamentally or Using “Cheap” Encryption? Can Integrity be Ensured or Reliability Improved by Exploiting Redundancy? –Source Coding and Performance –Adaptive Techniques Source: Goodrich, Karin

System Software Vision – Real-Time Tightly Coupled Wide-Area Distributed Computing Real- Time Object network Goals High-precision Timings of Critical Actions Tight Bounds on Response Times Ease of Programming –High-Level Prog –Top-Down Design Ease of Timing Analysis Dynamically formed Distributed Virtual Computer Source: Kim, UCI

System Software Real-Time: from LAN to WAN Time-Triggered Message-Triggered Object (TMO) Middleware Subsystem Model that can be Easily Implemented on Both Windows and Linux Platforms Developed a Global Time-Based Coordination for use in Fair and Efficient Distributed On-Line Game Systems and LAN Feasibility Demonstration –a Step towards Distributed OptIPuter Environment Demonstration –Paper will be Presented at IDPT 2003 Conference, December 2003 var TT Method 2 Service Method 1 TT Method 1 AAC        Compo- nents of a C++ object No thread, No priority High-level Programming Style Deadlines Service Method 2 Source: Kim, UCI

System Software TMO and OptIPuter Software TMO will be Integrated into the Overall OptIPuter Software Architecture Begin Design TMO Programming Framework for the OptIPuter Prototype Implementation TMO Support on Linux Platforms, Including OptIPuter Visualization Cluster (UIC – Leigh, UCI -- Jenks) Kernel TMOSM FT Support Middleware Lambda mux / demux Kernel TMOSM FT Support Middleware Lambda mux / demux data " Let us start a chorus at 2pm " " e-Science " An API Wrapping the Services of the RT Middleware Enables High-Level RT Programming Without a new Compiler Source: Kim, UCI

System Software Prophesy: Application Performance Modeling Performance Modeling of Applications on OptIPuter Cross Platform Comparison (vs. Traditional Grid & Parallel) Yr1: Completed Data Analysis Module Yr2: Work with Applications and High Speed Transport Protocols Target applications include: –SIO Geophysical Data Visualization –NCMIR/BIRN Neuroscience Applications Source: Taylor, TAMU Web-based GUI Profiling & Instrumentation Actual Execution Performance Database Template Database Systems Database Model Builder Symbolic Predictor DATA COLLECTION DATABASES DATA ANALYSIS

System Software Summary OptIPuter System Software Team Organization –Development of a Concrete, Shared Perspective –Organization into Tightly-Coupled Teams OptIPuter Software Architecture 1.0 (July 2003) –Provides Focus on Key Problems, Clusters Related Activities –Framework for Integrating Diverse Capabilities, Identifying Gaps, Integrating and Delivering Solutions Research Activity Clusters –Distributed Virtual Computers –Including Real-Time, Security, Storage, Performance Modeling –High Speed Transport Protocols –Optical Signaling and Network Management