Peer-to-Peer Distributed Shared Memory? Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan/Bretagne France Dagstuhl seminar, October 2003.

Slides:



Advertisements
Similar presentations
Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
Advertisements

Christian Delbe1 Christian Delbé OASIS Team INRIA -- CNRS - I3S -- Univ. of Nice Sophia-Antipolis November Automatic Fault Tolerance in ProActive.
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Distributed Systems 1 Topics  What is a Distributed System?  Why Distributed Systems?  Examples of Distributed Systems  Distributed System Requirements.
Cache Coherent Distributed Shared Memory. Motivations Small processor count –SMP machines –Single shared memory with multiple processors interconnected.
Using DSVM to Implement a Distributed File System Ramon Lawrence Dept. of Computer Science
Distributed components
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Introspective Replica Management Yan Chen, Hakim Weatherspoon, and Dennis Geels Our project developed and evaluated a replica management algorithm suitable.
Chapter 12 Distributed Database Management Systems
Introduction to client/server architecture
Middleware for P2P architecture Jikai Yin, Shuai Zhang, Ziwen Zhang.
Network File System (NFS) in AIX System COSC513 Operation Systems Instructor: Prof. Anvari Yuan Ma SID:
Google Distributed System and Hadoop Lakshmi Thyagarajan.
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
1 The Google File System Reporter: You-Wei Zhang.
CSC 456 Operating Systems Seminar Presentation (11/13/2012) Leon Weingard, Liang Xin The Google File System.
RUNNING PARALLEL APPLICATIONS BEYOND EP WORKLOADS IN DISTRIBUTED COMPUTING ENVIRONMENTS Zholudev Yury.
Distributed Systems Concepts and Design Chapter 10: Peer-to-Peer Systems Bruce Hammer, Steve Wallis, Raymond Ho.
Thesis Proposal Data Consistency in DHTs. Background Peer-to-peer systems have become increasingly popular Lots of P2P applications around us –File sharing,
JuxMem: An Adaptive Supportive Platform for Data Sharing on the Grid Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan, France Workshop.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
LEGO – Rennes, 3 Juillet 2007 Deploying Gfarm and JXTA-based applications using the ADAGE deployment tool Landry Breuil, Loïc Cudennec and Christian Perez.
Unit – I CLIENT / SERVER ARCHITECTURE. Unit Structure  Evolution of Client/Server Architecture  Client/Server Model  Characteristics of Client/Server.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
Large-scale Deployment in P2P Experiments Using the JXTA Distributed Framework Gabriel Antoniu, Luc Bougé, Mathieu Jan & Sébastien Monnet PARIS Research.
Distributed File Systems Overview  A file system is an abstract data type – an abstraction of a storage device.  A distributed file system is available.
1 Introduction to Middleware. 2 Outline What is middleware? Purpose and origin Why use it? What Middleware does? Technical details Middleware services.
Heavy and lightweight dynamic network services: challenges and experiments for designing intelligent solutions in evolvable next generation networks Laurent.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
JuxMem: An Adaptive Supportive Platform for Data Sharing on the Grid Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan, France Grid Data.
Introduction to DFS. Distributed File Systems A file system whose clients, servers and storage devices are dispersed among the machines of a distributed.
Building Hierarchical Grid Storage Using the GFarm Global File System and the JuxMem Grid Data-Sharing Service Gabriel Antoniu, Lo ï c Cudennec, Majd Ghareeb.
The JuxMem-Gfarm Collaboration Enhancing the JuxMem Grid Data Sharing Service with Persistent Storage Using the Gfarm Global File System Gabriel Antoniu,
A Peer-to-Peer Approach to Resource Discovery in Grid Environments (in HPDC’02, by U of Chicago) Gisik Kwon Nov. 18, 2002.
MapReduce and GFS. Introduction r To understand Google’s file system let us look at the sort of processing that needs to be done r We will look at MapReduce.
Wide-Area Service Composition: Performance, Availability and Scalability Bhaskaran Raman SAHARA, EECS, U.C.Berkeley Presentation at Ericsson, Jan 2002.
Ch 10 Shared memory via message passing Problems –Explicit user action needed –Address spaces are distinct –Small Granularity of Transfer Distributed Shared.
Latest news on JXTA and JuxMem-C/DIET Mathieu Jan GDS meeting, Rennes, 11 march 2005.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
Architecture Models. Readings r Coulouris, Dollimore and Kindberg Distributed Systems: Concepts and Design Edn. 3 m Note: All figures from this book.
GFS. Google r Servers are a mix of commodity machines and machines specifically designed for Google m Not necessarily the fastest m Purchases are based.
Going Large-Scale in P2P Experiments Using the JXTA Distributed Framework Mathieu Jan & Sébastien Monnet Projet PARIS Paris, 13 February 2004.
© Chinese University, CSE Dept. Distributed Systems / Distributed Systems Topic 1: Characterization of Distributed & Mobile Systems Dr. Michael R.
7. Grid Computing Systems and Resource Management
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Making a DSM Consistency Protocol Hierarchy-Aware: An Efficient Synchronization Scheme Gabriel Antoniu, Luc Bougé, Sébastien Lacour IRISA / INRIA & ENS.
1 VLDB - Data Management in Grids B. Del-Fabbro, D. Laiymani, J.M. Nicod and L. Philippe Laboratoire d’Informatique de l’Université de Franche-Comté Séoul,
© 2008 by Wind River; made available under the EPL v1.0 | 19-Nov-2008 TCF The Target Communication Framework Michael Scharf, Wind River wiki.eclipse.org/DSDP/TM/TCF_FAQ.
November, 19th GDS meeting, LIP6, Paris 1 Hierarchical Synchronization and Consistency in GDS Sébastien Monnet IRISA, Rennes.
© Oxford University Press 2011 DISTRIBUTED COMPUTING Sunita Mahajan Sunita Mahajan, Principal, Institute of Computer Science, MET League of Colleges, Mumbai.
Distributed File System. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
SYSTEM MODELS FOR ADVANCED COMPUTING Jhashuva. U 1 Asst. Prof CSE
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
XtreemOS IP project is funded by the European Commission under contract IST-FP Scientific coordinator Christine Morin, INRIA Presented by Ana.
Distributed Systems Architectures. Topics covered l Client-server architectures l Distributed object architectures l Inter-organisational computing.
Replication & Fault Tolerance CONARD JAMES B. FARAON
Introduction to Distributed Platforms
Definition of Distributed System
A Survey on Distributed File Systems
Providing Secure Storage on the Internet
Outline Midterm results summary Distributed file systems – continued
Distributed Systems Bina Ramamurthy 11/30/2018 B.Ramamurthy.
Distributed Systems Bina Ramamurthy 12/2/2018 B.Ramamurthy.
CLUSTER COMPUTING.
Distributed File Systems
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
Outline Review of Quiz #1 Distributed File Systems 4/20/2019 COP5611.
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Peer-to-Peer Distributed Shared Memory? Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan/Bretagne France Dagstuhl seminar, October 2003

2 Why Are DSM Systems Interesting? Allow to share mutable data in a distributed environment Transparent access (local/remote) Transparent data localization (remote) (Lots of) consistency models and protocols Node 0Node 1 Migration ? Replication ?

3 What Do DSM Systems Usually Assume? Protocols: implicit hypotheses! Static configuration (number of nodes) Every node knows every node No node failure No dynamic node departure/arrival Homogeneous architecture (processors, OS) Designed for small-scale environment Typically clusters of workstations Node 0Node 1 Migration ? Replication ?

4 Challenge: Data Sharing on the Grid (1) Distributed numerical simulations (code coupling) Solid mechanics Thermodynamics Optics Dynamics Satellite design

5 Challenge: Data Sharing on the Grid (2) Challenge for DSM systems: get larger! Need to integrate new hypotheses Scalability Fault tolerance Dynamicity Heterogeneity Just the opposite of DSM systems!

6 Large-Scale Data Sharing: Peer-to-Peer (P2P) Systems Client Internet server Client Cache Proxy server Congestion zone Client/ Server server Client/ Server Congestion zone Client/server model Peer-to-peer model

7 Data Sharing at a Large Scale: Peer-to-Peer Systems Features: Excellent scalability: millions of nodes High volatility tolerance But: Sharing read-only data Few exceptions: Oceanstore, Ivy, etc. Question: What consistency models and protocols for a large scale, dynamic environment?

8 DSM Systems vs. P2P Systems DSMP2P Scale DynamicityNullHigh Resource homogeneity Homogeneous (clusters) Heterogeneous (Internet) Control and trust HighLow TopologyFlat Data typeMutableImmutable Typical applications Scientific computation File sharing and storage

9 Data Sharing: the Gap! DSM Small-scale Static Homogeneous P2P Large-scale Dynamic Heterogeneous 10 3 ?

10 Idea: Hybrid Approach DSM systems: consistency and transparent access P2P systems: scalability and high dynamicity DSMData Sharing ServiceP2P Scale DynamicityNullMediumHigh Resource homogeneity Homogeneous (clusters) Rather heterogeneous (clusters of clusters) Heterogeneous (Internet) Control and trust HighMediumLow TopologyFlatHierarchicalFlat Data typeMutable Immutable Typical applications Scientific computation Scientific computation and data storage File sharing and storage

11 Why Such a Service? Data sharing service for ASP environments Persistent data Transparent localization Consistency Automatic redistribution AGENT(s) S1 S2 S3S4 Client A, B, C Answer (C) S2 ! Request Op1(C, A, B) Server Op2(C, A, B) Data Sharing Service

12 A Data Sharing Service for the Grid Internet Persistence

13 A Data Sharing Service for the Grid Internet Data transfer ? Transparent data location

14 A Data Sharing Service for the Grid Internet Scalability Internet

15 A Data Sharing Service for the Grid Internet Volatility tolerance

16 JXTA: a Framework for P2P Services Open-source platform for programming P2P applications Peer Uniquely identified (ID) Address independent of physical location Multiple network access points (TCP, HTTP, etc) Peer group Peer Firewall Peer TCP/IP HTTP Peer ID Firewall

17 JuxMem: an Architecture Proposal juxmem group cluster A group cluster B group cluster C group data group Physical architecture Logical architecture

18 JuxMem API Alloc (size, attribs) Map (id, attribs) Put (id, value) Get (id) Lock (id) Unlock (id)

19 Managing Shared Data Blocks Allocate a memory block = create a data group Data blocks replicated on providers Data blocks identified by the ID of the peer group Transparent access for clients via data ID Consistency Current model: SC Assume data-race-free programs Simultaneous updates of replicas (logical multicast) Clients are not notified of updates Synchronization One lock per data block Other mechanisms: in progress

20 Handling Peer Volatility Provider volatility A manager per peer group Dynamic monitoring of available peers (cluster) Automatic replication of data blocks (data) Manager volatility Periodic exchange of heartbeats Dynamic replication of managers if needed juxmem group cluster A group cluster B group cluster C group data group

21 Implementation and Preliminary Evaluation Implementation JXTA service, 5000 Java code lines Experimental setup PentiumII: 450 Mhz and 256 MB of RAM FastEthernet 100 Mb/s Linux 2.4 Number of nodes: 20 Experiment Study provider volatility

22 Study: Provider Volatility (1) juxmem group cluster group data group Data size: one byte Replication degree = 3 Data manager not killed 1 client: 100 iterations lock-put-unlock 16 providers

23 Study: Provider Volatility (2) juxmem group cluster group data group 1 client: 100 iterations lock-put-unlock 16 providers Data size: one byte Replication degree = 3 Data manager not killed

24 Study: Provider Volatility (3) Internal locking during replication Guarantee consistency during replica creation Client is blocked juxmem group cluster group data group

25 Study: Provider Volatility (4) JXTA/Java Expensive underlying JXTA-level dynamic channel management Reconfiguration time  11 seconds Targeted volatility is weaker ( >> 80 seconds)

26 Summary A hierarchical architecture for a data sharing service for the grid Hybrid approach: DSM and P2P systems Transparent access to data blocks Persistent storage Mutable data SC memory model for DRF access Active support for peer volatility

27 Ongoing Work Studies Replication strategies for fault tolerance Consistency protocols in a dynamic environment Co-scheduling computation and data distribution Manage data-data affinity Integrate high-speed networks: Myrinet, SCI Goal: build a Grid Data Service GDS project: Extensive evaluation on realistic codes Actual execution: 100 nodes Simulation: 1,000-10,000 nodes

28 Questions?

29 Managing Memory Resources cluster group juxmem group Size: 8 MB Memory provided Provider advertisements: cluster group Cluster advertisements: juxmem group

30 Allocation: How Does It Work? 2 1 3a 3b MB?