JuxMem: An Adaptive Supportive Platform for Data Sharing on the Grid Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan, France Workshop.

Slides:



Advertisements
Similar presentations
Efficient Event-based Resource Discovery Wei Yan*, Songlin Hu*, Vinod Muthusamy +, Hans-Arno Jacobsen +, Li Zha* * Chinese Academy of Sciences, Beijing.
Advertisements

Christian Delbe1 Christian Delbé OASIS Team INRIA -- CNRS - I3S -- Univ. of Nice Sophia-Antipolis November Automatic Fault Tolerance in ProActive.
P2P data retrieval DHT (Distributed Hash Tables) Partially based on Hellerstein’s presentation at VLDB2004.
High Performance Computing Course Notes Grid Computing.
Reliability on Web Services Presented by Pat Chan 17/10/2005.
1 On Death, Taxes, & the Convergence of Peer-to-Peer & Grid Computing Adriana Iamnitchi Duke University “Our Constitution is in actual operation; everything.
Extensible Networking Platform IWAN 2005 Extensible Network Configuration and Communication Framework Todd Sproull and John Lockwood
Distributed components
Rheeve: A Plug-n-Play Peer- to-Peer Computing Platform Wang-kee Poon and Jiannong Cao Department of Computing, The Hong Kong Polytechnic University ICDCSW.
Other File Systems: AFS, Napster. 2 Recap NFS: –Server exposes one or more directories Client accesses them by mounting the directories –Stateless server.
Based on last years lecture notes, used by Juha Takkinen.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
1 Client-Server versus P2P  Client-server Computing  Purpose, definition, characteristics  Relationship to the GRID  Research issues  P2P Computing.
Middleware for P2P architecture Jikai Yin, Shuai Zhang, Ziwen Zhang.
1 The Google File System Reporter: You-Wei Zhang.
Wide-Area Cooperative Storage with CFS Robert Morris Frank Dabek, M. Frans Kaashoek, David Karger, Ion Stoica MIT and Berkeley.
Cli/Serv.: JXTA/151 Client/Server Distributed Systems v Objective –explain JXTA, a support environment for P2P services and applications ,
RUNNING PARALLEL APPLICATIONS BEYOND EP WORKLOADS IN DISTRIBUTED COMPUTING ENVIRONMENTS Zholudev Yury.
Distributed Systems Concepts and Design Chapter 10: Peer-to-Peer Systems Bruce Hammer, Steve Wallis, Raymond Ho.
Checkpoint & Restart for Distributed Components in XCAT3 Sriram Krishnan* Indiana University, San Diego Supercomputer Center & Dennis Gannon Indiana University.
Thesis Proposal Data Consistency in DHTs. Background Peer-to-peer systems have become increasingly popular Lots of P2P applications around us –File sharing,
Presenter: Dipesh Gautam.  Introduction  Why Data Grid?  High Level View  Design Considerations  Data Grid Services  Topology  Grids and Cloud.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
LEGO – Rennes, 3 Juillet 2007 Deploying Gfarm and JXTA-based applications using the ADAGE deployment tool Landry Breuil, Loïc Cudennec and Christian Perez.
Deploying DIET and JuxMem: GoDIET + JDF Mathieu Jan PARIS Research Group IRISA INRIA & ENS Cachan / Brittany Extension Rennes Lyon, July 2004.
1 A P2P Collaborative System Using JXTA Hosei Graduation School ITPC 02R3315 Katsuhiro CHIBA.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications Xiaozhou Li COS 461: Computer Networks (precept 04/06/12) Princeton University.
Large-scale Deployment in P2P Experiments Using the JXTA Distributed Framework Gabriel Antoniu, Luc Bougé, Mathieu Jan & Sébastien Monnet PARIS Research.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
Peer-to-Peer Distributed Shared Memory? Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan/Bretagne France Dagstuhl seminar, October 2003.
Heavy and lightweight dynamic network services: challenges and experiments for designing intelligent solutions in evolvable next generation networks Laurent.
Februar 17, 2006GDS meeting - LIP1 MOve: an application-Malleable Overlay UIUC / INRIA Collaboration.
JuxMem: An Adaptive Supportive Platform for Data Sharing on the Grid Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan, France Grid Data.
Building Hierarchical Grid Storage Using the GFarm Global File System and the JuxMem Grid Data-Sharing Service Gabriel Antoniu, Lo ï c Cudennec, Majd Ghareeb.
The JuxMem-Gfarm Collaboration Enhancing the JuxMem Grid Data Sharing Service with Persistent Storage Using the Gfarm Global File System Gabriel Antoniu,
A Peer-to-Peer Approach to Resource Discovery in Grid Environments (in HPDC’02, by U of Chicago) Gisik Kwon Nov. 18, 2002.
Dr. Ian Wang Cardiff University, U.K.. Ian Wang, Cardiff University Peer-to-Peer and Grids What does the Peer-to-Peer paradigm offer Grids? Scalable Decentralized.
Towards high-performance communication layers for JXTA on grids Mathieu Jan GDS meeting, Lyon, 17 February 2006.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
Latest news on JXTA and JuxMem-C/DIET Mathieu Jan GDS meeting, Rennes, 11 march 2005.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
GLOBE DISTRIBUTED SHARED OBJECT. INTRODUCTION  Globe stands for GLobal Object Based Environment.  Globe is different from CORBA and DCOM that it supports.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
University of Pennsylvania 7/15/98 Asymmetric Bandwidth Channel (ABC) Architecture Insup Lee University of Pennsylvania July 25, 1998.
Going Large-Scale in P2P Experiments Using the JXTA Distributed Framework Mathieu Jan & Sébastien Monnet Projet PARIS Paris, 13 February 2004.
© Chinese University, CSE Dept. Distributed Systems / Distributed Systems Topic 1: Characterization of Distributed & Mobile Systems Dr. Michael R.
18-1 Summary (Day 2) Learning Summary – What is JXTA ? – Understand the fundamental concepts of JXTA – Learn about the various implementations of.
7. Grid Computing Systems and Resource Management
Peer to Peer Network Design Discovery and Routing algorithms
Performance Evaluation of JXTA-* Communication Layers Mathieu Jan PARIS Research Group Paris, November 2004.
Making a DSM Consistency Protocol Hierarchy-Aware: An Efficient Synchronization Scheme Gabriel Antoniu, Luc Bougé, Sébastien Lacour IRISA / INRIA & ENS.
A N I N - MEMORY F RAMEWORK FOR E XTENDED M AP R EDUCE 2011 Third IEEE International Conference on Coud Computing Technology and Science.
1 VLDB - Data Management in Grids B. Del-Fabbro, D. Laiymani, J.M. Nicod and L. Philippe Laboratoire d’Informatique de l’Université de Franche-Comté Séoul,
Bruce Hammer, Steve Wallis, Raymond Ho
November, 19th GDS meeting, LIP6, Paris 1 Hierarchical Synchronization and Consistency in GDS Sébastien Monnet IRISA, Rennes.
Distributed File System. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
SYSTEM MODELS FOR ADVANCED COMPUTING Jhashuva. U 1 Asst. Prof CSE
Distributed Systems Architecure. Architectures Architectural Styles Software Architectures Architectures versus Middleware Self-management in distributed.
GGF 17 - May, 11th 2006 FI-RG: Firewall Issues Overview Document update and discussion The “Firewall Issues Overview” document.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
XtreemOS IP project is funded by the European Commission under contract IST-FP Scientific coordinator Christine Morin, INRIA Presented by Ana.
Data Management on Opportunistic Grids
Introduction to Distributed Platforms
Introduction to Load Balancing:
Distributed DBMS Concepts of Distributed DBMS
Plethora: Infrastructure and System Design
Distributed File Systems
Presentation transcript:

JuxMem: An Adaptive Supportive Platform for Data Sharing on the Grid Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan, France Workshop on Adaptive Grid Middleware New Orleans, September 2003

2 Context: Data Management on the Grid Distributed numerical simulations (code coupling) Problem: data management Solid mechanics Thermodynamics Optics Dynamics Satellite design

3 Existing Data Management Systems Non-transparent large scale data management GridFTP (Globus) and MPI-IO Security, heterogeneity Internet Backplane Protocol (IBP) Control Explicit transfer No consistency guarantee

4 Existing Data Management Systems Transparent small-scale data management Distributed shared memory (DSM) Transparent access Transparent data localization Consistency models and protocols Static, homogeneous architecture

5 Another Approach: Peer-to-Peer Systems Peer-to-peer systems (P2P) Distributed (large-scale) Volatile peers Peers have the same capacities and responsibilities Sharing immutable data Centralized (Napster) Flooding (Gnutella, KaZaA) Distributed hash table (CFS, PAST) Sharing mutable data One writer per data, static assumptions (OceanStore) Manual conflict resolution (Ivy)

6 DSM systems and P2P systems Comparing basic hypotheses DSMP2P Scale DynamicityNullHigh Resource homogeneity Homogeneous (clusters) Heterogeneous (Internet) Control and trust HighLow TopologyFlat Data typeMutableImmutable Typical applications Scientific computation File sharing and storage

7 Idea: Data Sharing Service Proposal: hybrid approach DSM systems: consistency and transparent access P2P systems: scalability and high dynamicity DSMGrid Data ServiceP2P Scale DynamicityNullMediumHigh Resource homogeneity Homogeneous (clusters) Rather heterogeneous (clusters of clusters) Heterogeneous (Internet) Control and trust HighMediumLow TopologyFlatHierarchicalFlat Data typeMutable Immutable Typical applications Scientific computation Scientific computation and data storage File sharing and storage

8 A Data Sharing Service for the Grid Internet Persistence

9 A Data Sharing Service for the Grid Internet Data transfer ? Transparent data location

10 A Data Sharing Service for the Grid Internet Scalability Internet

11 A Data Sharing Service for the Grid Internet Volatility tolerance

12 JXTA: a Framework for P2P Services Open-source platform for programming P2P applications A peer Uniquely identified (ID) Address independent of physical location Multiple network access points (TCP, HTTP, etc) Peer Firewall Peer TCP/IP HTTP Peer ID Firewall

13 JXTA: Peer Groups Set of peers that share a common set of interests Specific management policy Peer group services Peer ID NetPeerGroup PeerGroupA PeerGroupB

14 JXTA: Advertisements Every resource is described by an advertisement Peers Peer groups Communication channels Services … PeerGroup Advertisement: urn:jxta: uuid- BCBCDEABDBBBABEABBBABA urn:jxta:uuid- BFEFDEDFBABAFRUDBACE My Group This group is to be used for my own testing

15 JuxMem: an Architecture Proposal juxmem group cluster A group cluster B group cluster C group data group Physical architecture Logical architecture

16 JuxMem API Alloc (size, attribs) Map (id, attribs) Put (id, value) Get (id) Lock (id) Unlock (id)

17 Managing Memory Resources cluster group juxmem group Size: 8 MB Memory provided Provider advertisements: cluster group Cluster advertisements: juxmem group

18 Allocation: How Does It Work? 2 1 3a 3b MB?

19 Managing Shared Data Blocks Allocate a memory block = create a data group Data blocks identified by the ID of the peer group Transparent access for clients via data ID Consistency Data blocks replicated on providers Simultaneous updates (logical multicast) Clients are not notified of updates Synchronization One lock per data block Other mechanisms: in progress

20 Handling Peer Volatility Provider volatility A manager per peer group (cluster and data) Dynamic monitoring of available peers (cluster) Automatic replication of data blocks (data) Manager volatility Periodic exchange of heartbeats Dynamic replication of managers if needed on other peers

21 Implementation and Preliminary Evaluation Implementation JXTA service, 5000 Java code lines Experimental setup PentiumII: 450 Mhz and 256 MB of RAM FastEthernet 100 Mb/s Linux 2.4 Number of nodes: 20 Experiment Study provider volatility

22 Study: Provider Volatility (1) juxmem group cluster group data group Data size: one byte Replication degree = 3 Data manager not killed 1 client: 100 iterations lock-put-unlock 16 providers

23 Study: Provider Volatility (2) juxmem group cluster group data group 1 client: 100 iterations lock-put-unlock 16 providers Data size: one byte Replication degree = 3 Data manager not killed

24 Study: Provider Volatility (3) Internal locking during replication Guarantee consistency during replica creation Client is blocked juxmem group cluster group data group

25 Study: Provider Volatility (4) JXTA/Java Expensive underlying JXTA-level dynamic channel management Reconfiguration time  11 seconds Targeted volatility is weaker ( >> 80 seconds)

26 Conclusion A hierarchical architecture for a data sharing service for the grid Hybrid approach: DSM and P2P systems Transparent access to data blocks Persistent storage Mutable data: consistency guarantees Active support for peer volatility

27 Ongoing Work Studies Replication strategies for fault tolerance Consistency protocols in a dynamic environment Co-scheduling computation and data distribution Manage data-data affinity Integrate high-speed networks: Myrinet, SCI. Goal: build a Grid Data Service GDS project: Extensive evaluation on realistic codes