Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/1 Outline Introduction Background Distributed Database Design Database Integration Semantic Data Control.

Slides:



Advertisements
Similar presentations
Copyright © 2007, GemStone Systems Inc. All Rights Reserved. Optimize computations with Grid data caching OGF21 Jags Ramnarayan Chief Architect, GemFire.
Advertisements

Database Tuning Principles, Experiments and Troubleshooting Techniques Baseado nos slides do tutorial com o mesmo nome da autoria de: Dennis Shasha
Dynamic Data Partitioning for Distributed Graph Databases Xavier Martínez Palau David Domínguez Sal Josep Lluís Larriba Pey.
Distributed DBMS© M. T. Özsu & P. Valduriez Ch.10/1 Outline Introduction Background Distributed Database Design Database Integration Semantic Data Control.
Distributed DBMS©M. T. Özsu & P. Valduriez Ch.15/1 Outline Introduction Background Distributed Database Design Database Integration Semantic Data Control.
Distributed DBMSPage 6. 1© 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database Design.
Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.
Distributed DBMS© M. T. Özsu & P. Valduriez Ch.6/1 Outline Introduction Background Distributed Database Design Database Integration Semantic Data Control.
Distributed DBMS © M. T. Özsu & P. Valduriez Ch.7/1 Outline Introduction Background Distributed Database Design Database Integration Semantic Data Control.
Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.
Pertemuan <<12>> Paralel dan Basis Data Terdistribusi
Physical Database Design Data Migration/Conversion.
Distributed DBMS© M. T. Özsu & P. Valduriez Ch.4/1 Outline Introduction Background Distributed Database Design Database Integration ➡ Schema Matching ➡
Query Evaluation Techniques for Cluster Database Systems Andrey V. Lepikhov, Leonid B. Sokolinsky South Ural State University Russia 22 September 2010.
1 Minggu 12, Pertemuan 23 Introduction to Distributed DBMS (Chapter , 22.6, 3rd ed.) Matakuliah: T0206-Sistem Basisdata Tahun: 2005 Versi: 1.0/0.0.
Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.
Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.
Distributed DBMS© 2001 M. Tamer Özsu & Patrick Valduriez Page 1.1 Outline  Introduction à What is a distributed DBMS à Problems à Current state-of-affairs.
Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.
Distributed DBMSPage 4. 1© 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background  Distributed DBMS Architecture  Datalogical Architecture.
Distributed DBMSPage 5. 1 © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture  Distributed Database.
Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.
Preface Exponential growth of data volume, steady drop in storage costs, and rapid increase in storage capacity Inadequacy of the sequential processing.
Chapter 1 Introduction 1.1A Brief Overview - Parallel Databases and Grid Databases 1.2Parallel Query Processing: Motivations 1.3Parallel Query Processing:
Distributed DBMSPage 5. 1 © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture  Distributed Database.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 17 Client-Server Processing, Parallel Database Processing,
Distributed DBMS© 2001 M. Tamer Özsu & Patrick Valduriez Page 1.1 Outline  Introduction à What is a distributed DBMS à Problems à Current state-of-affairs.
Distributed DBMS© 1998 M. Tamer Özsu & Patrick Valduriez Page 13.1 Outline Introduction Background Distributed DBMS Architecture Distributed Database Design.
1 Distributed Databases CS347 Lecture 13 May 23, 2001.
CPSC-608 Database Systems Fall 2010 Instructor: Jianer Chen Office: HRBB 315C Phone: Notes 1.
Outline Introduction Background Distributed Database Design
Distributed DBMS © M. T. Özsu & P. Valduriez Ch.7/1 Outline Introduction Background Distributed Database Design Database Integration Semantic Data Control.
Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.
An Introduction to Infrastructure Ch 11. Issues Performance drain on the operating environment Technical skills of the data warehouse implementers Operational.
Object-based Storage Long Liu Outline Why do we need object based storage? What is object based storage? How to take advantage of it? What's.
PMIT-6102 Advanced Database Systems
Chapters 17 & 18 Physical Database Design Methodology.
01/02/20031 Welcome to Distributed Databases Course.
Database Architecture Introduction to Databases. The Nature of Data Un-structured Semi-structured Structured.
HadoopDB project An Architetural hybrid of MapReduce and DBMS Technologies for Analytical Workloads Anssi Salohalla.
Distributed DBMS© 2001 M. Tamer Özsu & Patrick Valduriez Page 0.1 Outline Introduction Background Distributed DBMS Architecture Distributed Database Design.
Web Application Servers Dean Jacobs BEA WebLogic.
DBSQL 12-1 Copyright © Genetic Computer School 2009 Chapter 12 Recent Concepts and Application of Databases.
Intro – Part 2 Introduction to Database Management: Ch 1 & 2.
Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.
Preventive Replication in Database Cluster Esther Pacitti, Cedric Coulon, Patrick Valduriez, M. Tamer Özsu* LINA / INRIA – Atlas Group University of Nantes.
1 By Paul Murray Claire McQuade Kashif Rafiq David Miller.
Elmasri and Navathe, Fundamentals of Database Systems, Fourth Edition Copyright © 2004 Pearson Education, Inc. Slide 2-1 Data Models Data Model: A set.
Databases Illuminated
Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.
Database Architecture Course Orientation & Context.
Introduction.  Administration  Simple DBMS  CMPT 454 Topics John Edgar2.
1 Distributed Databases architecture, fragmentation, allocation Lecture 1.
Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.
Cluster computing. 1.What is cluster computing? 2.Need of cluster computing. 3.Architecture 4.Applications of cluster computing 5.Advantages of cluster.
Distributed DBMS© 2001 M. Tamer Özsu & Patrick Valduriez Page 1.1 Outline n Introduction Background Distributed DBMS Architecture Distributed Database.
MarkLogic The Only Enterprise NoSQL Database Presented by: Aashi Rastogi ( ) Sanket Patel ( )
Distributed Database Management Systems © 2001 M
Outline Background Introduction Distributed DBMS Architecture
CSE-291 Cloud Computing, Fall 2016 Kesden
OVERVIEW OF RELATIONAL DBMS PERTEMUAN 1 IR. NIZIRWAN ANWAR, MT
CS 244A Expanded Course Outline Fall 2007
Topics Covered in COSC 6340 Data models (ER, Relational, XML (short))
Distributed Databases
Topics Covered in COSC 6340 Data models (ER, Relational, XML)
Outline Introduction Background Distributed DBMS Architecture
Database Architecture
Database System Architectures
Distributed Database Management Systems
Outline Introduction Background Distributed DBMS Architecture
Presentation transcript:

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/1 Outline Introduction Background Distributed Database Design Database Integration Semantic Data Control Distributed Query Processing Multidatabase Query Processing Distributed Transaction Management Data Replication Parallel Database Systems ➡ Data placement and query processing ➡ Load balancing ➡ Database clusters Distributed Object DBMS Peer-to-Peer Data Management Web Data Management Current Issues

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/2 The Database Problem Large volume of data  use disk and large main memory I/O bottleneck (or memory access bottleneck) ➡ Speed(disk) << speed(RAM) << speed(microprocessor) Predictions ➡ Moore’s law: processor speed growth (with multicore): 50 % per year ➡ DRAM capacity growth : 4  every three years ➡ Disk throughput : 2  in the last ten years Conclusion : the I/O bottleneck worsens

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/3 The Solution Increase the I/O bandwidth ➡ Data partitioning ➡ Parallel data access Origins (1980's): database machines ➡ Hardware-oriented  bad cost-performance  failure ➡ Notable exception : ICL's CAFS Intelligent Search Processor 1990's: same solution but using standard hardware components integrated in a multiprocessor ➡ Software-oriented ➡ Standard essential to exploit continuing technology improvements

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/4 Multiprocessor Objectives High-performance with better cost-performance than mainframe or vector supercomputer Use many nodes, each with good cost-performance, communicating through network ➡ Good cost via high-volume components ➡ Good performance via bandwidth Trends ➡ Microprocessor and memory (DRAM): off-the-shelf ➡ Network (multiprocessor edge): custom The real chalenge is to parallelize applications to run with good load balancing

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/5 Data Server Architecture client interface query parsing data server interface communication channel Application server Data server database application server interface database functions Client

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/6 Objectives of Data Servers Avoid the shortcomings of the traditional DBMS approach ➡ Centralization of data and application management ➡ General-purpose OS (not DB-oriented) By separating the functions between ➡ Application server (or host computer) ➡ Data server (or database computer or back-end computer)

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/7 Data Server Approach: Assessment Advantages ➡ Integrated data control by the server (black box) ➡ Increased performance by dedicated system ➡ Can better exploit parallelism ➡ Fits well in distributed environments Potential problems ➡ Communication overhead between application and data server ✦ High-level interface ➡ High cost with mainframe servers

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/8 Parallel Data Processing Three ways of exploiting high-performance multiprocessor systems:  Automatically detect parallelism in sequential programs (e.g., Fortran, OPS5)  Augment an existing language with parallel constructs (e.g., C*, Fortran90)  Offer a new language in which parallelism can be expressed or automatically inferred Critique  Hard to develop parallelizing compilers, limited resulting speed-up  Enables the programmer to express parallel computations but too low-level  Can combine the advantages of both (1) and (2)

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/9 Data-based Parallelism Inter-operation ➡ p operations of the same query in parallel op.3 op.1 op.2 op. R R1R1 R2R2 R3R3 R4R4 Intra-operation ➡ The same op in parallel

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/10 Parallel DBMS Loose definition: a DBMS implemented on a tighly coupled multiprocessor Alternative extremes ➡ Straighforward porting of relational DBMS (the software vendor edge) ➡ New hardware/software combination (the computer manufacturer edge) Naturally extends to distributed databases with one server per site

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/11 Parallel DBMS - Objectives Much better cost / performance than mainframe solution High-performance through parallelism ➡ High throughput with inter-query parallelism ➡ Low response time with intra-operation parallelism High availability and reliability by exploiting data replication Extensibility with the ideal goals ➡ Linear speed-up ➡ Linear scale-up

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/12 Linear Speed-up Linear increase in performance for a constant DB size and proportional increase of the system components (processor, memory, disk) new perf. old perf. ideal components

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/13 Linear Scale-up Sustained performance for a linear increase of database size and proportional increase of the system components. components + database size new perf. old perf. ideal

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/14 Barriers to Parallelism Startup ➡ The time needed to start a parallel operation may dominate the actual computation time Interference ➡ When accessing shared resources, each new process slows down the others (hot spot problem) Skew ➡ The response time of a set of parallel processes is the time of the slowest one Parallel data management techniques intend to overcome these barriers

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/15 Parallel DBMS – Functional Architecture RM task n DM task 12 DM task n2 DM task n1 Data Mgr DM task 11 Request Mgr RM task 1 Session Mgr User task 1 User task n

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/16 Parallel DBMS Functions Session manager ➡ Host interface ➡ Transaction monitoring for OLTP Request manager ➡ Compilation and optimization ➡ Data directory management ➡ Semantic data control ➡ Execution control Data manager ➡ Execution of DB operations ➡ Transaction management support ➡ Data management

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/17 Parallel System Architectures Multiprocessor architecture alternatives ➡ Shared memory (SM) ➡ Shared disk (SD) ➡ Shared nothing (SN) Hybrid architectures ➡ Non-Uniform Memory Architecture (NUMA) ➡ Cluster

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/18 Shared-Memory DBMS on symmetric multiprocessors (SMP) Prototypes: XPRS, Volcano, DBS3 + Simplicity, load balancing, fast communication - Network cost, low extensibility

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/19 Shared-Disk Origins : DEC's VAXcluster, IBM's IMS/VS Data Sharing Used first by Oracle with its Distributed Lock Manager Now used by most DBMS vendors + network cost, extensibility, migration from uniprocessor - complexity, potential performance problem for cache coherency

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/20 Shared-Nothing Used by Teradata, IBM, Sybase, Microsoft for OLAP Prototypes: Gamma, Bubba, Grace, Prisma, EDS + Extensibility, availability - Complexity, difficult load balancing

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/21 Hybrid Architectures Various possible combinations of the three basic architectures are possible to obtain different trade-offs between cost, performance, extensibility, availability, etc. Hybrid architectures try to obtain the advantages of different architectures: ➡ efficiency and simplicity of shared-memory ➡ extensibility and cost of either shared disk or shared nothing 2 main kinds: NUMA and cluster

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/22 NUMA Shared-Memory vs. Distributed Memory ➡ Mixes two different aspects : addressing and memory ✦ Addressing: single address space vs multiple address spaces ✦ Physical memory: central vs distributed NUMA = single address space on distributed physical memory ➡ Eases application portability ➡ Extensibility The most successful NUMA is Cache Coherent NUMA (CC-NUMA)

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/23 CC-NUMA Principle ➡ Main memory distributed as with shared-nothing ➡ However, any processor has access to all other processors’ memories Similar to shared-disk, different processors can access the same data in a conflicting update mode, so global cache consistency protocols are needed. ➡ Cache consistency done in hardware through a special consistent cache interconnect ✦ Remote memory access very efficient, only a few times (typically between 2 and 3 times) the cost of local access

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/24 Cluster Combines good load balancing of SM with extensibility of SN Server nodes: off-the-shelf components ➡ From simple PC components to more powerful SMP ➡ Yields the best cost/performance ratio ➡ In its cheapest form, Fast standard interconnect (e.g., Myrinet and Infiniband) with high bandwidth (Gigabits/sec) and low latency

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/25 SN cluster vs SD cluster SN cluster can yield best cost/performance and extensibility ➡ But adding or replacing cluster nodes requires disk and data reorganization SD cluster avoids such reorganization but requires disks to be globally accessible by the cluster nodes ➡ Network-attached storage (NAS) ✦ distributed file system protocol such as NFS, relatively slow and not appropriate for database management ➡ Storage-area network (SAN) ✦ Block-based protocol thus making it easier to manage cache consistency, efficient, but costlier

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/26 Discussion For a small configuration (e.g., 8 processors), SM can provide the highest performance because of better load balancing Some years ago, SN was the only choice for high-end systems. But SAN makes SN a viable alternative with the main advantage of simplicity (for transaction management) ➡ SD is now the preferred architecture for OLTP ➡ But for OLAP databases that are typically very large and mostly read-only, SN is used Hybrid architectures, such as NUMA and cluster, can combine the efficiency and simplicity of SM and the extensibility and cost of either SD or SN

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/27 Parallel DBMS Techniques Data placement ➡ Physical placement of the DB onto multiple nodes ➡ Static vs. Dynamic Parallel data processing ➡ Select is easy ➡ Join (and all other non-select operations) is more difficult Parallel query optimization ➡ Choice of the best parallel execution plans ➡ Automatic parallelization of the queries and load balancing Transaction management ➡ Similar to distributed transaction management

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/28 Data Partitioning Each relation is divided in n partitions (subrelations), where n is a function of relation size and access frequency Implementation ➡ Round-robin ✦ Maps i -th element to node i mod n ✦ Simple but only exact-match queries ➡ B-tree index ✦ Supports range queries but large index ➡ Hash function ✦ Only exact-match queries but small index

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/29 Partitioning Schemes Round-RobinHashing Interval a-gh-mu-z

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/30 Replicated Data Partitioning High-availability requires data replication ➡ simple solution is mirrored disks ✦ hurts load balancing when one node fails ➡ more elaborate solutions achieve load balancing ✦ interleaved partitioning (Teradata) ✦ chained partitioning (Gamma)

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/31 Interleaved Partitioning Node Primary copy R 1 R 2 R 3 R 4 Backup copy r 1.1 r 1.2 r 1.3 r 2.3 r 2.1 r 2.2 r 3.2 r 3.2 r

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/32 Chained Partitioning Node Primary copy R 1 R 2 R 3 R 4 Backup copy r 4 r 1 r 2 r

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/33 Placement Directory Performs two functions ➡ F 1 (relname, placement attval) = lognode-id ➡ F 2 (lognode-id) = phynode-id In either case, the data structure for f 1 and f 2 should be available when needed at each node

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/34 Join Processing Three basic algorithms for intra-operator parallelism ➡ Parallel nested loop join: no special assumption ➡ Parallel associative join: one relation is declustered on join attribute and equi-join ➡ Parallel hash join: equi-join They also apply to other complex operators such as duplicate elimination, union, intersection, etc. with minor adaptation

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/35 Parallel Nested Loop Join send partition node 3node 4 node 1node 2 R1:R1: S1S1 S2S2 R2:R2:

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/36 Parallel Associative Join node 1 node 3node 4 node 2 R1:R1: R2:R2: S1S1 S2S2

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/37 Parallel Hash Join node node 1 node 2 R1:R1: R2:R2: S1:S1: S2:S2:

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/38 Parallel Query Optimization The objective is to select the “best” parallel execution plan for a query using the following components Search space ➡ Models alternative execution plans as operator trees ➡ Left-deep vs. Right-deep vs. Bushy trees Search strategy ➡ Dynamic programming for small search space ➡ Randomized for large search space Cost model (abstraction of execution system) ➡ Physical schema info. (partitioning, indexes, etc.) ➡ Statistics and cost functions

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/39 Execution Plans as Operator Trees R2 R1 R4 Result j2 j3 Left-deep Right-deep j1 R3 R2R1 R4 Result j5 j6 j4 R3 R2R1 R3 j7 R4 Result j9 Zig-zag Bushy j8 Result j10 j12 j11 R2R1 R4 R3

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/40 Equivalent Hash-Join Trees with Different Scheduling R3 Probe3 Build3 R4 Temp2 Temp1 Build3 R4 Temp2 Probe3 Build3 Probe2 Build2 Probe1 Build1 R2R1 R3 Temp1 Probe2 Build2 Probe1 Build1 R2 R1

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/41 Load Balancing Problems arise for intra-operator parallelism with skewed data distributions ➡ attribute data skew (AVS) ➡ tuple placement skew (TPS) ➡ selectivity skew (SS) ➡ redistribution skew (RS) ➡ join product skew (JPS) Solutions ➡ sophisticated parallel algorithms that deal with skew ➡ dynamic processor allocation (at execution time)

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/42 Data Skew Example Join1 Res1 Res2 Join2 AVS/TPS JPS RS/SS Scan1 S2 R2 S1 R1 Scan2

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/43 Load Balancing in a DB Cluster Choose the node to execute Q ➡ round robin ➡ The least loaded ✦ Need to get load information Fail over ➡ In case a node N fails, N’s queries are taken over by another node ✦ Requires a copy of N’s data or SD In case of interference ➡ Data of an overloaded node are replicated to another node Q1Q1 Q2Q2 Load balancing Q3Q3 Q4Q4 Q4Q3Q2Q1Q4Q3Q2Q1

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/44 Oracle Transparent Application Failover Client Node 1 connect1 Node 2 Ping

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/45 Client PCs Enterprise network Microsoft Failover Cluster Topology Internal network Fibre Channel

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/46 Main Products VendorProductArchitecturePlatforms IBM DB2 Pure Scale DB2 Database Partitioning Feature (DPF) SD SN AIX on SP Linux on cluster Microsoft SQL Server SQL Server 2008 R2 Parallel Data Warehouse SD SN Windows on SMP and cluster Oracle Real Application Cluster Exadata Database Machine SDWindows, Unix, Linux on SMP and cluster NCRTeradata SN Bynet network NCR Unix and Windows OracleMySQLSNLinux Cluster

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/47 The Exadata Database Machine New machine from Oracle with Sun Objectives ➡ OLTP, OLAP, mixed workloads Oracle Real Application Cluster ➡ 8+ servers bi-pro Xeon, 72 GB RAM Exadata storage server : intelligent cache ➡ 14+ cells, each with ✦ 2 processors, 24 Go RAM ✦ 385 GB of Flash memory (read is 10* faster than disk) ✦ 12+ SATA disks of 2 To or 12 SAS disks of 600 GB

Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/48 Exadata Architecture Real Application Cluster Infiniband Switches Storage cells