Distributed Data Management Graeme Kerr Oracle in R&D Programme
Distributed Data Management Provides access to data when and where you need it Optimizes information access regardless of physical location of data. Enables virtualisation of data in Grids. Types of Data: Flat Files, Relational. Structured & unstructured. Text, Binary, proprietary. Scalar, user defined typed, XML, RDF Images, DICOM, LOB, Spatial. High Volumes.
Methods of Data Distribution Distribute Federate Consolidate
Distribute Information Share information between users, applications, databases, organisations and partners. Move or copy / replicate information as needed Traditionally implemented as replication or message queuing Wide range of technology solutions to achieving distribution. Continuous feed and periodic refresh capabilities are key requirements. Common within Scientific Grid Approach Works well when replicas are read-only. Considerable issues when replicas are updateable.
Data Distribution - Example SOURCE SITE REPLICA SITES Continuous Feeds
Data Distribution - Example SOURCE SITE REPLICA SITES Continuous Feeds Periodic Refresh
Data Distribution - Example SOURCE DATABASE REPLICA DATABASES Oracle STREAMS Transportable Tablespaces, RMAN, Data Pump
CERN LCG Example RALBNLCNAFIN2P3GridkaASGCTRIUMPFSARAPIC Nordu Grid Online Detector Offline CERN IT
Methods of Data Distribution Distribute Federate Consolidate
Federate Information Federate data in multiple data stores into a single virtual database Access both Database and non-Database data Access both structured and unstructured data Hide physical location of data from applications Vendor and open-source solutions. Used mostly as point solution within Commercial World rather than as a key design point.
Data Federation - Example DB2 Excel Web Services
Data Federation - Example DB2 XML Excel Flat Files Text Web Services Flat Files Sybase
Data Federation - Example DB2 XML Excel Flat Files Text Web Services Flat Files Sybase
Data Federation - Example DB2 XML Excel Flat Files Text Web Services Flat Files Sybase
Methods of Data Distribution Distribute Federate Consolidate
Consolidate Information Consolidate heterogeneous data into a central pool. Single database or pool of databases Any data (audio, video, XML, , messages, etc…) Any platform (Linux, Windows, Solaris, HPUX, AIX, Tru64, OpenVMS, OS/390) Proven scalability Hundreds of Very Large Database customers Industry leading benchmarks Very common in commercial world.
Typical Data Centre View DB Clusters Disk DB Users
Consolidated Clusters with RAC & ASM DB Cluster Pool Disk Pool DB Users
DB Server & Disk Virtualisation DB Cluster Pool Disk Pool DB Users
DB Server & Disk Virtualisation DB Cluster Pool Disk Pool DB Users
DB Server & Disk Virtualisation DB Cluster Pool Disk Pool DB Users
Distributed Data Management : Summary Oracle supports efficient utilization of all resources: Consolidate Information—Scalable access to large amounts of information Federate Information—Data can be located where needed but accessed as single database Share Information—Multiple users and applications can share data/information to operate more effectively Information Integration & fast connectivity critical to Grid Computing
Oracle Database Versions. Database 10G Express Edition Entry-Level, small footprint, free to develop, deploy & distribute. 32 Bit Windows, RHEL3 & 4, FC 4, SLES9, Debian 3.1, Ubuntu & Mandriva Database 10G Enterprise Edition Flagship, Full-featured version. Database 10G Standard Edition Limited featured version, RAC, ASM, Streams Apply supported Database 10G Standard Edition 1 SE on limited hardware. Streams Apply supported. Data Grid / Distributed Caching TimesTen TangoSol Coherence
Oracle Database Versions. Database 10G Express Edition Entry-Level, small footprint, free to develop, deploy & distribute. 32 Bit Windows, RHEL3 & 4, FC 4, SLES9, Debian 3.1, Ubuntu & Mandriva Database 10G Enterprise Edition Flagship, Full-featured version. Database 10G Standard Edition Limited featured version, RAC, ASM, Streams Apply supported Database 10G Standard Edition 1 SE on limited hardware. Streams Apply supported. Data Grid / Distributed Caching TimesTen TangoSol Coherence
For More Information Database: Express Edition: Tangosol: Times 10: or Database | Tangosol | etc
EMEA Oracle in R&D Programme