Data Replication with Advanced Replication & Oracle Streams John Abrahams Technology Sales Consultant Oracle Nederland.

Slides:



Advertisements
Similar presentations
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Advertisements

Oracle Materialized Views for Replication COUG Presentation, Feb 20, 2014 Jane Lamont,
BY LECTURER/ AISHA DAWOOD DW Lab # 3 Overview of Extraction, Transformation, and Loading.
Chapter 24 Replication and Mobile Databases Transparencies © Pearson Education Limited 1995, 2005.
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
Oracle Data Guard Ensuring Disaster Recovery for Enterprise Data
Chapter 13 (Web): Distributed Databases
Oracle Clustering and Replication Technologies CCR Workshop - Otranto Barbara Martelli Gianluca Peco.
Oracle Advanced Queuing Features Overview
Overview Distributed vs. decentralized Why distributed databases
Hands-On Microsoft Windows Server 2003 Administration Chapter 5 Administering File Resources.
Data Warehouse success depends on metadata
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
CS 603 Data Replication in Oracle February 27, 2002.
EIM April 19, Robin Weaver 13 Years with IBM Prior to Assignment at UNC Charlotte Range of Database Development/Data Management Projects and Products.
Working with SQL and PL/SQL/ Session 1 / 1 of 27 SQL Server Architecture.
Module 14: Scalability and High Availability. Overview Key high availability features available in Oracle and SQL Server Key scalability features available.
Passage Three Introduction to Microsoft SQL Server 2000.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
Module 9: SQL Server 2005 Replication. Overview Overview of Replication Enhancements New Types of Replication Configuring Replication.
ADVANCE REPLICATION Oracle Coretech Surabaya (OCS) One Day Seminar Surabaya, August 30 th 2008.
Jason Morrill NCOAUG Training Day February, 2008
SQL Server Replication By Karthick P.K Technical Lead, Microsoft SQL Server.
Gary MacDougall Premjit Singh Managing your Distributed Data.
Overview of SQL Server Alka Arora.
IMS 4212: Distributed Databases 1 Dr. Lawrence West, Management Dept., University of Central Florida Distributed Databases Business needs.
Distributing Data for Availability and Scalability Don Vilen Program Manager SQL Server Microsoft Corporation.
Data Warehousing Seminar Chapter 5. Data Warehouse Design Methodology Data Warehousing Lab. HyeYoung Cho.
Oracle Streams--Simplifying Information Sharing in Oracle10 g Patricia McElroy Product Manager Oracle Corporation Session id:
Sofia, Bulgaria | 9-10 October SQL Server 2005 High Availability for developers Vladimir Tchalkov Crossroad Ltd. Vladimir Tchalkov Crossroad Ltd.
DAT201: Introduction To SQL Server ™ Replication Matt Hollingsworth Program Manager Microsoft Corporation.
1 Oracle Database 11g – Flashback Data Archive. 2 Data History and Retention Data retention and change control requirements are growing Regulatory oversight.
Data: Migrating, Distributing and Audit Tracking Michelle Ayers, Advisory Solution Consultant
5 Copyright © 2009, Oracle. All rights reserved. Right-Time Data Warehousing with OWB.
By Lecturer / Aisha Dawood 1.  You can control the number of dispatcher processes in the instance. Unlike the number of shared servers, the number of.
DBSQL 14-1 Copyright © Genetic Computer School 2009 Chapter 14 Microsoft SQL Server.
Lecture 5: Sun: 1/5/ Distributed Algorithms - Distributed Databases Lecturer/ Kawther Abas CS- 492 : Distributed system &
Right In Time Presented By: Maria Baron Written By: Rajesh Gadodia
FORUM II Best Practices in Data Warehousing in Higher Education: A Framework for Higher Education Reporting April 18, 2005 Slide 1 Cornell University’s.
Module 1: Exploring Replication. Overview Understanding SQL Server Replication Setting Up Replication Understanding Agents in Replication Securing Replication.
Overview – Chapter 11 SQL 710 Overview of Replication
Module 6: Implementing SQL Server Replication in an Enterprise Environment.
Module 11: Introducing Replication. Overview Introduction to Distributed Data Introduction to SQL Server Replication SQL Server Replication Agents SQL.
1 Distributed Databases BUAD/American University Distributed Databases.
Page 1. Data Integration Using Oracle Streams A Case Study Session #:
Oracle® Streams for Near Real Time Asynchronous Replication Nimar S. Arora Oracle USA.
SQL Server 2005 Implementation and Maintenance Chapter 12: Achieving High Availability Through Replication.
Ing. Erick López Ch. M.R.I. Replicación Oracle. What is Replication  Replication is the process of copying and maintaining schema objects in multiple.
MBA 664 Database Management Systems Dave Salisbury ( )
MGA Duplica Replication Tool. 1. High Availability and Avoidance of Data Loss  Replicate to alternate databases 2. Split activities across databases.
7 Strategies for Extracting, Transforming, and Loading.
Module 5: Implementing Merge Replication. Overview Understanding Merge Replication Architecture Implementing Conflict Resolution Planning and Deploying.
DATABASE REPLICATION DISTRIBUTED DATABASE. O VERVIEW Replication : process of copying and maintaining database object, in multiple database that make.
Features Of SQL Server 2000: 1. Internet Integration: SQL Server 2000 works with other products to form a stable and secure data store for internet and.
ViaSQL Technical Overview. Viaserv, Inc. 2 ViaSQL Support for S/390 n Originally a VSE product n OS/390 version released in 1999 n Identical features.
HyperKVS Group Meeting Oracle Streams Dr. Volker Kuhr.
Chapter 1 Database Access from Client Applications.
Oracle9i Performance Tuning Chapter 11 Advanced Tuning Topics.
Status of tests in the LCG 3D database testbed Eva Dafonte Pérez LCG Database Deployment and Persistency Workshop.
C Copyright © 2006, Oracle. All rights reserved. Integrating with Oracle Streams.
Log Shipping, Mirroring, Replication and Clustering Which should I use? That depends on a few questions we must ask the user. We will go over these questions.
20 Copyright © 2006, Oracle. All rights reserved. Best Practices and Operational Considerations.
Distributed Databases
11 Copyright © 2009, Oracle. All rights reserved. Enhancing ETL Performance.
Oracle Clustering and Replication Technologies UK Metadata Workshop - Oxford Barbara Martelli Gianluca Peco.
Chapter Name Replication and Mobile Databases Transparencies
Overview of MDM Site Hub
Maximum Availability Architecture Enterprise Technology Centre.
Technical Capabilities
Data Warehousing Concepts
Presentation transcript:

Data Replication with Advanced Replication & Oracle Streams John Abrahams Technology Sales Consultant Oracle Nederland

What is Replication  Multiple copies of data at different sites  Increased availability  Manual data replication implementations – Export/Import – CREATE TABLE AS SELECT FROM REMOTETABLE – COPY

Oracle9i Features for Information Sharing  Features Introduced In Prior Releases – Data Guard – Physical Standby Database – Advanced Queuing -- Message Queuing – Advanced Replication -- Replication of Data – Change Data Capture -- DW Loading  Features available in Oracle9iR2 – Oracle Streams -- a comprehensive information sharing solution – Data Guard -- Logical Standby Database based on Oracle Streams

Oracle Advanced Replication

Oracle9i Replication Technology  Oracle9i provides built-in technology to create and manage replicated environments – Integrated, no add-ons, no special commands – Managed with Oracle Enterprise Manager  Advanced data replication technology – Bi-directional, all copies potentially updatable – Automatic conflict detection and resolution – Tables and supporting objects – Full copies or subsets – DDL (schema changes) as well as DML (transactions) – Continuous or on demand replication

Replication Usage Examples Information Dissemination Move data (such as price lists) locally for improved response times Offload queries from master site Only subsets of data need to be replicated

Replication Usage Examples Call Centers Same data available at all sites Very useful for balancing usage Viable failover strategy; if one sites fails, others remain available Updates can be done anywhere

Replication Usage Examples Branch Office Automation Data is located close to users at local sites Data is consolidated at central site for processing and rollups Only subsets of data need to be replicated at each site Updates can be done anywhere

Replication Usage Examples Mass Deployment Mobile users must be able to operate even when disconnected from central sites Mobile users can replicate only data they need to their laptop Synchronization can be done when re-connected

Architectural Overview Single master replication Single, updatable master Multiple updatable or read-only materialized views (snapshots) – Full copies or subsets of master All conflicts resolved at parent site of materialized view Oracle9i standard edition master materialized view master materialized view

Support for multiple, n-way connected, updatable masters Improves scalability and availability Oracle9i Enterprise Edition master M View Master M View M View Architectural Overview Multiple master replication

Master vs. Materialized View Replication MultimasterMaterialized View Server-to-Server Mass Deployment Large number of small remote sites HQ and regional offices Continuous, near real- time data propagation Periodic Bulk Transfer Updates per transaction Final Values of Changed Rows Only Full Copies Subsets or Full Copies

Advanced Replication Key Features  Near real-time replication (multimaster) – Parallel data propagation – Multiple, pre-defined conflict resolution methods  Mass deployment (materialized views) – subquery materialized views – deployment templates – multitier materialized views  Oracle Enterprise Manager for configuration and administration  Specialized options: procedural and synchronous replication

Near Real-time Replication  Benefits: – availability, scalability, failover  Uses: – Telesales, support  Requirements: – Efficient data capture and storage – Efficient data propagation – Continuous data propagation – Automatic resolution of conflicting updates

Architectural Overview Multimaster Groups of related schema objects kept in synch at multiple locations Sites communicate by broadcasting changes to all other sites master

Replication Objects  Database object replication to multiple servers  The following database objects can be replicated – Tables, Indexes, Views, Synonyms, Triggers – Packages, Procedures, Functions – Advanced Data Types  User-Defined Types, Indextypes  Tables with column objects, object tables  Nested Tables, Varrays

Efficient Data Capture and Storage  Committed changes are added to queue for later propagation to remote sites – Enqueued using advanced queueing mechanism – Captured and applied using internal C code – Minimum data needed to apply change is captured Updates Internal Trigger Advanced Queue Source tables

Efficient Data Propagation  Queued changes are pushed to remote sites in parallel for improved performance – Single parallel stream – Maintains transactional consistency – Automatically detects transaction dependencies If update … If delete … If insert... If update … If delete … If insert... internal procedure background process

Parallel Data Propagation  Dependencies – Transaction “B” is dependent on transaction “A” if “B” accesses data “A” has updated – Dependency detection is dynamic and light- weight  Ordering – Dependent transactions are propagated in dependency order – All other transactions are propagated in parallel

Continuous Propagation  Changes can be continuously propagated or at a fixed interval, fixed time, or on demand  Different intervals can be used for each location  Different intervals can be used for each group  Dynamic Views to monitor propagation activity and throughput

Automatic Conflict Resolution  Automatic conflict detection with user- selectable conflict resolution routines – latest timestamp, earliest timestamp, maximum or minimum value, overwrite, priority group, discard, site priority, average, or additive  User-definable resolution routines  Detection and resolution based on column groups

Mass Deployment Replication  Benefits: – disconnected, updatable  U ses: – field sales, field service  Requirements: – Easily define unique subsets – Easily deploy to 100’s of sites – Support mobile users  refresh on demand

– Full transactional consistency – Efficient, batch-oriented refresh l Scheduled or on demand l Refresh groups preserve master-detail relationships – Updatable materialized views use deferred transactions to push changes to master Architectural Overview Materialized Views SELECT … FROM... network Master table log Materialized View Updatable or read-only copy of a table, or portion of a table, at a point in time

Managing Advanced Replication

Specialized Options  Synchronous Replication – always up to date, no conflicts – slower response, network dependent  Procedural Replication – faster for batch processing – must be serialized, best done in off hours – useful for purging

Replication Summary  Full and subset  Near realtime or on demand  Graphical administration tool  Sophisticated Functionality – Efficient data capture and storage – Parallel data propagation – Automatic conflict detection and resolution – Subquery subsetting – Deployment templates – Multitier materialized views – Specialized options

Oracle Streams

Oracle Streams – Unified Messaging and Data Movement Oracle Advanced Queuing (AQ) Oracle Advanced Replication Update Standby Feed Data Warehouse Messaging Replication Standby Data Warehousing Oracle Streams Intelligent, Unified, Time-Ordered Information Stream

Oracle Streams  A new solution for information sharing  Provides a unified architecture for all information sharing solutions – uniquely flexible replication – message queuing – data warehouse loading – event management and notification  The foundation of Data Guard Logical Standby Database

Streams Basic Elements  Three basic elements in each database – Capture – Staging – Consumption (apply) Consumption StagingCapture

Multi-Database Streams  A stream can contain multiple elements from multiple databases  Events flow between staging areas Consumption Staging Capture Consumption Staging Capture

Capture  Streams captures events – Implicitly: log-based capture of DML and DDL – Explicitly: Direct enqueue of user messages  Captured events are published in the staging area Capture

Log-Based Change Capture  Low overhead, low latency change capture – Changes to the database are written to the online redo log – Oracle Streams can extract changes from the log as it is written (mining the active log) – Changes are formatted as a Logical Change Record (LCR), a SQL like representation of the change Capture

Direct Enqueue  User applications can explicitly enqueue user messages into the staging area – Multiple open interfaces supported: JMS, C, PLSQL, SOAP (XML/HTTP), XML/SMTP – Allows applications to communicate at a higher level – Allows users to introduce events into the stream from non-Oracle systems Capture

Staging  Streams publishes captured events into a staging area – Implemented as a queue – Supports for new self-describing type “any” datatype allows a single staging area to hold any type of data – All events, LCRs and user-messages, can be staged in the same queue – Messages remain in staging area until consumed by all subscribers Staging

Staging Area Propagation  Other staging areas can subscribe to events – in same database – in a remote database  Events can be routed through a series of staging areas Propagation Staging Staging

Transformations  Transformations can be performed – as events enter the staging area – as events leave the staging area – as events propagate between staging areas  Transformation examples – change format, data type, column name, table name Staging

Consumption  Staged events are consumed by subscribers – Implicitly: Apply Process  Default Apply  User-Defined Apply – Explictly: Application dequeue via open interfaces  JMS, C, PLSQL, SOAP (XML/HTTP), XML/SMTP Consumption

Default Apply  The default apply engine will directly apply the DML or DDL represented in the LCR – apply to local Oracle table – apply via DB Link to non-Oracle table  Automatic conflict detection with optional resolution – unresolved conflicts placed in exception queue  Parallel apply maximizes concurrency Consumption

User-defined Apply  User-written custom apply functions  Written in PL/SQL, Java, C, C++  Uses: – custom transformations – column subsetting – normalizing or denormalizing data – populating related fields or tables Consumption

Rule-based Subscription  Consumers subscribe to published events  Content-based subscriptions limit delivered events to those meeting the subscription criteria  Rules govern capture, staging, and consumption Staging Area Rules Engine Publish Subscribe UPDATE EMP... WHERE OBJECT = ‘EMP’

NY(master) London(subset) Milan(subset)Paris(subset) Directed Networks  Propagation independent of Apply  Rules-based subscription determine if event is locally applied – London applies UK only  WAN Friendly – Send once, fan out – NY-->London, London-->Milan, London-->Paris INSERT … VALUES (‘EUROPE’,’ ITALY’) ITALY EUROPE FRANCE

Heterogeneous Support  Oracle to non-Oracle apply via gateway – Apply process on Oracle node applies change  Non-Oracle to Oracle change capture supported via explicit enqueue of LCRs  Message Gateways – MQ Series – Tibco LCR or user message Gateway MessageGateway MQ Series Sybase

Streams Deployments  Streams can be deployed to meet a variety of information sharing requirements – Replication – Data Warehouse Loading – Event Notification – Message Queuing – Data Guard Logical Standby Database

Replication  Streams asynchronously maintains multiple copies of objects via automatic apply – Identical objects – Related via a transformation or function  Streams automatically captures, propagates, and applies DML and DDL changes – Detects and optionally resolves conflicts  Supports flexible data movement and subsetting  Gateways and APIs for heterogeneous support  Compatible with Materialized Views

Replication  Benefits: – No quiesce for DDL – Lower overhead on production system – Reduced network traffic – Flexible configurations Log-based Capture Capture Stage DefaultApply Log-based Stage DefaultApply Propagation

Data Warehouse Loading  Streams can load data warehouse staging areas and Operational Data Stores – Updates captured from a production system – Messages and business events from a process flow  Supports continuous or batch loading  Automatically transforms data to appropriate format and schema during Operational Data Store load

Data Warehouse Loading  Benefits: – low overhead – automatic transformation – near real-time loading of operation data stores Stage User-definedApply Propagation Stage Log-based Capture Capture Production Database Staging Table or ODS

Event Notification  Streams can notify subscribers that events of interest have occurred – Pager notification of flight delays (Orbitz) – Notification of price drops (CNET Shopper) – Notification to sales manager of Gold Customer purchase (CRM App)  Streams can evaluate DML events and send notifications to applications that send s, page users, etc – Users get information they want

Event Notification  Benefits: – scalable – reduced custom development Log-based Capture Capture Stage ExplicitDequeue

Message Queuing  Streams can be deployed as an enhanced database integrated message queuing solution – Point-to-point messaging, publish and subscribe – Single data, security and transactional model for database and message queuing operations – Centrally managed and multi-consumer queues to simplify configuration – Content-based subscriptions, internet access – Automatic dequeue to server-run user function – Automatic transform DML/DDL into messages

Message Queuing  Benefits: – Reduced development costs – Easy database integration – Single development, operational, security model – Reliability and integrity of database Stage ExplicitDequeue Propagation Stage ExplicitEnqueue Source Database Destination Database

Data Guard Logical Standby Database  Multiple copies of data protects from human and data errors, and disasters  Special case of replication – Entire database (by default) – One direction only  Streams supports reporting from standby as updates are applied  Data Guard adds higher level interface, tailored GUIs and broker

Data Guard Logical Standby Database  Benefits: – Open while protecting data – Support near real-time reporting from standby – Protects from physical corruptions – Additional indexes and materialized views DefaultApply Remote Logging Log-based Capture Capture Production Database Logical Standby Database

Other Oracle9i Information Sharing Features  Advanced Replication – provides compatible replication with Oracle 8, 8i, and 9i databases – Migration path to Oracle Streams in future release  Advanced Queuing – Compatible with Oracle Streams – Most functionality offered in Streams – API’s retained for compatibility – Migration path to Oracle Streams in future release

Other Oracle9i Information Sharing Features  Data Guard Physical Standby Database – Uses media recovery mechanism to apply changes to database – Creates an exact copy of the production database  block-for-block copy  Same version of Oracle, same hardware/software architecture – Supports very high transaction workloads – Will coexist with Data Guard Logical Standby Database

Summary  Oracle Streams unifies all enterprise information into a single Stream – Unifies database, messaging, replication, publish/subscribe APIs and capabilities  Allows deployment of a variety of solutions  Provides a single, unified solution to the problem of Information Sharing Consumption Staging Capture