Oracle Database Native Sharding: a customer perspective ©2016 PayPal Inc. Confidential and proprietary. John Kanagaraj, Sr. Member of Technical Staff,

Slides:



Advertisements
Similar presentations
Database Architectures and the Web
Advertisements

Cloud Computing: Theirs, Mine and Ours Belinda G. Watkins, VP EIS - Network Computing FedEx Services March 11, 2011.
2 Proprietary & Confidential What is Sharding Benefits of Sharding Alternatives of Sharding When to start Sharding Agenda.
Introduction to DBA.
Transaction.
“It’s going to take a month to get a proof of concept going.” “I know VMM, but don’t know how it works with SPF and the Portal” “I know Azure, but.
© 2011 Citrusleaf. All rights reserved.1 A Real-Time NoSQL DB That Preserves ACID Citrusleaf Srini V. Srinivasan Brian Bulkowski VLDB, 09/01/11.
© 2014 ScaleArc. All Rights Reserved. 1 Creating an Agile Data Environment for Apps in the Cloud Summer 2014.
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
CMU SCS Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications C. Faloutsos – A. Pavlo How to Scale a Database System.
Confidential ODBC May 7, Features What is ODBC? Why Create an ODBC Driver for Rochade? How do we Expose Rochade as Relational Transformation.
Web Application Architecture: multi-tier (2-tier, 3-tier) & mvc
Introduction to Databases Transparencies 1. ©Pearson Education 2009 Objectives Common uses of database systems. Meaning of the term database. Meaning.
How WebMD Maintains Operational Flexibility with NoSQL Rajeev Borborah, Sr. Director, Engineering Matt Wilson – Director, Production Engineering – Consumer.
Object Oriented Databases by Adam Stevenson. Object Databases Became commercially popular in mid 1990’s Became commercially popular in mid 1990’s You.
Performance and Scalability. Performance and Scalability Challenges Optimizing PerformanceScaling UpScaling Out.
Databases with Scalable capabilities Presented by Mike Trischetta.
Database Design Table design Index design Query design Transaction design Capacity Size limits Partitioning (shard) Latency Redundancy Replica overhead.
Object-based Storage Long Liu Outline Why do we need object based storage? What is object based storage? How to take advantage of it? What's.
SANPoint Foundation Suite HA Robert Soderbery Sr. Director, Product Management VERITAS Software Corporation.
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
Training Workshop Windows Azure Platform. Presentation Outline (hidden slide): Technical Level: 200 Intended Audience: Developers Objectives (what do.
Database Architecture Introduction to Databases. The Nature of Data Un-structured Semi-structured Structured.
Distributed Data Stores and No SQL Databases S. Sudarshan Perry Hoekstra (Perficient) with slides pinched from various sources such as Perry Hoekstra (Perficient)
© , OrangeScape Technologies Limited. Confidential 1 Write Once. Cloud Anywhere. Building Highly Scalable Web applications BASE gives way to ACID.
NoSQL Databases Oracle - Berkeley DB. Content A brief intro to NoSQL About Berkeley Db About our application.
Data Dependent Routing may not be necessary when using Oracle RAC Ken Gottry Apr-2003 Through Technology Improvements in: Oracle 9i - RAC Oracle 9i - CacheFusion.
Scott Klein Technical Evangelist. Scott Klein.
 2009 Calpont Corporation 1 Calpont Open Source Columnar Storage Engine for Scalable MySQL Data Warehousing April 22, 2009 MySQL User Conference Santa.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Srik Raghavan Principal Lead Program Manager Kevin Cox Principal Program Manager SESSION CODE: DAT206.
Intuitions for Scaling Data-Centric Architectures
Copyright © 2006, GemStone Systems Inc. All Rights Reserved. Increasing computation throughput with Grid Data Caching Jags Ramnarayan Chief Architect GemStone.
Infrastructure for Data Warehouses. Basics Of Data Access Data Store Machine Memory Buffer Memory Cache Data Store Buffer Bus Structure.
Your Data Any Place, Any Time Performance and Scalability.
Copyright ©2003 Dell Inc. All rights reserved. Scaling-Out with Oracle® Grid Computing on Dell™ Hardware J. Craig Lowery, Ph.D. Software Architect and.
Introduction. System Design Hardware/Software Platform Selection Software Architectures Database Design Human-Computer Interaction (HCI) Interface Object.
CMPE 226 Database Systems May 3 Class Meeting Department of Computer Engineering San Jose State University Spring 2016 Instructor: Ron Mak
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
1 Cloud-Native Data Warehousing Bob Muglia. 2 Scenarios with affinity for cloud Gartner 2016 Predictions: By 2018, six billion connected things will be.
Intro to NoSQL Databases Tony Hannan November 2011.
Eric Grancher CERN IT department Overview of Database Technologies Computing and Astroparticle Physics 2 nd ASPERA Workshop /1.
Introduction to Oracle Forms Developer and Oracle Forms Services
Platform as a Service (PaaS)
CSCI5570 Large Scale Data Processing Systems
Platform as a Service (PaaS)
Windows Azure SQL Federation
Efficient data maintenance in GlusterFS using databases
Introduction to Oracle Forms Developer and Oracle Forms Services
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
Open Source distributed document DB for an enterprise
Every Good Graph Starts With
Sebastian Solbach Consulting Member of Technical Staff
Introduction to Oracle Forms Developer and Oracle Forms Services
Modern Databases NoSQL and NewSQL
Did your feature got in, out or planned?
Introduction to NewSQL
In-Memory Performance
Introduction of Week 6 Assignment Discussion
Capitalize on modern technology
Data and Application Modeling in the Brave New World of Oracle Sharding BUS1845: Oracle OpenWorld 2018 John Kanagaraj, Sr. MTS, PayPal • 23-Oct-2018.
Design and Maintenance of Web Applications in J2EE
Massively Parallel Cloud Data Storage Systems
Arrested by the CAP Handling Data in Distributed Systems
NoSQL Databases An Overview
Introduction to Databases Transparencies
Cloud computing mechanisms
MORE ON ARCHITECTURES The main reasons for using an architecture are maintainability and performance. We want to structure the software into reasonably.
NoSQL databases An introduction and comparison between Mongodb and Mysql document store.
Presentation transcript:

Oracle Database Native Sharding: a customer perspective ©2016 PayPal Inc. Confidential and proprietary. John Kanagaraj, Sr. Member of Technical Staff, PayPal Core Data Platform 1

Speaker Qualifications Currently Sr. Database/Data PayPal Has been working with Oracle Databases and UNIX for 28 years Working on various NoSQL technologies for the past 3 years Has worked on many Sharded applications – Both Oracle and NoSQL Author, Technical editor, Oracle ACE, Frequent speaker Loves to mentor new speakers and authors!

©2016 PayPal Inc. Confidential and proprietary. 3 Sharding at PayPal PayPal runs 800+ Oracle databases Oracle is used for all site-critical applications; NoSQL is onboarding Core transactional systems on Oracle Sharding based scaling for selected use cases Custom, fixed # sharding for high scale workloads started in 2009 Maintaining custom sharding code and infrastructure challenging NoSQL provides sharding out of box Evolving standard pattern for sharded access (client, connection pooling, routing, logical-to-physical mapping, etc.) Oracle Sharding will allow us to more easily and effectively scale our mission- critical transactional databases

Custom Sharding Architecture evolution ©2016 PayPal Inc. Confidential and proprietary App code determines shard to query Dev needs to understand sharding! Tables T1 – T24 spread on 3 physicals Shards hardcoded to Physical servers Connection pool per physical Shard migration is Hard! Data Access Layer determines Shard to query (not App code!) Logical Shards mapped to Physical shards Logical to Physical mapping metadata cached in Data Access Layer (Shard Catalog!) One Connection pool per logical schema Moving Shard is Easy! (or rather: Easier) Logical Shard # Physical Shard #

©2016 PayPal Inc. Confidential and proprietary. 5 Sharding Principles Sharding is mostly a Scalability play ”Key is Key” – Choose one STRONG key and align. E.g. Account/User ID Data model changes (drastically!) – Normalization is relaxed Cross-key transaction boundaries are broken Non-shard key access can be expensive; forces “scatter-gather” pattern CAP Theorem: 2 of 3 for (C)onsistency, (A)vailability and Network (P)artition “Lookup” requirement for Common data elements Scheme for mapping logical to physical is critical for future scale-out Joins and ACID principles usually not available in Sharded systems (e.g. NoSQL) Oracle Sharding supports the sharding principles while providing consistency, transactions, and relational capabilities such as Joins

©2016 PayPal Inc. Confidential and proprietary. 6 Oracle Sharding – Feedback from Beta software What we liked Auto-configuration (Create and Manage Shards and Routing) Out of box support for Shard Catalog, Routing layer, Common Tables for Lookup Cross-shared query support (Not supported in custom sharding) Familiar SQL for sharding management Two level sharding (not supported in other technologies such as NoSQL) Consistent hashing to prevent large data movement as new shards are added Dynamically changing the number of shards Things to be careful of Sharding requires major rework in Data Model and Data Design Duplicated Table – Make sure not to overuse; Not for write-heavy use cases Enable connection pool limits per Shard

Backup Slides ©2016 PayPal Inc. Confidential and proprietary. 7

©2015 PayPal Inc. Confidential and proprietary. 8 Challenges at Scale [Remove this slide for the presentation – retain in backup] Pushing the limits Connections Memory Interconnect CPU DDL on busy tables RAC reconfiguration Redo rate I/O latencies SAN Storage limits Replication latencies Solutions Separate reads for RO scaleout (ADG/GG) Microservices Connection multiplexing Write Isolation Custom caching …. And finally.... Sharding!