In-Memory Data Grid Use Cases & Patterns Jean-Noel Moyne TIBCO Fellow © Copyright 2000-2014 TIBCO Software Inc.

Slides:



Advertisements
Similar presentations
Chapter 13 (Web): Distributed Databases
Advertisements

ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 16 – Intro. to Transactions.
“Turn you Smart phone into Business phone “
Chapter 17: Client/Server Computing Business Data Communications, 4e.
Chapter 3 Database Management
© 2011 Citrusleaf. All rights reserved.1 A Real-Time NoSQL DB That Preserves ACID Citrusleaf Srini V. Srinivasan Brian Bulkowski VLDB, 09/01/11.
Database Management: Getting Data Together Chapter 14.
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
Platform as a Service (PaaS)
Google App Engine Danail Alexiev Technical Trainer SoftAcad.bg.
Cross Platform Mobile Backend with Mobile Services James
Ch 4. The Evolution of Analytic Scalability
Word Wide Cache Distributed Caching for the Distributed Enterprise.
Selecting and Implementing An Embedded Database System Presented by Jeff Webb March 2005 Article written by Michael Olson IEEE Software, 2000.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
IMS 4212: Distributed Databases 1 Dr. Lawrence West, Management Dept., University of Central Florida Distributed Databases Business needs.
Introduction To Windows Azure Cloud
1 Large-scale Incremental Processing Using Distributed Transactions and Notifications Written By Daniel Peng and Frank Dabek Presented By Michael Over.
Opening Keynote Presentation An Architecture for Intelligent Trading  Alessandro Petroni – Senior Principal Architect, Financial Services, TIBCO Software.
Wrangling Customer Usage Data with Hadoop Clearwire – Thursday, June 27 th Carmen Hall – IT Director Mathew Johnson – Sr. IT Manager.
— Build your own enterprise-class PaaS platform. Master Cloudset Cloudset1 Cloudset Resource Pool cloud Dedicated resource can be assigned to a cloudset.
HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.
1 Introduction to Database Systems. 2 Database and Database System / A database is a shared collection of logically related data designed to meet the.
Managing a Cloud For Multi Agent System By, Pruthvi Pydimarri, Jaya Chandra Kumar Batchu.
Meet with the AppEngine Márk Gergely eu.edge. What is AppEngine? It’s a tool, that lets you run your web applications on Google's infrastructure. –Google's.
DBSQL 14-1 Copyright © Genetic Computer School 2009 Chapter 14 Microsoft SQL Server.
Mr. Ran Mitra Vice President Intelligent Networks Division New Ways to Generate Profit in a Real-Time Billing Environment.
1 Moshe Shadmon ScaleDB Scaling MySQL in the Cloud.
NoSQL Databases Oracle - Berkeley DB Rasanjalee DM Smriti J CSC 8711 Instructor: Dr. Raj Sunderraman.
NoSQL Databases Oracle - Berkeley DB. Content A brief intro to NoSQL About Berkeley Db About our application.
IMDGs An essential part of your architecture. About me
DATABASE MANAGEMENT SYSTEMS IN DATA INTENSIVE ENVIRONMENNTS Leon Guzenda Chief Technology Officer.
Large-scale Incremental Processing Using Distributed Transactions and Notifications Daniel Peng and Frank Dabek Google, Inc. OSDI Feb 2012 Presentation.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Intro – Part 2 Introduction to Database Management: Ch 1 & 2.
Server to Server Communication Redis as an enabler Orion Free
1 Distributed Databases BUAD/American University Distributed Databases.
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
Introduction.  Administration  Simple DBMS  CMPT 454 Topics John Edgar2.
Copyright © 2006, GemStone Systems Inc. All Rights Reserved. Increasing computation throughput with Grid Data Caching Jags Ramnarayan Chief Architect GemStone.
My project  Small-Medium Enterprises (SMEs)  faces goods distribution problems  needs necessary resources, money and technical expertise, to purchase.
Ceilometer + Gnocchi + Aodh Architecture
Dynamo: Amazon’s Highly Available Key-value Store DAAS – Database as a service.
NoSQL Or Peles. What is NoSQL A collection of various technologies meant to work around RDBMS limitations (mostly performance) Not much of a definition...
© 2003 Prentice Hall, Inc.3-1 Chapter 3 Database Management Information Systems Today Leonard Jessup and Joseph Valacich.
Aneka Cloud ApplicationPlatform. Introduction Aneka consists of a scalable cloud middleware that can be deployed on top of heterogeneous computing resources.
Copyright ©2003 Dell Inc. All rights reserved. Scaling-Out with Oracle® Grid Computing on Dell™ Hardware J. Craig Lowery, Ph.D. Software Architect and.
Data-Centric Security and User Access Controls for Hadoop on Microsoft Azure MICROSOFT AZURE APP BUILDER PROFILE: BLUETALON BlueTalon provides data-centric.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 16 – Intro. to Transactions.
Technology Drill Down: Windows Azure Platform Eric Nelson | ISV Application Architect | Microsoft UK |
Smart Grid Big Data: Automating Analysis of Distribution Systems Steve Pascoe Manager Business Development E&O - NISC.
uses of DB systems DB environment DB structure Codd’s rules current common RDBMs implementations.
Rick Claus Architect like a PRO for Performance and Availability of your Microsoft Azure VMs ARC43 6.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
Ignite in Sberbank: In-Memory Data Fabric for Financial Services
Gorilla: A Fast, Scalable, In-Memory Time Series Database
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
Platform as a Service (PaaS)
CS 540 Database Management Systems
Real Time Inventory November 2016.
Platform as a Service (PaaS)
Platform as a Service (PaaS)
Open Source distributed document DB for an enterprise
In-Memory Performance
#01 Client/Server Computing
Google App Engine Danail Alexiev
Ch 4. The Evolution of Analytic Scalability
TN19-TCI: Integration and API management using TIBCO Cloud™ Integration
#01 Client/Server Computing
Presentation transcript:

In-Memory Data Grid Use Cases & Patterns Jean-Noel Moyne TIBCO Fellow © Copyright TIBCO Software Inc.

2 The State of the Market Challenges Faced The Right Technology to Address These Challenges Sample Use Cases and Patterns Next Steps and Q&A

3 © Copyright TIBCO Software Inc. The State of the Market

Used to be: centralized SQL DBMS was the only tool SSD Column oriented Caching In-memory Distributed data stores Map/reduce What data store should I use? 4 The State of the Market © Copyright TIBCO Software Inc.

5 The Challenges

Big Data: scaling databases Cloud: scaling in virtual environments Faster data access: for reads and writes Elasticity: handling spikes Achieving fault-tolerance and disaster recovery Bridging the gap between ‘data at rest’ and ‘data in motion’ Eliminating incoherence due to data being copied over many datastores It’s not just about “Big Data”, it’s also about “Fast Data”! 6 The Challenges © Copyright TIBCO Software Inc.

7 The Right Technology to Address These Challenges

“Like a database”: Store and retrieve data: Key/value store Queries with indexing Data stored with a schema, self-describing Tuples Work on data Immediately consistent ACID properties Locking and ‘Compare and Set’ operations “Like a messaging system”: Real-time ‘push’ over the network Listeners Continuous queries “Like a compute grid”: Remote invocation for map/reduce processing 8 TIBCO ActiveSpaces In-Memory Datagrid © Copyright TIBCO Software Inc.

True peer-to-peer distributed design Distributed storage using monotonic hashing algorithm Horizontal scalability Elasticity In-memory storage with durability Replication Persistence to disk 9 ActiveSpaces Architecture © Copyright TIBCO Software Inc.

10 Persistence © Copyright TIBCO Software Inc. Shared-NothingShared-All

True peer-to-peer distributed design Distributed storage using monotonic hashing algorithm Horizontal scalability Elasticity In-memory storage with durability Replication Persistence to disk Platform independent middleware Stores Tuples (rows), not objects C, Java,.Net API 11 ActiveSpaces Architecture (continued) © Copyright TIBCO Software Inc. Embeddable Written in C/C++, stores everything outside of the heap Multisite Secure

Scalability and elasticity: True peer-to-peer distributed design for linear scalability Software-only Not “just for caching” Can be deployed in multiple data centers More than just storage (e.g. eventing, distributed processing) Integrated into (and used by) the TIBCO stack of products 12 Key Features of ActiveSpaces © Copyright TIBCO Software Inc.

When you have the “need for speed” When you know what kind of questions you are repeatedly going to ask about the data When you know the analysis you want to run, and you run it all the time (repeatedly, as soon as the data changes) When you need both distributed scalability and ACID properties When you want scalability in software on commodity hardware (or virtualized environment) When you want fault-tolerance without the need for special hardware When you want events about the changes to the data When you need data-store and eventing capabilities in a single package 13 The ActiveSpaces “Sweet Spot” © Copyright TIBCO Software Inc.

Pure in-memory with optional disk persistence Distributed and replicated built-in disk persistence Can be used to provide cache-through access to existing DB tables Queries return even ‘evicted’ data Everything stored off-heap Immediately consistent with ACID properties Platform independent middleware SQL-like query language Event pushed over the network in real-time Built-in transactional cross-site replication Event-driven map/reduce processing Speed and throughput! 14 Compared to Other Offerings © Copyright TIBCO Software Inc.

15 © Copyright TIBCO Software Inc. Use Cases and Patterns

Generic: Low latency data access Telco: Real-time offer generation and fulfillment Retail: In-memory product catalog, in-memory inventory Retail Banking: Fast temporary shared storage for EDI context data, fast account lookup Transportation: Real-time tracking and incident management Capital markets: Real-time processing 16 ActiveSpaces Use Cases © Copyright TIBCO Software Inc.

When you need low latency access to data, including: Data that you read all the time (or repeatedly) Data that you write all the time Temporary (or ‘working’) data Any data with a short shelf-life (e.g. events) And when every millisecond (microsecond, even) counts! 17 Generic: Low Latency Data Access © Copyright TIBCO Software Inc.

Telco: Real-Time Offer Generation and Fulfillment Reload Give 100 free SMS to subscriber who tops-up > $xxx Total: 12 mio top-up / day Peak: 300 top-up per sec Purchase 3G Package Cross-sell Voice/SMS package to subscriber who purchases 3G Mobile Package Total: 3 mio / day Peak: 50 events per sec Voice Call Give discount VOIP package to subscriber who makes a IDD call Total: 200 mio / day Peak: 12,000 events per sec SMS Usage Give discounted SMS package to subscriber who sends SMS more than 10 times a day Total: 750 mio / day Peak: 27,000 events per sec Event Cloud Purchase BB Package Reload Voice Call IDD Call OnNet Call SMS Usage Event Handling and Processing Touchpoint Integration Billing, Offer Fulfilled Fulfill SMS Package Fulfill 3G Package Fulfill Voice Package Fulfill SMS Package 46,7 millions subscribers 2,000 SMS notifications per seconds 500 offer fulfillments per second Offer Message Reminder Message Fulfillment Message

Event handling: CDRs coming from the cell network compare event against lists Campaign trigger: offer is being qualified according to occurred / tracked events from the subscriber 19 Telco: Real-Time Offer Generation and Fulfillment © Copyright TIBCO Software Inc. The Numbers 1 billion events per day Peaks of 40 to 50,000 events per second (for hours, during peak usage period of the day) from Network 2 BW servers, 2 AS servers (active-active) The Numbers 1 billion events per day Peaks of 40 to 50,000 events per second (for hours, during peak usage period of the day) from Network 2 BW servers, 2 AS servers (active-active)

Retail: In-Memory Product Catalog

Product MDM implemented using TIBCO MDM Used by approximately applications The challenge is that the load generated by all those applications is too high for MDM to handle directly ActiveSpaces used to speed up the data access Applications can read/write from/to the data grid rather than hit MDM directly Created a service layer on top of ActiveSpaces implemented with BusinessWorks (ActiveSpaces plugin) 21 Retail: In-Memory Product Catalog © Copyright TIBCO Software Inc. The Numbers 800 million records Peak load: 200,000 operations per second AS: 16 VMs (8 cores, 128 Gb of RAM each) 50% growth expected in the next 6 months The Numbers 800 million records Peak load: 200,000 operations per second AS: 16 VMs (8 cores, 128 Gb of RAM each) 50% growth expected in the next 6 months

Real-time store inventory for US national retailers Problem: too many layers, too much delay introduces in-coherency Buy online, pick-up at the store Smart fulfillment Opens new possibilities: “Triggers” outside of the System of Record (SOR) servers 22 Retail: In-Memory Inventory © Copyright TIBCO Software Inc.

Fast Temporary Shared Storage for EDI Context Data Fast Account Lookup for Monetary Authority of Singapore G3 rules Real-time transaction settlement Real time alerts on account activity 23 Retail Banking © Copyright TIBCO Software Inc. EDI Service Sender EDI Process Receive EDI Process EAI HUB TIBCO BusinessWorks Sender EDI Process Process... External Partners and Banks Receive EDI Process Send Message for Requesting to Partner Put message in AS with GUID key Send Request Message Receive Message Take message from AS with GUID key Reply Asynchronously with Sender’s GUID Bank … … VAN Space

Real-time tracking of packages, containers, ships, etc. Incident management Equipment breaking triggers a plane change Real-time ODS for train schedule status 24 Transportation: Real-time Tracking/Incident Management © Copyright TIBCO Software Inc.

Real time distributed processing Thousands of accounts, each with hundreds of positions Positions to be updated for all accounts as market data updates arrive Risk calculation Thousands of market data updates per second result in many hundreds of thousands of position updates per second AS shows linear scalability into the millions of updates per second “Affinity” to leverage data locality is key to performance and scalability 25 Capital Markets © Copyright TIBCO Software Inc.

26 Emergence of a Generic Pattern © Copyright TIBCO Software Inc. ActiveSpaces ODS ActiveSpaces ODS Correlation Aggregation Historical CEP Analysis MDM Closing the loop Application

Check out a demo at our booth Win an AnkiDrive artificial intelligence race car! Download a 90-day trial activespaces.tibco.com Learn more, discuss, support tibco.com tibcommunity.com 27 Next Steps © Copyright TIBCO Software Inc.

28 © Copyright TIBCO Software Inc. Questions?