© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Enabling Real Time Data Analysis.

Slides:

Advertisements

Similar presentations

Fast Data at Massive Scale Lessons Learned at Facebook Bobby Johnson.

Advertisements

Copyright © 2007, GemStone Systems Inc. All Rights Reserved. Optimize computations with Grid data caching OGF21 Jags Ramnarayan Chief Architect, GemFire.

Case Study: Photo.net March 20, What is photo.net? An online learning community for amateur and professional photographers 90,000 registered users.

Extreme Performance with Oracle Data Warehousing

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide

13 Copyright © 2005, Oracle. All rights reserved. Monitoring and Improving Performance.

© Bharati Vidyapeeths Institute of Computer Applications and Management, New Delhi © Bharati Vidyapeeths Institute of Computer Applications and.

Chapter 10: Designing Databases

An overview of Data Warehousing and OLAP Technology Presented By Manish Desai.

Database System Concepts and Architecture

1. SQL Server 2014 In-Memory by Design Arthur Zubarev June 21, 2014.

Chapter 13 The Data Warehouse

SSRS 2008 Architecture Improvements Scale-out SSRS 2008 Report Engine Scalability Improvements.

Analysis of : Operator Scheduling in a Data Stream Manager CS561 – Advanced Database Systems By Eric Bloom.

Logically Centralized Control Class 2. Types of Networks ISP Networks – Entity only owns the switches – Throughput: 100GB-10TB – Heterogeneous devices:

Multi-granular, multi-purpose and multi-Gb/s monitoring on off-the-shelf systems TELE9752 Group 3.

Outline What is a data warehouse? A multi-dimensional data model Data warehouse architecture Data warehouse implementation Further development of data.

Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:

Engine Design: Stream Operators Everywhere Theodore Johnson AT&T Labs – Research Contributors: Chuck Cranor Vladislav Shkapenyuk.

Components and Architecture CS 543 – Data Warehousing.

Physical Database Monitoring and Tuning the Operational System.

Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.

Chapter 14 The Second Component: The Database.

Applications : Network Monitoring Theodore Johnson AT&T Labs – Research Contributors: Chuck Cranor Vladislav Shkapenyuk Oliver.

Data Warehousing: Defined and Its Applications Pete Johnson April 2002.

Performance and Scalability. Performance and Scalability Challenges Optimizing PerformanceScaling UpScaling Out.

Lecture On Database Analysis and Design By- Jesmin Akhter Lecturer, IIT, Jahangirnagar University.

HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.

TM 7-1 Copyright © 1999 Addison Wesley Longman, Inc. Physical Database Design.

Heartbeat Mechanism and its Applications in Gigascope Vladislav Shkapenyuk (speaker), Muthu S. Muthukrishnan Rutgers University Theodore Johnson Oliver.

Sensor Database System Sultan Alhazmi

OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.

Right In Time Presented By: Maria Baron Written By: Rajesh Gadodia

C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.

6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.

MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.

Jennifer Rexford Princeton University MW 11:00am-12:20pm Measurement COS 597E: Software Defined Networking.

Srik Raghavan Principal Lead Program Manager Kevin Cox Principal Program Manager SESSION CODE: DAT206.

INNOV-10 Progress® Event Engine™ Technical Overview Prashant Thumma Principal Software Engineer.

Chapter 9 Database Systems © 2007 Pearson Addison-Wesley. All rights reserved.

Microsoft Management Seminar Series SMS 2003 Change Management.

MGA Duplica Replication Tool. 1. High Availability and Avoidance of Data Loss  Replicate to alternate databases 2. Split activities across databases.

Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.

Introduction.  Administration  Simple DBMS  CMPT 454 Topics John Edgar2.

Yaping Zhu with: Jennifer Rexford (Princeton University) Aman Shaikh and Subhabrata Sen (ATT Research) Route Oracle: Where Have.

Copyright © 2006, GemStone Systems Inc. All Rights Reserved. Increasing computation throughput with Grid Data Caching Jags Ramnarayan Chief Architect GemStone.

Infrastructure for Data Warehouses. Basics Of Data Access Data Store Machine Memory Buffer Memory Cache Data Store Buffer Bus Structure.

SSIS – Deep Dive Praveen Srivatsa Director, Asthrasoft Consulting Microsoft Regional Director | MVP.

1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.

Technology Drill Down: Windows Azure Platform Eric Nelson | ISV Application Architect | Microsoft UK |

Gorilla: A Fast, Scalable, In-Memory Time Series Database

How To Build a Real-time Warehouse Theodore Johnson Vladislav Shkapenyuk April, 2010.

1 Out of Order Processing for Stream Query Evaluation Jin Li (Portland State Universtiy) Joint work with Theodore Johnson, Vladislav Shkapenyuk, David.

Gigascope A stream database for network monitoring

Understanding and Improving Server Performance

Distributed, real-time actionable insights on high-volume data streams

Data Platform and Analytics Foundational Training

Integrating the R Language Runtime System with a Data Stream Warehouse

Open Source distributed document DB for an enterprise

Applying Control Theory to Stream Processing Systems

Data warehouse and OLAP

The Client/Server Database Environment

Exploring Azure Event Grid

SONATA: Query-Driven Network Telemetry

Predictive Performance

Physical Database Design

AWS Cloud Computing Masaki.

Control Theory in Log Processing Systems

Presentation transcript:

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Enabling Real Time Data Analysis Divesh Srivastava, Lukasz Golab, Rick Greer, Theodore Johnson, Joseph Seidel, Vladislav Shkapenyuk, Oliver Spatscheck, Jennifer Yates AT&T Labs - Research

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Evolution of Data Analysis 1980s 1990s2000s 2010s Analysis Offline reports OLAP, ad hoc analysis Streaming queries Real time analysis Drivers Banking, airlines Sales, CRM, Marketing Alerting, Fraud Security, Healthcare Transactional databases Separate DSMS, DW Integrated DSMS+SDW Data warehouses

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Challenge: Enabling Real Time Analysis TroubleshootingNetwork alerts Customer trouble tickets Routing reports Syslogs SONET Performance reports Workflow Traffic Network

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Non-Solution: Traditional Warehouses Troubleshooting Network alerts Customer trouble tickets Routing reports Syslogs SONET Performance reports Workflow Traffic Network Traditional Data Warehouse Troubleshooting

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Solution: Streaming Data Management Troubleshooting Network alerts Customer trouble tickets Routing reports Syslogs SONET Performance reports Workflow Traffic Network Integrated DSMS + SDW

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Goal: Integrated DSMS + SDW Storage – Long term storage of historical data in tables, materialized views – Update tables, materialized views in real time (minutes, not hours) Queries – Aggregate massive volumes (Gbit/sec) of streaming data – Join across multiple data sources – Correlate real-time data with historical data – Real-time alerting (reaction time in minutes, not days)

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Outline Motivation Data stream management systems – GS tool Database management systems: the prequel – Daytona Streaming data warehouses: the sequel – Data Depot + Bistro

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. GS Tool GS tool is a fast, flexible data stream management system – High performance at speeds up to OC768 (2 x 40 Gbits/sec) – GSQL queries support SQL-like functionality Monitoring platform of choice for AT&T networks Developed at AT&T Labs-Research – Collaboration between database and networking research

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. GS Tool: GSQL Queries GSQL queries support – Filtering, aggregation – Merges, joins – Tumbling windows Arbitrary code support – UDFs (e.g., LPM), UDAFs GSQL query paradigm – Streams-in, stream-out – Permits composability

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Example: TCP SYN Flood Detection Attack characteristic: exploits 3-way TCP handshake Attack detection: correlate SYN, ACK packets in TCP stream define { query_name toomany_syn; } select A.tb, (A.cnt – M.cnt) outer_join from all_syn_count A, matched_syn_count M where A.tb = M.tb define { query_name all_syn_count; } select S.tb, count(*) as cnt from tcp_syn S group by S.tb define { query_name matched_syn_count; } select S.tb, count(*) as cnt from tcp_syn S, tcp_ack A where S.sourceIP = A.destIP and S.destIP = A.sourceIP and S.sourcePort = A.destPort and S.destPort = A.sourcePort and S.tb = A.tb and (S.sequence_number+1) = A.ack_number group by S.tb

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. GS Tool : Scalability GS tool is a fast, flexible data stream management system – High performance at speeds up to OC768 (2 x 40 Gbits/sec) Scalability mechanisms – Two-level architecture: query splitting, pre-aggregation – Distribution architecture: query-aware stream splitting – Unblocking: reduce data buffering – Sampling algorithms: meaningful load shedding

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Performance of Example Deployment GS tool is configured to track – IP addresses – Latency and loss statistics – ICMP unreachable messages GS tool monitors at peak times – users – > 1 million packets/second – > 1.4 Gbit/sec

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. GS Tool: Two-Level Architecture Low-level queries perform fast selection, aggregation – Significant data reduction – Temporal clustering in IP data High-level queries complete complex aggregation NIC Ring Buffer Low High App Low

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. GS Tool: Low-Level Aggregation Fixed number of group slots, fixed size slot for each group – No malloc at run-time Direct-mapped hashing Optimizations – Limited hash chaining reduces eviction rate – Slow eviction of groups when epoch changes Fixed-size slots Fixed number of slots groupaggregate data Eviction on collision

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. GS Tool: Query Splitting select tb, destIP, sum(sumLen) from SubQ group by tb, destIP having sum(cnt) > 1 define { query_name SubQ; } select tb, destIP, sum(len) as sumLen, count(*) as cnt from TCP where protocol = 6 and destPort = 25 group by time/60 as tb, destIP define { query_name smtp; } select tb, destIP, sum(len) from TCP where protocol = 6 and destPort = 25 group by time/60 as tb, destIP having count(*) > 1

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. GS Tool: Data Reduction Rates Significant data reduction through low-level aggregation Typical applications – App1: example deployment – App2: detailed accounting in 1 minute granularity – App3: detailed accounting, P2P detection and TCP throughput monitoring

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Distributed GS Tool Problem: OC768 monitoring needs more than one CPU – 2x40 Gb/sec = 16M pkts/sec – Need to debug splitter, router Solution: split data stream, process query, recombine partitioned query results For linear scaling, splitting needs to be query-aware splitter GS1GS2GSn Gigabit Ethernet High speed (OC768) stream

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Query-Unaware Stream Splitting GS 1 flows hflows GS n flows U round robin flows define { query_name flows; } select tb, srcIP, destIP, count(*) from TCP group by time/60 as tb, srcIP, destIP define { query_name hflows; } select tb, srcIP, max(cnt) from flows group by tb, srcIP

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. GS Tool : Query-Aware Stream Splitting define { query_name flows; } select tb, srcIP, destIP, count(*) from TCP group by time/60 as tb, srcIP, destIP define { query_name hflows; } select tb, srcIP, max(cnt) from flows group by tb, srcIP GS 1 flows hflows GS n flows hflows U hash(srcIP)

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Query-Aware Stream Splitting at Scale

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Distributed GS Tool: First Prototype

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Outline Motivation Data stream management systems – GS tool Database management systems: the prequel – Daytona Streaming data warehouses: the sequel – Data Depot + Bistro

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Daytona Daytona is a highly scalable and robust data management system – Organizes and stores large amounts of data and indices on disk – Enables concise, high-level expression of sophisticated queries – Provides answers to sophisticated queries quickly – Manages data in a concurrent, crash-proof environment Data management system of choice for AT&Ts largest databases – Production quality system since 1989 Developed at AT&T Labs-Research

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Daytona: Cymbal Queries High-level, multi-paradigm programming language – Full capability of SQL data manipulation language – First-order symbolic logic (including generalized transitive closure) – Set theory (comprehensions) Sophisticated bulk data structures – Tables with set- and list-valued attributes on disk – Scalar- and tuple-valued multi-dimensional associative arrays – Boxes (in-memory collections) with indexing and sorting Synergy of complex processing and data access

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Daytona: Scalability August 2010 – ~ 1PB norm. data volume – 1.25T records in largest table Scalability mechanisms – Compilation architecture – Horizontal partitioning – Data compression – SPMD parallelization

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Typical DBMS Server Architecture DBMS server process provides – File system – Networking – Threads – Scheduling – Memory management – Caching – … – Query processing Queries Answers DBMS Server Operating System Queries Answers Client BClient A

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Compilation Architecture for Speed Highly customized compilation – SQL Cymbal C – Enables using shared libraries No DBMS server processes – Query executable uses OS Answer Daytona Executable Operating System Parameters

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Horizontal Partitioning for Capacity TABLE FILES PARTITIONED BY BILLS Month Region 3-10 NW 4-10 NW 4-10 SE /u1/file1 /u2/file2 /u3/file3 Unbounded data storage – 1.25T records in 135K files Fast bulk data adds/deletes – Can maintain windows Disk load balancing Serves as another indexing level – Partition directory

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Outline Motivation Data stream management systems – GS tool Database management systems: the prequel – Daytona Streaming data warehouses: the sequel – Data Depot + Bistro

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. What do you do with all the Data? Network alerts Customer trouble tickets Routing reports Syslogs SONET Performance reports Workflow Traffic Network Streaming Data Warehouse

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. What is a Streaming Data Warehouse? Conventional data warehouse – Provides long term data storage and extensive analytics facilities – Supports deeply nested levels of materialized views – Updates to tables and materialized views performed in large batch Streaming data warehouse – Like a conventional data warehouse, but supports continuous, real time update of tables and materialized views – Like a DSMS, but provides long term data storage and supports deeply nested levels of materialized views

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Data Depot Simplify stream warehousing – Manage Daytona warehouses Cascaded materialized views – Time-based partitioning – Real-time + historical tables View update propagation – Update scheduling Concurrency control

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Data Depot: Time-Based Partitioning Time-based partitioning of raw tables, materialized views – Daytona horizontal partitions – Roll in at leading edge – Roll off at trailing edge Set partition size to be the real-time update increment – Avoid index rebuilding

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Data Depot: RT + Historical Tables Issue: RT and historical tables are optimized differently – Small partitions for RT tables – Large partitions for historical Solution: newest part of table is RT, oldest part is historical – Combined RT+historical table – Transparent query access to data in combined table New data Historical tableRT table Combined RT+historical table

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Data Depot: Update Propagation Update propagation through partition dependencies Overload situation common – Need update scheduling Basic algorithm – Determine source partitions of a derived partition – Recompute derived partition if a source partition changes

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Data Depot: Partition Dependencies CREATE TABLE PROBES_BLACK2RED AS SELECT TimeStamp, Source, Destination FROM PROBES_BLACK P WHERE NOT EXISTS (SELECT B.Timestamp, B.Source, B.Destination FROM PROBES_BLACK B WHERE B.Source = P.Source AND B.Destination = P.Destination AND B.Timestamp = P.Timestamp – 1) PROBES_BLACK PROBES_BLACK2RED

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Data Depot: Update Scheduling Make-style protocol is wrong – Track last-update timestamp – Update if source is newer Vector-timestamp style model – Record all dependencies – Eventual consistency easy – Mutual consistency hard – Trailing edge consistency is the best compromise B1B2 D1D2 D X X X X X

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Data Depot: Consistency Provides eventual consistency – All data depends on raw data – Convergence after all updates get propagated What about before eventually – Leading edge consistency: use all the data that is available – Trailing edge consistency: use only stable data S1S2D Leading Edge Trailing Edge

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Data Depot: Cascaded Views Example Application to measure loss and delay intervals in real time – Raw data every 15 min, 97% of update propagation within 5 min

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Bistro Weakest link: when are base tables ready for a refresh? – Every 5 min 2.5 min delay Bistro: data feed manager – Pushes raw data from sources to clients with minimal delay – Provides delivery triggers – Monitors quality, availability – Manages resources Russian быстро means quickly Feeds Subscribers Raw files

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Data Feed Management Before Bistro Data feed management has largely been an ad hoc activity – Shell scripts invoked using cron, heavy usage of rsync and find Cron jobs – Propagation delays, can step on previous unfinished scripts – No prioritized resource management Rsync – Lack of notification on client side (no real-time triggers) – Clients must keep identical directory structure and time window – No systemic performance and quality monitoring

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Bistro: Scalability, Real-Time Features Maintain database of transferred raw files – Avoid repeated scans of millions of files (e.g., find) – Different clients can keep differently sized time windows of data Intelligent transfer scheduling – Deal with long periods of client unavailability – Parallel data transfer trying to maximize the locality of file accesses Real-time triggers – Notify clients about file delivery, batch notifications

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Real-Time Darkstar Enabled by Daytona, Data Depot, Bistro As of June 2010 – 183 data feeds, 607 tables – 30.9 TB data Real-time analysis applications – PathMiner: analyze problems on path between 2 endpoints – NICE: mine for root causes of network problems Darkstar

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Summary What have we accomplished? – Built some cool systems: Daytona, Data Depot, GS tool, Bistro – Enabled very high-speed streaming analysis using GS tool – End-to-end solution for SDW using Daytona, Data Depot, Bistro Whats next? – Existing systems continue to evolve in scale and functionality – New systems being built (e.g., Data Auditor for monitoring quality) – Integrated DSMS + SDW: challenging research problems

© 2010 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property. Parting Thoughts Enabling real time data analysis is critical – Once data is collected, it should be available for analysis right away – Need to support artifacts of data analysis in the database Need to have an end-to-end solution for RT data management – Our focus has been on managing data once it is in the database – We should do research on the entire pipeline from data generation to data management to data usage Exciting journey: should keep us busy for quite a while!