Time Series Data Repository

Slides:



Advertisements
Similar presentations
Top-Down Network Design Chapter Nine Developing Network Management Strategies Copyright 2010 Cisco Press & Priscilla Oppenheimer.
Advertisements

Time Series Data Repository (TSDR)
A Java Architecture for the Internet of Things Noel Poore, Architect Pete St. Pierre, Product Manager Java Platform Group, Internet of Things September.
Observation Pattern Theory Hypothesis What will happen? How can we make it happen? Predictive Analytics Prescriptive Analytics What happened? Why.
DATA WAREHOUSING.
NATIONAL & KAPODISTRIAN UNIVERSITY OF ATHENS INTERDEPARTMENTAL GRADUATE PROGRAM IN MANAGEMENT AND ECONOMICS OF TELECOMMUNICATION NETWORKS Master Thesis.
© 2015 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION Mohammad Hanif June 2015 Optimal Flow Placement in SDN Networks.
` tuplejump The data engineering platform. A startup with a vision to simplify data engineering and empower the next generation of data powered miracles!
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
OpenDaylight: Introduction, Lithium and Beyond
INNOV-10 Progress® Event Engine™ Technical Overview Prashant Thumma Principal Software Engineer.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
What’s New and Hot in OpenDaylight Beryllium Colin Dixon TSC Chair, OpenDaylight Distinguished Engineer, Brocade.
Internet of Things. Creating Our Future Together.
© Copyright IBM Corporation 2016 Diagram Template IBM Cloud Architecture Center Using the Diagram Template This template is for use in creating a visual.
Time Series Data Repository #ODSummit - The Generic, Extensible, and Elastic Data Repository in OpenDaylight for Advanced Analytics.
ODL based AI/ML for Networks Prem Sankar Gopannan, Ericsson
Atrium Router Project Proposal Subhas Mondal, Manoj Nair, Subhash Singh.
PART1 Data collection methodology and NM paradigms 1.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Luis Gomez, Principal SW Test Engineer, Brocade
Reinventing Customer Experiences
SDN controllers App Network elements has two components: OpenFlow client, forwarding hardware with flow tables. The SDN controller must implement the network.
Euro17 LSO Hackathon Open LSO Analytics
OpenDaylight Based Machine Learning for Networks
Connected Infrastructure
Instructor Materials Chapter 7: Network Evolution
Chapter 6: Securing the Cloud
Chapter 19: Network Management
Organizations Are Embracing New Opportunities
Makes Insurance Smarter.
LISP Flow Mapping Service
Introducing SmartView
Big Data Enterprise Patterns
Connected Living Connected Living What to look for Architecture
University of Maryland College Park
TrueSight Operations Management 11.0 Architecture
Smart Building Solution
OptiView™ XG Network Analysis Tablet
Connected Maintenance Solution
Hybrid Management and Security
The Internet of Things (IoT) and Analytics
Smart Building Solution
Connected Maintenance Solution
Connected Living Connected Living What to look for Architecture
Wonderware Online Cost-Effective SaaS Solution Powered by the Microsoft Azure Cloud Platform Delivers Industrial Insights to Users and OEMs MICROSOFT AZURE.
Connected Infrastructure
Extensible Platform Microsoft Dynamics 365
Top-Down Network Design Chapter Nine Developing Network Management Strategies Copyright 2010 Cisco Press & Priscilla Oppenheimer.
Remote Monitoring solution
ONOS Drake Release September 2015.
Designed for Big Data Visual Analytics, Zoomdata Allows Business Users to Quickly Connect, Stream, and Visualize Data in the Microsoft Azure Platform MICROSOFT.
Cloud Computing and Cloud Networking
Big Data - in Performance Engineering
Dive into Predictive Maintenance using Cortana Intelligence Suite
OpenEMS: Automating the Data Center with Condor
What's New in eCognition 9
Technical Capabilities
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Agenda Need of Cloud Computing What is Cloud Computing
Big DATA.
CRM DMP – a marriage of two acronyms
What's New in eCognition 9
Mark Quirk Head of Technology Developer & Platform Group
Customer 360.
How Dell, SAP and SUSE Deliver Value Quickly
SQL Server 2019 Bringing Apache Spark to SQL Server
Top-Down Network Design Chapter Nine Developing Network Management Strategies Copyright 2010 Cisco Press & Priscilla Oppenheimer.
Dynamics 365 Customer Insights
Presentation transcript:

Time Series Data Repository Contributions from: Cisco, Dell, Tata Consultancy Service, xFlow Research, and OpenNMS #ODSummit

The Power of Time Series Data #ODSummit

What is Time Series Data Time Series Data is a sequence of data points with time stamps. Measurements Log files Events generated from machines or software Huge amount of time series data being generated every day. Cloud Infrastructures Software applications Network equipment Security appliances IoT devices #ODSummit

Why we need time series data analysis The power of time series data analysis is… leveraging what happened in the past(historical view) together with what is happening now(real-time view) to predict what’s going to happen next ( predictive data analysis) and take proactive actions(prescriptive data analysis with automation) Time Series Data Analysis has been successful in many areas including… Financial Market Weather forecasting Economics Health care Insurance The Goal of TSDR in ODL is to apply time series data analysis in SDN. Big data technologies make the time series data analysis possible on high velocity of data #ODSummit

Example Use Case – Traffic congestion prediction with automated control #ODSummit

Other example data driven applications Traffic classification Congestion control Traffic pattern prediction Traffic redirection with route analysis Network issue events prediction Security and Auditing analysis Troubleshooting network problems Resource optimization Network Performance Analysis #ODSummit

TSDR Objectives in OpenDaylight #ODSummit

TSDR goals in OpenDaylight To help with the scalability and performance of ODL controller In Helium, the time series data, such as OpenFlow stats, were only available from the InMemory data store. In Helium, the OpenFlow stats data started to drop from InMemory data store after three seconds in large deployment scenarios. Leveraging Stats Plane concept to separate time series data processing from the control plane and data plane. To enable and encourage data driven applications built from ODL controller For example, a traffic pattern prediction with reconfiguration app could be built on top of ODL controller and TSDR. Help to create an intelligent and ‘smart’ controller With various data driven applications leveraging data from TSDR and feeding the analytics result back to the SDN controller for dynamic flow configuration. #ODSummit

To realize SDN Stats Plane using TSDR Third Party Analytics Engine Separates statistics collection and storage from control plane. Generic, extensible, and elastic architecture framework supporting various types of time series data. Creates new data-driven application platform for SDN . Control Plane Data Stats SNMP Collector Notification Collector sFlow Collector Syslog Collector Data Collection Service Data Storage Service TSDR Data Model TSDR Persistence Service HBase Plugin Cassan- dra Plugin H2 Plugin Data Query Service Data Aggregation Service Data Purging Service Northbound API Software Defined Network #ODSummit

To provide a generic platform for time series data A Data Collection Framework To incorporate a broad range of data collectors for different types of time series data. To facilitate open integration with the specification of polling, pushing, and notification interfaces for time series data collection. A Common Data Model to transform different types of time series data into a common data representation format. A Scalable and pluggable Data Repository To store large amount of time series data. To allow plugin of different types of data stores. A generic open integration API For integration with third party analytics engines. An optimized time series data maintenance solution Periodic Data Aggregation and Purging solutions optimized for time series data #ODSummit

To enable advanced analytics for business optimization Descriptive time based data analytics on different data sources Leveraging the common time series data model. Leveraging time stamps that are common in the data model. Leveraging integration with third party data analytics engine or visualization tools. Predictive and Prescriptive data analytics Automated pattern discovery. Event prediction based on time series data analytics Automated correlation among multiple data sources Prescriptive actions based on the advanced analytics results. By integration with advanced data analytics engines. Automation based on analytics results Automation actions triggered from analytics results for SDN controller optimization. Integration with ODL Controller for re-configuration and redirection of the traffic flows. #ODSummit

To combine real-time and historical analytics Streaming data processing for real-time data analysis Apply streaming data processing technologies for real-time data analysis. Apply advanced data analytics on real-time streaming data. Enable real-time automated actions for business optimization. Scalable data storage for historical view Capture large amount of streaming data within limited time window. Support active queries from the large time series data repository in reasonable response time. Feedback of historical data analytics result into real-time automation Provide capability of feeding back the historical data analytics result into real-time automation engine. #ODSummit

TSDR Capabilities and Architecture Framework Roadmap Control Flow Data Flow #ODSummit

TSDR in OpenDaylight architecture #ODSummit

TSDR Integrated Architecture in ODL TSDR Data Services including Data Collection, Data Storage, Data Query, Data Purging, and Data Aggregation are MD-SAL services. Data Collection service receives time series data published on MD-SAL from MD-SAL southbound plugins. Data Collection service communicates with Data Storage service to store the data into TSDR. TSDR data services access TSDR Data Stores such as HBase Data Store through generic TSDR Data Persistence Layer. #ODSummit

Functions and Capabilities delivered in Beryllium Release – (1) Data Collection A notification based data collector to collect OpenFlow Stats in the network Common Data Model The first version of time series data model that incorporates measurements and log entries. Data Storage TSDR persistence layer with SPI Two TSDR data stores: HBase (NoSQL) and Apache H2(SQL) Note: HBase single node deployed on the same host as ODL controller is supported in Lithium. Pluggable Data Stores HBase Cassandra HSQLDB #ODSummit

Functions and Capabilities delivered in Beryllium Release – (2) Data Query “tsdr:list” command to query the data from TSDR data stores. tsdr:list {Category}[StartTime][EndTime] Example: tsdr:list FlowStats ‘07/20/2015 08:00:00 AM’ ‘07/20/2015 08:15:00 AM’ This command gives the latest 1000 records from TSDR datastore that matches the data category and time range. Data Query Service to support REST APIs for retrieving data from TSDR Grafana visualization Integration Grafana adapter in TSDR Northbound to support the visualization of the data stored in TSDR Data Purging Service Periodically purges the data stored in TSDR Configuration available for the user to change the retention time, purging time, and frequency of purging. #ODSummit

TSDR Common Data Model TSDR common data model in ODL Lithium captures two types of time series data: Measurements Log entries The common data model also supports two data granularities: Fine-grained raw data Aggregated roll up data The characteristics of the design: Generic Extensible Scalable Performance Optimized OpenFlow stats implementation delivered based on this data model: Flow Stats Interface Stats Group Stats Flow Table Stats Queue Stats Meter Stats NetFlow SNMP SYSLOG #ODSummit

TSDR Roadmap #ODSummit

TSDR in ODL AI/ML initiatives for advanced analytics Enable AI/ML on both historical and real-time data paths. Many use cases would require both offline and online ML on the time series data. External events could be additional input for accurate machine learning results. Feed back the results to SDN control path for automatic traffic steering and policy placement. Well-defined interface among the components towards future standardization of advanced analytics in SDN. #ODSummit

TSDR in Boron release and beyond IoTDM integration for IoT Sensor Data Provides a new Binary type of data support. New Data Store support ( Elastic Search ) Plugin Elastic Search as the fourth type of data stores. Multiple data store support at runtime Support multiple data stores running at runtime for different types of data being pushed to different data stores. JDBC driver and SQL parser on northbound of TSDR Provide JDBC driver from TSDR northbound for third party analytics tool integration. Data Aggregation Performance and Scalability Testing and Benchmarking Performance Enhancement Advanced Analytics PoC #ODSummit

TSDR Links and references in OpenDaylight TSDR Main page https://wiki.opendaylight.org/view/TSDR:Main TSDR Related Presentations and Demos https://www.youtube.com/watch?v=6CK4a9Cz7mo https://www.youtube.com/watch?v=itFJav91P94 TSDR Installation Guide and User Guide https://github.com/opendaylight/docs/blob/master/manuals/getting-started- guide/src/main/asciidoc/tsdr/tsdr-installation-guide.adoc https://github.com/opendaylight/docs/blob/master/manuals/user-guide/src/main/asciidoc/tsdr/tsdr- user-guide.adoc TSDR and ODL AI/MI Work Group https://wiki.opendaylight.org/view/Weekly_Meetings_with_Recording #ODSummit

TSDR Beyond OpenDaylight #ODSummit

TSDR use cases beyond OpenDaylight With ODL being adopted and used in various customer scenarios, we could see TSDR being used in the following possible use cases: IoT applications NFV use cases Software Defined Data Center analysis and automation #ODSummit

Other TSDR example use cases Using TSDR in IoT data analytics applications With generic, open, elastic and scalable architecture, TSDR could easily fit into multi-level IoT data analytics architecture. Using TSDR for NFV data analysis and automation With generic model based time series data platform, TSDR could be used for data analytics purposes in NFV architecture with optional SDN controller deployed together. Using TSDR in software defined data center analytics Such as DDoS Attack detection and automated protection from such risks. #ODSummit

Demo #ODSummit

TSDR Demo Set up #ODSummit

Visualization of data from TSDR data stores #ODSummit

Thank You #ODSummit