Concept demo System dashboard. Overview Dashboard use case General implementation ideas Use of MULE integration platform Collection Aggregation/Factorization.

Slides:



Advertisements
Similar presentations
Module 13: Performance Tuning. Overview Performance tuning methodologies Instance level Database level Application level Overview of tools and techniques.
Advertisements

Visual Scripting of XML
Lukas Blunschi Claudio Jossen Donald Kossmann Magdalini Mori Kurt Stockinger.
General introduction to Web services and an implementation example
Messaging and Data integration models are used in Phase I of the MCAS project to connect set of message enabled services into a workflow for efficient.
SOAP.
Technical BI Project Lifecycle
Management Information Systems, Sixth Edition
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
© 2004, The Trustees of Indiana University 1 OneStart Workflow Basics Brian McGough, Manager, Systems Integration, UITS Ryan Kirkendall, Lead Developer.
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
Software Architecture Patterns (2). what is architecture? (recap) o an overall blueprint/model describing the structures and properties of a "system"
Peoplesoft: Building and Consuming Web Services
About the Presentations The presentations cover the objectives found in the opening of each chapter. All chapter objectives are listed in the beginning.
State of Connecticut Core-CT Project Query 4 hrs Updated 1/21/2011.
TIBCO Designer TIBCO BusinessWorks is a scalable, extensible, and easy to use integration platform that allows you to develop, deploy, and run integration.
Hadoop: The Definitive Guide Chap. 8 MapReduce Features
HADOOP ADMIN: Session -2
DSpace XML UI Project Texas A&M University Digital Initiatives, Research and Technology Scott Phillips, Cody Green, Alexey Maslov, Adam Mikeal, Brian Surratt,
BUSINESS INTELLIGENCE/DATA INTEGRATION/ETL/INTEGRATION AN INTRODUCTION Presented by: Gautam Sinha.
Windows.Net Programming Series Preview. Course Schedule CourseDate Microsoft.Net Fundamentals 01/13/2014 Microsoft Windows/Web Fundamentals 01/20/2014.
A First Program Using C#
JSP Standard Tag Library
Chapter 11 Databases.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Overview of the Database Development Process
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
XForms: A case study Rajiv Shivane & Pavitar Singh.
Overview of Previous Lesson(s) Over View  ASP.NET Pages  Modular in nature and divided into the core sections  Page directives  Code Section  Page.
Interoperability Scenario Producing summary versions of compound multimedia historical documents.
Object and component “wiring” standards This presentation reviews the features of software component wiring and the emerging world of XML-based standards.
Zhonghua Qu and Ovidiu Daescu December 24, 2009 University of Texas at Dallas.
Fundamentals of Information Systems, Fifth Edition
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
Chapter 7: Database Systems Succeeding with Technology: Second Edition.
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
Web Programming : Building Internet Applications Chris Bates CSE :
Supporting High- Performance Data Processing on Flat-Files Xuan Zhang Gagan Agrawal Ohio State University.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
SaveUML System design. System overview Possible...
Messaging and Data integration models are used in Phase I of the MCAS project to connect a set of message- enabled services into a workflow for efficient.
Introduction to the Adapter Server Rob Mace June, 2008.
Distributed Information Retrieval Using a Multi-Agent System and The Role of Logic Programming.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Grid Computing at Yahoo! Sameer Paranjpye Mahadev Konar Yahoo!
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 14 Slide 1 Object-oriented Design.
240-Current Research Easily Extensible Systems, Octave, Input Formats, SOA.
Enterprise Integration Patterns CS3300 Fall 2015.
Metadata Mòrag Burgon-Lyon University of Glasgow.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
D. Heynderickx DH Consultancy, Leuven, Belgium 22 April 2010EuroPlanet, London, UK.
JAVA BEANS JSP - Standard Tag Library (JSTL) JAVA Enterprise Edition.
Web Services from 10,000 feet Part I Tom Perkins NTPCUG CertSIG XML Web Services.
Effects of Visualization and Interface Design on User Comprehensibility of Composite Data Asheem Chhetri, Apoorv Wairagade, Mahesh Gorantla, Hanye Xu,
Web Technologies Lecture 10 Web services. From W3C – A software system designed to support interoperable machine-to-machine interaction over a network.
Apache PIG rev Tools for Data Analysis with Hadoop Hadoop HDFS MapReduce Pig Statistical Software Hive.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
ECHO Technical Interchange Meeting 2013 Timothy Goff 1 Raytheon EED Program | ECHO Technical Interchange 2013.
Lecture 11 Introduction to R and Accessing USGS Data from Web Services Jeffery S. Horsburgh Hydroinformatics Fall 2013 This work was funded by National.
Keith Chadwick 1 Metric Analysis and Correlation Service. CD Seminar.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Management Information Systems by Prof. Park Kyung-Hye Chapter 7 (8th Week) Databases and Data Warehouses 07.
J2EE Platform Overview (Application Architecture)
What is BizTalk ?
Hadoop.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Metric Correlation and Analysis Service (MCAS)
Metric Correlation and Analysis Service (MCAS)
Supporting High-Performance Data Processing on Flat-Files
OPeNDAP/Hyrax Interfaces
Presentation transcript:

Concept demo System dashboard

Overview Dashboard use case General implementation ideas Use of MULE integration platform Collection Aggregation/Factorization model Plotting Service Usage Examples

Concept demo of the “troubleshooting dashboard” Understand compound state of the system in past by cross analyzing independent (and possibly many) data sources. The goal of the analysis is to spot trends and pattern in the change of the system state to develop some sort of systematic response procedure. Challenge variety of resources available to supply needed data. No common format or place where that data can be fetched Otherwise, grep and awk through log files Factor, scale,compare and display data on the fly

Time series Time series element describes state of the system at time T. Series show the evolution of system state (or its aspects) Time series are generated by the data source We want dashboard to index as many data sources as possible without enforcing strict content schema Use “dimensions” as the language to describe system aspects Time series element contains list of dimensions which collectively define an aspect. To optimize time series search queries, the dashboard will need to know the data source “schema” ( list of supportable dimensions)

Concept components Brian’s Bockelman gratia graphing and reporting tool Excellent resources which renders and makes available accounting data in XML. dCache memory plots by D. Litvintsev RRD toolkit (for visual reporting) Common graphing package for time series data Time series factorization tool. Developed by me. Implements rules for finding commonalities between different time series elements SMTP, Mime message, graphical dashboard A way to deliver results to user

Where Mule fits in ? Defines common data format to describe various system activities We’ve used time series of numerical values Defines work flow of services interacting with each other in accordance with user’s initial request Implements aggregation of data Integrates results into common reporting tools

Data source(s) Our scenario Data source Web frontend Log files Gratia dCache i.providers Fuzzy troubleshooter request Client Request results Render WareHouse Warehouse Service MapReduce [MULE] Data factorization and aggregation Request splitter Time. S factorization Aggregator

General design considerations Break up work flow into components that must access and maintain smallest possible, transient context Standalone service, pass through transformations Understand means of accepting, transforming and dispatching requests Map components into UMO models Map acceptance, transformation and dispatch to Mule endpoints, transformers and routers.

UI: Accepting requests Model We need a model that describes how we want to accept and dispatch out user requests to out “troublshooting/dashboard system”. In Mule, model may perform complex business related tasks or be simply pass though component linking input and output transformation logic. While both the model and Mule transformers “transform” data, the principal trait of the model is ability to maintain and use object state

UI: Accepting requests Model: input endpoints Define independent endpoints within the model " transformers=« MimeToString XMLToObjectt« /> Web Std in Each endpoint accepts data in a different format and uses specific transformer to converge to common type used down the chain.

Dispatching requests Our use case is to accept the request and multiplex that request to a set of data source providers for lookup. Use correlation to assemble results later Translate user query into the context specific to the data source

Data source(s) model Data source model adapts particular type of the data source. Gratia, dCache billing data, log mining, cache of the previously retrieved data Example: Gratia access though Brian B. web interface Use Mule RestServiceWrapper to accept messages and proxy them to a REST WEB service. For simplicity we don’t do any parsing or filtering here.

Data source input and output endpoints Input Output Content of the Brian’s web page (which is XML) is transformed to XML representing common time series data. That data is transformed to a java specific serialized XML representation and the translated to a vector of internal time series objects.

Aggregation/Factorization model The Mule pass though model is used to implement aggregation and factorization functional pieces. Data is aggregated at the input of the model Data is factorized at the output of the model No internal processing is needed.

Aggregation Time data aggregator waits and collects pieces from all data sources selected at the dispatch stage. TimeDataAggregator is a very simple class the defines a rule of how vectors of time series data should be joined into a new data type of higher level which instructs further steps of factorization and rendering.

Factorization We want to produce collection of plots from the input data array such that each plot has the same plotting instructions yet built from a subset of supplied data. Identify similarities across data array and generate set of independent reporting instances.

Mule mapping for factorization GroupBySplitterRouter implements router splitter interface. It uses message context to dissect factorization instructions and splits the message into pieces relevant only for each independent report (graph) Uses correlation to enable optional assembly of independent reports into a top level summary

PlottingService A stub for time series renderer May be replaced with any other render. JasperReport for ex. Model accepts time series messages with opaque instruction on how that time series must be rendered. In our case – plain RRD command. Model outputs UserPlotResponse that contains initial request along with the URL of the report file

UserPlotResponse aggregation Each UserPlotResponse is aggregated by the component responsible for notifying user with results of his request Uses correlation as set by factorization router-splitter

Example Your request to Mule stdin endpoint (for simplicity) Random id factor(Vos,$VO) testDimension=Running Jobs; RRD( AREA:testDimension#FF0000 ) Factor instructions define grouping by value of the VO field of the dataset (note : uses all data source data - no initial filtering of the data source, for simplicity ) RRD command follows the syntax of the RRD tools cmd line

Example output on your mailbox ……

Example 2 Your request sent to the SAME Mule stdin endpoint Random id factor(runningJobs,Running Jobs) testDimension=VO(dzero); testDimension1=VO(cms); RRD( LINE2:testDimension#FF0000 :D0 LINE2:testDimension1#0000FF:CMS) Factor defines class of time series that have “Running Jobs” dimension (which are all in the example) RRD expression plots CMS Vs D0 (true story)

Output in your mail box

capasityDim=Capacity; utilizationDim=Utilization; RRD( CDEF:diff=capasityDim,utilizationDim,- AREA:capasityDim#FF0000:capasity(Mb) AREA:utilizationDim#0000FF:utilization(Mb) AREA:diff#00FF00:difference(Mb))