Tracking a Soccer Game with Big Data Srinath Perera Director of Research, WSO2 Member, Apache Software

Slides:



Advertisements
Similar presentations
Your All in 1 Logistics Solution. The Problem Problems that your business might be facing Delivery delay / mistake Facilities are not used effectively.
Advertisements

Solving Manufacturing Equipment Monitoring Through Efficient Complex Event Processing Tilmann Rabl, Kaiwen Zhang, Mohammad Sadoghi, Navneet Kumar Pandey,
MIDDLEWARE SYSTEMS RESEARCH GROUP MSRG.ORG Grand Challenge: The BlueBay Soccer Monitoring Engine Hans-Arno Jacobsen Kianoosh Mokhtarian Tilmann Rabl Mohammad.
Transforming Business with Advanced Analytics: Introducing the New Intel® Xeon® Processor E7 v2 Family Seetha Rama Krishna Director, APAC HPC Solutions.
© Hortonworks Inc Daniel Dai Thejas Nair Page 1 Making Pig Fly Optimizing Data Processing on Hadoop.
GPS Fleet Solutions for Business Managing your fleet just got a whole lot easier! Managing Director –
1 NETE4631 Cloud deployment models and migration Lecture Notes #4.
The NewSQL database you’ll never outgrow Taming the Big Data Fire Hose John Hugg Sr. Software Engineer, VoltDB.
MessageOps Monitor. Communication apps are mission critical But how do you ensure high service levels when they run in the cloud?
A Fast Growing Market. Interesting New Players Lyzasoft.
Observation Pattern Theory Hypothesis What will happen? How can we make it happen? Predictive Analytics Prescriptive Analytics What happened? Why.
Scalable Realtime Analytics with declarative, SQL like, Complex Event Processing Scripts Srinath Perera Director, Research WSO2 Apache Member
Chapter 5. Database Aspects of Location-Based Services Lee Myong Soo Mobile Data Engineering Lab. Dept. of.
Dunja Mladenić Marko Grobelnik Jožef Stefan Institute, Slovenia.
Cloud Connected Fitness
Create Real-Time Awareness with Business Activity Monitoring iCEP – FIS September 2008 Sinan Sen Forschungszentrum Informatik - FZI, Karlsruhe,
A Billiards Point of Sale Application Christopher Ulmer CS 470 Final Presentation.
The Pulse of UCF James Doty EEL 6788 University of Central Florida 19 April 2010.
C7:Complex Event Processing Making Sense of Sensor Network Events in Real Time John Doherty Senior Presales Consultant.
SET THE DISRUPTIVE SCENE David Minton Director of The Leisure Database Company.
Next-Generation IDS: A CEP Use Case in 10 Minutes 3rd Draft – November 8, nd Event Processing Symposium Redwood Shores, California Tim Bass, CISSP.
John Plummer Technical Specialist Data Platform Microsoft Ltd StreamInsight Complex Event Processing (CEP) Platform.
Financial Information Management Operations, BI, and Analytics Stefano Grazioli.
Snow Plow Tracking and GIS
…optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research.
+ Big Data IST210 Class Lecture. + Big Data Summary by EMC Corporation ( More videos that.
Network Computing Laboratory A programming framework for Stream Synthesizing Service.
What is Big Query?.
Your GPS Fleet Solutions Experts Managing your fleet just got a whole lot easier!
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.
Goal Reports | All Rights Reserved Provider of the world’s best soccer evaluation software.
The Future of Traffic Records as envisioned by… Carol Flannagan, Ph.D. Interim Director, UMTRI October 27, 2015 Traffic Records Forum.
Guidance Tool for Implementation of Traffic Incident Management Performance Measures (NCHRP 07-20) December 16, 2014 Brian Hoeft Freeway and Arterial System.
Empowering analytics through data capture Cary Moretti, CTO | HockeyTech Ottawa Hockey Analytics, Carleton University, Ottawa the world’s premier provider.
Information Systems in Organizations Managing the business: decision-making Growing the business: knowledge management, R&D, and social business.
Internet of Things. Creating Our Future Together.
Harnessing Big Data with Hadoop Dipti Sangani; Madhu Reddy DBI210.
Microsoft Ignite /28/2017 6:07 PM
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Azure Stream Analytics
Connected Infrastructure
IoT Business Maturity Model 1. Operational efficiency
In quest of the operational database for real-time environmental monitoring and early warning systems Bartosz Baliś, Marian Bubak, Daniel Harezlak, Piotr.
Streaming Analytics & CEP Two sides of the same coin?
Parcel Tracking Solution Parcel Tracking What to look for Architecture
Sport Specfic Assigment: Performa Sports
Researcher and Lecturer Assistant at FCI Assiut University
Solving DEBS Grand Challenge with WSO2 CEP
Connected Infrastructure
Enabling Scalable and HA Ingestion and Real-Time Big Data Insights for the Enterprise OCJUG, 2014.
LEO Kinesis More Kafka-like Blaine Nielsen
Remote Monitoring solution
Malwarebytes Crashes during Scan Window
COS 518: Advanced Computer Systems Lecture 11 Michael Freedman
Automation in IMS Can it help the shrinking talent pool
Federico Perrero – Plant Manager
Big Data Analytics: Visualizing the Future in the Aftermarket
Cornell Notes Identify the Problem Gather Information
Flow Map Identify the Problem Gather Information State a Hypothesis
C7: Complex Event Processing
Big Data Young Lee BUS 550.
Taming the Big Data Fire Hose
Soccer Team Gameplay Metrics
MIS2502: Data Analytics Things You Can Do With Data & Information Architecture Zhe (Joe) Deng
Performance And Scalability In Oracle9i And SQL Server 2000
Analytics, BI & Data Integration
COS 518: Advanced Computer Systems Lecture 12 Michael Freedman
Lu Tang , Qun Huang, Patrick P. C. Lee
Customer 360.
SDMX meeting Big Data technologies
Presentation transcript:

Tracking a Soccer Game with Big Data Srinath Perera Director of Research, WSO2 Member, Apache Software

Outline Big Data and CEP Tracking a Soccer Game Making it real Conclusion Photo by John Trainoron Flickr Licensed under CC

A day in your life  Think about a day in your life? -What is the best road to take? -Would there be any bad weather? -How to invest my money? -How is my health?  There are many decisions that you can do better if only you can access the data and process them / / CC licence

Real Time Analytics  Most first usecases were of batch processing, which take minutes if not hours to run  Many insights you rather have sooner -Tomorrows weather -Heart condition -Traffic congestion -Natural disaster -Stockmarket Crash

Realizing Real-time Analytics  Processing Data on the fly, while storing a minimal amount of information and responding fast (from <1 ms to few seconds)  Idea of Event streams -A series of events in time  Enabling technologies -Stream Processing (Storm, S4) -Complex Event processing

Internet of Things  Currently physical world and software worlds are detached  Internet of things promises to bridge this -It is about sensors and actuators everywhere -In your fridge, in your blanket, in your chair, in your carpet.. Yes even in your socks -Google IO pressure mats

Vision of the Future  Sensors everywhere  Data collected from everywhere, analyzing, optimizing, and helping (and hopefully not taking over)  Analytics and Internet of things.. Immersive world  Big data and real-time analytics will be crucial. How far are we from realizing that?

What would take to build such a world?  Sensors and actuators (Motes?)  Fast interoperable event systems (MQTT?)  Powerful query languages (CEP?)  Powerful control systems and decision systems

Data Processing Landscape

Complex Event Processing

Complex Event Processing Operators  Filters or transformations (process a single event) - from Ball[v>10] select.. insert into..  Windows + aggregation (track window of events: time, length) - from Ball#window.time(30s) select avg(v)..  Joins (join two event streams to one) - from Ball#window.time(30s) as b join Players as p on p.v < b.v  Patterns (state machine implementation) - from Ball[v>10], Ball[v 10] select..  Event tables (map a database as an event stream) - Define table HitV (v double) using.. db info..

Sport (Soccer) Usecases?  Dashboard on game status  Alarms about critical events in the game  Real-time game analysis and predictions about the next move  Updates/ stats etc., on mobile phone with customized offers  Study of game and players effectiveness  Monitor players health and body functions

DEBS Challenge Soccer game, players and ball has sensors (DESB Challenge 2013) sid, ts, x,y,z, v,a Use cases: Running analysis, Ball Possession and Shots on Goal, Heatmap of Activity WSO2 CEP (Siddhi) did 100K+ throughput

Usecase 1: Running Analysis  Main idea: detect when speed thresholds have passed define partition player by Players.id; from s = Players [v 11], t = Players [v > 1 and v <= 11]+, e = Players [v 11] select s.ts as tsStart, e.ts as tsStop,s.id as playerId, ‘‘trot" as intensity, t [0].v as instantSpeed, (e.ts - s.ts )/ as unitPeriod insert into RunningStats partition by player;

Usecase 2: Ball possession  Ball possession (you possess the ball from time you hit it until someone else hit it or ball leaves the ground)

Usecase 3: Heatmap of Activity  Show where actions happened (via cells defined by a grid of 64X100 etc.), need updates once every second  Can solved via cell change boundaries, but does not work if one player stays more than 1 sec in the same cell. So need to join with a timer.

Usecase 4: Detect kicks on the goal  Main Idea: Detect kicks on the ball, calculate direction after 1m, and keep giving updates as long as it is in right direction

Results for DEBS Scenarios

Making it Real

CEP High Availability

Complex Event ProcessingStream Processing SQL like language Supports powerful temporal operators (e.g. windows, event patterns) Focus on speed Harder to scale e.g. WSO2 CEP, Streambase, Esper Operators connected in a network, but you have to write the logic Distributed by design Focus on reliability (do not loose messages), has transactions e.g. Storm, S4

Scaling CEP: Pipeline

Scaling Operators: e.g. Distributed Join

Siddhi Storm Bolt (next Major Release)  We have written a Siddhi bolt that would let users run distributed Siddhi Queries using Storm SiddhiBolt siddhiBolt1 = new SiddhiBolt(.. siddhi queries.. ); SiddhiBolt siddhiBolt2 = new SiddhiBolt(.. siddhi queries.. ); TopologyBuilder builder = new TopologyBuilder(); builder.setSpout("source", new PlayStream(), 1); builder.setBolt("node1", siddhiBolt1, 1).shuffleGrouping("source", "PlayStream1");.. builder.setBolt("LeafEacho", new EchoBolt(), 1).shuffleGrouping("node1", "LongAdvanceStream");.. cluster.submitTopology("word-count", conf, builder.createTopology());

Conceptual Architecture

Data Collection Can receive events via SOAP, HTTP, JMS,.. WSO2 Events is blazing fast (400K events TPS) Default Agents and you can write custom agents. Agent agent = new Agent(agentConfiguration); publisher = new AsyncDataPublisher( "tcp://localhost:7612",.. ); StreamDefinition definition = new StreamDefinition(STREAM_NAME, VERSION); definition.addPayloadData("sid", STRING);... publisher.addStreamDefinition(definition);... Event event = new Event(); event.setPayloadData(eventData); publisher.publish(STREAM_NAME, VERSION, event);

Business Activity Monitor

Think CEP AND Hive, not OR  Real power comes when you can join the batch and real-time processing  E.g. if velocity of the ball after a kick is different from season average by 3 times of season’s standard deviation, trigger event with player ID and speed -Hard to detect velocity after the kick with Hive/ MapReduce, but much easier with CEP -Complicated to calculate season average with CEP, as we end up with huge windows.

Other Applications  System/ Device Management  Fleet/ Logistic Management  Fraud Detection  Targeted/ Location Sensitive Marketing  Smart Grid Control  Geo Fencing  …

Conclusion  We are heading for a deeply integrated world with real-time detection and actions -We have technology to do this now. E.g. (DEBS usecases) -Power of CEP -Use real-time and batch processing in tandem  All the software we discussed are Open source under Apache License. Visit  Like to integrate with us, help, or join? Come to booth 901, get in touch via or