Download presentation
Presentation is loading. Please wait.
Published byGerald Goodwin Modified over 9 years ago
1
Monitoring Streams- A New Class of Data Management Applications Presented by Qing Cao at CS@UVA
2
2/30 Table of contents Introduction Aurora System Model Aurora Optimization Real-Time Operation Details Critique Conclusion Discussion throughout the talk
3
3/30 Introduction Scenario RFID tagged Components Armed various sensors RPM, temperature, pressure, oil status, … Pressure Sensor Brightness Sensor User ID and Status
4
4/30 Auto Service Database 4G Wireless Network Service center GPS Repair center Home visit service Notify Instead of Query
5
5/30 Scenario Summary Data Streams rather than Static Data Paradigm shift from HADP to DAHP Can traditional Database be used to ha- ndle this kind of scenarios? According to the authors, NO!
6
6/30 Comparison Monitoring Application Traditional DBMS Typical model Data Active Human Passive Data Passive Human Active Managing History of values required Very hard or inefficient Approximate query result requiredNot supported Real-time requirement requiredNot supported
7
7/30 So… Quote: The primary goal of the Aurora project is to build a single infrastructure that can efficiently and seamlessly meet the requirements of such demanding applications. To this end, we are currently critically rethinking many existing data management and processing issues, as well as developing new proactive data processing concepts and techniques.
8
8/30 Implementation - trigger Data Stream Output ??? DBMS Data Submitter Messaging Systems Query register CHALLENGE CHALLENGE CHALLENGE CHALLENGE CHALLENGE Trigger : they are not scalable Data stream : Not in RealTime Update query : millions update in short time burst Query management : often update new triggers or queries requested by 3 rd party History of values : no scalable way to support latest location of the car CHALLENGE Optimization : Is it helpful doing massive optimization during high load? CHALLENGE QoS : can not ensure service for premium customers
9
9/30 Implementation - middleware Data Stream Output ??? DBMS Data Submitter Messaging Systems Query register query Query Processor CHALLENGE QoS : can not ensure service for premium customers Query management : has to use new query language Data stream : sometimes lost or delivered lately History of values : no scalable way to find latest location of the car Optimization : Can not benefit from query optimization Update query : millions update in short time burst CHALLENGE Resource usage : are we efficiently using the system? CHALLENGE
10
10/30 Implementation - Aurora Data Stream Output DBMS Data Submitter Messaging Systems Query register CHALLENGE query Query Processor CHALLENGE QoS : can not ensure service for premium customers Query management : has to use new query language Data stream : sometimes lost or delivered lately History of values : no scalable way to find latest location of the car Optimization : Can not benefit from query optimization Update query : millions update in short time burst CHALLENGE Data stream : new stream processing architecture Update queries : new stream processing architecture History of the values : new stream processing architecture Optimization : run-time optimization Query management : intuitive stream algebra and GUI QoS : specified by application administrator & load shedding CHALLENGE Resource usage : are we efficiently using the system? Resource usage : train scheduling & feed back from/to QoS
11
11/30 System model of Aurora External data source User application Operator boxes data flow Continuous & ad hoc queries Historical Storage Aurora System QoS spec Query spec Application administrator
12
12/30 Implementation - Aurora Data Stream Output Buffer manager Storage Manager Persistent Store Q1Q1 Q2Q2 QmQm Q1Q1 Q2Q2 QnQn Scheduler Load Shedder QoS Monitor Catalog Box Processors σμσμ Router inputs outputs
13
13/30 Aurora Query Semantics Traditional Structured Query Language Declarative query on static data Aurora Data flow model for data stream Application manager will construct queries using GUI Stream Query Algebra Queries are processed by SQuAl operators on the data stream
14
14/30 Operators Discussion Slide Tumble Latch Resample Filter Drop Map GroupBy MAP+GROUPBY = CASE
15
15/30 Query model b1b2b3 b4 b5b6 b7 b8b9 app QoS spec continuous query view ad-hoc query Connection point Storage
16
16/30 Optimization Dynamic continuous query optimization Inserting projections Combining boxes Reordering boxes Ad hoc query optimization 1 st stage : replace implementation (Filter/Join) 2 nd stage : same as continuous query
17
17/30 RunTime Operation QoS Data Structure Storage Management Real-time Scheduling Load Shedding
18
18/30 Whole Structure Revisited Data Stream Output Buffer manager Storage Manager Persistent Store Q1Q1 Q2Q2 QmQm Q1Q1 Q2Q2 QnQn Scheduler Load Shedder QoS Monitor Catalog Box Processors σμσμ Router inputs outputs
19
19/30 Aurora from Above...... App QoS...... App QoS.................. App QoS
20
20/30 Runtime Operation Scheduling: Minimizing Per Tuple Processing Overh ead Train Scheduling: A B …xyz A (x)A (y)A (z)B (A (x))B (A (y))B (A (z)) = Scheduler Action AB …xyz B (A (x))B (A (y))B (A (z)) Box Trains: A B …xyz A (z, y, x) B (A (z), A (y), A (x)) Tuple Trains:
21
21/30 Performance
22
22/30 Disucssion Solution approach Rethink about everything for the requirements Query model Data flow style query specification and QoS Optimization Dynamic runtime optimization Train scheduling QoS specification based resource management
23
23/30 Discussion Can it works in a distributive manner? Aurora project What is the final result? After intensive searching of the tens of papers published on this subject, I finally finds what was implemented:
24
24/30 The final Result The Aurora stream-processing engine. Aurora is currently operational. It consists of some 100K lines of C++ and Java and runs on both Unix- and Linux-based platf- orms.
25
25/30 Graphical Interface
26
26/30 GUI for an Example
27
27/30 Critique The overall approaches lacks in novelty, e.g. stream operators are ad-hoc. The overall result is not impressing. The project output is no more than a toy java program. Papers published lack in originality, depth, and overlap too much.
28
28/30 Conclusion Aurora is a large project that aims at stream query based engine design. Various new approaches are presented. No comparison results found in any paper. What do you think?
29
29/30 Extra on Aurora Aurora is the Latin word for "dawn". A polar light (caused by solar wind and seen near the poles). The collective noun for a group of polar bears. Several aircraft. Several vessels. Several Companies. In space: An asteroid, discovered by J. C. Watson, in september 6, 1867. The Aurora Programme, a strategy of the European Space Agency. In fiction: A superhero in the Marvel Universe. One of the Spacer worlds in Isaac Asimov's fiction One of at least four distinct music groups: a UK house group, also known as Aurora UK; a California-based ambient group; a contemporary Christian R&B group; a Mexican Latin music band. The name of the game engine that runs Neverwinter Nights, the toolset is called the Aurora to olset because of this. AND the aurora system as presented today.
30
30/30 THANK YOU!
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.