Aurora Proponent Team Wei, Mingrui Liu, Mo Rebuttal Team Joshua M Lee Raghavan, Venkatesh.

Slides:



Advertisements
Similar presentations
Computer Systems & Architecture Lesson 2 4. Achieving Qualities.
Advertisements

Analysis of : Operator Scheduling in a Data Stream Manager CS561 – Advanced Database Systems By Eric Bloom.
1 11. Streaming Data Management Chapter 18 Current Issues: Streaming Data and Cloud Computing The 3rd edition of the textbook.
The Design of the Borealis Stream Processing Engine Daniel J. Abadi1, Yanif Ahmad2, Magdalena Balazinska1, Ug ̆ur C ̧ etintemel2, Mitch Cherniack3, Jeong-Hyon.
A Data Stream Management System for Network Traffic Management Shivnath Babu Stanford University Lakshminarayanan Subramanian Univ. California, Berkeley.
Big Data Management and Analytics Introduction Spring 2015 Dr. Latifur Khan 1.
The Design of the Borealis Stream Processing Engine Brandeis University, Brown University, MIT Magdalena BalazinskaNesime Tatbul MIT Brown.
Static Optimization of Conjunctive Queries with Sliding Windows over Infinite Streams Presented by: Andy Mason and Sheng Zhong Ahmed M.Ayad and Jeffrey.
Load Shedding in a Data Stream Manager Kevin Hoeschele Anurag Shakti Maskey.
1 Continuous Queries over Data Streams Vitaly Kroivets, Lyan Marina Presentation for The Seminar on Database and Internet The Hebrew University of Jerusalem,
An Abstract Semantics and Concrete Language for Continuous Queries over Streams and Relations Presenter: Liyan Zhang Presentation of ICS
Quality-Of-Service (QoS) Panel Mitch Cherniack Brandeis David Maier OGI Rajeev Motwani Stanford Johannes GehrkeCornell Hari BalakrishnanMIT SWiM, Stanford.
Stream Processing Zachary G. Ives University of Pennsylvania CIS 650 – Database & Information Systems March 30, 2005.
Monitoring Streams -- A New Class of Data Management Applications Don Carney Brown University Uğur ÇetintemelBrown University Mitch Cherniack Brandeis.
Caching for Mobile Devices Kinsen Choy. Purpose Reduce network related costs for mobile devices. Different needs than desktop machines. – Limited energy.
1 PODS 2002 Motivation. 2 PODS 2002 Data Streams data sets Traditional DBMS – data stored in finite, persistent data sets data streams New Applications.
Improving the Accuracy of Continuous Aggregates & Mining Queries Under Load Shedding Yan-Nei Law* and Carlo Zaniolo Computer Science Dept. UCLA * Bioinformatics.
1 Load Shedding in a Data Stream Manager Slides edited from the original slides of Kevin Hoeschele Anurag Shakti Maskey.
Monitoring Streams -- A New Class of Data Management Applications Don Carney Brown University Uğur ÇetintemelBrown University Mitch Cherniack Brandeis.
SWIM 1/9/20031 QoS in Data Stream Systems Rajeev Motwani Stanford University.
Panel on Stream Query Languages The Aurora View Stan Zdonik Brown University.
State of Connecticut Core-CT Project Query 4 hrs Updated 1/21/2011.
Efficient Query Evaluation over Temporally Correlated Probabilistic Streams Bhargav Kanagal, Amol Deshpande ΗΥ-562 Advanced Topics on Databases Αλέκα Σεληνιωτάκη.
Using Probabilistic Models for Data Management in Acquisitional Environments Sam Madden MIT CSAIL With Amol Deshpande (UMD), Carlos Guestrin (CMU)
Mirek Riedewald Department of Computer Science Cornell University Efficient Processing of Massive Data Streams for Mining and Monitoring.
CSC271 Database Systems Lecture # 30.
Sensor Data Management: Challenges and (some) Solutions Amol Deshpande, University of Maryland.
Monitoring Streams- A New Class of Data Management Applications Presented by Qing Cao at
MONITORING STREAMS: A NEW CLASS OF DATA MANAGEMENT APPLICATIONS DON CARNEY, U Ğ UR ÇETINTEMEL, MITCH CHERNIACK, CHRISTIAN CONVEY, SANGDON LEE, GREG SEIDMAN,
The Design of the Borealis Stream Processing Engine CIDR 2005 Brandeis University, Brown University, MIT Kang, Seungwoo Ref.
Next-Generation IDS: A CEP Use Case in 10 Minutes 3rd Draft – November 8, nd Event Processing Symposium Redwood Shores, California Tim Bass, CISSP.
An Integration Framework for Sensor Networks and Data Stream Management Systems.
Query Processing, Resource Management, and Approximation in a Data Stream Management System.
Chapter 10: Stream-based Data Management Title: Retrospective on Aurora Authors: Hari Balakrishnan, et. al.
Master’s Thesis (30 credits) By: Morten Lindeberg Supervisors: Vera Goebel and Jarle Søberg Design, Implementation, and Evaluation of Network Monitoring.
A new model and architecture for data stream management.
Towards Low Overhead Provenance Tracking in Near Real-Time Stream Filtering Nithya N. Vijayakumar, Beth Plale DDE Lab, Indiana University {nvijayak,
Right In Time Presented By: Maria Baron Written By: Rajesh Gadodia
Secure Sensor Data/Information Management and Mining Bhavani Thuraisingham The University of Texas at Dallas October 2005.
Aurora – system architecture Pawel Jurczyk. Currently used DB systems Classical DBMS: –Passive repository storing data (HADP – human-active, DBMS- passive.
Integrating Scale Out and Fault Tolerance in Stream Processing using Operator State Management Author: Raul Castro Fernandez, Matteo Migliavacca, et al.
OPERETTA: An Optimal Energy Efficient Bandwidth Aggregation System Karim Habak†, Khaled A. Harras‡, and Moustafa Youssef† †Egypt-Japan University of Sc.
Tracking Irregularly Moving Objects based on Alert-enabling Sensor Model in Sensor Networks 1 Chao-Chun Chen & 2 Yu-Chi Chung Dept. of Information Management.
Big Data Analytics Large-Scale Data Management Big Data Analytics Data Science and Analytics How to manage very large amounts of data and extract value.
Data Stream Management Systems
Load Shedding Techniques for Data Stream Systems Brian Babcock Mayur Datar Rajeev Motwani Stanford University.
Load Shedding in Stream Databases – A Control-Based Approach Yicheng Tu, Song Liu, Sunil Prabhakar, and Bin Yao Department of Computer Science, Purdue.
1/22/08 RTR Project Presentation to TPTF RTR Project Michael Daskalantonakis & Brian Cook.
Aurora Group 19 : Chu Xuân Tình Trần Nhật Tuấn Huỳnh Thái Tâm Lec: Associate Professor Dr.techn. Dang Tran Khanh A new model and architecture for data.
A new model and architecture for data stream management.
Aurora: a new model and architecture for data stream management Daniel J. Abadi 1, Don Carney 2, Ugur Cetintemel 2, Mitch Cherniack 1, Christian Convey.
Adaptive Ordering of Pipelined Stream Filters Babu, Motwani, Munagala, Nishizawa, and Widom SIGMOD 2004 Jun 13-18, 2004 presented by Joshua Lee Mingzhu.
Control-Based Load Shedding in Data Stream Management Systems Yicheng Tu and Sunil Prabhakar Department of Computer Sciences, Purdue University April 3,
1 Semantics and Evaluation Techniques for Window Aggregates in Data Streams Jin Li, David Maier, Kristin Tufte, Vassilis Papadimos, Peter Tucker This work.
Monitoring Streams -- A New Class of Data Management Applications based on paper and talk by authors below, slightly adapted for CS561: Don Carney Brown.
REED : Robust, Efficient Filtering and Event Detection in Sensor Network Daniel J. Abadi, Samuel Madden, Wolfgang Lindner Proceedings of the 31st VLDB.
Control-based Quality Adaptation in Data Stream Management Systems (DSMS) Yicheng Tu†, Song Liu‡, Sunil Prabhakar†, and Bin Yao‡ † Department of Computer.
Energy Efficient Data Management in Sensor Networks Sanjay K Madria Web and Wireless Computing Lab (W2C) Department of Computer Science, Missouri University.
Control-Based Load Shedding in Data Stream Management Systems Yicheng Tu and Sunil Prabhakar Department of Computer Sciences, Purdue University April 3,
Streaming Semantic Data COMP6215 Semantic Web Technologies Dr Nicholas Gibbins –
S. Sudarshan CS632 Course, Mar 2004 IIT Bombay
The Design of an Acquisitional Query Processor For Sensor Networks
Advanced Database Management System
Data Stream Management System (DSMS)
Load Shedding Techniques for Data Stream Systems
Load Shedding in Stream Databases – A Control-Based Approach
Streaming Sensor Data Fjord / Sensor Proxy Multiquery Eddy
Sam Madden MIT CSAIL With Amol Deshpande (UMD), Carlos Guestrin (CMU)
Adaptive Query Processing (Background)
An Analysis of Stream Processing Languages
Presentation transcript:

Aurora Proponent Team Wei, Mingrui Liu, Mo Rebuttal Team Joshua M Lee Raghavan, Venkatesh

AURORA Proponent Introduction Motivation Aurora System Architecture Aurora System Query model Conclusion

Introduction:Traditional DBMSs Passive reporitory: Human-Active, DBMS- Passive(HADP) model The current of state of the data is important: Previous data needs to be extracted form the log Triggers and alerts as second-class citizens Perfect synchronization of data elements and exact query answers No real-time services from applications

Introduction: Monitoring apps Monitoring applications are applications that monitor continuous streams of data Active repository: DBMS-Active, Human-Passive (DAHP) model History of the data is important: Not only the current state but also the previous history Triggers and alerts as the first-class citizens Missing or imprecise data, and approximate query answers Real-time services required by applications

Introducation: Monitoring apps Target Applications :military financial analysis tracking other real-time applications

Car Navigation System Data( e.g., the location of the car) comes from external sources History of the data is required( e.g., display a trajectory of your car in the past 20 minutes) Trigger and alert oriented: an alert for the driver when the car is approaching to an intersection The location of the car is not always perfectly transmitted due to interferences etc..

Motivation-DSMS Data Stream Management Systems Streams: continuous data feeds from sources. ect. sensors and satellites Monitoring applications track the data from numerous streams, filtering them for signs of abnormal activity and processing them for purposes of aggregation, reduction and correlation

Management Requirements DBMS 1.Data processing results issuing transactions and queries 2.Manages data in its tables 3.Provide exact answers to exact queries and is blind to real-time deadlines 4.Optimizations of all queries in the same way 5.The norm is pull-based queries DSMS 1.Monitoring and alerting humans for abnormal activities 2.Processing of data that is bounded by finite window of values and not over unbounded past 3.Respond to real-time deadlines and provide reasonable approximations to queries 4.Benefits from Application specific optimization criteria (QoS) 5.The norm is push-based data processing

What is Aurora Deals with large numbers of data streams Users built queries out of a small set of operators (boxes) Supports combination of (boxes) for better answers Aurora-Better support monitoring application; Stream processing QoS functions Operators: filtering, mapping, windowed aggregate, join Timeout

Aurora –Architecture Continuous stream data comes Flow through a set of operators Output to application or materialized Multiple streams can be merged

Aurora System: Query model Supports continuous queries Supports Ad-hoc queries

Aurora QoS Model Each output is supplied with a QoS specification QoS is captured by three functions A latency graph A loss-tolerance graph A value-based graph

QoS

Aurora optimization Dynamic continuous query optimization optimizer selects a portion of the network to optimize by: Inserting projections. Combining boxes. Reordering boxes.

QoS data structures Response times – output tuples should be produced in a timely fashion, as otherwise QoS will degrade as delays get longer. Tuple drops – if tuples are dropped to shed load, then the QoS of the affected outputs will deteriorate. Values produced – QoS clearly depends on whether or not important values are being produced.

QoS

Load Shedding Load shedding by dropping tuples Minimizing the overall performance degradation as a result of static analysis Semantic load shedding by filtering tuples Semantic load shedding based on value- based QoS information if available

Conclusion Aurora is a new rising star in DBMS More demand for monitoring applications Future directions: Aurora* for distributed processing More efficient data handing algorithm for missing and/or imprecise data that is common in sensor network

AURORA Rebuttal User Issues –The graphical workflow style specification could be cumbersome in real life use –An extended SQL may be easier for users