Dieter Gawlick, Oracle October, 2005 (GGF15 in Boston)

Slides:



Advertisements
Similar presentations
© 2006 Open Grid Forum INFOD Extended Specifications OGF21, Seattle, WA, USA
Advertisements

The Top 10 Reasons Why Federated Can’t Succeed And Why it Will Anyway.
1 11. Streaming Data Management Chapter 18 Current Issues: Streaming Data and Cloud Computing The 3rd edition of the textbook.
C van Ingen, D Agarwal, M Goode, J Gupchup, J Hunt, R Leonardson, M Rodriguez, N Li Berkeley Water Center John Hopkins University Lawrence Berkeley Laboratory.
Event Processing Use Case: Automated Security Pricing Event Processing Symposium November
CS 501: Software Engineering Fall 2000 Lecture 16 System Architecture III Distributed Objects.
1 PODS 2002 Motivation. 2 PODS 2002 Data Streams data sets Traditional DBMS – data stored in finite, persistent data sets data streams New Applications.
Streaming Data, Continuous Queries, and Adaptive Dataflow Michael Franklin UC Berkeley NRC June 2002.
Copyright © 2006, Open Geospatial Consortium, Inc., All Rights Reserved. The OGC and Emergency Services: GML for Location Transport & Formats & Mapping.
IBM Research – Thomas J Watson Research Center | March 2006 © 2006 IBM Corporation Events and workflow – BPM Systems Event Application symposium Parallel.
Kevin Hudson Oracle Corporation October Evolution of Oracle from Application to Infrastructure.
1 Serge Abiteboul - Monitoring 1 Monitoring of distributed applications (in P2P) Serge Abiteboul, Pierre Bourhis, Bogdan Marinoiu, INRIA Saclay and Université.
Next-Generation IDS: A CEP Use Case in 10 Minutes 3rd Draft – November 8, nd Event Processing Symposium Redwood Shores, California Tim Bass, CISSP.
20411B 8: Installing, Configuring, and Troubleshooting the Network Policy Server Role Presentation: 60 minutes Lab: 60 minutes After completing this module,
National Center for Supercomputing Applications NCSA OPIE Presentation November 2000.
Towards Low Overhead Provenance Tracking in Near Real-Time Stream Filtering Nithya N. Vijayakumar, Beth Plale DDE Lab, Indiana University {nvijayak,
Data Management Conference Introducing SQL Server 2008 R2 Mark Linton Director of WW Marketing SQL Server Business Group
Information System Development Courses Figure: ISD Course Structure.
Event Processing A Perspective From Oracle Dieter Gawlick, Shailendra Mishra Oracle Corporation March,
Wellstorm Development Connecting Real Time Data to Everything Hugh Winkler May 11, 2006.
Data Stream Management Systems
Aum Sai Ram Security for Stream Data Modified from slides created by Sujan Pakala.
Web 2.0 and Telco 2.0 n Service orientation is about inter-operability and federation across domains. l Domain silos defeat the purpose. l User wants one.
1 Makes Mobile WiMAX Simple Netspan Overview Andy Hobbs Director, Product Management 5 th October 2007.
Text TCS INTERNAL Oracle PL/SQL – Introduction. TCS INTERNAL PL SQL Introduction PLSQL means Procedural Language extension of SQL. PLSQL is a database.
CERN 21 January 2005Piotr Nyczyk, CERN1 R-GMA Basics and key concepts Monitoring framework for computing Grids – developed by EGEE-JRA1-UK, currently used.
Streaming Semantic Data COMP6215 Semantic Web Technologies Dr Nicholas Gibbins –
Leading the pervasive adoption of grid computing for research and industry © 2005 Global Grid Forum The information contained herein is subject to change.
AMSA TO 4 Advanced Technology for Sensor Clouds 09 May 2012 Anabas Inc. Indiana University.
Data Streams COMP3017 Advanced Databases Dr Nicholas Gibbins –
CE design report Luigi Zangrando
DOWeR Detecting Outliers in Web Service Requests Master’s Presentation of Christian Blass.
Data Mining - Introduction Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
CS457 Introduction to Information Security Systems
OGSA Information Abdeslem Djaoui OGSA Information, OGSA-WG #11
OGSA Information Management Services status
OGF PGI – EDGI Security Use Case and Requirements
What is sql?.
LMAP BoF 1. ISP use case 2. Framework
INFOD Use Case Scenario
COMP3211 Advanced Databases
OGSA Data Architecture WG Data Transfer Discussion
Semantic Event-based Service Oriented Architecture
Distributed database approach,
Every Good Graph Starts With
R-GMA as an example of a generic framework for information exchange
Work plan revisited Activity 3 Impact Activity 4 Management
The Client/Server Database Environment
INFOD-WG Implementation
SECURITY INFORMATION AND EVENT MANAGEMENT
The Design of an Acquisitional Query Processor For Sensor Networks
Environment-Aware Reputation Management for Ad Hoc Networks
The Top 10 Reasons Why Federated Can’t Succeed
Teaching slides Chapter 8.
Service Oriented Architecture for Cloud Based Travel Reservation Software as a Service Comp 684 – Rayna Burgess.
Database Systems: Design, Implementation, and Management
Taking business online October 2008
The Past, The Present, and The Future
Data Security in Local Networks using Distributed Firewalls
MGS 4020 Business Intelligence Ch 1 – Introduction to DSS Jun 7, 2018
Introduction to Stream Computing and Reservoir Sampling
RSS RSS is an acronymn for Really Simple Syndication or Rich Site Summary. RSS (noun) - an XML format for distributing news headlines on the Web. RSS.
Software and Software Engineering
Knowledge Byte In this section, you will learn about:
Confidentiality, Integrity, Nonrepudiation
Ponder policy toolkit Jovana Balkoski, Rashid Mijumbi
Digital Libraries and Online Education
Drawn from TAPI: oimt.2019.ND TapiStreaming.mht
Presentation transcript:

Dieter Gawlick, Oracle 03-06 October, 2005 (GGF15 in Boston) Notes About Streams Dieter Gawlick, Oracle 03-06 October, 2005 (GGF15 in Boston)

Examples of Streams Network monitoring and traffic engineering Sensor networks, RFID tags Telecom call records Financial applications Web logs and click-streams Manufacturing processes Queues RSS

Streams are Everywhere Application Application Application Application Stream File/DB File/DB Stream Sensor Data Actuators Stream Stream Feeds Feeds Stream Propagations Propagations A virtual stream; potentially based on a CQ A materialized stream Stream

Definition … a body of water, confined within a bed and banks and having a detectable current – What you get from Google Continuous, unbounded, rapid, time-varying streams of data elements – Jennifer Widom A semi ordered and ever growing set of related data – Anonymous … make up your own definition …

Specifications of Streams Data model Data structure of elements (NVP, CSV, SQL99, XML, RDF, ..) - the easy part Relation between elements has to be added Sequencing, other Review presentation from Stanford Streams project and others Data access Continuous Queries; e.g., Subscriptions, CQL Single elements – well understood Windows, joins, aggregates – (very) limited understanding Any language has to deal with evolution of streams Ad-hoc data access assumes a static/stable data source Ad-hoc access Current languages are very weak on temporal support

Specifications of Streams Client model and tools We need an additional client pattern Use consumption of queues as guide (Exactly once) consumption E.g., make sure that ‘new’ data are processed exactly once even in the presence of application and system failures State management - to protect against failures E.g., prevent extended locking – commit after consumption of new data element Management (logical properties) Retention Other

The Creation of Streams Creation of (initial) streams Applications Fixed Based on parameters Always, on demand Temporal states Continuous query must be able to identify events Creation of (derived) streams by evaluation of other streams: Single element (classical subscription) Windows – CQL like languages Joining of streams – CQL like language with extensions Joining stream with non-stream data – tricky (temporal reference) Many policies See next slide

Distribution and consumption of Streams Messaging, RSS, … Propagation Consumption Single client, group of clients Policies Best effort, auditable, non-reputable Fair According to community policies According to constraints defined by publisher and consumer Best effort Non transactional, transactional (exactly once)

Operational Characteristics Scalability/performance Large number of (streams) elements, publishers, consumers (clients) Reliability/Recoverability Streams may represented business interaction; e.g., may represent most valuable business (data) asset Security Should be aligned with database/file security; e.g., fine grain, context oriented Transactional Elements of some streams need to be created, propagated consumed exactly once Fair Everyone gets information at the same time … and more

Streams and Events The elements of streams are messages Message report about events Event: A change of a state that is of interest Event and message specification Newton Model: Events and messages are pre-efined – take what you get, select a subset Heisenberg Model: Events and messages can be defined by subscriber/consumer Requires temporal support Requires a language to support event/message specification It is a significant challenge to identify important events and related messages