1 Querying the Physical World Son, In Keun Lim, Yong Hun.

Slides:



Advertisements
Similar presentations
Split Databases. What is a split database? Two databases Back-end database –Contains tables (data) only –Resides on server Front-end database –Contains.
Advertisements

Database Architectures and the Web
Distributed Databases John Ortiz. Lecture 24Distributed Databases2  Distributed Database (DDB) is a collection of interrelated databases interconnected.
Chapter 13 (Web): Distributed Databases
Institute for Software Science – University of ViennaP.Brezany 1 Databases and the Grid Peter Brezany Institute für Scientific Computing University of.
Data - Information - Knowledge
Network Operating Systems Users are aware of multiplicity of machines. Access to resources of various machines is done explicitly by: –Logging into the.
Distributed Database Management Systems
Overview Distributed vs. decentralized Why distributed databases
A Hierarchical Energy-Efficient Framework for Data Aggregation in Wireless Sensor Networks IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 55, NO. 3, MAY.
16: Distributed Systems1 DISTRIBUTED SYSTEM STRUCTURES NETWORK OPERATING SYSTEMS The users are aware of the physical structure of the network. Each site.
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
What is adaptive web technology?  There is an increasingly large demand for software systems which are able to operate effectively in dynamic environments.
University of Pennsylvania 11/21/00CSE 3801 Distributed File Systems CSE 380 Lecture Note 14 Insup Lee.
Distributed Databases
Database System Architectures  Client-server Database System  Parallel Database System  Distributed Database System Wei Jiang.
Query Processing in Mobile Databases
Introduction to Cyberspace
Sensor Networks Storage Sanket Totala Sudarshan Jagannathan.
SensIT PI Meeting, January 15-17, Self-Organizing Sensor Networks: Efficient Distributed Mechanisms Alvin S. Lim Computer Science and Software Engineering.
Lecture slides prepared for “Computer Security: Principles and Practice”, 3/e, by William Stallings and Lawrie Brown, Chapter 5 “Database and Cloud Security”.
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
CORE 2: Information systems and Databases CENTRALISED AND DISTRIBUTED DATABASES.
An Integration Framework for Sensor Networks and Data Stream Management Systems.
Sensor Network Databases1 Overview: Chapter 6  Sensor Network Databases  Sensor networks are conceptually a distributed DB  Store collected data  Indexes.
1 Moshe Shadmon ScaleDB Scaling MySQL in the Cloud.
Massively Distributed Database Systems - Distributed DBS Spring 2014 Ki-Joune Li Pusan National University.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
Sensor Database System Sultan Alhazmi
Week 5 Lecture Distributed Database Management Systems Samuel ConnSamuel Conn, Asst Professor Suggestions for using the Lecture Slides.
Querying Large Databases Rukmini Kaushik. Purpose Research for efficient algorithms and software architectures of query engines.
1.file. 2.database. 3.entity. 4.record. 5.attribute. When working with a database, a group of related fields comprises a(n)…
Distributed Virtual Environments Introduction. Outline What are they? DVEs vs. Analytic Simulations DIS –Design principles Example.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
4 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved. Computer Software Chapter 4.
Lecture 22: Client-Server Software Engineering
CERN – European Organization for Nuclear Research Administrative Support - Internet Development Services CET and the quest for optimal implementation and.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
1 Distributed Databases BUAD/American University Distributed Databases.
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
MBA 664 Database Management Systems Dave Salisbury ( )
Querying The Internet With PIER Nitin Khandelwal.
Chapter 6.  Problems of managing Data Resources in a Traditional File Environment  Effective IS provides user with Accurate, timely and relevant information.
Big traffic data processing framework for intelligent monitoring and recording systems 學生 : 賴弘偉 教授 : 許毅然 作者 : Yingjie Xia a, JinlongChen a,b,n, XindaiLu.
Web Service-Based Remote Monitoring System for Smart Home Space Sheng Cai Joshua Ferguson Xinhui Hu Wei Wu Project for CSE535 Mobile Computing.
Efficient OLAP Operations in Spatial Data Warehouses Dimitris Papadias, Panos Kalnis, Jun Zhang and Yufei Tao Department of Computer Science Hong Kong.
W. Hong & S. Madden – Implementation and Research Issues in Query Processing for Wireless Sensor Networks, ICDE 2004.
Chapter 1 Database Access from Client Applications.
1 Querying the Physical World Cornell University Event Detection Services Using Data Service Middleware in Distributed Sensor Networks University.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
REED : Robust, Efficient Filtering and Event Detection in Sensor Network Daniel J. Abadi, Samuel Madden, Wolfgang Lindner Proceedings of the 31st VLDB.
Em Spatiotemporal Database Laboratory Pusan National University File Processing : Database Management System Architecture 2004, Spring Pusan National University.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
Troubleshooting Dennis Shasha and Philippe Bonnet, 2013.
Operating Systems Distributed-System Structures. Topics –Network-Operating Systems –Distributed-Operating Systems –Remote Services –Robustness –Design.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
1 Chapter 22 Distributed DBMSs - Concepts and Design Simplified Transparencies © Pearson Education Limited 1995, 2005.
Managing Data Resources File Organization and databases for business information systems.
Data-centric view of sensornets: An Overview Puru Kulkarni Vijay Sundaram Bhuvan Urgaonkar.
Table General Guidelines for Better System Performance
Introduction to NewSQL
به نام خدا Big Data and a New Look at Communication Networks Babak Khalaj Sharif University of Technology Department of Electrical Engineering.
MANAGING DATA RESOURCES
Chapter 17: Database System Architectures
Table General Guidelines for Better System Performance
Discretized Streams: A Fault-Tolerant Model for Scalable Stream Processing Zaharia, et al (2012)
Performance And Scalability In Oracle9i And SQL Server 2000
Database System Architectures
Distributed Edge Computing
Presentation transcript:

1 Querying the Physical World Son, In Keun Lim, Yong Hun

2 Outline  Introduction  Kinds of Queries  Warehousing vs. Distributed approach  Device Database System  Query Execution Plans  Conclusion

3 Future Computing Environments  The widespread deployments of devices  sensors, actuators, and mobile devices  with power, memory, and communication capabilities  devices are interconnected and accessible from local and wide area networks  What is a device?  physical object with computing and communication capabilities

4 Kinds of Queries  Historical Queries  typically aggregate queries over historical data  “For each rainfall sensor, display the average level of rainfall for 1999”  Snapshot queries  concerns the device network at a given point in time  “Retrieve the current rainfall level for all sensors in Tompkins County”  Long-running queries  concern the device network over a time interval  “For the next five hors, retrieve every 30 seconds the rainfall level for all sensors in Tompkins County”

5 Warehousing approach  Traditional query processing  A warehousing approach – data are extracted in a predefined way and store in a centralized DB system DBMS Query requesters Query Result Raw data -ATM Machine -Airplane/Ship/Train -Etc NETWORK

6 Warehousing approach(cont’d)  Dissociates access to devices from the query workload  Transfer large amounts of raw data from devices to the database server  Wastes valuable resources (network resources, processing power)  Modern devices include processing capabilities that could be used to process data locally

7 Distributed Approach  New way for query processing  Small-scale mobile devices will have self-computing power, memory, and communication capabilities Meta- Information Query requesters Query Result Sub query -ATM Machine -Airplane/Ship/Train -Etc NETWORK Sub result Query processor

8 Distributed Approach(cont’d)  The query determines the data that are extracted from remote sites  The query determines where possibly portions of queries are executed on devices  Device Database System  access devices directly when processing queries

9 Device Database System  What is a device?  A mini-server supporting functions  Functions like acquiring data  Functions like actuating (Triggers)  Device types  Synchronous devices returns results immediately  Asynchronous devices may move, may be unconnected, therefore returns results after an arbitrary period of time

10 Representation of Functions  User Representation : ADT  modeling each type of device in the network as an ADT  the public interface of the ADT corresponds to the specific functions supported by the device  Internal Representation : Virtual Relation  tabular relation of a function

11 Virtual Relation  Attributes  Inputs to the function  Arguments to the function  Output of the function  Timestamp of the function  Properties  records are never updated or deleted  is naturally partitioned over the network => a distributed DB

12 Query Execution Plan  Performance metrics  Throughput and response time  Resource usage and reaction time  Notation  Joins are between R and VRs  Rs are centrally located tables (like a list of sensors)  VRs are distributed (like a VR representing a device function)

13 Traditional Query Execution Plan  Materialize VR  Do Join at Front End  Centralized  Bottleneck, not scalable

14 Query Execution Plan A  Virtual records are produced once on each site for a snapshot query  Virtual records are produced repetitively for a long-running query  Whenever you get a new records, do a join

15 Query Execution Plan B  Semi-join : send joining attribute to devices  Use processing power of devices more  let them do the join  Send back only valid records  Reduced network traffic

16 Query Execution Plan C  Just send the condition  Again do the join at the center

17 Experiments 1  Nodes outside the target region  Plan C predictably shows no consumption (the transfer of the condition is negligible)  Plan B amortizes  Plan A accumulates

18 Experiment 2  Nodes inside the target region  Plan C shows growth, similar to Plan A  Plan B grows much more  The difference is the cost of performing a selection – low compared to cost of sending data.

19 Conclusions  Flexible  Scalable  Optimizer assume global knowledge  Query plans must adapt dynamically and be robustness

20 Future Expectations  A lot of self-computing devices which have their own memory, communication capability will exist in the future world (already?)  Refrigerator, Television, Vending Machine…  Hot issues are going to be  How to maintain device meta-information with a d ecentralized approach (P2P?)  Which information should be stored carefully at the data warehouse and which device should be accessed  Information retrieval will be more important  How to make query execution plans