Data dissemination in wireless computing environments

Slides:



Advertisements
Similar presentations
Dynamic Generation of Data Broadcasting Programs for Dynamic Generation of Data Broadcasting Programs for a Broadcast Disk Array in a Mobile Computing.
Advertisements

Dissemination-based Data Delivery Using Broadcast Disks.
Multicasting in Mobile Ad hoc Networks By XIE Jiawei.
Introduction to Database Systems1 Records and Files Storage Technology: Topic 3.
Query Execution, Concluded Zachary G. Ives University of Pennsylvania CIS 550 – Database & Information Systems November 18, 2003 Some slide content may.
Maximum Battery Life Routing to Support Ubiquitous Mobile Computing in Wireless Ad Hoc Networks By C. K. Toh.
Data Integrity Proofs in Cloud Storage Sravan Kumar R, Ashutosh Saxena Communication Systems and Networks (COMSNETS), 2011 Third International Conference.
©Silberschatz, Korth and Sudarshan12.1Database System Concepts Chapter 12: Part C Part A:  Index Definition in SQL  Ordered Indices  Index Sequential.
Hash-Based Indexes Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY.
Hash-based Indexes CS 186, Spring 2006 Lecture 7 R &G Chapter 11 HASH, x. There is no definition for this word -- nobody knows what hash is. Ambrose Bierce,
1 Hash-Based Indexes Module 4, Lecture 3. 2 Introduction As for any index, 3 alternatives for data entries k* : – Data record with key value k – –Choice.
Hash-Based Indexes The slides for this text are organized into chapters. This lecture covers Chapter 10. Chapter 1: Introduction to Database Systems Chapter.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Hash-Based Indexes Chapter 11.
Quick Review of Apr 10 material B+-Tree File Organization –similar to B+-tree index –leaf nodes store records, not pointers to records stored in an original.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Hash-Based Indexes Chapter 11.
Interpolating the Air for Optimizing Wireless Data Broadcast Fotis Tsakiridis, Panagiotis Bozanis, Dimitrios Katsaros Dept. of Computer & Communication.
Fall 2008Parallel Query Optimization1. Fall 2008Parallel Query Optimization2 Bucket Sizes and I/O Costs Bucket B does not fit in the memory in its entirety,
Chapter 11 (3 rd Edition) Hash-Based Indexes Xuemin COMP9315: Database Systems Implementation.
Copyright 2003Curt Hill Hash indexes Are they better or worse than a B+Tree?
©Silberschatz, Korth and Sudarshan12.1Database System Concepts Chapter 12: Indexing and Hashing Basic Concepts Ordered Indices B+-Tree Index Files B-Tree.
Index tuning Hash Index. overview Introduction Hash-based indexes are best for equality selections. –Can efficiently support index nested joins –Cannot.
1 Balancing Push and Pull for Efficient Information Discovery in Large-Scale Sensor Networks Xin Liu, Qingfeng Huang, Ying Zhang CS 6204 Adv Top. in Systems-Mob.
Spatial Indexing I Point Access Methods. PAMs Point Access Methods Multidimensional Hashing: Grid File Exponential growth of the directory Hierarchical.
Processing Data in External Storage CS Data Structures Mehmet H Gunes Modified from authors’ slides.
Modern Information Retrieval
1 Hash-Based Indexes Yanlei Diao UMass Amherst Feb 22, 2006 Slides Courtesy of R. Ramakrishnan and J. Gehrke.
Data Broadcast in Asymmetric Wireless Environments Nitin H. Vaidya Sohail Hameed.
1 Hash-Based Indexes Chapter Introduction  Hash-based indexes are best for equality selections. Cannot support range searches.  Static and dynamic.
1 Hash-Based Indexes Chapter Introduction : Hash-based Indexes  Best for equality selections.  Cannot support range searches.  Static and dynamic.
A New Broadcasting Technique for An Adaptive Hybrid Data Delivery in Wireless Mobile Network Environment JungHwan Oh, Kien A. Hua, and Kiran Prabhakara.
T. Imielinski, S. Viswanathan, and B.R. Badrinath Presented by Qinhai Xia Data on Air: Organization and Access.
Sorting and Query Processing Zachary G. Ives University of Pennsylvania CIS 550 – Database & Information Systems November 29, 2005.
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 34 – Media Server (Part 3) Klara Nahrstedt Spring 2012.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 File Organizations and Indexing Chapter 5, 6 of Elmasri “ How index-learning turns no student.
Massively Distributed Database Systems Broadcasting - Data on air Spring 2014 Ki-Joune Li Pusan National University.
Spring Efficient Dissemination of Enterprise Summary Data to Mobile Clients Mohamed A. Sharaf University of Pittsburgh.
Broadcast Protocols to Support Efficient Retrieval from Databases by Mobile Users By Anindya Datta, et al. Presented by Matt Miller February 20, 2003.
Chapter 11 Indexing & Hashing. 2 n Sophisticated database access methods n Basic concerns: access/insertion/deletion time, space overhead n Indexing 
Introduction to DFS. Distributed File Systems A file system whose clients, servers and storage devices are dispersed among the machines of a distributed.
Hashing and Hash-Based Index. Selection Queries Yes! Hashing  static hashing  dynamic hashing B+-tree is perfect, but.... to answer a selection query.
Lecture 15- Parallel Databases (continued) Advanced Databases Masood Niazi Torshiz Islamic Azad University- Mashhad Branch
Storage Structures. Memory Hierarchies Primary Storage –Registers –Cache memory –RAM Secondary Storage –Magnetic disks –Magnetic tape –CDROM (read-only.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Hash-Based Indexes Chapter 11 Modified by Donghui Zhang Jan 30, 2006.
Introduction to Database, Fall 2004/Melikyan1 Hash-Based Indexes Chapter 10.
1.1 CS220 Database Systems Indexing: Hashing Slides courtesy G. Kollios Boston University via UC Berkeley.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Indexed Sequential Access Method.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 Hash-Based Indexes Chapter 10.
Feb 5, ECET 581/CPET/ECET 499 Mobile Computing Technologies & Apps Data Dissemination and Management 2 of 3 Lecture 7 Paul I-Hai Lin, Professor Electrical.
ASPLOS’02 Presented by Kim, Sun-Hee.  Technology trends ◦ The rate of frequency scaling is slowing down  Performance must come from exploiting concurrency.
Data Scheduling for Multi-item and transactional Requests in On-demand Broadcast Nitin Pabhu Vijay Kumar MDM 2005.
Massively Distributed Database Systems Broadcasting - Data on air Spring 2015 Ki-Joune Li Pusan National University.
Indexing By: Arnold Mesa. Indexing You can think of an index to a file like a catalogue to a library.
Chapter 5 Record Storage and Primary File Organizations
Chapter 5 Ranking with Indexes. Indexes and Ranking n Indexes are designed to support search  Faster response time, supports updates n Text search engines.
Distributed Skip Air Index for Smart Broadcasting in Intelligent Transportation Systems Leandros Maglaras and Dimitrios Katsaros Department of Computer.
CS 540 Database Management Systems
Data Dissemination and Management (2) Lecture 10
Are they better or worse than a B+Tree?
Hash-Based Indexes Chapter 11
Chapter 11: Indexing and Hashing
Outline Midterm results summary Distributed file systems – continued
B+-Trees and Static Hashing
Hash-Based Indexes Chapter 10
Indexing and Hashing Basic Concepts Ordered Indices
CSE 4340/5349 Mobile Systems Engineering
Index tuning Hash Index.
Hash-Based Indexes Chapter 11
Chapter 11 Instructor: Xin Zhang
Chapter 11: Indexing and Hashing
Data Dissemination and Management (2) Lecture 10
Presentation transcript:

Data dissemination in wireless computing environments

Introduction Characteristics of wireless computing environments Limited wireless channel bandwidth Unreliable transmission Asymmetric communication environments Limited effective battery lifespan Threat to security High cost

Design issues in wireless data dissemination Efficient wireless bandwidth utilization Efficient and effective scheduling strategies at the server Energy-efficient data access for battery-powered portable devices Support for disconnection Support for secured and reliable transmission

Models for information dissemination Point to point Push-based Pull-based Hybrid Static dynamic

Data broadcast scheduling Organization of broadcast for push-based broadcast system Access time Tuning time Broadcast program Flat program Skewed non-flat program Regular non-flat program Broadcast cycle

Generation of broadcast programs Flat program The union of all objects that needed by clients are broadcast The average access time is the same for all objects 20/80 rule Broadcast the frequently accessed objects more regularly than those that are less popular Naïve approach Probabilistically pick an object for transmission Problems?

Optimal broadcast program Copies of an object are equally spaced For any two objects x and y, fx/fy= fx: number of copies of item x in a broadcast cycle qx: access probabilities of item x It’s not always possible to generate such broadcast program

Broadcast disk Data is split into n partitions Data with similar access frequency is put in the same partition Partitions with larger access frequencies will be broadcast more often than those with smaller access frequencies

Scheduling strategies for pull-based broadcast system Methods FCFS LWF MRF R*W Performance metrics Responsiveness Scheduling overhead Robustness Fairness

Indexing on air Basic protocol for retrieving broadcast data

Flat broadcast programs with indexes (1,m) index A complete index is broadcast m times during a bcast All buckets have an offset to the beginning of the next index segment discussion High average access time Good tuning time Consideration Is it need to replicate the complete index between successive data blocks?

Tree-based index A data file is associated with a B+ tree index structure Broadcast media is a sequential medium, the data file and index must be flattened Preorder traversal First k levels of the index will be partially replicated in the broadcast, and the remaining levels will not be replicated All non-replicated buckets contain pointers that will direct the search to the next copy of its replicated ancestors

Hash-based index Data are hashed into a set of partitions Partitions may have different sizes (nonuniform distribution) Hpartition(k) : determines the partition that that object k belong to Hhash: determines the hash bucket that contains the shift pointer The gap between hash buckets is given by the size of the smallest partition Hhash = 1+(hpartition-1)*gap

Signature-based index An abstraction of the information stored in an object or a file K-levels of signature The higher the level is, the coarser the granularity of the grouping is Each integrated signature is broadcast before the corresponding group of objects. To reduce the number of false drops, the hashing functions used in generating the signatures at different levels should be different.

Flexible indexing scheme Split a sorted list of objects into several equal-sized segments At the beginning of each segment, there is a control index Global index Local index

Discussion

Broadcast program generation for skewed data access Access frequencies can be exploited to design index methods that further minimize the average number of index probes Two kinds of approach Imbalanced tree approach Non-flat broadcast programs with indexes

Non-flat broadcast programs with indexes Segment-level index Similar to the (1,m) index scheme Broadcast a full index at the beginning of each segment The broadcast program is generated under broadcast disks Problems The cost to find the index may be very large

Distributed indexing Each segment index is split into sub-indexes that are distributed within its corresponding segment