NASA Space Communications Symposium

Slides:



Advertisements
Similar presentations
Examples of Physical Query Plan Alternatives
Advertisements

Data Bits Models Classes & Schemes Rows & Tables Keys Associations $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 Final DataBit.
COMP 5138 Relational Database Management Systems Semester 2, 2007 Lecture 5A Relational Algebra.
CS4432: Database Systems II
Query Optimization CS634 Lecture 12, Mar 12, 2014 Slides based on “Database Management Systems” 3 rd ed, Ramakrishnan and Gehrke.
D ATABASE S YSTEMS I R ELATIONAL A LGEBRA. 22 R ELATIONAL Q UERY L ANGUAGES Query languages (QL): Allow manipulation and retrieval of data from a database.
ItCompress: An Iterative Semantic Compression Algorithm
Search Engines and Information Retrieval
Progress Report on Continuous Data Stream Management  Mining Frequent Itemsets over Data Streams  Music Virtual Channel Presented by: Dr. Yi-Hung Wu.
1 8. Safe Query Languages Safe program – its semantics can be at least partially computed on any valid database input. Safety is tied to program verification,
Project Title Project Investigators Project Duration: (E.g., 3 years; currently in year 2, or x months if this is a better representation of project time)
Peter Dinda Department of Computer Science Northwestern University Beth Plale Department.
Data Mining – Intro.
Copyright © 2016 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Overview of Distributed Data Mining Xiaoling Wang March 11, 2003.
Search Engines and Information Retrieval Chapter 1.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 Query Evaluation Chapter 12: Overview.
Page 1 Semantic Data Compression Techniques for NASA and Mobile Computing Databases Principal Investigators: G. Ozsoyoglu, Z.M. Ozsoyoglu Case Western.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 10Slide 1 Architectural Design l Establishing the overall structure of a software system.
Computer Science 101 Database Concepts. Database Collection of related data Models real world “universe” Reflects changes Specific purposes and audience.
DANIEL J. ABADI, ADAM MARCUS, SAMUEL R. MADDEN, AND KATE HOLLENBACH THE VLDB JOURNAL. SW-Store: a vertically partitioned DBMS for Semantic Web data.
Analysis of algorithms Analysis of algorithms is the branch of computer science that studies the performance of algorithms, especially their run time.
Page 1 Biosensor Networks Principal Investigators: Frank Merat, Wen H. Ko Task Number: NAG Case Western Reserve University September 18, 2002 NASA.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Query Execution Section 15.1 Shweta Athalye CS257: Database Systems ID: 118 Section 1.
PROCESSING OF DATA The collected data in research is processed and analyzed to come to some conclusions or to verify the hypothesis made. Processing of.
Page 1 Remote Interaction With Machines Principal Investigator: Vincenzo Liberatore Task Number: NAG Case Western Reserve University September 18,
Introduction to Query Optimization, R. Ramakrishnan and J. Gehrke 1 Introduction to Query Optimization Chapter 13.
Page 1 Active MEMS for Wireless and Optical Space Communications Principal Investigators: Frank Merat, Stephen Phillips Task Number: NAG Case Western.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 Introduction to Query Optimization Chapter 13.
Ranking of Database Query Results Nitesh Maan, Arujn Saraswat, Nishant Kapoor.
Page 1 Task Title Placed Here Principal Investigator: Enter Name Task Number: Y0T3869 (enter your task number) Enter Your Affiliation (GRC, University,
Jennifer Widom Relational Databases The Relational Model.
CS-508 Databases and Data Mining By Dr. Noman Hasany.
BAHIR DAR UNIVERSITY Institute of technology Faculty of Computing Department of information technology Msc program Distributed Database Article Review.
SketchVisor: Robust Network Measurement for Software Packet Processing
Information Retrieval in Practice
CPSC-310 Database Systems
Oracle Advanced Analytics
Creating Database Objects
CPSC 603 Database Systems Lecturer: Laurie Webster II, Ph.D., P.E.
Machine Learning with Spark MLlib
Data Mining – Intro.
Capability-Sensitive Query Processing on Internet Sources
How To Build a Compressed Bitmap Index
Chapter 1 Introduction.
CSCI5570 Large Scale Data Processing Systems
IS301 – Software Engineering Dept of Computer Information Systems
Chapter 1 Introduction.
Introduction to Query Optimization
Relational Algebra Chapter 4, Part A
Jiawei Han Department of Computer Science
Chapter 15 QUERY EXECUTION.
Introduction lecture1.
Introduction to Database Systems
Examples of Physical Query Plan Alternatives
Chair of Tech Committee, BetterGrids.org
Active MEMS for Wireless and Optical Space Communications
Database.
Relational Databases The Relational Model.
Relational Databases The Relational Model.
Query Processing B.Ramamurthy Chapter 12 11/27/2018 B.Ramamurthy.
The Relational Algebra
Relational Algebra Chapter 4, Sections 4.1 – 4.2
Query Execution Presented by Jiten Oswal CS 257 Chapter 15
Probabilistic Databases
By: Ran Ben Basat, Technion, Israel
Introduction to Database Systems
Creating Database Objects
Pattern Analysis Prof. Bennett
Presentation transcript:

NASA Space Communications Symposium Semantic Data Compression Techniques for Mobile Computing and Stream Data Principal Investigators: G Ozsoyoglu, Z.M. Ozsoyoglu Task Number: NAG3-2578 Case Western Reserve University September 18, 2002

Semantic Data Compression Project Overview Start Date: 8/1/2001 End date: 3/31/2003 Querying Compressed Tables: Designing compression-aware query languages Compromise between query expressive power and compression efficiency Querying Compressed Data Streams: Real-time, one-pass-only stream querying and compression efficiency

Semantic Data Compression Enterprise Relevance and Impact Enterprise Relevance: Table and stream data occur frequently in computer networks, distributed mobile networks, and telecommunication networks such as the Earth Science Enterprise, Space Science Enterprise, Mars Network, and Space-Based Internets of NASA. Compression and querying of stream data is directly applicable to NASA projects. Impact: Databases will be compressed on a “query-need” basis. Query engines will be aware of the compression employed and perform efficient querying.

Milestones - Technical Accomplishments and Schedules Task Title Placed Here Milestones - Technical Accomplishments and Schedules Due Date Milestone Description Tech Accomplishments 1 2 10/2001 10/2002 Survey table compression techniques. Compression-aware query processing algorithms Report generated. In progress. Schedule Status Schedule Deviation 1 Completed 2 On schedule

A large number of compression techniques. Syntactic compression: Compress byte strings. Semantic Compression: Employ data semantics in approximating data; Answer queries with a guaranteed upper bound on the error of approximation. Representative tuples and outliers (row-wise relationships) Classification and regression trees (column-wise rel.s) Employ attribute domain information.

Given a compressed database DB and query Q, Evaluate Q on DB without decompressing DB; decompress output. Best for existing query engines; low compression ratio. By first decompressing selected relations/columns. Cost: Rewriting tables before Q evaluation. By decompressing tuple components (selectively) during query evaluation. Cost: On a per-query basis. Requires query engine changes, fast random decompression. Algebraic Laws: Commutativity Op(DeCmp(T)) =? DeCmp(Op(T))

Semantic Data Compression Funding Issues This is a research initiation project with a two-year funding of $35,869. There are no funding issues.