On-the-fly Visualization of Scientific Geospatial Data Using Wavelets

Slides:



Advertisements
Similar presentations
Números.
Advertisements

University Paderborn 07 January 2009 RG Knowledge Based Systems Prof. Dr. Hans Kleine Büning Reinforcement Learning.
Trend for Precision Soil Testing % Zone or Grid Samples Tested compared to Total Samples.
Trend for Precision Soil Testing % Zone or Grid Samples Tested compared to Total Samples.
AGVISE Laboratories %Zone or Grid Samples – Northwood laboratory
PDAs Accept Context-Free Languages
20% In-house rental/New Third Party 21% TP upgrade & Resign potential 10% of our possible servicing customers That exposes 17% of our customers Approx.
EuroCondens SGB E.
Worksheets.
Reinforcement Learning
1 When you see… Find the zeros You think…. 2 To find the zeros...
Western Public Lands Grazing: The Real Costs Explore, enjoy and protect the planet Forest Guardians Jonathan Proctor.
Add Governors Discretionary (1G) Grants Chapter 6.
CALENDAR.
1 1  1 =.
1  1 =.
CHAPTER 18 The Ankle and Lower Leg
Summative Math Test Algebra (28%) Geometry (29%)
Learning to show the remainder
Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)
The 5S numbers game..
突破信息检索壁垒 -SciFinder Scholar 介绍
A Fractional Order (Proportional and Derivative) Motion Controller Design for A Class of Second-order Systems Center for Self-Organizing Intelligent.
The basics for simulations
Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)
Factoring Quadratics — ax² + bx + c Topic
EE, NCKU Tien-Hao Chang (Darby Chang)
© 2010 Concept Systems, Inc.1 Concept Mapping Methodology: An Example.
Mental Math Math Team Skills Test 20-Question Sample.
The Pecan Market How long will prices stay this high?? Brody Blain Vice – President.
Money Math Review.
TCCI Barometer March “Establishing a reliable tool for monitoring the financial, business and social activity in the Prefecture of Thessaloniki”
1 Prediction of electrical energy by photovoltaic devices in urban situations By. R.C. Ott July 2011.
Dynamic Access Control the file server, reimagined Presented by Mark on twitter 1 contents copyright 2013 Mark Minasi.
TCCI Barometer March “Establishing a reliable tool for monitoring the financial, business and social activity in the Prefecture of Thessaloniki”
Progressive Aerobic Cardiovascular Endurance Run
Biology 2 Plant Kingdom Identification Test Review.
Visual Highway Data Select a highway below... NORTH SOUTH Salisbury Southern Maryland Eastern Shore.
Area under curves Consider the curve y = f(x) for x  [a, b] The actual area under the curve is units 2 The approximate area is the sum of areas.
Intercollegiate FRCS Update
MaK_Full ahead loaded 1 Alarm Page Directory (F11)
When you see… Find the zeros You think….
2011 WINNISQUAM COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=1021.
Before Between After.
2011 FRANKLIN COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=332.
ROMS User Workshop, October 2, 2007, Los Angeles
Numeracy Resources for KS2
1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)
Detecting Spam Zombies by Monitoring Outgoing Messages Zhenhai Duan Department of Computer Science Florida State University.
Group Meeting Presented by Wyman 10/14/2006
Static Equilibrium; Elasticity and Fracture
Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)
Resistência dos Materiais, 5ª ed.
& dding ubtracting ractions.
Lial/Hungerford/Holcomb/Mullins: Mathematics with Applications 11e Finite Mathematics with Applications 11e Copyright ©2015 Pearson Education, Inc. All.
Take any two digit number between 11 to Add the numbers (Example: Number 54 : 5+4=9). 2. Now subtract the answer from original number (i.e. 54-9=45,
A Data Warehouse Mining Tool Stephen Turner Chris Frala
1 Dr. Scott Schaefer Least Squares Curves, Rational Representations, Splines and Continuity.
Chart Deception Main Source: How to Lie with Charts, by Gerald E. Jones Dr. Michael R. Hyman, NMSU.
1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)
Introduction Embedded Universal Tools and Online Features 2.
What impact does the address have on the tribe?
úkol = A 77 B 72 C 67 D = A 77 B 72 C 67 D 79.
Schutzvermerk nach DIN 34 beachten 05/04/15 Seite 1 Training EPAM and CANopen Basic Solution: Password * * Level 1 Level 2 * Level 3 Password2 IP-Adr.
Optimal Workload-Based Weighted Wavelet Synopsis
1 Wavelets for Efficient Querying of Large Multidimensional Datasets Wavelets for Efficient Querying of Large Multidimensional Datasets Cyrus Shahabi University.
Constructing Optimal Wavelet Synopses Dimitris Sacharidis Timos Sellis
1 C. Shahabi Mining Multidimensional Databases Mining Multidimensional Databases Cyrus Shahabi University of Southern California Dept. of Computer Science.
Dense-Region Based Compact Data Cube
Chapter 15: Wavelets (i) Fourier spectrum provides all the frequencies
Presentation transcript:

On-the-fly Visualization of Scientific Geospatial Data Using Wavelets GeoDA On-the-fly Visualization of Scientific Geospatial Data Using Wavelets Cyrus Shahabi, Farnoush Banaei-Kashani, Kai Song Introduce: self (postdoc, USC), paper (title and GeoDA), coauthors On behalf of Kai, Bear with me

Outline Motivation and Problem Definition Our Solution: GeoDA Summary Underlying Technology Background: Discrete Wavelet Transform WOLAP Prototype System Development Summary Future Work

USC-JPL SURP Project Cyrus Shahabi and Farnoush Banaei-Kashani USC-JPL collaboration under a SURP project Earth Science Data Visualization Cyrus Shahabi and Farnoush Banaei-Kashani Information Laboratory (InfoLab) University of Southern California (USC) Los Angeles, CA 90089 [shahabi,banaeika]@usc.edu http://infolab.usc.edu Yi Chao and Peggy Li Climate, Oceans, and Solid Earth Science Section Jet Propulsion Laboratory (JPL) Pasadena, CA 91109 [yi.chao,peggy.li]@jpl.nasa.gov http://science.jpl.nasa.gov/COSE/

Earth Science Data Visualization Without Re-scaling Example: SST as data (dimensions and measure attribute), and average as the query Generalize Color-coded map: based on resolution each pixel is a aggregate query covering data of the corresponding area Interactive: Range selection Range Selection With Re-scaling

Earth Science Data Visualization Range Selection Range Re-scaling Aggregated query over latitude, longitude and/or time

Off-line vs. On-the-fly Visualization Off-line Visualization Pre-selected range (and resolution) Visualization by query pre-computation On-the-fly Visualization On-the-fly range (and resolution) selection Visualization by on-the-fly query computation to support dynamic data

Outline Motivation and Problem Definition Our Solution: GeoDA Summary Underlying Technology Background: Discrete Wavelet Transform WOLAP Prototype System Development Summary Future Work Give overview of solution that enables on-the-fly visualization here

Discrete Wavelet Transform 80 70 60 90 37 67 50 50 a {1/2, 1/2} {1/2, -1/2} 75 63 12 -15 75 52 50 5 -15 -15 75 51 1 =DWT(a) â =Wa â 63 12 A transform where the generated data has interesting properites Describe wavelet by example Assume one dimensional dataset How to generate? What are properties? Coefficients are representatives and they are ordered based on their energy Compression as well Multi-resolution view: Compression! 75 60 90 36 66 50 a′ 80 70 37 67 50 60 90 75 51 63 75 52 50 63 12 5 -15 1 â * For simplification, assume {1/2, 1/2} and {1/2, -1/2} as filters instead of the Haar filters {1/2, 1/2} and {1/2, -1/2}.

Wavelets in Databases Our work (WOLAP)2: Others’ work1: Query Compression Reason: fast response time Define range-sum query as dot product of query vector and data vector At the query time, we have the knowledge of what is important to the pending query More opportunities: Progressive results Data-independent approximation Others’ work1: Data Compression Reason: save space? Implicit reason: queries deal with smaller datasets and hence faster Problems: Only approximate results! Very data-dependant Different error rates for different queries 1 See Vitter-CIKM'98, Vitter-SIGMOD'99, Agrawal-CIKM'00, Garofalakis-VLDB'00 2 See Schmidt-PODS‘02, Schmidt-EDBT‘02, Jahangiri-SIGMOD’05

WOLAP Example Original Wavelet* Result=504 Result=178.19*2.83=504 80 70 60 90 37 67 50 80 70 60 90 37 67 50 80 70 60 90 37 67 50 â 178.19 33.94 7.07 -21.21 2 178.19 33.94 7.07 -21.21 2 178.19 33.94 7.07 -21.21 2 1 2.83 Result=504 Result=178.19*2.83=504 (Parseval Theorem) 1 1.73 -.35 -1 .5 .71 Result=304 Result=178.19*1.73+33.94*(-.35)+2*.5 =304 (Parseval Theorem) ~303 (99% accuracy!) O(N) << O(log N) * Here we assume the actual Haar filter: {1/2, 1/2} and {1/2, -1/2}

WOLAP Query Complexity: O(log n) 1 1.4 1.4 1.4 1.4 1.4 1.4 0.7 0.7 1.0 2.0 2.0 1.5 -1 0.5 2.1 2.5 -.7 0.4 3.3 -.3 3.3 -.3 -.7 0.4 -1 0.5 0.7 So summarize WOLAP Assuming that the query is of size N: Theorem 1: Using “lazy wavelet transform” (computing only on the boundaries of the selected range), one can transform any polynomial range-aggregate query in O(log N) to wavelet domain. Theorem 2: The query has O(log N) non-zero values in wavelet domain.

Related Work Agrawal-SIGMOD'97 Abbadi-ICDE'99 Abbadi-Dawak'00 Literature Support not only for queries but for update And more: progressiveness, e.g. N=domain size for each dimension d=number of dimensions

Outline Motivation and Problem Definition Our Solution: GeoDA Summary Underlying Technology Background: Discrete Wavelet Transform WOLAP Prototype System Development Summary Future Work

WOLAP Query Engine (ProDA) GeoDA Architecture Google Map Mashup Presentation Tier WOLAP Query Engine (ProDA) Plotting Tools Query Tier Wavelet Datacubes Data Tier Text Files NC Files

Helena Data Helene Dataset Helene Datacube 10+ dimensions (selected longitude and latitude) 100+ Variables (selected SST) 1km by 1km resolution, daily samples, world-wide 36000  18000 data points per sample (~1/3 of which are null) Helene Datacube Dimensions: Latitude, Longitude Variable: SST

Presentation Tier Implementation Progressive Visualization Cross-language development – JavaScript, C#, ASP.NET AJAX Multi-thread programming Progressive Visualization Demo: Interface Flexible ranges Rescaling GeoDA

Outline Motivation and Problem Definition Our Solution: GeoDA Summary Underlying Technology Background: Discrete Wavelet Transform WOLAP Prototype System Development Summary Future Work

Summary We devised a framework for on-the-fly visualization of large-scale scientific datasets. We designed and exploited a fast range-aggregate query processing technique, WOLAP, that enables on-the-fly visualization. WOLAP supports the family of polynomial range-aggregate queries. We developed a prototype system, GeoDA, as a proof-of-concept based on the designed visualization framework and query processing technique.

Future Work Supporting dynamic datasets by extending WOLAP to handle append of the data stream in wavelet domain. Enhancing WOLAP via caching, to enable group/batch aggregate queries.

Q & A