NASA/IPAC Infrared Science Archive Tatiana Goldina, Loi Ly, Trey Roby, Xiuqin Wu.

Slides:



Advertisements
Similar presentations
Enter the data range in the Input range box
Advertisements

© Copyright 2008 All rights reserved 2 VO-India Project Started in 2002 as a collaboration between IUCAA and Persistent Systems Ltd. Part of International.
Database Performance Tuning and Query Optimization
Metadata in the TAP context (1) The Problem: learn about which tables, tablesets,... are available from a TAP server for each of the tables / tablesets,
Biostatistics Unit 3 Graphs 1. Grouped data Data can be grouped into a set of non- overlapping, contiguous intervals called class intervals (Excel calls.
Stem and leaf diagrams and box plots Statistics. Draw a stem and leaf diagram using the set of data below
Document Image Processing
CDS 301 Fall, 2009 Image Visualization Chap. 9 November 5, 2009 Jie Zhang Copyright ©
Measures of Variation Sample range Sample variance Sample standard deviation Sample interquartile range.
Modal Class Exercise. The masses, in grams, of 100 potatoes are shown in the table below: Mass (m) gramsFrequency 0 < m  < m  < m 
Reporting In Visual Studio 2005 An in-depth look at Reporting Internals Vijay Shandilya Software Engineer.NET Elite Team Aithent Technology.
Ch. 2: The Art of Presenting Data Data in raw form are usually not easy to use for decision making. Some type of organization is needed Table and Graph.
Evaluation of MineSet 3.0 By Rajesh Rathinasabapathi S Peer Mohamed Raja Guided By Dr. Li Yang.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 11 Database Performance Tuning and Query Optimization.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 11 Database Performance Tuning and Query Optimization.
Mary K. Olson PS Reporting Instance – Query Tool 101.
Computer Science 101 Web Access to Databases SQL – Extended Form.
IManage – New Table Option For better sorting, filtering and sorting.
SQL Server Reporting Services London Database Developer Forum Anoop Patel.
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 11 Database Performance Tuning and Query Optimization.
Database Systems Design, Implementation, and Management Coronel | Morris 11e ©2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 10 Database Performance Tuning and Query Optimization.
With Microsoft Office 2007 Introductory© 2008 Pearson Prentice Hall1 PowerPoint Presentation to Accompany GO! with Microsoft ® Office 2007 Introductory.
Elasticsearch in Dashboard Data Management Applications David Tuckett IT/SDC 30 August 2013 (Appendix 11 November 2013)
The Nav Bar. Nav is short for Navigation. Having a Navigation Bar makes searching for information easier on those accessing your page. Here are some common.
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
Physical Database Design & Performance. Optimizing for Query Performance For DBs with high retrieval traffic as compared to maintenance traffic, optimizing.
SQLXPress Visual Query Tuner Copyright © 2014 Merlon Software Corporation.
Analyzing Data For Effective Decision Making Chapter 3.
Ashwani Roy Understanding Graphical Execution Plans Level 200.
Esri UC2013. Technical Workshop. Technical Workshop 2013 Esri International User Conference July 8–12, 2013 | San Diego, California Caching Imagery Using.
Best Practices for Designing Effective Map Services Tanu Hoque.
Columns run horizontally in tables and rows run from left to right.
Search - on the Web and Locally Related directly to Web Search Engines: Part 1 and Part 2. IEEE Computer. June & August 2006.
Microsoft Office 2007 Intermediate© 2008 Pearson Prentice Hall1 PowerPoint Presentation to Accompany GO! With Microsoft ® Office 2007 Intermediate Chapter.
With Microsoft Office 2007 Intermediate© 2008 Pearson Prentice Hall1 PowerPoint Presentation to Accompany GO! with Microsoft ® Office 2007 Intermediate.
Html Tables Basic Table Markup. How Tables are Used For Data Display Tables were originally designed to display and organize tabular data (charts, statistics,
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
HPDC 2013 Taming Massive Distributed Datasets: Data Sampling Using Bitmap Indices Yu Su*, Gagan Agrawal*, Jonathan Woodring # Kary Myers #, Joanne Wendelberger.
CIS234A- Lecture 7 Instructor Greg D’Andrea. Tables A table can be displayed on a Web page either in a text or graphical format. A text table: – contains.
T-SQL: Simple Changes That Go a Long Way DAVE ingeniousSQL.com linkedin.com/in/ingenioussql.
Crossmatch: the NRAO Cross-identifcation Service Jared Crossley Data Archive Access Meeting - Socorro, NM - 5 April 2010.
Japanese Virtual Observatory (JVO) National Astronomical Observatory of Japan (NAOJ) Contact Address: Demo 1: GRID-based database federation.
IMS 4212: Database Implementation 1 Dr. Lawrence West, Management Dept., University of Central Florida Physical Database Implementation—Topics.
What is Firefly (1) A web UI framework for web applications
1 Chapter 9 Tuning Table Access. 2 Overview Improve performance of access to single table Explain access methods – Full Table Scan – Index – Partition-level.
Smart Catalog App Content legend To be read in conjunction with: KL-GE Smart Catalog Content Specifications.docx Kevin G. Lee /// Feb. 26, 2014.
1 SUI/T Team – Technical lead: Xiuqin Wu – Science Lead: David Ciardi – System Engineer/Architect: John Rector, Trey Roby – System Design Scientist: Gregory.
Multivariable Critical Points
Database Systems, 8 th Edition SQL Performance Tuning Evaluated from client perspective –Most current relational DBMSs perform automatic query optimization.
What’s Next 2015 H2 – Improve cube algorithm Cube by segments, 30%-50% faster Build delay down to tens of minutes – Streaming cubing Analyze real-time.
Diving into Query Execution Plans ED POLLACK AUTOTASK CORPORATION DATABASE OPTIMIZATION ENGINEER.
Analyzing Data Week 1. Types of Graphs Histogram Must be Quantitative Data (measurements) Make “bins”, no overlaps, no gaps. Sort data into the bins.
CHAPTER 7 LESSON B Creating Database Reports. Lesson B Objectives  Describe the components of a report  Modify report components  Modify the format.
Chris Index Feng Shui Chris
Stem and leaf diagrams and box plots
PDAC Roadmap from SUIT point of view
T-SQL: Simple Changes That Go a Long Way
Database Performance Tuning &
Haim Kaplan and Uri Zwick
Database Performance Tuning and Query Optimization
Implementation of PX-Web at Statistics Norway
Volume 129, Issue 2, Pages (April 2007)
Database systems Lecture 6 – Indexes
Query Functions.
Chapter 11 Database Performance Tuning and Query Optimization
Investigating Focal Adhesion Substructures by Localization Microscopy
LSST, the Spatial Cross-Match Challenge
Number Summaries and Box Plots.
SQL Server Indexing for the Client Developer
Presentation transcript:

NASA/IPAC Infrared Science Archive Tatiana Goldina, Loi Ly, Trey Roby, Xiuqin Wu

2003 2MASS Point Source Catalog 0.5 billion rows > 100 columns 2013 AllWISE Source Catalog 0.75 billion rows > 300 columns

Gum31, AllWISE Source Catalog, 0.5d search. Data are selected in each of the 3 views.

Sky area: box with center , and length 5400 arcsec. CatalogRows, Columns (short form default) Space on disk (ascii IPAC Table) AllWISE Source Catalog 30,000 rows, 47 columns13MB / 9B per cell COSMOS Cassata morphology Catalog 230,000 rows,15 columns62MB / 18B per cell Spitzer Source List250,000 rows, 148 columns416MB / 11B per cell Table covers one page at a time. Image overlay and plot should cover all rows. How do we visualize this much data?

Points on top of each other - hard to distinguish - hard to interpret - can be aggregated Plot area: 400 x 400 px 2 Symbol size: 5 x 5 px 2 160,000 px 2 / 25 px 2 = ,000 catalog rows are plotted with 5960 square symbols

 Data aggregation technique  Used by statistical packages (R or SDSS)  2-d histogram; shade represent N p in bin  Outlier preserving

Color-color diagram created from AllWISE Source Catalog. 1 degree cone search. Lockman Hole. 46,475 data points from are represented by 1,598 bins.

Same diagram, different shading scheme. Darker – 3.1 times more points.

x:y – aspect ratio N bins – maximum number of bins N x = (int)sqrt( N bins * [x:y] ) N y = (int)sqrt( N bins / [x:y] ) binsize x = (x max – x min ) / N x + pad x binsize y = (y max – y min ) / N y + pad y

SERVER SIDE CLIENT SIDE Reduces transferred data size Used for larger tables (> 30,000 rows) Reduces rendered data size Common plot operations – zoom, select – do not require server call Used for smaller tables (up to 30,000 rows)

1. Retrieve data from low-level query and data service 2. Apply dynamic [current table] filters 3. Apply current sorting order 4. Aggregate data for visualization stream table processing – one row at a time cache intermediate results cache intermediate results fix plot aspect ratio fix plot aspect ratio Policies

Filtering from image overlay. How to find matching rows?  Aggregation parameters must be preserved!

 Aggregation parameters  X, Y names or expressions  Minimum values: x min, y min  Step sizes: binsize x, binsize y  For each aggregated value  Bin index  Number of points

 Binning is efficient aggregation technique  Use client-side binning for smaller tables  Preserve aggregation parameters to move between aggregated and full data  Process one row at a time / cache on server  Fix aspect ratio on client

NASA/IPAC Infrared Science Archive Tatiana Goldina, Loi Ly, Trey Roby, Xiuqin Wu