Presentation is loading. Please wait.

Presentation is loading. Please wait.

NASA/IPAC Infrared Science Archive Tatiana Goldina, Loi Ly, Trey Roby, Xiuqin Wu.

Similar presentations


Presentation on theme: "NASA/IPAC Infrared Science Archive Tatiana Goldina, Loi Ly, Trey Roby, Xiuqin Wu."— Presentation transcript:

1 NASA/IPAC Infrared Science Archive Tatiana Goldina, Loi Ly, Trey Roby, Xiuqin Wu

2 2003 2MASS Point Source Catalog 0.5 billion rows > 100 columns 2013 AllWISE Source Catalog 0.75 billion rows > 300 columns

3 Gum31, AllWISE Source Catalog, 0.5d search. Data are selected in each of the 3 views.

4 Sky area: box with center 150.12, +2.21 and length 5400 arcsec. CatalogRows, Columns (short form default) Space on disk (ascii IPAC Table) AllWISE Source Catalog 30,000 rows, 47 columns13MB / 9B per cell COSMOS Cassata morphology Catalog 230,000 rows,15 columns62MB / 18B per cell Spitzer Source List250,000 rows, 148 columns416MB / 11B per cell Table covers one page at a time. Image overlay and plot should cover all rows. How do we visualize this much data?

5 Points on top of each other - hard to distinguish - hard to interpret - can be aggregated Plot area: 400 x 400 px 2 Symbol size: 5 x 5 px 2 160,000 px 2 / 25 px 2 = 6400 230,000 catalog rows are plotted with 5960 square symbols

6  Data aggregation technique  Used by statistical packages (R or SDSS)  2-d histogram; shade represent N p in bin  Outlier preserving

7 Color-color diagram created from AllWISE Source Catalog. 1 degree cone search. Lockman Hole. 46,475 data points from are represented by 1,598 bins.

8 Same diagram, different shading scheme. Darker – 3.1 times more points.

9 x:y – aspect ratio N bins – maximum number of bins N x = (int)sqrt( N bins * [x:y] ) N y = (int)sqrt( N bins / [x:y] ) binsize x = (x max – x min ) / N x + pad x binsize y = (y max – y min ) / N y + pad y

10 SERVER SIDE CLIENT SIDE Reduces transferred data size Used for larger tables (> 30,000 rows) Reduces rendered data size Common plot operations – zoom, select – do not require server call Used for smaller tables (up to 30,000 rows)

11 1. Retrieve data from low-level query and data service 2. Apply dynamic [current table] filters 3. Apply current sorting order 4. Aggregate data for visualization stream table processing – one row at a time cache intermediate results cache intermediate results fix plot aspect ratio fix plot aspect ratio Policies

12 Filtering from image overlay. How to find matching rows?  Aggregation parameters must be preserved!

13  Aggregation parameters  X, Y names or expressions  Minimum values: x min, y min  Step sizes: binsize x, binsize y  For each aggregated value  Bin index  Number of points

14  Binning is efficient aggregation technique  Use client-side binning for smaller tables  Preserve aggregation parameters to move between aggregated and full data  Process one row at a time / cache on server  Fix aspect ratio on client

15 NASA/IPAC Infrared Science Archive Tatiana Goldina, Loi Ly, Trey Roby, Xiuqin Wu


Download ppt "NASA/IPAC Infrared Science Archive Tatiana Goldina, Loi Ly, Trey Roby, Xiuqin Wu."

Similar presentations


Ads by Google