Download presentation
Presentation is loading. Please wait.
Published byAllison Kelly Modified over 9 years ago
1
Open Earth Framework Dealing with file formats, data semantics, and other gotchas Dave Nadeau John Moreland
2
Want to... Connect lots of tools together Data collection + processing + visualization No one tool does it all Data exchange conventions are needed
3
Want to... Plot earthquakes as dots on a map Paste a gravity map atop a terrain Slice 3D tomography below a terrain Do this all at once
4
Have to... Work with lots of data types Lists, grids, geometry, time series, hierarchical, overlapping, multichannel,... Stitch together software Ideally use open source libraries But usually have to write new code too
5
Semantics & formats Define: Semantics are the meaning & structure of data File formats store it Both must be standardized So software & users can depend upon it But...
6
Lots of gotchas... A few of our pet peeves
7
Lack of standards Sometimes no suitable standard exists Have to resort to ad hoc standards Text files Custom software README’s and code comments /* * Column 1 = latitude (degrees) * Column 2 = longitude (radians) * Column 3 = depth (furlongs) * Column 4 = age (dog years) */ /* * Column 1 = latitude (degrees) * Column 2 = longitude (radians) * Column 3 = depth (furlongs) * Column 4 = age (dog years) */
8
Often not enough semantics in files CSV has no coordinate space or standard columns GXF has no field units NetCDF has no geoscience conventions Still need README’s Incomplete standards Lat Lon Flork Yeem Snorf Wiffle Bloop 23.5 43.1 -18.5 A37 $ ✔ #! ☛☀ ☻ Ω ☢☂ 23.2 44.8 -27.5 Ö8⅓ ❸✸✠ ✂☎✇ ✈ ℅©™ 24.1 45.7 -8.9 ß4½ § ➥ ξ ❃ ‡₪ ⊗ ∞ Lat Lon Flork Yeem Snorf Wiffle Bloop 23.5 43.1 -18.5 A37 $ ✔ #! ☛☀ ☻ Ω ☢☂ 23.2 44.8 -27.5 Ö8⅓ ❸✸✠ ✂☎✇ ✈ ℅©™ 24.1 45.7 -8.9 ß4½ § ➥ ξ ❃ ‡₪ ⊗ ∞
9
Incomplete standards Usually missing provenance Author? Contact? Creation date? How collected and processed? Need to track these to understand source, value, and give credit Author: Joe’s pizza and geodata emporium Credit: Joe and his brother Zeek Date: last wednesday after lunch Methods: Counted Zeek’s paces (size 12 shoe) Author: Joe’s pizza and geodata emporium Credit: Joe and his brother Zeek Date: last wednesday after lunch Methods: Counted Zeek’s paces (size 12 shoe)
10
Proprietary standards Rarely publicly documented Few “official” ESRI format specs Mostly reverse engineered Can change at any time Format supports tool, not community Changes “encourage” buying new software version 3.0 += version 3.0.0.1
11
Misused standards Improperly used features Odd field names and units Z scale/up/down assumptions != format’s defaults Non-descriptive titles, descriptions, & authors Missing optional features Missing field names, units, scales Missing coordinate system
12
Misused standards GeoTIFF is misunderstood Stores geolocated data, not just an image BadGood
13
Complex standards GeoSciML does “everything” Hard to use it just a little bit
14
Inefficient standards Text: 2x to 3x expansion of binary data 4 Gbytes becomes 8 to 12 Gbytes XML: another 2x to 3x expansion 4 Gbytes becomes 16 to 36 Gbytes BinaryXML text
15
Un-indexed standards Missing table of contents Needed to: Find data you want, skipping the rest Do so repeatedly and efficiently Do so when whole data set won’t fit in memory
16
Suggestions... A few of our thoughts
17
Use existing standards Lots to choose between NetCDF or HDF for lists and 2D/3D grids GeoTIFF for image overlays Use them if suitable Already documented Already debugged Often widely supported
18
Prefer open standards Free Community-driven No vendor lock-in Lots of software already
19
Follow conventions Add all the metadata at creation time Too easy to forget to add it later No more README’s needed They’re easily lost or out of date
20
Create conventions Make new conventions if necessary Standard names for fields “Height”, “Depth”, “Elevation”, or “Z”? Standard field units Meters or kilometers? Standard field use Positive Z up or down?
21
Simplify standards Use subsets of standards GeoSciML packages NetCDF conventions
22
Use efficient standards Prefer binary Such as NetCDF, HDF, GeoTIFF Prefer those with a table of contents Such as NetCDF, HDF, GeoTIFF
23
What we’re doing... Developing a software framework for Data handling Processing Visualization
24
Open Earth Framework Open source, modular, portable Java, threaded, 3D accelerated Applications & construction kits Interactive & batch tools Integrates with other software Doesn’t assume one “right” way to do anything
25
Open Earth Framework Collection of construction kits (libraries) Few dependencies Combine with other software Build your own applications, batch tools, and web services Several pre-built applications
26
Open Earth Framework Data handling construction kits Standard & common formats Format conversion File “completion” to restore missing info Web service construction kits Standard & custom protocols Mix, match, and extend
27
Open Earth Framework Visualization construction kits 2D & 3D Lots of layers Common and new geoscience visual representations Control over colors, shapes, etc. Interactive Mix, match, and extend
28
Open Earth Framework User interface construction kits 3D canvases Control panels Common menus, toolbars, dialogs Mix, match, and extend
29
Open Earth Framework Terrains and overlays Data draped over terrain Color, shading, & transparency control
30
Open Earth Framework Tomography Sectioning planes & isosurfaces Color, shading, & transparency control
31
Open Earth Framework Dots, lines, and shapes Above, atop, or below terrain Color, size, & transparency control
32
Open Earth Framework All at once
33
Open Earth Framework Ongoing development Alpha release... soon? We want your input... Formats, vis techniques, processing, interaction, etc.
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.