A Declarative Domain-Free Approach for Querying and Generating Visualizations Nicholas Del Rio 1 1 Committee Chair:Dr. Paulo Pinheiro 1 Dr. Vladik Kreinovich.

Slides:



Advertisements
Similar presentations
PHP I.
Advertisements

Intelligent Technologies Module: Ontologies and their use in Information Systems Revision lecture Alex Poulovassilis November/December 2009.
VisKo: Enabling Visualization Generation Over the Web Nicholas Del Rio – UTEP Paulo Pinheiro - PNNL 1
XML: Extensible Markup Language
Networked Digital Whiteboard with Handwritten-Symbol Interpreter and Dynamic-Display-Object Creator Atsuhide Kobashi Henry M. Gunn High School Palo Alto,
An Operational Metadata Framework For Searching, Indexing, and Retrieving Distributed GIServices on the Internet By Ming-Hsiang.
ParaView Tutorial Greg Johnson, Karla Vega. Before we begin… Make sure you have ParaView installed so you can follow along in the lab section –
Presentation 7 part 2: SOAP & WSDL. Ingeniørhøjskolen i Århus Slide 2 Outline Building blocks in Web Services SOA SOAP WSDL (UDDI)
SSP Re-hosting System Development: CLBM Overview and Module Recognition SSP Team Department of ECE Stevens Institute of Technology Presented by Hongbing.
Slide 1 Visualization of scientific data under Linux - Techniques and data wrangling Mike Walterman, Manager of Graphics Programming, Scientific Computing.
Understanding Metamodels. Outline Understanding metamodels Applying reference models Fundamental metamodel for describing software components Content.
An Intelligent Broker Approach to Semantics-based Service Composition Yufeng Zhang National Lab. for Parallel and Distributed Processing Department of.
Ontology translation: two approaches Xiangkui Yao OntoMorph: A Translation System for Symbolic Knowledge By: Hans Chalupsky Ontology Translation on the.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Visualization Knowledge (VisKo): Leveraging the Semantic Web to Support VisualizationVisKo University of Texas at El Paso Computer Science.
Visualization Knowledge (VisKo): Leveraging the Semantic Web to Support VisualizationVisKo Nicholas Del Rio CyberShARE Center University of Texas at El.
Pasquale Herzig, Michael Englert, Sebastian Wagner, Yvonne Jung & Ulrich Bockholt Presented at Web3D 2013 X3D-EarthBrowser.
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
Building a UI with Zen Pat McGibbon –Sales Engineer.
Concept demo System dashboard. Overview Dashboard use case General implementation ideas Use of MULE integration platform Collection Aggregation/Factorization.
Beyond the Visualization Pipeline Werner Benger 1, Marcel Ritter, Georg Ritter, Wolfram Schoor 1 Scientific Visualization Group Center for Computation.
1 CMPT 275 High Level Design Phase Architecture. Janice Regan, Objectives of Design  The design phase takes the results of the requirements analysis.
ICOM 5995: Performance Instrumentation and Visualization for High Performance Computer Systems Lecture 7 October 16, 2002 Nayda G. Santiago.
Funded by: European Commission – 6th Framework Project Reference: IST WP 2: Learning Web-service Domain Ontologies Miha Grčar Jožef Stefan.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
An Introduction to Programming and Algorithms. Course Objectives A basic understanding of engineering problem solving process. A basic understanding of.
Scalable Metadata Definition Frameworks Raymond Plante NCSA/NVO Toward an International Virtual Observatory How do we encourage a smooth evolution of metadata.
GIS On The Web: An Overview of ArcIMS. *The easy flow of geographic data can offer real-life solutions in many societal sectors, including municipal government,
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
Khoros Yongqun He Dept. of Computer Science, Virginia Tech.
CHAPTER TEN AUTHORING.
Knowledge Modeling, use of information sources in the study of domains and inter-domain relationships - A Learning Paradigm by Sanjeev Thacker.
Dimitrios Skoutas Alkis Simitsis
Integrated Grid workflow for mesoscale weather modeling and visualization Zhizhin, M., A. Polyakov, D. Medvedev, A. Poyda, S. Berezin Space Research Institute.
OceanBrowser viewing service, overview and upgrades Alexander Barth (1), Charles Troupin (2), Aida Alvera Azcárate (1), Jean-Marie Beckers (1) (1) University.
Unit-1 Introduction Prepared by: Prof. Harish I Rathod
An Ontological Framework for Web Service Processes By Claus Pahl and Ronan Barrett.
1 5 Nov 2002 Risto Pohjonen, Juha-Pekka Tolvanen MetaCase Consulting AUTOMATED PRODUCTION OF FAMILY MEMBERS: LESSONS LEARNED.
Visualization Knowledge (VisKo): Leveraging the Semantic Web to Support VisualizationVisKo Paulo Pinheiro da Silva and Nicholas Del Rio CyberShARE Center.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Implementing Unified Access to Scientific Data from.NET Platform Sergey B. Berezin Dmitriy V. Voitsekhovskiy Vilen M. Paskonov Moscow State University.
Accessing and Using Fire-Related Data with the CAPITA DataFed.net* Services Framework Stefan Falke Rudolf Husar Kari Hoijarvi Washington University in.
RSISIPL1 SERVICE ORIENTED ARCHITECTURE (SOA) By Pavan By Pavan.
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
Chapter 11 Using SAS ® Web Report Studio. Section 11.1 Overview of SAS Web Report Studio.
Supported in part by the National Science Foundation under Grant No. HRD Any opinions, findings, and conclusions or recommendations expressed.
Visualization Knowledge Query Language (VKQL) Workshop Nicholas Del Rio University of Texas at El Paso Computer Science.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Slide 1 SDTSSDTS FGDC CWG SDTS Revision Project ANSI INCITS L1 Project to Update SDTS FGDC CWG September 2, 2003.
Visualization with ParaView. Before we begin… Make sure you have ParaView 3.14 installed so you can follow along in the lab section –
Module: Software Engineering of Web Applications Chapter 2: Technologies 1.
Copyright © 2010, SAS Institute Inc. All rights reserved. SAS ® Using the SAS Grid.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
CHAPTER 4 THE VISUALIZATION PIPELINE. CONTENTS The focus is on presenting the structure of a complete visualization application, both from a conceptual.
Supported in part by the National Science Foundation under Grant No. HRD Any opinions, findings, and conclusions or recommendations expressed.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
What’s Ahead for Embedded Software? (Wed) Gilsoo Kim
University of Maryland Scaling Heterogeneous Information Access for Wide area Environments Michael Franklin and Louiqa Raschid.
CS223: Software Engineering
Drill-Through Features Cognos 8 BI. Objectives  In this module we will examine:  Cognos 8 Drill Through Overview  Model / Package Drill Through  Cross.
Compilation of XSLT into Dataflow Graphs for Web Service Composition Peter Kelly Paul Coddington Andrew Wendelborn.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
VisKo: Enabling Visualization Generation Over the Web Nicholas Del Rio – UTEP Paulo Pinheiro - PNNL 1
Trustworthy Semantic Webs Building Geospatial Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas October 2006 Presented at OGC Meeting,
课程名 编译原理 Compiling Techniques
Overview of Workflows: Why Use Them?
Scientific Workflows Lecture 15
Presentation transcript:

A Declarative Domain-Free Approach for Querying and Generating Visualizations Nicholas Del Rio 1 1 Committee Chair:Dr. Paulo Pinheiro 1 Dr. Vladik Kreinovich 1 Dr. Rodrigo Romero 1 Dr. Aaron Velasco 2 1 Dept. of Computer Science and 2 Dept. of Geological Science

Outline 1.The Visualization Toolkit Deluge 2.Visualization Queries 3.A Toolkit-Centric Model Supporting Queries 4.Answering Visualization Queries 5.Applications 6.User Study 7.Conclusion 2

At AGU 2012… 1000’s of scientists generating visualizations using a variety of toolkits: 3

Multiple Toolkits Some scientists are proficient with > 1 toolkit: 4 Number of toolkits Percentage of Users

The Benefit of Multiple Toolkits 5 Near neighbor vs. surface gridding techniques 3D views: isosurfaces vs. point plot There are many ways to visualize a single dataset Multiple visualizations provide different perspectives of data and thus different insights

Challenges of using Multiple Toolkits 6 dos.DebugOn(); dos.SetExecuteMethod(this, "loadFieldData"); dos.Update(); vtkShepardMethod sm = new vtkShepardMethod(); sm.DebugOn(); sm.SetInputConnection(aa.GetOutputPort()); sm.SetSampleDimensions(40,42,1); sm.SetMaximumDistance(0.2); sm.SetModelBounds(-109,-107,33,34,0,1); sm.Update(); vtkExtractVOI ev = new vtkExtractVOI(); ev.SetInputConnection(sm.GetOutputPort()); ev.SetVOI(0,40,0,42,0,0); ev.Update(); vtkContourFilter contours = new vtkContourFilter(); contours.SetInputConnection(ev.GetOutputPort()); contours.DebugOn(); contours.GenerateValues(10, ev.GetOutput().GetScalarRange()); contours.Update(); awk 'BEGIN {FS=" "} {if (NR > 1) print $1,$2,$3}' $infile > $tmpfile if [ $calc_region -eq 0 ] ; then region=$5 else minmaxvals=`minmax -C $tmpfile` set -- $minmaxvals region="${1}/${2}/${3}/${4}" Fi nearneighbor -R$region -I$gridspacing -S$searchradius -G$gridfile $tmpfile colorsFile=$workspace/colors.cpt makecpt -C$colorPallet -T$colorrange > $colorsFile grdimage $infile -J$projection -P -B$boundaryAnnotationInterval - C$colorsFile > $outfile Disparities: Supported Operators Supported parameters Data models (2D vs. 3D) Languages portability Similarities: Modular (pipeline based) Pipelines serve as the visualization specification

Goal Provide an abstraction from which scientists can use to specify visualizations declaratively 7 Graphics Libraries (e.g., OpenGL) Parallel Libraries (MPI, OpenMP) Conversion Operators Data Streaming Libraries Transformation Operators Mapping Operators Viewing Operators Abstraction Layer (interpret, assemble, execute) Declarative Specification Toolkit layer Graphics and Data layer Performance Layer Proposed Layer

Approach 8 Apply the query-answering paradigm supported by DBMS to visualization 1 Abstract visualization pipelines in the form of declarative requests (visualization queries) 2 Construct knowledge bases of visualization operators 3 Develop methods for translating the abstractions into pipelines (query answering) VISUALIZE AS isosurfaces IN firefox WHERE FORMAT= csv ANDTYPE= gravity ANDinterval= 5 ANDxRotation= 10 VISUALIZE AS isosurfaces IN firefox WHERE FORMAT= csv ANDTYPE= gravity ANDinterval= 5 ANDxRotation= 10

Proposed Usage Pattern 9 System may generate other visualizations of the same dataset from a variety toolkits Visualize AS * IN web-browser WERE FORMAT = csv AND TYPE = gravity

Visualization Query Language 10 A visualization query specifies a visualization in a machine readable and declarative form VISUALIZE AS isosurfaces IN web-browser WHEREFORMAT = binaryFloatArrayAND TYPE = griddedTimeAND zRot= 45AND numConts= 35 Source Data Visualization Abstraction and ViewerSet Data Characterization Parameter Bindings Can also use wildcard (VISUALIZE * ) for explorative scenarios

Queries versus Pipelines Visualization Abstraction VISUALIZE AS views:2D_ContourMap IN viewersets:PDFViewer Query Answering

Using MVE-based Toolkits 12 Sequence of visualization operators known as a pipeline Modular Visualization Environment (MVE) based toolkits provide building blocks from which to compose visualizations

Data Flow/State Pipeline Structure 13 Op 1 Op 2 Op 3 Op 4 Op 5 Op1: vtkDataObjectToDataSetFilter Op2: vtkShepardMethod Op 3: vtkExtractVOI Op 4: vtkContourFilter Op 5: vtkPolyDataMapper Mapping 3 Data Flow Model – Haber and McNabb 90Data State Model – Chi 98 Data Gathering 2 Rendering Visualization Abstraction specified in the query 4 View 6 Value 1 56

Other Toolkit Models 14 ClassModelOps.Params.DataTasksVis. Transform- Centric Data State Data Flow Lattice Data-CentricMackinlay Zhou User-CentricTask-by-Data Domino HybridDuke Ontology = no modeling 1 = coarse level modeling 2 = find level modeling We extend Data Flow and borrow from Data State, Zhou, and Duke

Our Model 15 Format Converter Viewer Data Enricher/Gatherer Mapper Format ConverterType Transformer Data Filterer Format[Type] 1.Operator based perspective (Data State) 2.Data Enricher/Gatherer specialties 3.Input/output in terms of format[type] (Mackinlay and MIME) 4.Mapper (Data Flow) 5.Operators as web services 6.Injects an optional Format Converter after Mappers 7.Fuses Renderer with Viewer, inspired by Data State mapsTo Visualization Abstraction Web Service implBy treemap isosurfaces

Model Limitations Interactive viewers Composite operators (i.e., operators that perform both gridding and filtering) – Need to model these from a singular perspective Multi-faceted operators (i.e., operators that serve different functions based on input data or parameter settings) – Need to model these from multiple perspectives (multiple desciptions) – Might employ rules in future versions (i.e., if 2D input, then 2D output) 16

A VTK Pipeline in terms of our Model 17 Op 2 Op 3 Op 4 XML [vtkPolyData] XML [vtkImageData3D] XML [vtkImageData2D] Dimension reduction not explicitly specified; inferred through type requirements Formats and types defined in ontologies, fostering interoperability OBSERVATION 3 OBSERVATION 4 XML [vtkPolyData] Transformer Mapper Viewer Op 1 Transformer Op 1 Converter CSV [owl:Thing gravity-data] SSV [gravity-data] Format converters are type agnostic; Ingest/output owl:Thing and propagate type. OBSERVATION 1 Type polymorphism; scalable MIME OBSERVATION 2

Populating the Knowledge Base 18 Users register operators by describing their functions in terms of our model: Classify: converter, transformer, filter, mapper, and viewer Specify: visualization abstraction if mapper Specify: input/output format[type] The resulting operator descriptions form a search graph Nodes are operators Arcs are data (i.e, format[type]) MapperTransformerViewer

Answering Visualization Queries 19 Visualization Queries Specify: Source format[type] Target Visualization Abstraction Target Viewer MapperTransformerViewer VISUALIZE AS 3d-point-plot IN firefox WHERE FORMAT = csv AND TYPE = gravity-data VISUALIZE AS 3d-point-plot IN firefox WHERE FORMAT = csv AND TYPE = gravity-data Search algorithm: Specialized depth-first-traversal Tailored for ensuring pipeline model structure

Multiple Pipeline Results PDF Viewer JPEG Viewer FITS Viewer Web Browser Viewers gravity-data[CSV] Forest Pipeline Results Query requested that gravity-data[CSV] be viewed in Web Browser Kick off DFS

Pipeline Structure (Automata) A pipeline can be considered a sentence in a pipeline language that describes a visualization. It has an alphabet (i.e., operators types) and a grammar (i.e., structure) 21 T C F FTFT FCFC M MCMC V V V V T: Transformer C: Converter F: Filter FT: Post Filtering Transformer FC: Post Filtering Converter M: Mapper MC: Post Mapping Converter V

Sharing Visualizations 1. Send image (contents or by URL) 2. Send data Recipient may be unable to adjust any properties such as contour interval, color tables, projection and labels Recipient may not have tools, capabilities, and expertise to regenerate visualization from data 3. Send URL of visualization embedded in viewer These solutions have been implemented only for specific domains, for example OGC VisKo queries address the limitations above 4. Send a VisKo Query specifying the visualization 22 Visualize AS contour-map IN web-browser WERE FORMAT = csv AND TYPE = gravity

Evaluation There are many facets associated with our approach: 23 FacetEvaluationMetric ModelValidityToolkit Coverage Intuition of Visualization Abstractions Survey PerformanceSearch AlgorithmComplexity Pipeline ExecutionExecution/Data Transfer Time QueryEffectivenessReadability and Writability

User Study 24 Compare our query language to pipeline languages: – (Use 1) Is our query language more readable than pipeline code – (Use 2) Is our query language more writeable than pipeline code Control: pipeline specifications Independent Variable: Pipeline/Query Language Dependent Variable: Correctness GivenTaskRequired Pipeline/QueryIdentifyA visualization selection Visualization Options GivenTaskRequired Operators/ResourcesComposePipeline/Query Visualization Readability Trails Writability Trails

Independent Variable Isolation Pre-existing toolkit knowledge: – Target demographic had experience using MVE-based toolkit The Language Factor: – Pipelines specified in an abstract form Visualization misinterpretation: – All visualizations were labeled The parameter factor: – No parameters 25 1.Operator 1 2.Operator 2 3.Operator 3 4.…

Results: Tasks 26 PipelineQuery Mean Variance Observations15 Pearson Correlation Hypothesized Mean Difference0 df14 t Stat P(T<=t) one-tail t Critical one-tail P(T<=t) two-tail t Critical two-tail PipelineQuery Mean Variance Observations1511 Pooled Variance Hypothesized Mean Difference0 df24 t Stat P(T<=t) one-tail t Critical one-tail P(T<=t) two-tail t Critical two-tail Failed Readability Passed Writability

Results: Questionnaire 27 Passed Readability Passed Writability PipelineQuery Mean Variance Observations15 Pearson Correlation Hypothesized Mean Difference0 df14 t Stat P(T<=t) one-tail t Critical one-tail P(T<=t) two-tail t Critical two-tail PipelineQuery Mean Variance Observations15 Pearson Correlation Hypothesized Mean Difference0 df14 t Stat P(T<=t) one-tail t Critical one-tail P(T<=t) two-tail t Critical two-tail

Future Work 28

References 1.E.H. Chi and J. Riedl. An operator interaction framework for visualization systems. 2.K. Brodlie and N.M. Noor. Visualization notations, models and taxonomies. 3.D. J. Duke, K. W. Brodlie, and D. A. Duce. Building an ontology of visualization. 4.B. Haber and D. A. McNabb. Visualization Idioms: A Conceptual Model for Scientific Visualization Systems. 5.W. L. Hibbard, C. R. Dyer, and B. E. Paul. A lattice model for data display. 6.J. Mackinlay, P. Hanrahan, and C. Stolte. Show me: Automatic presentation for visual analysis. 7.B. Shneiderman. The eyes have it: A task by data type taxonomy for information visualizations. 8.M.X. Zhou and S.K. Feiner. Data characterization for automatically visualizing heterogeneous information. 29

Backup Slides 30

Resources Snippet Formats: Types:

Pipeline Readability 32 Visualization A: Raster MapVisualization B: Contour MapVisualization C: other Trial Type 1 (GMT-based) Instructions: Using the input data and pipeline described below, choose the visualization that would most likely be generated by circling it. NOTE: Please refrain from leveraging any source outside of the evaluation material presented to you. This includes toolkit manuals of any kind (e.g., versions published on the Web). Input Data Description: Data Format: XYZ List (longitude, latitude, scalar-of-interest) in tabular ASCII Data Type: Unstructured Points Data Dimensionality: 2 Pipeline: Possible Visualization Outputs (circle the most likely output): surface.exe grdImage.exe ps2pdf.exe

Pipeline Writability 33 Trial Type 2 (GMT-based) Instructions: Using the input data, list of pipeline operators, and visualization shown below, write the visualization pipeline that would most likely generate the visualization. NOTE: Please refrain from leveraging any source outside of the evaluation material presented to you. This includes toolkit manuals of any kind (e.g., versions published on the Web). Input Data Description: Data Format: XYZE (longitude, latitude, scalar-of-interest, elevation) in tabular ASCII Data Type: Unstructured Points Data Dimensionality: 3 Visualization: XY Plot to be viewed in a web browser Visualization Pipeline (please write down the pipeline that could generate the visualization):

Query Readability 34 VISUALIZE AS IN WHERE TYPE = AND FORMAT = Visualization A: Hedge-HogVisualization B: Stream-linesVisualization C: null Example: Trial Type 3 (VTK-based) Instructions: Using the input data and query described below, choose the visualization that would most likely be generated by circling it. NOTE: Please refrain from leveraging any source outside of the evaluation material presented to you. This includes toolkit manuals of any kind (e.g., versions published on the Web). Input Data Description: Data Format: Binary Float Array Data Type: Gridded Vectors Data Dimensionality: 3 Query: Possible Visualization Outputs (circle the most likely output and justify your selection on the back):

Query Writability 35 Example: Trial Type 4 (VTK-based) Instructions: Using the input data, list of visualization resources, and visualization shown below, write the visualization query that would most likely generate the visualization. NOTE: Please refrain from leveraging any source outside of the evaluation material presented to you. This includes toolkit manuals of any kind (e.g., versions published on the Web). Input Data Description: Data Format: Binary Float Array Data Type: Gridded Vectors Data Dimensionality: 2 Data Location: Visualization: Glyphs to be viewed in a web browser Visualization Query (please write down the query that could generate the visualization):

VisKo Query Submission 36

VisKo Pipeline Results 37

Parameter Editing 38

Visualization Result 39

Query Examples 40

Contributing Knowledge through Modules 41 Module Service source Service Meta- data generation source reads ModuleSDK Service Libs Meta-data gen libs Resources: types, formats, and vis abstractions refs Service Meta Data installs Server Services Visko-app execs publishes searches Client Side Server Side